Re: [Qemu-block] backup notifier fail policy

qemu-block

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-block] backup notifier fail policy

From:	Stefan Hajnoczi
Subject:	Re: [Qemu-block] backup notifier fail policy
Date:	Mon, 3 Oct 2016 14:11:51 +0100
User-agent:	Mutt/1.7.0 (2016-08-17)

On Fri, Sep 30, 2016 at 09:59:16PM +0300, Vladimir Sementsov-Ogievskiy wrote:
> On 30.09.2016 20:11, Vladimir Sementsov-Ogievskiy wrote:
> > Hi all!
> > 
> > Please, can somebody explain me, why we fail guest request in case of io
> > error in write notifier? I think guest consistency is more important
> > than success of unfinished backup. Or, what am I missing?
> > 
> > I'm saying about this code:
> > 
> > static int coroutine_fn backup_before_write_notify(
> >         NotifierWithReturn *notifier,
> >         void *opaque)
> > {
> >     BackupBlockJob *job = container_of(notifier, BackupBlockJob,
> > before_write);
> >     BdrvTrackedRequest *req = opaque;
> >     int64_t sector_num = req->offset >> BDRV_SECTOR_BITS;
> >     int nb_sectors = req->bytes >> BDRV_SECTOR_BITS;
> > 
> >     assert(req->bs == blk_bs(job->common.blk));
> >     assert((req->offset & (BDRV_SECTOR_SIZE - 1)) == 0);
> >     assert((req->bytes & (BDRV_SECTOR_SIZE - 1)) == 0);
> > 
> >     return backup_do_cow(job, sector_num, nb_sectors, NULL, true);
> > }
> > 
> > So, what about something like
> > 
> > ret = backup_do_cow(job, ...
> > if (ret < 0 && job->notif_ret == 0) {
> >    job->notif_ret = ret;
> > }
> > 
> > return 0;
> > 
> > and fail block job if notif_ret < 0 in other places of backup code?
> > 
> 
> And second question about notifiers in backup block job. If block job is
> paused, notifiers still works and can copy data. Is it ok? So, user thinks
> that job is paused, so he can do something with target disk.. But really,
> this 'something' will race with write-notifiers. So, what assumptions may
> user actually have about paused backup job? Is there any agreements? Also,
> on query-block-jobs we will see job.busy = false, when actually
> copy-on-write may be in flight..

I agree that the job should fail and the guest continues running.

The backup job cannot do the usual ENOSPC stop/resume error handling
since we lose snapshot consistency once guest writes are allowed to
proceed.  Backup errors need to be fatal, resuming is usually not
possible.  The user will have to retry the backup operation.

Stefan

signature.asc
Description: PGP signature

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Qemu-block] backup notifier fail policy, Stefan Hajnoczi <=
- Re: [Qemu-block] backup notifier fail policy, John Snow, 2016/10/03
  - Re: [Qemu-block] backup notifier fail policy, Stefan Hajnoczi, 2016/10/04
    - Re: [Qemu-block] [Qemu-devel] backup notifier fail policy, Kevin Wolf, 2016/10/04
    - Re: [Qemu-block] [Qemu-devel] backup notifier fail policy, Denis V. Lunev, 2016/10/04
    - Re: [Qemu-block] [Qemu-devel] backup notifier fail policy, Kevin Wolf, 2016/10/04
    - Re: [Qemu-block] [Qemu-devel] backup notifier fail policy, Stefan Hajnoczi, 2016/10/04
    - Re: [Qemu-block] [Qemu-devel] backup notifier fail policy, John Snow, 2016/10/04
    - Re: [Qemu-block] [Qemu-devel] backup notifier fail policy, Denis V. Lunev, 2016/10/04
    - Re: [Qemu-block] [Qemu-devel] backup notifier fail policy, Kevin Wolf, 2016/10/05
    - Re: [Qemu-block] [Qemu-devel] backup notifier fail policy, Stefan Hajnoczi, 2016/10/05

Prev by Date: Re: [Qemu-block] [Qemu-devel] [PATCH] raw-posix: add 'offset' and 'size' options
Next by Date: Re: [Qemu-block] [Qemu-devel] [PATCH] raw-posix: add 'offset' and 'size' options
Previous by thread: [Qemu-block] [PATCH] raw-posix: add 'offset' and 'size' options
Next by thread: Re: [Qemu-block] backup notifier fail policy
Index(es):
- Date
- Thread