Re: [PATCH 00/13] block: remove aio_disable

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH 00/13] block: remove aio_disable_external() API

From:	Paolo Bonzini
Subject:	Re: [PATCH 00/13] block: remove aio_disable_external() API
Date:	Tue, 4 Apr 2023 15:43:20 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.9.0

On 4/3/23 20:29, Stefan Hajnoczi wrote:

The aio_disable_external() API temporarily suspends file descriptor monitoring
in the event loop. The block layer uses this to prevent new I/O requests being
submitted from the guest and elsewhere between bdrv_drained_begin() and
bdrv_drained_end().

While the block layer still needs to prevent new I/O requests in drained
sections, the aio_disable_external() API can be replaced with
.drained_begin/end/poll() callbacks that have been added to BdrvChildClass and
BlockDevOps.

This newer .bdrained_begin/end/poll() approach is attractive because it works
without specifying a specific AioContext. The block layer is moving towards
multi-queue and that means multiple AioContexts may be processing I/O
simultaneously.

The aio_disable_external() was always somewhat hacky. It suspends all file
descriptors that were registered with is_external=true, even if they have
nothing to do with the BlockDriverState graph nodes that are being drained.
It's better to solve a block layer problem in the block layer than to have an
odd event loop API solution.

That covers the motivation for this change, now on to the specifics of this
series:

While it would be nice if a single conceptual approach could be applied to all
is_external=true file descriptors, I ended up looking at callers on a
case-by-case basis. There are two general ways I migrated code away from
is_external=true:

1. Block exports are typically best off unregistering fds in .drained_begin()
    and registering them again in .drained_end(). The .drained_poll() function
    waits for in-flight requests to finish using a reference counter.

2. Emulated storage controllers like virtio-blk and virtio-scsi are a little
    simpler. They can rely on BlockBackend's request queuing during drain
    feature. Guest I/O request coroutines are suspended in a drained section and
    resume upon the end of the drained section.


Sorry, I disagree with this.

Request queuing was shown to cause deadlocks; Hanna's latest patch ispiling another hack upon it, instead in my opinion we should go in thedirection of relying _less_ (or not at all) on request queuing.

I am strongly convinced that request queuing must apply only afterbdrv_drained_begin has returned, which would also fix the IDE TRIM bugreported by Fiona Ebner. The possible livelock scenario is generallynot a problem because 1) outside an iothread you have anyway the BQLthat prevents a vCPU from issuing more I/O operations duringbdrv_drained_begin 2) in iothreads you have aio_disable_external()instead of .drained_begin().

It is also less tidy to start a request during the drained_begin phase,because a request that has been submitted has to be completed (canceldoesn't really work).

So in an ideal world, request queuing would not only apply only afterbdrv_drained_begin has returned, it would log a warning and.drained_begin() should set up things so that there are no such warnings.


Thanks,

Paolo

[Prev in Thread]

Current Thread

[Next in Thread]

[PATCH 09/13] hw/xen: do not set is_external=true on evtchn fds, (continued)
- [PATCH 09/13] hw/xen: do not set is_external=true on evtchn fds, Stefan Hajnoczi, 2023/04/03
- [PATCH 07/13] virtio: do not set is_external=true on host notifiers, Stefan Hajnoczi, 2023/04/03
- [PATCH 10/13] block/export: rewrite vduse-blk drain code, Stefan Hajnoczi, 2023/04/03
- [PATCH 12/13] block/fuse: do not set is_external=true on FUSE fd, Stefan Hajnoczi, 2023/04/03
- [PATCH 13/13] aio: remove aio_disable_external() API, Stefan Hajnoczi, 2023/04/03
  - Re: [PATCH 13/13] aio: remove aio_disable_external() API, Juan Quintela, 2023/04/04
    - Re: [PATCH 13/13] aio: remove aio_disable_external() API, Dr. David Alan Gilbert, 2023/04/04
- [PATCH 11/13] block/fuse: take AioContext lock around blk_exp_ref/unref(), Stefan Hajnoczi, 2023/04/03
  - Re: [PATCH 11/13] block/fuse: take AioContext lock around blk_exp_ref/unref(), Paolo Bonzini, 2023/04/04
    - Re: [PATCH 11/13] block/fuse: take AioContext lock around blk_exp_ref/unref(), Stefan Hajnoczi, 2023/04/04
- Re: [PATCH 00/13] block: remove aio_disable_external() API, Paolo Bonzini <=
  - Re: [PATCH 00/13] block: remove aio_disable_external() API, Stefan Hajnoczi, 2023/04/04

Prev by Date: Re: [PATCH 04/13] util/vhost-user-server: rename refcount to in_flight counter
Next by Date: Re: [PATCH 11/13] block/fuse: take AioContext lock around blk_exp_ref/unref()
Previous by thread: Re: [PATCH 11/13] block/fuse: take AioContext lock around blk_exp_ref/unref()
Next by thread: Re: [PATCH 00/13] block: remove aio_disable_external() API
Index(es):
- Date
- Thread