Re: [PATCH v2] block/stream: Drain subtree around graph change

qemu-block

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v2] block/stream: Drain subtree around graph change

From:	Hanna Reitz
Subject:	Re: [PATCH v2] block/stream: Drain subtree around graph change
Date:	Mon, 28 Mar 2022 10:09:48 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.5.0

On 28.03.22 09:44, Hanna Reitz wrote:

On 25.03.22 17:37, Vladimir Sementsov-Ogievskiy wrote:
24.03.2022 17:09, Hanna Reitz wrote:
When the stream block job cuts out the nodes between top and base in
stream_prepare(), it does not drain the subtree manually; it fetchesthe
base node, and tries to insert it as the top node's backing node with
bdrv_set_backing_hd(). bdrv_set_backing_hd() however will drain,and sothe actual base node might change (because the base node is actuallynot
part of the stream job) before the old base node passed to
bdrv_set_backing_hd() is installed.

This has two implications:
First, the stream job does not keep a strong reference to the basenode.
Therefore, if it is deleted in bdrv_set_backing_hd()'s drain (e.g.
because some other block job is drained to finish), we will get a
use-after-free.  We should keep a strong reference to that node.

Second, even with such a strong reference, the problem remains that the
base node might change before bdrv_set_backing_hd() actually runsand as
a result the wrong base node is installed.
Hmm.
So, we don't really need a strong reference, as if it helps to avoidsome use-after-free, it means that we'll finish up with wrong blockgraph..
Sure. But I found it better style to strongly reference a node whileit’s used. I’d rather have an outdated block graph (as in: A nodethat was supposed to disappear would still be in use) than ause-after-free.
Graph modifying operations must be somehow isolated from each other.
Both effects can be seen in 030's TestParallelOps.test_overlapping_5()
case, which has five nodes, and simultaneously streams from the middle
node to the top node, and commits the middle node down to the basenode.
As it is, this will sometimes crash, namely when we encounter the
above-described use-after-free.

Taking a strong reference to the base node, we no longer get a crash,
but the resuling block graph is less than ideal: The expected result is
obviously that all middle nodes are cut out and the base node is the
immediate backing child of the top node.  However, if stream_prepare()
takes a strong reference to its base node (the middle node), and then
the commit job finishes in bdrv_set_backing_hd(), supposedly dropping
that middle node, the stream job will just reinstall it again.

Therefore, we need to keep the whole subtree drained in
stream_prepare(), so that the graph modification it performs is
effectively atomic, i.e. that the base node it fetches is still thebase
node when bdrv_set_backing_hd() sets it as the top node's backing node.
Emanuele has similar idea of isolating graph changes from each otherby subtree-drain.
If I understand correctly the idea is that we'll drain all otherblock jobs, so the wouldn't do their block-graph modification duringdrained section. So, we can safely modify the graph.
I don't like this idea:
1. drained section = stop IO. But we don't need to stop IO in thewhole subtree to do a needed block-graph modification.
If you mean to say that draining just the single node should besufficient, I’ll be happy to change it.
Not sure which node, though, because I’d think it would be `base`, butto safely fetch it I’d need to drain it, which seems to bite itself inthe tail. That’s why I went for a subtree drain from `above_base`.
2. Drained section is not a lock, several clients may drain same setof nodes.. So we exploit the fact that concurrent clients will bepaused by drained section and don't proceed to graph-modificationcode.. But are we sure that block-jobs are (and will be?) the onlyconcurrent block-graph modifying clients? Can qmp commands interleavesomehow?
They can under very specific circumstances and that’s a bug. Seehttps://lists.nongnu.org/archive/html/qemu-block/2022-03/msg00582.html .
Can some jobs from other subtree start a block-graph modificationthat touches our subtree?
That would be wrong. A block job shouldn’t change nodes it doesn’town; stream doesn’t own the base, but it also doesn’t change it, itonly needs to have the top node point to it.
If go this way, that would be more safe to drain the wholeblock-graph on any block-graph modification..
I think we'd better have a separate global mechanism for isolatinggraph modifications. Something like a global co-mutex or queue, whereclients waits for their turn in block graph modifications.
Here is my old proposal on that topic:https://patchew.org/QEMU/20201120161622.1537-1-vsementsov@virtuozzo.com/
That would only solve the very specific issue in 030, right? Thestream job isn’t protected from any graph modifications but thosecoming from mirror. Might be a solution going forward (I didn’t lookcloser at it at the time, given I saw you had a discussion withKevin), if we lock every graph change operation (though a global lockhonestly doesn’t sound strictly better than draining subsections ofthe graph, both have their drawbacks), but that doesn’t look like it’dbe something for 7.1.

I wonder whether we could have a short-term version of`BdrvChild.frozen` that’s a coroutine mutex. If `.frozen` is set, youjust can’t change the graph, and you also can’t wait, so that’s just anerror. But if `.frozen_lock` is set, you can wait on it. Here, we’dkeep `.frozen` set for all links between top and above_base, and then inprepare() take `.frozen_lock` on the link between above_base and base.

[Prev in Thread]

Current Thread

[Next in Thread]

[PATCH v2] block/stream: Drain subtree around graph change, Hanna Reitz, 2022/03/24
- Re: [PATCH v2] block/stream: Drain subtree around graph change, John Snow, 2022/03/24
  - Re: [PATCH v2] block/stream: Drain subtree around graph change, Hanna Reitz, 2022/03/25
- Re: [PATCH v2] block/stream: Drain subtree around graph change, Eric Blake, 2022/03/25
- Re: [PATCH v2] block/stream: Drain subtree around graph change, Vladimir Sementsov-Ogievskiy, 2022/03/25
  - Re: [PATCH v2] block/stream: Drain subtree around graph change, Hanna Reitz, 2022/03/28
    - Re: [PATCH v2] block/stream: Drain subtree around graph change, Hanna Reitz <=
    - Re: [PATCH v2] block/stream: Drain subtree around graph change, Vladimir Sementsov-Ogievskiy, 2022/03/28
    - Re: [PATCH v2] block/stream: Drain subtree around graph change, Hanna Reitz, 2022/03/29
    - Re: [PATCH v2] block/stream: Drain subtree around graph change, Vladimir Sementsov-Ogievskiy, 2022/03/29
    - Re: [PATCH v2] block/stream: Drain subtree around graph change, Hanna Reitz, 2022/03/29
    - Re: [PATCH v2] block/stream: Drain subtree around graph change, Emanuele Giuseppe Esposito, 2022/03/30

Prev by Date: Re: [PATCH v2] block/stream: Drain subtree around graph change
Next by Date: Re: Proposal for a regular upstream performance testing
Previous by thread: Re: [PATCH v2] block/stream: Drain subtree around graph change
Next by thread: Re: [PATCH v2] block/stream: Drain subtree around graph change
Index(es):
- Date
- Thread