Re: [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and Blo

qemu-block

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and Blo

From:	Emanuele Giuseppe Esposito
Subject:	Re: [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and BlockCopyState types and functions
Date:	Thu, 10 Jun 2021 16:21:11 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.10.1



On 10/06/2021 13:12, Vladimir Sementsov-Ogievskiy wrote:

10.06.2021 13:46, Emanuele Giuseppe Esposito wrote:
On 10/06/2021 12:27, Vladimir Sementsov-Ogievskiy wrote:
10.06.2021 13:14, Emanuele Giuseppe Esposito wrote:
On 09/06/2021 11:12, Vladimir Sementsov-Ogievskiy wrote:
08.06.2021 10:33, Emanuele Giuseppe Esposito wrote:
As done in BlockCopyCallState, categorize BlockCopyTask
and BlockCopyState in IN, State and OUT fields.
This is just to understand which field has to be protected with alock.
.sleep_state is handled in the series "coroutine: new sleep/wake API"
and thus here left as TODO.

Signed-off-by: Emanuele Giuseppe Esposito <eesposit@redhat.com>
---
block/block-copy.c | 47++++++++++++++++++++++++++++++----------------
  1 file changed, 31 insertions(+), 16 deletions(-)

diff --git a/block/block-copy.c b/block/block-copy.c
index d58051288b..b3533a3003 100644
--- a/block/block-copy.c
+++ b/block/block-copy.c
@@ -56,25 +56,33 @@ typedef struct BlockCopyCallState {
      QLIST_ENTRY(BlockCopyCallState) list;
      /* State */
Why previous @list field is not in the state? For sure it's not anIN parameter and should be protected somehow.
-    int ret;
      bool finished;
-    QemuCoSleep sleep;
-    bool cancelled;
+    QemuCoSleep sleep; /* TODO: protect API with a lock */
      /* OUT parameters */
+    bool cancelled;
      bool error_is_read;
+    int ret;
  } BlockCopyCallState;
  typedef struct BlockCopyTask {
      AioTask task;
+    /*
+     * IN parameters. Initialized in block_copy_task_create()
+     * and never changed.
+     */
      BlockCopyState *s;
      BlockCopyCallState *call_state;
      int64_t offset;
-    int64_t bytes;
-    BlockCopyMethod method;
-    QLIST_ENTRY(BlockCopyTask) list;
+ int64_t bytes; /* only re-set in task_shrink, before runningthe task */+ BlockCopyMethod method; /* initialized inblock_copy_dirty_clusters() */
hmm. to be precise method is initialized in block_copy_task_create.
And after block_copy_task_create finished, task is in the list andcan be read by parallel block_copy_dirty_clusters(). So, @bytes ispart of State, we must protect it..
So if I understand correctly, you refer to the fact that a parallelblock_copy_dirty_clusters() can create another task and search withfind_conflicting_task_locked(), or in general alsoblock_copy_wait_one() can do the same in parallel, correct?
yes
Here there is also another problem: if we add the task to the listand then shrink it in two different critical sections, we are goingto have problems because in the meanwhile find_conflicting_tasks canbe issued in parallel.
But we shrink task only once, and we do it under mutex, so we are OKI think?
I think you understood, but just in case: I am thinking the case wherewe have:

But maybe I am overcomplicating.
Both shrink and find_ are done under mutex, so they can't intersect. Butyes, we should keep in mind that if we do find_ under mutex, and thenrelease mutex, the information get from find_ may become incorrect.
Check callers of find_conflicting_task_locked():

block_copy_wait_one has one critical section.
if no conflicting tasks we are OK.. Are we? Ok, look at the only callerof block_copy_wait_one() - block_copy_common().
assume block_copy_dirty_clusters() returns 0, so there no dirty bits atsome moment...
than in parallel thread some task may finish with failure, leaving somenew dirty bits.. Then we check that there no conflicting tasks.. Andthen we go out of the loop, when actually we must retry for these newdirty bits.
So I'm afraid you are right, we are not threadsafe yet inblock_copy_common(), as we should check conflicting tasks and dirty bitsin same critical section to be consistent.


Wait, we are talking about two different problems:

- What I wanted to point out has to do with @bytes, not (as far as Iunderstand) with the dirty bits. From the example I made below, I assumethere are 3 separate non-overlapping critical sections:

T1: block_copy_task_create()
T2: find_conflicting_tasks() <-- sees the initial task
T1: task_shrink() <-- bytes are updated, T2 saw the wrong amount ofbytes. This might or might not have consequences, I am not sure.

T1 creates the task, T2 iterates to search for conflicting tasks (calledfrom a parallel block_copy_wait_one), T1 shrinks the current task. Ithink that T2 in this case misses the updated task, even though theworst it can happen is that the task is smaller, so a false positive (atask is not conflicting but might be marked as conflicting).The outcome is that T2 is waiting for a task it shouldn't, but there isno error there.

- Your point is about a task failing between block_copy_dirty_clustersand block_copy_wait_one. The task failing calls block_copy_task_end andsets the dirty bitmap, but at that point block_copy_wait_one won't checkit anymore and the bitmap is left dirty. I think the default behaviorhere should be that block_copy_dirty_clusters() is called and a new taskis created. This, as you pointed out, is a proper error.

In this case, we need to stop iterating only when 1) the whole bitmap isclear, and 2) no conflicting task is present.


Therefore a possible solution can be the one below:

int stop_looping = 0;

...

do {

    // create all the tasks, clears the bitmap but
    // adds tasks to the task list
    block_copy_dirty_clusters();

    /* here a task can fail, but then the dirty map will be set */

    lock();
    // make sure no task is running for this operation
    stop_looping = (find_conflicting_task() == NULL);
    // make sure that the dirty bitmap is clear
    stop_looping |= (!bdrv_dirty_bitmap_next_dirty_area()) << 1;
    unlock();

    /* if stop_looping is == 0, no task can fail */

    /* if a task fails here, the if below won't see it but it will
     * block_copy_dirty_clusters in the next iteration */

    if (stop_looping & 1) {
        // there is some conflicting task, wait for it
        qemu_co_queue_wait(&task->wait_queue);
    }

} while(stop_looping != 0);

...

What do you think?

Emanuele

So, is there a reason why we don't want
QLIST_INSERT_HEAD(&s->tasks, task, list);
in block_copy_dirty_clusters()?
By doing that, I think we also spare @bytes from the criticalsection, since it is only read from that point onwards.
This way find_conflicting_tasks will just skip our new creatingtask.. And we'll get conflict when try to add our new task. No, weshould add task to the list at same critical section where we cleardirty bits from the bitmap.
I agree, with the above.
So to me the most correct solution would be to call create and shrinkin the same lock, but this creates a much wider critical section.
Alternatively, I can leave it as it is and just update the comment.
Then we shrink task in another critical section, it should be OK too.
I am also trying to see if I can group some critical sections.
Btw I think we already talked about @bytes and it's not the firsttime we switch it from IN to STATE and vice-versa...
I mean, I agree with you but it starts to be confusing.
On last review it seemed to me that you actually protect bytes bycritical section where it is needed. So here I'm saying only aboutthe comment..
This also goes against your comment later in patch 4,
@@ -212,7 +222,7 @@ static BlockCopyTask*block_copy_task_create(BlockCopyState *s,
      bytes = QEMU_ALIGN_UP(bytes, s->cluster_size);
        /* region is dirty, so no existent tasks possible in it */
-    assert(!find_conflicting_task(s, offset, bytes));
+    assert(!find_conflicting_task_locked(s, offset, bytes));
        bdrv_reset_dirty_bitmap(s->copy_bitmap, offset, bytes);
      s->in_flight_bytes += bytes;
@@ -248,16 +258,19 @@ static void coroutine_fnblock_copy_task_shrink(BlockCopyTask *task,
The function reads task->bytes not under mutex.. It's safe, as onlythat function is modifying the field, and it's called once. Still,let's make critical section a little bit wider, just forsimplicity. I mean, simple QEMU_LOCK_GUARD() at start of function.
Where if I understand correctly, it is not safe, becausefind_conflicting_tasks might search the non-updated task.
find_conflicting_tasks only reads bytes, so it can't make damage..Anyway making critical sections a bit wider won't hurt.

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [PATCH v3 1/5] block-copy: streamline choice of copy_range vs. read/write, (continued)
- [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and BlockCopyState types and functions, Emanuele Giuseppe Esposito, 2021/06/08
  - Re: [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and BlockCopyState types and functions, Vladimir Sementsov-Ogievskiy, 2021/06/09
    - Re: [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and BlockCopyState types and functions, Emanuele Giuseppe Esposito, 2021/06/10
    - Re: [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and BlockCopyState types and functions, Vladimir Sementsov-Ogievskiy, 2021/06/10
    - Re: [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and BlockCopyState types and functions, Emanuele Giuseppe Esposito, 2021/06/10
    - Re: [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and BlockCopyState types and functions, Vladimir Sementsov-Ogievskiy, 2021/06/10
    - Re: [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and BlockCopyState types and functions, Emanuele Giuseppe Esposito <=
    - Re: [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and BlockCopyState types and functions, Vladimir Sementsov-Ogievskiy, 2021/06/10
- [PATCH v3 3/5] block-copy: move progress_set_remaining in block_copy_task_end, Emanuele Giuseppe Esposito, 2021/06/08
- [PATCH v3 4/5] block-copy: add a CoMutex, Emanuele Giuseppe Esposito, 2021/06/08
  - Re: [PATCH v3 4/5] block-copy: add a CoMutex, Vladimir Sementsov-Ogievskiy, 2021/06/09
    - Re: [PATCH v3 4/5] block-copy: add a CoMutex, Emanuele Giuseppe Esposito, 2021/06/10
- [PATCH v3 5/5] block-copy: atomic .cancelled and .finished fields in BlockCopyCallState, Emanuele Giuseppe Esposito, 2021/06/08

Prev by Date: Re: [PATCH 2/2] nbd: Add new qemu:joint-allocation metadata context
Next by Date: Re: [PATCH 2/2] nbd: Add new qemu:joint-allocation metadata context
Previous by thread: Re: [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and BlockCopyState types and functions
Next by thread: Re: [PATCH v3 2/5] block-copy: improve comments of BlockCopyTask and BlockCopyState types and functions
Index(es):
- Date
- Thread