Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support f

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support f

From:	Geert Jansen
Subject:	Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication]
Date:	Tue, 29 May 2012 13:57:33 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:12.0) Gecko/20120430 Thunderbird/12.0.1

Hi,

On 05/24/2012 04:19 PM, Paolo Bonzini wrote:

Here is how the bitmaps are handled when doing I/O on the source:
- after writing to the source:
   - clear bit in the volatile in-flight bitmap
   - set bit in the persistent dirty bitmap

- after flushing the source:
   - msync the persistent bitmap to disk


Here is how the bitmaps are handled in the drive-mirror coroutine:
- before reading from the source:
   - set bit in the volatile in-flight bitmap

- after writing to the target:
   - if the dirty count will become zero, flush the target
   - if the bit is still set in the in-flight bitmap, clear bit in the
     persistent dirty bitmap
   - clear bit in the volatile in-flight bitmap


I have a few questions, apologies if some of these are obvious..

I assume the target can be any QEmu block driver including e.g. NBD? Anetworked block driver would be required for a continuous replicationsolution.

Does the drive-mirror coroutine send the writes to the target in thesame order as they are sent to the source? I assume so.

Does the drive-mirror coroutine require that writes are acknowledged?I'd assume so, as you mention that the bit from the persistent bitmap iscleared after a write, so you'd need to know the write arrived otherwiseyou cannot safely clear the bit.

If the two above are true (sending in-order, and require acknowledgmentof writes by the target), then I assume there is a need to keep anin-memory list with the IOs that still need to be sent to the target?That list could get too large if i.e. the target cannot keep up orbecomes unavailable. When this happens, the dirty bitmap is needed tore-establish synchronized state again between the two images.

For this re-sync, i think there will be two phases. The first phasewould send blocks marked as dirty by the bitmap. I assume these would besent in arbitrary order, not the order in which they were sent to thesource, right?

After the copy phase is done, in order to avoid race conditions, thebitmap should be reset and mirroring should start directly andatomically. Is that currently handed by your design?

Also probably the target would need some kind of signal that the copyended and that we are now mirroring because this is when writes arein-order again, and therefore only in this phase the solution canprovide crash consistent protection. In the copy phase no crashconsistency can be provided if i am not mistaken.

Finally, again if i am not mistaken, I think that the scenario wheresynchronization is lost with the target is exactly the same as when youneed to do an initial copy, expect that in the latter case all bits inthe bitmap are set, right?


Regards,
Geert

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Qemu-devel] Proposal for extensions of block job commands in QEMU 1.2, (continued)
- [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Paolo Bonzini, 2012/05/24
  - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Ori Mamluk, 2012/05/24
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Paolo Bonzini, 2012/05/24
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Dor Laor, 2012/05/24
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Paolo Bonzini, 2012/05/25
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Geert Jansen <=
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Paolo Bonzini, 2012/05/29
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Geert Jansen, 2012/05/30
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Paolo Bonzini, 2012/05/30
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Geert Jansen, 2012/05/31
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Paolo Bonzini, 2012/05/31
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Roni Luxenberg, 2012/05/31
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Paolo Bonzini, 2012/05/31
  - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Eric Blake, 2012/05/24
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Paolo Bonzini, 2012/05/25
    - Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication], Eric Blake, 2012/05/25

Prev by Date: [Qemu-devel] [PATCH 1.1 v2] sheepdog: fix return value of do_load_save_vm_state
Next by Date: [Qemu-devel] [PATCH 1.1] sheepdog: add coroutine_fn markers to coroutine functions
Previous by thread: Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication]
Next by thread: Re: [Qemu-devel] Block job commands in QEMU 1.2 [v2, including support for replication]
Index(es):
- Date
- Thread