Re: [Qemu-devel] [RFC] More robust migration

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [RFC] More robust migration

From:	Anthony Liguori
Subject:	Re: [Qemu-devel] [RFC] More robust migration
Date:	Fri, 20 Feb 2009 09:15:58 -0600
User-agent:	Thunderbird 2.0.0.19 (X11/20090105)

Hi Andre,

Andre Przywara wrote:

Hi,
after fiddling around with migration (and the data dumped into thestream) I found the current concept possesses some shortcomings.

Yikes :-) FWIW, I focused a lot on robustness in the implementation sohopefully a lot of what you mention below were conscious decisions withvery specific reasoning.

I am interested in your opinions whether it is worth to implement anew improved format.

FWIW, the format is sufficiently versioned that it isn't necessary tocompletely change it (not that I think it needs changing).

Issues I would like to address:
1. Transfer configuration data. Currently there is no VM configurationdata transferred with the stream.

Yes, the difficulty here is that we need to transfer the machineconfiguration but not the host configuration. Management tools shoulddecide how to configure the host on the target side but we should bepassing the machine configuration.

If you've been following the config file threads, I've mentioned this asa use case for the current design a number of times. We would pass aflattened device tree as another savevm section with a well known name(like "machine"). Given the semantics of the current migrationprotocol, this would ensure that the machine generated on the remotenode was exactly the same as the source node.

One has to start QEMU/KVM with the _exact_ same parameters on theother side to allow migration. If there would be a pseudo-device(transferred first) holding these parameters (and other runtimedependent stuff like kvm_enabled()) this would ease migration a lot.


FWIW, there's nothing preventing migrating from TCG -> KVM.

I think one can debate about whether host config should be migratedtoo. I'd argue that in the core migration protocol, host config shouldnot be present. I think you can have an easier to use migrationprotocol (like the old ssh protocol) that also transferred host config.But in the general case, you want management tools to be able tomanipulate host config upon migration.

2. Introduce a length field to the header of each device.

IMHO, this would reduce robustness. It's also difficult because of theway savevm registration works. You don't know how large a section isuntil it's written and migration streams are not seekable.

This would allow to skip unknown (or unwanted) devices.

No good can come from this. If you have an unknown section, you mustthrow and error and stop the migration. What if this is for a devicethat the guest is interacting with? The device just disappears aftermigration? All savevm state is state that affects the functionality ofa guest. Throwing away this state will change the functionality of theVM and migration should not affect guest functionality.

I know this imposes a bit of a challenge, because the length is notalways known in advance, but one could overcome this (by using thebuffer to patch in the length later for instance).

What are the use cases where you think this would be beneficial? Ireally see the change in semantics from the old way (throwing awayunknown sections) to the new way (requiring strict versioning andvalidating all sections) as being a huge step toward robustness.

3. Make the device versioning really bulletproof. Currently somedevices dump different data depending on runtime (or bettertime-of-creation) state (for instance hw/i8254.c: if (s->irq_timer)...).

If you look carefully, s->irq_timer will always be set. The checks areunnecessary.

Another example is the (x86?) CPU state, which differs with KVMen/disabled.


Not in upstream QEMU...

Some devices even dump host system dependent structures (like structvecio in virtio-blk.c).

That is awful and needs to be fixed. It should have never beencommitted like that.

Also one could create some kind of (limited) upward compatibility, soolder QEMU versions ignore additional, but optional fields in a devicestate (similar to the ext2 compatibility scheme). Maybe this could bedone by an external converter program.

To me, ignoring is always a bad thing. It's almost always going to beunsafe. Doesn't this decrease robustness by being less conservative?

4. Allow optional devices. Some devices are always started (likeHPET), although they don't need to be used by the OS. If one migratessuch a guest from say KVM-83 to KVM-81, it will fail, because KVM-81does not support HPET. One could migrate the device only if it hasbeen used.

There's no way you can migrate from KVM-83 to KVM-81 if you've enabledthe HPET. It cannot be made to work.

There is a -no-hpet option though. If you are a management tool thatneeds to support migration from multiple versions, you should use-no-hpet. Also, if you need to migrate from KVM-81 to KVM-83, youshould use -no-hpet with KVM-83 to avoid changing the guest visible state.

In the long run, the machine configuration file will address this in amore thorough manner. FWIW, -no-hpet was added specifically to dealwith migration.

In general I would like to know whether QEMU migration is intended tobe used in such a flexible manner or whether the requirement of theexact same software version on both side is not a limitation ineveryday use.

My primary goal for migration is robustness. I do not think it's a goodidea to support any circumstances that could introduce changes in guestvisible state during a live migration.

Live migration is a critical feature for many production environments.To be useful IMHO, it has to be bullet-proof.


Regards,

Anthony Liguori

Awaiting your comments!

Regards,
Andre.

[Prev in Thread]

Current Thread

[Next in Thread]

[Qemu-devel] [RFC] More robust migration, Andre Przywara, 2009/02/20
- Re: [Qemu-devel] [RFC] More robust migration, Anthony Liguori <=
  - Re: [Qemu-devel] [RFC] More robust migration, Paul Brook, 2009/02/20
    - Re: [Qemu-devel] [RFC] More robust migration, Jamie Lokier, 2009/02/20
    - Re: [Qemu-devel] [RFC] More robust migration, Paul Brook, 2009/02/20
    - Re: [Qemu-devel] [RFC] More robust migration, Jamie Lokier, 2009/02/22
    - Re: [Qemu-devel] [RFC] More robust migration, Paul Brook, 2009/02/23
    - Re: [Qemu-devel] [RFC] More robust migration, Jamie Lokier, 2009/02/23
    - Re: [Qemu-devel] [RFC] More robust migration, Paul Brook, 2009/02/23
    - Re: [Qemu-devel] [RFC] More robust migration, Anthony Liguori, 2009/02/23
    - Re: [Qemu-devel] [RFC] More robust migration, Avi Kivity, 2009/02/24
  - Re: [Qemu-devel] [RFC] More robust migration, Jamie Lokier, 2009/02/20

Prev by Date: [Qemu-devel] [RFC] More robust migration
Next by Date: [Qemu-devel] regression introduced by cirrus_vga.c commit r6622?
Previous by thread: [Qemu-devel] [RFC] More robust migration
Next by thread: Re: [Qemu-devel] [RFC] More robust migration
Index(es):
- Date
- Thread