Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy

From:	Avi Kivity
Subject:	Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy
Date:	Sun, 27 Feb 2011 11:10:35 +0200
User-agent:	Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.13) Gecko/20101209 Fedora/3.1.7-0.35.b3pre.fc14 Lightning/1.0b3pre Thunderbird/3.1.7

On 02/24/2011 07:58 PM, Anthony Liguori wrote:

If you move the cdrom to a different IDE channel, you have to updatethe stateful non-config file.
Whereas if you do

   $ qemu-img create -f cd-tray -b ~/foo.img ~/foo-media-tray.img
   $ qemu -cdrom ~/foo-media-tray.img

the cd-rom tray state will be tracked in the image file.
Yeah, but how do you move it?

There is no need to move the file at all. Simply point the new drive atthe media tray.

If you do a remove/add through QMP, then the config file will reflectthings just fine.

If all access to the state file is through QMP then it becomes morepalatable. A bit on that later.

If you want to do it outside of QEMU, then you can just ignore theconfig file and manage all of the state yourself. But it's nevergoing to work as well (it will be racy) and you're pushing atremendous amount of knowledge that ultimately belongs in QEMU (whatstate needs to persist) to something that isn't QEMU which means it'sprobably not going to be done correctly.
I know you're a big fan of the omnipotent management tool but myexperience has been that we need to help the management tooling folksmore by expecting less from them.

I thought that's what I'm doing by separating the state out. It's easyfor management to assemble configuration from their database and convertit into a centralized representation (like a qemu command line). It's alot harder to disassemble a central state representation and move itback to the database.

Using QMP is better than directly accessing the state file since qemudoes the disassembly for you (provided the command references the deviceusing its normal path, not some random key). The file just becomes away to survive a crash, and all management needs to know about is tomake it available and back it up. But it means that everything must bedone via QMP, including assembly of the machine, otherwise the statefile can become stale.

Separating the state out to the device is even easier, since managementis already expected to take care of disk images. All that's needed isto create the media tray image once, then you can forget about itcompletely.

Again the question is who is the authoritative source of theconfiguration. Is it the management tool or is it qemu?
QEMU. No question about it. At any point in time, we are theauthoritative source of what the guest's configuration is. There's nodoubt about it. A management tool can try to keep up with us, butultimately we are the only ones that know for sure.
We have all of this information internally. Just persisting it is nota major architectural change. It's something we should have beendoing (arguably) from the very beginning.

That's a huge divergence from how management tools are written.Currently they contain the required guest configuration, arepresentation of what's the current live configuration, and they issuemonitor commands to move the live configuration towards the requiredconfiguration (or just generate a qemu command line). What you'redescribing is completely different, I'm not even sure what it is.

The management tool already has to keep track of (the optional partsof) the guest device tree. It cannot start reading the statefulnon-config file at random points in time. So all that is left is theguest controlled portions of the device tree, which are pretty rare,and random events like live-copy migration. I think that introducinga new authoritative source of information will create a lot of problems.
QEMU has always been the authoritative source. Nothing new has beenintroduced. We never persisted the machine's configuration whichmeant management tools had to try to aggressively keep up with uswhich is intrinsically error prone. Fixing this will only improveexisting management tools.

If you look at management tools, they believe they are the authoritativesource of configuration information (not guest state, which is more orless ignored).

Right, but we should make it easy, not hard.
Yeah, I fail to see how this makes it hard. We conveniently aresaying, hey, this is all the state that needs to be persisted. We'llpersist it for you if you want, otherwise, we'll expose it in acentral location.

The state-in-a-file is just a blob. Don't expect the tool to parse itand reassociate the various bits to its own representation. Exposing itvia QMP commands is a lot better though.

If the tool wants to ignore it and guess based on various combinationsof other commands, more power to it.
I don't see why it doesn't work.  Please explain.
1) guest eject
2) qemu posts eject event
3) qemu acknowledges eject to the guest
4) management tool sees eject event and updates guest config
There's a race between 3 & 4. It can only be addressed byinterposing 4 between 2 and 3 OR making qemu persist this statebetween 2 and 3 such that the management tool can reliably query it.
If "it" is my cd-rom tray block format driver, it works. It's reallythe same in action as the stateful non-config, except it's part ofthe device/image, not a central location.
Because you've introduced a one-off. Having a bunch of one-offs(especially being a bunch of new block formats!) is not going to makethings simpler for management tools.

I don't see it as a one-off. We have some state, we store it in a statefile. Like we store each image in its own file.


Have a tool expose it.  Part of the range is unspecified anyway.


I guess we need to agree to disagree then.


We can just extend our existing disagreement agreement.

Using a block format driver means that we don't have to care about acrash during a write, that we can snapshot it, etc.
Why?


Because block format drivers are already prepared for it.

We always need to care about a crash during write. What I've beenthinking for a config file is the class approach of using a ~ and .#file to make sure that we write out the new file and then atomicallyrename it to get the new contents. Yeah, it's a bit heavy weight butthis shouldn't be a very common thing to update.

Suppose it has information about ide1-cd0's media tray. Now werestart qemu and cold-plug the cdrom into ide0-cd0. What happens tothe information?
Whether media is present is not a property of a blockdev, it's aproperty of a device.

You're making it the property of a device path (I guess you could makeit the property of a device's id).

What does it even mean to use your media-tray format with somethinglike a CMOS device?


Nothing.

Technically, mac address is stored on eeprom and we store that as adevice property today. We can't persist device properties eventhough you can change the mac address of a network card and it doespersist across reboots. Are you advocating that we introduce aneeprom for every network card (all in a slightly different format)and have special tools to manipulate the eeprom to store and viewthe mac address?
Yes -- if we really want to support it. Obviously we have to storethe entire eeprom, not just the portion containing the MAC address,so it's not just a key/value store. A card may even have itsfirmware in flash.
I think that's overengineering. I think we can go very far by justpersisting small amounts of information in a central location. We'renot building a cycle-accurate simulator here afterall.


Well, the whole thing is rather theoretical.

--
error compiling committee.c: too many arguments to function

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy, (continued)

Prev by Date: Re: [Qemu-devel] Stupid question?
Next by Date: Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy
Previous by thread: Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy
Next by thread: Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy
Index(es):
- Date
- Thread