Re: [Qemu-devel] qcow2 performance plan

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] qcow2 performance plan

From:	Anthony Liguori
Subject:	Re: [Qemu-devel] qcow2 performance plan
Date:	Tue, 14 Sep 2010 10:25:24 -0500
User-agent:	Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.1.12) Gecko/20100826 Lightning/1.0b1 Thunderbird/3.0.7

On 09/14/2010 08:07 AM, Avi Kivity wrote:

Here's a draft of a plan that should improve qcow2 performance. It'swritten in wiki syntax for eventual upload to wiki.qemu.org; linesstarting with # are numbered lists, not comments.


= Basics =

At the minimum level, no operation should block the main thread.  This
could be done in two ways: extending the state machine so that each

blocking operation can be performed asynchronously(<code>bdrv_aio_*</code>)

or by threading: each new operation is handed off to a worker thread.
Since a full state machine is prohibitively complex, this document
will discuss threading.

== Basic threading strategy ==

A first iteration of qcow2 threading adds a single mutex to an image.
The existing qcow2 code is then executed within a worker thread,
acquiring the mutex before starting any operation and releasing it
after completion.  Concurrent operations will simply block until the
operation is complete.  For operations which are already asynchronous,
the blocking time will be negligible since the code will call
<code>bdrv_aio_{read,write}</code> and return, releasing the mutex.
The immediate benefit is that currently blocking operations no long block
the main thread, instead they just block the block operation which is
blocking anyway.

== Eliminating the threading penalty ==

We can eliminate pointless context switches by using the worker thread
context we're in to issue the I/O.  This is trivial for synchronous calls

(<code>bdrv_read</code> and <code>bdrv_write</code>); we simply issuethe I/O

from the same thread we're currently in.  The underlying raw block format

driver threading code needs to recognize we're in a worker threadcontext so

it doesn't need to use a worker thread of its own; perhaps using a thread
variable to see if it is in the main thread or an I/O worker thread.

For asynchronous operations, this is harder.  We may add a
<code>bdrv_queue_aio_read</code> and <code>bdrv_queue_aio_write</code> if
to replace a

    bdrv_aio_read()
    mutex_unlock(bs.mutex)
    return;

sequence.  Alternatively, we can just eliminate asynchronous calls.  To
retain concurrency we drop the mutex while performing the operation:
an convert a <code>bdrv_aio_read</code> to:

    mutex_unlock(bs.mutex)
    bdrv_read()
    mutex_lock(bs.mutex)

The incremental version of this is hard for me to understand.bdrv_read() may be implemented in terms of bdrv_aio_read() +qemu_io_wait() which dispatches bottom halves. This is done through ashared resource so if you allow bdrv_read() to be called in parallel,there's a very real possibility that you'll get corruption of a sharedresource.

You'd have to first instrument bdrv_read() to be re-entrant by acquiringbs.mutex() in every bdrv_read() caller. You would then need to modifythe file protocol so that it could safely be called in parallel.

IOW, you've got to make the whole block layer thread safe before you canbegin to make qcow2 thread safe.


Regards,

Anthony Liguori

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Qemu-devel] qcow2 performance plan, (continued)
- [Qemu-devel] Re: qcow2 performance plan, Kevin Wolf, 2010/09/14
  - Re: [Qemu-devel] Re: qcow2 performance plan, Stefan Hajnoczi, 2010/09/14
- Re: [Qemu-devel] qcow2 performance plan, Anthony Liguori <=
  - Re: [Qemu-devel] qcow2 performance plan, Avi Kivity, 2010/09/14

Prev by Date: Re: [Qemu-devel] qcow2 performance plan
Next by Date: Re: [Qemu-devel] [PATCH] Disable virtio-balloon memory stats interface
Previous by thread: Re: [Qemu-devel] Re: qcow2 performance plan
Next by thread: Re: [Qemu-devel] qcow2 performance plan
Index(es):
- Date
- Thread