qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH] blockdev: avoid acquiring AioContext lock twice


From: Sergio Lopez
Subject: Re: [Qemu-devel] [PATCH] blockdev: avoid acquiring AioContext lock twice at do_drive_backup()
Date: Fri, 13 Sep 2019 11:37:24 +0200
User-agent: mu4e 1.2.0; emacs 26.2

Max Reitz <address@hidden> writes:

> On 12.09.19 18:16, Sergio Lopez wrote:
>> do_drive_backup() acquires the AioContext lock of the corresponding
>> BlockDriverState. This is not a problem when it's called from
>> qmp_drive_backup(), but drive_backup_prepare() also acquires the lock
>> before calling it.
>> 
>> This change adds a BlockDriverState argument to do_drive_backup(),
>> which is used to signal that the context lock is already acquired and
>> to save a couple of redundant calls.
>
> But those redundant calls don’t really hurt (it’s just bdrv_lookup_bs(),
> as far as I can tell).  Wouldn’t it be simpler to just release the
> context lock in drive_backup_prepare() before calling do_drive_backup()?
>  The BDS is drained anyway.

Redundant calls rarely hurt, they're just redundant ;-)

> On top of that, do_backup_common() calls bdrv_try_set_aio_context() to
> bring the target into the source’s AioContext.  However, this function
> must be called with the old AioContext held, and the new context not held.

Is this documented somewhere? I see nothing in the function declaration
nor definition.

I'm starting to get the feeling that the block layer is riddled with
unwritten rules and assumptions that makes every change a lot harder
than it should be.

> Currently, it’s called exactly the other way around: With the new
> context held, but the old one not held.
>
> So I think it indeed actually makes more sense to release the AioContext
> before calling do_drive_backup(), and to move the
> bdrv_try_set_aio_context() call for target_bs to the callers of
> do_backup_common() (where they have not yet taken the AioContext lock).

OK. I see this also happens in external_snapshot_prepare() and
qmp_drive_mirror() too. I guess we should fix these too.

In qmp_drive_mirror(), would it be safe to delay the acquisition of any
context until just before the blockdev_mirror_common()?

> Max
>
>> Signed-off-by: Sergio Lopez <address@hidden>
>> ---
>>  blockdev.c | 54 ++++++++++++++++++++++++++++++++++++++----------------
>>  1 file changed, 38 insertions(+), 16 deletions(-)
>> 
>> diff --git a/blockdev.c b/blockdev.c
>> index fbef6845c8..0cc6c69ceb 100644
>> --- a/blockdev.c
>> +++ b/blockdev.c
>> @@ -1762,8 +1762,10 @@ typedef struct DriveBackupState {
>>      BlockJob *job;
>>  } DriveBackupState;
>>  
>> -static BlockJob *do_drive_backup(DriveBackup *backup, JobTxn *txn,
>> -                            Error **errp);
>> +static BlockJob *do_drive_backup(DriveBackup *backup,
>> +                                 BlockDriverState *backup_bs,
>> +                                 JobTxn *txn,
>> +                                 Error **errp);
>>  
>>  static void drive_backup_prepare(BlkActionState *common, Error **errp)
>>  {
>> @@ -1781,6 +1783,11 @@ static void drive_backup_prepare(BlkActionState 
>> *common, Error **errp)
>>          return;
>>      }
>>  
>> +    if (!bs->drv) {
>> +        error_setg(errp, "Device has no medium");
>> +        return;
>> +    }
>> +
>>      aio_context = bdrv_get_aio_context(bs);
>>      aio_context_acquire(aio_context);
>>  
>> @@ -1789,7 +1796,9 @@ static void drive_backup_prepare(BlkActionState 
>> *common, Error **errp)
>>  
>>      state->bs = bs;
>>  
>> -    state->job = do_drive_backup(backup, common->block_job_txn, &local_err);
>> +    state->job = do_drive_backup(backup, bs,
>> +                                 common->block_job_txn,
>> +                                 &local_err);
>>      if (local_err) {
>>          error_propagate(errp, local_err);
>>          goto out;
>> @@ -3607,7 +3616,9 @@ static BlockJob *do_backup_common(BackupCommon *backup,
>>      return job;
>>  }
>>  
>> -static BlockJob *do_drive_backup(DriveBackup *backup, JobTxn *txn,
>> +static BlockJob *do_drive_backup(DriveBackup *backup,
>> +                                 BlockDriverState *backup_bs,
>> +                                 JobTxn *txn,
>>                                   Error **errp)
>>  {
>>      BlockDriverState *bs;
>> @@ -3625,18 +3636,27 @@ static BlockJob *do_drive_backup(DriveBackup 
>> *backup, JobTxn *txn,
>>          backup->mode = NEW_IMAGE_MODE_ABSOLUTE_PATHS;
>>      }
>>  
>> -    bs = bdrv_lookup_bs(backup->device, backup->device, errp);
>> -    if (!bs) {
>> -        return NULL;
>> -    }
>> +    if (backup_bs) {
>> +        bs = backup_bs;
>> +        /*
>> +         * If the caller passes us a BDS, we assume it has already
>> +         * acquired the context lock.
>> +         */
>> +        aio_context = bdrv_get_aio_context(bs);
>> +    } else {
>> +        bs = bdrv_lookup_bs(backup->device, backup->device, errp);
>> +        if (!bs) {
>> +            return NULL;
>> +        }
>>  
>> -    if (!bs->drv) {
>> -        error_setg(errp, "Device has no medium");
>> -        return NULL;
>> -    }
>> +        if (!bs->drv) {
>> +            error_setg(errp, "Device has no medium");
>> +            return NULL;
>> +        }
>>  
>> -    aio_context = bdrv_get_aio_context(bs);
>> -    aio_context_acquire(aio_context);
>> +        aio_context = bdrv_get_aio_context(bs);
>> +        aio_context_acquire(aio_context);
>> +    }
>>  
>>      if (!backup->has_format) {
>>          backup->format = backup->mode == NEW_IMAGE_MODE_EXISTING ?
>> @@ -3713,7 +3733,9 @@ static BlockJob *do_drive_backup(DriveBackup *backup, 
>> JobTxn *txn,
>>  unref:
>>      bdrv_unref(target_bs);
>>  out:
>> -    aio_context_release(aio_context);
>> +    if (!backup_bs) {
>> +        aio_context_release(aio_context);
>> +    }
>>      return job;
>>  }
>>  
>> @@ -3721,7 +3743,7 @@ void qmp_drive_backup(DriveBackup *arg, Error **errp)
>>  {
>>  
>>      BlockJob *job;
>> -    job = do_drive_backup(arg, NULL, errp);
>> +    job = do_drive_backup(arg, NULL, NULL, errp);
>>      if (job) {
>>          job_start(&job->job);
>>      }
>> 

Attachment: signature.asc
Description: PGP signature


reply via email to

[Prev in Thread] Current Thread [Next in Thread]