Re: [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support

From:	Avihai Horon
Subject:	Re: [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support
Date:	Sun, 26 Feb 2023 18:43:50 +0200
User-agent:	Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101 Thunderbird/102.8.0


On 23/02/2023 23:16, Alex Williamson wrote:

External email: Use caution opening links or attachments


On Thu, 23 Feb 2023 17:25:12 +0200
Avihai Horon <avihaih@nvidia.com> wrote:

On 22/02/2023 22:58, Alex Williamson wrote:

External email: Use caution opening links or attachments


On Wed, 22 Feb 2023 19:48:58 +0200
Avihai Horon <avihaih@nvidia.com> wrote:

@@ -302,23 +380,44 @@ static void vfio_save_cleanup(void *opaque)
       trace_vfio_save_cleanup(vbasedev->name);
   }

+static void vfio_state_pending_estimate(void *opaque, uint64_t threshold_size,
+                                        uint64_t *must_precopy,
+                                        uint64_t *can_postcopy)
+{
+    VFIODevice *vbasedev = opaque;
+    VFIOMigration *migration = vbasedev->migration;
+
+    if (migration->device_state != VFIO_DEVICE_STATE_PRE_COPY) {
+        return;
+    }
+
+    /*
+     * Initial size should be transferred during pre-copy phase so stop-copy
+     * phase will not be slowed down. Report threshold_size to force another
+     * pre-copy iteration.
+     */
+    *must_precopy += migration->precopy_init_size ?
+                         threshold_size :
+                         migration->precopy_dirty_size;

This sure feels like we're feeding false data back to the iterator to
spoof it to run another iteration, when the vfio migration protocol
only recommends that initial_bytes reaches zero before proceeding to
stop-copy, it's not a requirement.  What benefit is actually observed
from this?  Why is this required for initial pre-copy support?  It
seems devious.

As previously discussed in the thread that added the pre-copy uAPI [1],
the init_bytes can be used by drivers to reduce the downtime.
For example, mlx5 transfers some metadata to the target so it will be
able to pre-allocate resources etc.

[1]
https://lore.kernel.org/kvm/ae4a6259-349d-0131-896c-7a6ea775cc9e@nvidia.com/

Yes, but how does that become a requirement to QEMU that it must
iterate until the initial segment is complete?  Especially when we need
to trigger that behavior via such nefarious means.  AIUI, QEMU should
be allowed to move to stop-copy at any point.  We should make efforts
that QEMU would never decide on its own to move from pre-copy to
stop-copy without completing the init_bytes (which sounds suspiciously
like the purpose of @must_precopy),

@must_precopy represents the pending bytes that must be transferredduring pre-copy or stop-copy. If it's under the threshold, thenmigration will move to stop-copy and be completed.So simply adding init_bytes to @must_precopy will not guarantee that wesend all init_bytes before moving to stop-copy, since the transition tostop-copy can happen when @must_precopy != 0.

  but if, for instance a user forces a
transition to stop-copy, I don't see that we have any business to
impose a policy to delay that until the init_bytes is complete.


Is there a way a user can force the migration to move to stop-copy?

Looking at migration code, it seems that the only way to move tostop-copy is if @must_precopy is below the threshold.If so, then this is our effort to make QEMU send all init_bytes beforemoving to stop_copy and we can only benefit from it.

Regarding how to do it -- maybe instead of spoofing @must_precopy we canintroduce a new parameter in upper migration layer (e.g., @init_precopy)and add another condition in migration layer that it must be zero tomove to stop-copy.


Thanks.

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [PATCH v2 07/20] vfio/common: Add VFIOBitmap and (de)alloc functions, (continued)
- [PATCH v2 09/20] util: Extend iova_tree_foreach() to take data argument, Avihai Horon, 2023/02/22
- [PATCH v2 05/20] vfio/common: Fix wrong %m usages, Avihai Horon, 2023/02/22
- [PATCH v2 06/20] vfio/common: Abort migration if dirty log start/stop/sync fails, Avihai Horon, 2023/02/22
- [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support, Avihai Horon, 2023/02/22
  - Re: [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support, Alex Williamson, 2023/02/22
    - Re: [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support, Avihai Horon, 2023/02/23
    - Re: [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support, Alex Williamson, 2023/02/23
    - Re: [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support, Avihai Horon <=
    - Re: [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support, Alex Williamson, 2023/02/27
    - Re: [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support, Jason Gunthorpe, 2023/02/27
    - Re: [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support, Alex Williamson, 2023/02/27
- [PATCH v2 08/20] util: Add iova_tree_nnodes(), Avihai Horon, 2023/02/22
- [PATCH v2 11/20] vfio/common: Add device dirty page tracking start/stop, Avihai Horon, 2023/02/22
  - Re: [PATCH v2 11/20] vfio/common: Add device dirty page tracking start/stop, Alex Williamson, 2023/02/22
    - Re: [PATCH v2 11/20] vfio/common: Add device dirty page tracking start/stop, Jason Gunthorpe, 2023/02/22
    - Re: [PATCH v2 11/20] vfio/common: Add device dirty page tracking start/stop, Alex Williamson, 2023/02/23
    - Re: [PATCH v2 11/20] vfio/common: Add device dirty page tracking start/stop, Jason Gunthorpe, 2023/02/23
    - Re: [PATCH v2 11/20] vfio/common: Add device dirty page tracking start/stop, Alex Williamson, 2023/02/23

Prev by Date: Re: [PATCH v4] audio/pwaudio.c: Add Pipewire audio backend for QEMU
Next by Date: Re: [PATCH v2 11/20] vfio/common: Add device dirty page tracking start/stop
Previous by thread: Re: [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support
Next by thread: Re: [PATCH v2 03/20] vfio/migration: Add VFIO migration pre-copy support
Index(es):
- Date
- Thread