qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: VM crashed while hot-plugging memory


From: David Hildenbrand
Subject: Re: VM crashed while hot-plugging memory
Date: Mon, 27 Feb 2023 16:49:57 +0100
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.7.2

On 23.02.23 20:32, Igor Mammedov wrote:
On Fri, 10 Feb 2023 09:30:18 +0000
Yangming via <qemu-devel@nongnu.org> wrote:

Hello all:

I found VM crashed while hot-plugging memory.

Base infomation:
qemu version: qemu-master
requirements: hugepages, virtio-gpu

It happens by the following steps:
1. Booting a VM with hugepages and a virtio-gpu device.
2. Connecting VNC of the VM.
3. After the VM booted, hot-plugging 512G memory.
4. Then you can find that the image in vnc is blocked and the worse thing is 
that the VM crashed.

Actually the vcpu is blocked because of dead lock.

Analysis:
As when hot-pluging the BQL is held, at the meanwhile, virtio-gpu is trying to 
hold the BQL for writing date. Then a vcpu is blocked waiting for hugepages 
hot-plugging, specifically, waiting for touching pages. If the blocked vcpu 
stops for several seconds, the soft lockup will happen, if it stops for a long 
time, e.g. 30s, the VM will crash.

I am wandering if there are some ideas to avoid VM soft lockup and even VM 
crash ?

Maybe David can suggest something
(CCed)

Using hugepages usually requires memory preallocation. That preallocation is expensive and can take quite some time, and all hotplugging operations happen under the BQL.

Things that could improve the situation without modifications:

(a) Disable memory preallocation (prealloc=off on the memory backend).
    But that means that if you run out of huge pages, that your VM may
    crash.

(b) Use a file on a hugetlb mount, and preallocate the memory
    externally, outside of QEMU, before plugging creating the memory
    backend and plugging the DIMM. As all memory is already
    preallocated, plugging the DIMM should be fast.

(c) Use multiple, smaller DIMMs.

(d) Parallel preallocation, using multiple preallocation threads.

(e) Use virtio-mem instead of DIMMs, which will add the memory
    incrementally in smaller steps (e.g., 128MiB -- 2 GiB). But it is
    not supported by all guests (especially not under Windows yet).


There are some upstream ideas on how to do preallcoation with hugetlb faster, especially, having a pool of pre-zero'ed huge pages in the kernel, such that allocation of a huge page gets significantly faster -- not upstream.

Further, there was the idea of asynchronous preallocation in QEMU. That could help when first creating the memory backend and waiting until it was asynchronously preallocation. Then, one could plug the DIMM.

--
Thanks,

David / dhildenb




reply via email to

[Prev in Thread] Current Thread [Next in Thread]