qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] crash in vhost-user-bridge on migration


From: Marc-André Lureau
Subject: Re: [Qemu-devel] crash in vhost-user-bridge on migration
Date: Wed, 03 May 2017 10:31:17 +0000

Hi

On Tue, May 2, 2017 at 11:17 PM Dr. David Alan Gilbert <address@hidden>
wrote:

> Hi,
>   I've started playing with vhost-user-bridge and have it
> basically up and going, but I've just tried migration and
> I've got a reliable crash for it; I'm not that sure I've
> got it set up right, so suggestions please:
>
> This is with qemu head, on an f26 host running an f25-ish
> guest.
>
> Program received signal SIGSEGV, Segmentation fault.
> 0x000055c414112ce4 in vring_avail_idx (vq=0x55c41582fd68,
> vq=0x55c41582fd68)
>     at /home/dgilbert/git/qemu/contrib/libvhost-user/libvhost-user.c:940
> 940         vq->shadow_avail_idx = vq->vring.avail->idx;
> (gdb) p vq
> $1 = (VuVirtq *) 0x55c41582fd68
> (gdb) p vq->vring
> $2 = {num = 0, desc = 0x0, avail = 0x0, used = 0x0, log_guest_addr = 0,
> flags = 0}
> (gdb) p vq->shadow_avail_idx
> $3 = 0
>
> #0  0x000055c414112ce4 in vring_avail_idx (vq=0x55c41582fd68,
> vq=0x55c41582fd68)
>     at /home/dgilbert/git/qemu/contrib/libvhost-user/libvhost-user.c:940
> No locals.
> #1  virtqueue_num_heads (idx=0, vq=0x55c41582fd68, dev=0x55c41582fc20)
>     at /home/dgilbert/git/qemu/contrib/libvhost-user/libvhost-user.c:960
>         num_heads = <optimized out>
> #2  vu_queue_get_avail_bytes (dev=0x55c41582fc20, vq=0x55c41582fd68,
> address@hidden,
>     address@hidden,
> address@hidden,
>     address@hidden) at
> /home/dgilbert/git/qemu/contrib/libvhost-user/libvhost-user.c:1034
>         idx = 0
>         total_bufs = 0
>         in_total = 0
>         out_total = 0
>         rc = <optimized out>
> #3  0x000055c414112fbd in vu_queue_avail_bytes (dev=<optimized out>,
> vq=<optimized out>, in_bytes=0, out_bytes=0)
>     at /home/dgilbert/git/qemu/contrib/libvhost-user/libvhost-user.c:1116
>         in_total = 0
>         out_total = 0
> #4  0x000055c4141114da in vubr_backend_recv_cb (sock=<optimized out>,
> ctx=0x55c41582fc20)
>     at /home/dgilbert/git/qemu/tests/vhost-user-bridge.c:276
>         vubr = 0x55c41582fc20
>         dev = 0x55c41582fc20
>         vq = 0x55c41582fd68
>         elem = 0x0
>         mhdr_sg = {{iov_base = 0x0, iov_len = 0} <repeats 740 times>,
> {iov_base = 0x0, iov_len = 140512740079088}, {
>                    .....}
>
>         mhdr = {hdr = {flags = 0 '\000', gso_type = 0 '\000', hdr_len = 0,
> gso_size = 0, csum_start = 0,
>             csum_offset = 0}, num_buffers = 0}
>         mhdr_cnt = 0
>         hdrlen = 0
>         i = 0
>         hdr = {flags = 0 '\000', gso_type = 0 '\000', hdr_len = 0,
> gso_size = 0, csum_start = 0, csum_offset = 0}
>         __PRETTY_FUNCTION__ = "vubr_backend_recv_cb"
> #5  0x000055c414110ad3 in dispatcher_wait (timeout=200000,
> dispr=0x55c4158300b8)
>     at /home/dgilbert/git/qemu/tests/vhost-user-bridge.c:154
>         e = 0x55c415830180
>
>
This happens because vubr assumes that if a ring is enabled, it should be
able to use/read it.
This is not the case, as qemu may enable a ring before the slave has the
ring addresses.

The protocol has evolved regarding this. There are multiple ways to fix
this, I think the simplest is for vu_queue_avail_bytes() to return false if
the ring is not available. I'll send a patch.

That's from the destination bridge; I'm running both on a single
> host and that's happening when I just do a :
>    migrate_set_speed 1G
>    migrate tcp:localhost:8888
>
> The destination qemu spits out:
> qemu-system-x86_64: Failed to set msg fds.
> qemu-system-x86_64: vhost VQ 0 ring restore failed: -1: Resource
> temporarily unavailable (11)
> qemu-system-x86_64: Failed to set msg fds.
> qemu-system-x86_64: vhost VQ 1 ring restore failed: -1: Resource
> temporarily unavailable (11)
>
> but I'm not sure if that's before or after the seg of the bridge.
>

That is after, on disconnect.

>
> I've got:
>   a) One qemu that just has the -net socket / -net user setup as per the
> docs - but I've got
>      two lots of sockets for either side
>   b) Two qemus for the guests, teh second with just the -incoming
>   c) The two vhost-user-bridge instances - the destination being pointed
> at the second set of sockets.
>
> My test is run by doing:
> #!/bin/bash -x
> SESS=vhost
> tmux -L $SESS new-session -d
> tmux -L $SESS set-option -g set-remain-on-exit on
> # Start a router using the system qemu
> tmux -L $SESS new-window -n router qemu-system-x86_64 -M none -nographic
> -net socket,vlan=0,udp=localhost:4444,localaddr=localhost:5555 -net
> socket,vlan=0,udp=localhost:4445,localaddr=localhost:5556 -net user,vlan=0
> # Start source vhost bridge
> tmux -L $SESS new-window -n srcvhostbr ./tests/vhost-user-bridge -u
> /tmp/vubrsrc.sock
> tmux -L $SESS new-window -n source "./x86_64-softmmu/qemu-system-x86_64
> -enable-kvm -m 1G -smp 2 -object
> memory-backend-file,id=mem,size=1G,mem-path=/dev/shm,share=on -numa
> node,memdev=mem -mem-prealloc -chardev
> socket,id=char0,path=/tmp/vubrsrc.sock -netdev
> type=vhost-user,id=mynet1,chardev=char0,vhostforce -device
> virtio-net-pci,netdev=mynet1 /home/vmimages/f25.qcow2 -net none"
> # Start dest vhost bridge
> tmux -L $SESS new-window -n destvhostbr ./tests/vhost-user-bridge -u
> /tmp/vubrdst.sock -l 127.0.0.1:4445 -r 127.0.0.1:5556
> tmux -L $SESS new-window -n dest "./x86_64-softmmu/qemu-system-x86_64
> -enable-kvm -m 1G -smp 2 -object
> memory-backend-file,id=mem,size=1G,mem-path=/dev/shm,share=on -numa
> node,memdev=mem -mem-prealloc -chardev
> socket,id=char0,path=/tmp/vubrdst.sock -netdev
> type=vhost-user,id=mynet1,chardev=char0,vhostforce -device
> virtio-net-pci,netdev=mynet1 /home/vmimages/f25.qcow2 -net none -incoming
> tcp::8888"
>
> (I've got a few added printf's so the lines might be off by a few).
>
>
Thanks for the report
-- 
Marc-André Lureau


reply via email to

[Prev in Thread] Current Thread [Next in Thread]