Re: [Qemu-block] [Qemu-devel] [PULL 21/35] block: fix QEMU crash with sc

qemu-block

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-block] [Qemu-devel] [PULL 21/35] block: fix QEMU crash with sc

From:	Eric Blake
Subject:	Re: [Qemu-block] [Qemu-devel] [PULL 21/35] block: fix QEMU crash with scsi-hd and drive_del
Date:	Wed, 8 Aug 2018 09:53:43 -0500
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.8.0

On 08/08/2018 09:32 AM, Vladimir Sementsov-Ogievskiy wrote:

What's more, in commit f140e300, we specifically called out in thecommit message that maybe it was better to trace when we detectconnection closed rather than log it to stdout, and in all cases inthat commit, the additional 'Connection closed' messages do not addany information to the error message already displayed by the rest ofthe code.

Ok, agree, I'll do it in reconnect series.
hmm, do what?
I was going to change these error messages to be traces, but now I'm notsure that it's a good idea.

Traces are fine. They won't show up in iotests, but will show up whendebugging a failed connection.

We have generic errp returned from thefunction, and why to drop it from logs?

Because it is redundant with the very next line already in the log. Anyerror encountered when trying to write to a disconnected server isredundant with an already-reported error due to detecting EOF on readingfrom the server.

Fixing iotest is not a goodreason, better is to adjust iotest itself a bit (just commit changedoutput) and forget about it. Is iotest racy itself, did you seedifferent output running 83 iotest, not testing by hand?

The condition for the output of the 'Connection closed' message is racy- it depends entirely on the timing of whether the client was able tosend() a read request to the server prior to the server disconnectingimmediately after negotiation ended. If the client loses the race anddetects the server hangup prior to writing anything, you get one path;if the client wins the race and successfully writes the request and onlylater learns that the server has disconnected when trying to read theresponse to that request, you get the other path. The window for therace changed (and the iotests did not seem to ever expose it short ofthis particular change to the block layer to do an extra drain), but Icould still imagine scenarios where iotests will trigger the oppositepath of the race from what is expected, depending on load, since I don'tsee any synchronization points between the two processes where theserver is hanging up after negotiation without reading the client'srequest, but where the client may or may not have had time to get itsrequest sent to the server's queue.

So, just because I have not seen the iotest fail directly because of arace, I think that this commit causing failures in the iotest isevidence that the test is not robust with those extra 'Connectionclosed' messages being output. Switching the output to be a traceinstead should be just fine; overall, the client's attempt to read whenthe server hangs up will be an EIO failure whether or not the client wasable to send() its request and merely fails to get a reply (serverdisconnect was slow), or whether the client was not even able to send()its request (server disconnect was fast).


--
Eric Blake, Principal Software Engineer
Red Hat, Inc.           +1-919-301-3266
Virtualization:  qemu.org | libvirt.org

[Prev in Thread]

Current Thread

[Next in Thread]

Re: [Qemu-block] [Qemu-devel] [PULL 21/35] block: fix QEMU crash with scsi-hd and drive_del, Eric Blake, 2018/08/06
- Re: [Qemu-block] [Qemu-devel] [PULL 21/35] block: fix QEMU crash with scsi-hd and drive_del, Eric Blake, 2018/08/07
  - Re: [Qemu-block] [Qemu-devel] [PULL 21/35] block: fix QEMU crash with scsi-hd and drive_del, Vladimir Sementsov-Ogievskiy, 2018/08/08
    - Re: [Qemu-block] [Qemu-devel] [PULL 21/35] block: fix QEMU crash with scsi-hd and drive_del, Vladimir Sementsov-Ogievskiy, 2018/08/08
    - Re: [Qemu-block] [Qemu-devel] [PULL 21/35] block: fix QEMU crash with scsi-hd and drive_del, Eric Blake <=
  - Re: [Qemu-block] [Qemu-devel] [PULL 21/35] block: fix QEMU crash with scsi-hd and drive_del, Vladimir Sementsov-Ogievskiy, 2018/08/08
    - Re: [Qemu-block] [Qemu-devel] [PULL 21/35] block: fix QEMU crash with scsi-hd and drive_del, Eric Blake, 2018/08/08

Prev by Date: Re: [Qemu-block] [PATCH v3 3/5] qcow2: Resize the cache upon image resizing
Next by Date: Re: [Qemu-block] [PATCH 01/21] jobs: canonize Error object
Previous by thread: Re: [Qemu-block] [Qemu-devel] [PULL 21/35] block: fix QEMU crash with scsi-hd and drive_del
Next by thread: Re: [Qemu-block] [Qemu-devel] [PULL 21/35] block: fix QEMU crash with scsi-hd and drive_del
Index(es):
- Date
- Thread