qemu-trivial
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-trivial] [Qemu-devel] [PATCH for 2.1] vfio: use correct runsta


From: Eric Blake
Subject: Re: [Qemu-trivial] [Qemu-devel] [PATCH for 2.1] vfio: use correct runstate
Date: Wed, 02 Jul 2014 07:32:23 -0600
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:24.0) Gecko/20100101 Thunderbird/24.6.0

[adding libvirt]

On 06/27/2014 11:51 AM, Paolo Bonzini wrote:
> ----- Messaggio originale -----
>> Da: "Alex Williamson" <address@hidden>
>> A: "Paolo Bonzini" <address@hidden>
>> Cc: address@hidden, address@hidden
>> Inviato: Venerdì, 27 giugno 2014 18:34:59
>> Oggetto: Re: [PATCH for 2.1] vfio: use correct runstate
>>
>> On Fri, 2014-06-27 at 16:32 +0200, Paolo Bonzini wrote:
>>> io-error is for block device errors; it should always be preceded
>>> by a BLOCK_IO_ERROR event.
>>
>> Where does this requirement come from?  I only see a loose association
>> of IO_ERROR to disk in libvirt and none in QEMU.
> 
> See the RunState enum in qapi-schema.json:
> 
> ##
> # @RunState
> #
> # An enumeration of VM run states.
> #
> # ...
> #
> # @internal-error: An internal error that prevents further guest execution
> # has occurred
> #
> # @io-error: the last IOP has failed and the device is configured to pause
> # on I/O errors
> #
> # @paused: guest has been paused via the 'stop' command
> 
> The point of io-error is that management can look at block devices, see if
> any have an error reported, and then resume execution (see documentation of
> rerror=stop and werror=stop/enospc).  This is counter to the intentions you
> have in vfio.
> 
>>> I think vfio wants to use
>>> RUN_STATE_INTERNAL_ERROR instead.
>>
>> But that seems to put us into an "unknown" paused state in libvirt.
> 
> I think paused is incorrect, because (unlike RUN_STATE_IO_ERROR), you cannot
> resume from RUN_STATE_INTERNAL_ERROR except with a reset.  QEMU enforces that,
> and this matches the error you are reporting:
> 
>     error_report("%s(%04x:%02x:%02x.%x) Unrecoverable error detected.  "
>                  "Please collect any data possible and then kill the guest",
>                  __func__, vdev->host.domain, vdev->host.bus,
>                  vdev->host.slot, vdev->host.function);
> 
> libvirt has a crashed state, I think that's what libvirt should call the
> internal-error runstate.  IIRC on Xen you get to crashed when the processor
> raises an error on vmentry, for example.
> 
> Libvirt only knows about crashed/unknown, but one could add 
> crashed/internal-error
> too.

Yes, we probably need to teach libvirt about this state.

-- 
Eric Blake   eblake redhat com    +1-919-301-3266
Libvirt virtualization library http://libvirt.org

Attachment: signature.asc
Description: OpenPGP digital signature


reply via email to

[Prev in Thread] Current Thread [Next in Thread]