qemu-block
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH 10/13] hw/ide/piix: Reuse PCIIDEState::{cmd,data}_ops


From: Bernhard Beschow
Subject: Re: [PATCH 10/13] hw/ide/piix: Reuse PCIIDEState::{cmd,data}_ops
Date: Fri, 19 May 2023 17:09:50 +0000


Am 18. Mai 2023 14:53:26 UTC schrieb Mark Cave-Ayland 
<mark.cave-ayland@ilande.co.uk>:
>On 13/05/2023 13:21, Bernhard Beschow wrote:
>
>> Am 3. Mai 2023 19:52:41 UTC schrieb Mark Cave-Ayland 
>> <mark.cave-ayland@ilande.co.uk>:
>>> On 27/04/2023 19:15, Bernhard Beschow wrote:
>>> 
>>>> Am 27. April 2023 10:52:17 UTC schrieb Mark Cave-Ayland 
>>>> <mark.cave-ayland@ilande.co.uk>:
>>>>> On 26/04/2023 21:14, Bernhard Beschow wrote:
>>>>> 
>>>>>> Am 26. April 2023 18:18:35 UTC schrieb Bernhard Beschow 
>>>>>> <shentey@gmail.com>:
>>>>>>> 
>>>>>>> 
>>>>>>> Am 26. April 2023 11:37:48 UTC schrieb Mark Cave-Ayland 
>>>>>>> <mark.cave-ayland@ilande.co.uk>:
>>>>>>>> On 22/04/2023 16:07, Bernhard Beschow wrote:
>>>>>>>> 
>>>>>>>>> Now that PCIIDEState::{cmd,data}_ops are initialized in the base class
>>>>>>>>> constructor there is an opportunity for PIIX to reuse these 
>>>>>>>>> attributes. This
>>>>>>>>> resolves usage of ide_init_ioport() which would fall back internally 
>>>>>>>>> to using
>>>>>>>>> the isabus global due to NULL being passed as ISADevice by PIIX.
>>>>>>>>> 
>>>>>>>>> Signed-off-by: Bernhard Beschow <shentey@gmail.com>
>>>>>>>>> ---
>>>>>>>>>      hw/ide/piix.c | 30 +++++++++++++-----------------
>>>>>>>>>      1 file changed, 13 insertions(+), 17 deletions(-)
>>>>>>>>> 
>>>>>>>>> diff --git a/hw/ide/piix.c b/hw/ide/piix.c
>>>>>>>>> index a3a15dc7db..406a67fa0f 100644
>>>>>>>>> --- a/hw/ide/piix.c
>>>>>>>>> +++ b/hw/ide/piix.c
>>>>>>>>> @@ -104,34 +104,32 @@ static void piix_ide_reset(DeviceState *dev)
>>>>>>>>>          pci_set_byte(pci_conf + 0x20, 0x01);  /* BMIBA: 20-23h */
>>>>>>>>>      }
>>>>>>>>>      -static bool pci_piix_init_bus(PCIIDEState *d, unsigned i, 
>>>>>>>>> ISABus *isa_bus,
>>>>>>>>> -                              Error **errp)
>>>>>>>>> +static void pci_piix_init_bus(PCIIDEState *d, unsigned i, ISABus 
>>>>>>>>> *isa_bus)
>>>>>>>>>      {
>>>>>>>>>          static const struct {
>>>>>>>>>              int iobase;
>>>>>>>>>              int iobase2;
>>>>>>>>>              int isairq;
>>>>>>>>>          } port_info[] = {
>>>>>>>>> -        {0x1f0, 0x3f6, 14},
>>>>>>>>> -        {0x170, 0x376, 15},
>>>>>>>>> +        {0x1f0, 0x3f4, 14},
>>>>>>>>> +        {0x170, 0x374, 15},
>>>>>>>>>          };
>>>>>>>>> -    int ret;
>>>>>>>>> +    MemoryRegion *address_space_io = 
>>>>>>>>> pci_address_space_io(PCI_DEVICE(d));
>>>>>>>>>            ide_bus_init(&d->bus[i], sizeof(d->bus[i]), DEVICE(d), i, 
>>>>>>>>> 2);
>>>>>>>>> -    ret = ide_init_ioport(&d->bus[i], NULL, port_info[i].iobase,
>>>>>>>>> -                          port_info[i].iobase2);
>>>>>>>>> -    if (ret) {
>>>>>>>>> -        error_setg_errno(errp, -ret, "Failed to realize %s port %u",
>>>>>>>>> -                         object_get_typename(OBJECT(d)), i);
>>>>>>>>> -        return false;
>>>>>>>>> -    }
>>>>>>>>> +    memory_region_add_subregion(address_space_io, 
>>>>>>>>> port_info[i].iobase,
>>>>>>>>> +                                &d->data_ops[i]);
>>>>>>>>> +    /*
>>>>>>>>> +     * PIIX forwards the last byte of cmd_ops to ISA. Model this 
>>>>>>>>> using a low
>>>>>>>>> +     * prio so competing memory regions take precedence.
>>>>>>>>> +     */
>>>>>>>>> +    memory_region_add_subregion_overlap(address_space_io, 
>>>>>>>>> port_info[i].iobase2,
>>>>>>>>> +                                        &d->cmd_ops[i], -1);
>>>>>>>> 
>>>>>>>> Interesting. Is this behaviour documented somewhere and/or used in one 
>>>>>>>> of your test images at all? If I'd have seen this myself, I probably 
>>>>>>>> thought that the addresses were a typo...
>>>>>>> 
>>>>>>> I first  stumbled upon this and wondered why this code was working with 
>>>>>>> VIA_IDE (through my pc-via branch). Then I found the correct offsets 
>>>>>>> there which are confirmed in the piix datasheet, e.g.: "Secondary 
>>>>>>> Control Block Offset: 0374h"
>>>>>> 
>>>>>> In case you were wondering about the forwarding of the last byte the 
>>>>>> datasheet says: "Accesses to byte 3 of the Control Block are forwarded 
>>>>>> to ISA where the floppy disk controller responds."
>>>>> 
>>>>> Ahhh okay okay I see what's happening here: the PIIX IDE is assuming that 
>>>>> the legacy ioport semantics are in operation here, which as you note 
>>>>> above is where the FDC controller is also accessed via the above byte in 
>>>>> the IDE control block. This is also why you need to change the address 
>>>>> above from 0x3f6/0x376 to 0x3f4/0x374 when trying to use the 
>>>>> MemoryRegions used for the PCI BARs since the PCI IDE controller 
>>>>> specification requires a 4 byte allocation for the Control Block - see 
>>>>> sections 2.0 and 2.2.
>>>> 
>>>> Yes, PIIX assuming that might be the case. Why does it contradict the PCI 
>>>> IDE specification? PIIX seems to apply the apprppriate "workarounds" here.
>>> 
>>> Can you explain a bit more about where you see the contradiction? At first 
>>> glance it looks okay to me.
>>> 
>>>>> And that's fine, because the portio_lists used in ide_init_ioport() set 
>>>>> up the legacy IDE ioports so that FDC accesses done in this way can 
>>>>> succeed, and the PIIX IDE is hard-coded to legacy mode. So in fact PIIX 
>>>>> IDE should keep using ide_init_ioport() rather than trying to re-use the 
>>>>> BAR MemoryRegions so I think this patch should just be dropped.
>>>> 
>>>> I was hoping to keep that patch...
>>> 
>>> Perhaps a different way to think about it is that from QEMU's perspective a 
>>> BAR is a MemoryRegion that can be dynamically assigned/updated and cannot 
>>> overlap, whereas the portio_list implementation also handles unaligned 
>>> accesses and overlapping sparse accesses. Since the latter is the exact 
>>> case here with the IDE/FDC then it seems the existing portio_list solution 
>>> already does the "right thing" instead of having to manually emulate the 
>>> overlapping dispatch.
>> 
>> I've had another look into the "PCI IDE Controller Specification Revision 
>> 1.0" which says:
>
>Interesting: it looks as if we are getting different conclusions from the same 
>document.
>
>> "The Control Block registers consist of two bytes used for control/status of 
>> the IDE device. The second byte of this pair is read-only and has the 
>> interesting quirk where the top bit of this byte is shared with the floppy 
>> controller when the IDE device is mapped at 'compatibility' locations. It 
>> turns out that software controlling IDE devices (BIOS, drivers, etc.) does 
>> not use this register at all.
>
>Just before this section the start of the paragraph reads "The ATA Standard 
>defines two sets of registers known as Control Block Registers and Command 
>Block Registers." which reads to me that the paragraph quoted above is 
>describing the original ATA Standard behaviour, i.e. the expected behaviour 
>for pre-PCI controllers or PCI IDE controllers in compatibility mode.
>
>> The exception for PCI IDE controllers to the ATA Standard is that only the 
>> first of the two bytes defined in the Control Block registers is 
>> implemented. This byte provides Alternate Status on reads and Device Control 
>> on writes. Accesses to the second byte of the Control Block registers (Drive 
>> Address) should be ignored by the PCI IDE controller."
>
>And this paragraph then leads onto the differences for PCI IDE controllers 
>which are that the second (shared) byte in the Control Block is ignored, which 
>again makes sense from a PCI perspective since PCI BARs cannot overlap. But 
>that doesn't matter in PCI native mode because the BIOS/OS will have moved the 
>BAR to a suitable memory address that doesn't clash with the floppy drive.
>
>> So in fact the real PIIX does adhere to this standard and there is no reason 
>> to reject the idea behind this patch -- which is to make our PIIX device 
>> model implement this standard.
>> 
>> It's just that all our other PCI-IDE implementations need to implement this 
>> quirk as long as they implement the standard. And according to the Linux 
>> kernel they all do -- see its CONFIG_ATA_SFF.
>
>Another couple of hints that the registers in PCI IDE controllers in 
>compatibility mode aren't accessed through PCI BARs can also be found: i) the 
>table in section 2.1 for compatibility mode uses fixed addresses whilst the 
>table in section 2.2 references BAs and ii) section 2.4 suggests that PCI 
>controllers in compatibility mode always ignore the BARs.
>
>Now it could be that the description in the PIIX datasheet indicates that the 
>PCI IO address is hardcoded and then the second byte (re)dispatched to the ISA 
>bus, but then I would argue that this is an implementation detail: from QEMU's 
>perspective there is zero difference between this and the existing IDE 
>portio_list, and as a bonus the existing compatibility behaviour is completely 
>unaffected by any PCI BARs.

Right, that was my idea: Trading one implementation detail in PIIX with another 
to have a common "theory" accross all our TYPE_PCI_IDE devices which all 
implement the PCI IDE controller standard. At the same time quite a bit of 
redundant code could be removed.

We could of course extend this theory to consider compatibility and PCI native 
modes to have different implementations. That is, compatibility mode would use 
portio_list semantics while PCI native woulde use the BARs semantics. Then we'd 
have to make at least TYPE_VIA_IDE use portio_list since it is currently 
hardcoded to operate in compatibility mode. Same for cmd646 if it can switch 
modes. And then there is also sil3112...

I guess I'll split this series and only ship the first four patches in v2. 
Meanwhile we can discuss here further on the topic of the PCI IDE controller 
specification which the rest of this series addresses.

>
>> Since this patch actually uncovered a small bug in the other device models 
>> I'd rather fix those, too. One way I could do this is to decrease the size 
>> of the memory region or to map with lower priority. What is the preferred 
>> fix? Any other ideas?
>> 
>> Note that this and the next patch resolve the last dependencies on the 
>> "isabus" global. So after this series we could apply some small patches 
>> posted before and get rid of the global entirely... And have as many ISA and 
>> LPC buses as we want!
>
>This is the part I think we can do better with: both Phil and I have patches 
>that remove the isabus reference from the IDE ioports e.g. 
>https://patchew.org/QEMU/20230302224058.43315-1-philmd@linaro.org/20230302224058.43315-9-philmd@linaro.org/
> so dropping this patch shouldn't affect our ability to remove the isabus 
>global.
>
>Do you have an example of a use-case you have for multiple ISA buses? I'm 
>fairly sure that this wouldn't work on x86 PC machines with a single PCI root 
>bus for example.

The ICH0 (82801AA & 82801AB) datasheet depicts in the "system block diagram" an 
LPC bus for the Super I/O devices and an optional PCI-ISA bridge connected to 
the PCI bus. I guess the use case is for connecting ISA cards while the ICH 
made the switch to the more modern LPC bus. Of course one could attach an ISA 
"card" to the LPC bus in QEMU since they look the same from software. I'd 
assume though that above two buses would be visible somehow.

Other than that I don't see a real use case. It's just a property of a clean 
software design to avoid imposing any artificial limits onto callers and 
instead give them maximum control. ISABus currently assumes that there can only 
be one ISA bus in a QEMU process and above example seems like a counter 
example. Moreover, ISABus not being a singleton may make its testing more 
convenient and who knows, maybe it helps in moving towards heterogenious 
architectures.

Best regards,
Bernhard

>
>
>ATB,
>
>Mark.
>



reply via email to

[Prev in Thread] Current Thread [Next in Thread]