qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [RFC PATCH v2] target/ppc: Enable hardfloat for PPC


From: Howard Spoelstra
Subject: Re: [RFC PATCH v2] target/ppc: Enable hardfloat for PPC
Date: Wed, 19 Feb 2020 19:28:33 +0100

Hi,

I tested with the current ppc-for-5.0 branch and with v1 of the hardfloat patches applied on top of that. There is a noticeable speed improvement in Linux and OSX hosts. Windows 10 host doesn't seem to be impressed at all. I saw no obvious glitches so far. The fpu performance on OSX hosts seems very slow. This was not always the case in the past, when it was on par with Linux performance.

Below are my results.

Best,
Howard

Host Linux (Fedora 31):
Mac OS tests: 9.2 with MacBench 5.0
Baseline(100%): G3 300Mhz
5.0 branch + hardfloat patches: cpu 193%, fpu 86%
5.0 branch: cpu 188%, fpu 57%
Mac OSX tests: 10.5 with Skidmarks 4.0 test
Baseline(100%): G4 1.0Ghz.
5.0 branch + hardfloat patches: Int:131 FP:11 Vec:15
5.0 branch: Int:131 FP:9 Vec:11

Host OSX Sierra:
Mac OS tests: 9.2 with MacBench 5.0
Baseline(100%): G3 300Mhz
5.0 branch + hardfloat patches: cpu 199%, fpu 66%
5.0 branch: cpu 199%, fpu 40%
Mac OSX tests: 10.5 with Skidmarks 4.0 test
Baseline(100%): G4 1.0Ghz.
5.0 branch + hardfloat patches: Int:129 PF:11 Vec:14
5.0 branch: Int:129 FP:8 Vec:9

Host Windows 10:
Mac OS tests: 9.2 with MacBench 5.0
Baseline(100%): G3 300Mhz
5.0 branch + hardfloat patches: cpu 180%, fpu 54%
5.0 branch: cpu 199%, fpu 40%
Mac OSX tests: 10.5 with Skidmarks 4.0 test
Baseline(100%): G4 1.0Ghz.
5.0 branch + hardfloat patches: Int:130 FP:9 Vec:10
5.0 branch: Int:130 FP:10 Vec:11

All tests done on the same host with v1 of the hardfloat patches
Intel i7-4770K at 3.50Ghz. 32Gb memory
All guests set to 1024x768 and "thousands" of colors.
Linux and OSX (with brew) use default compilers.
Windows build cross-compiled from Fedora with x86_64-win64-mingw32

Skidmarks test is a combination of tests:
ParseVid (INT)
MPEG (INT)
PixBlend (INT)
Ellipticrypt (INT)
Rijndael (INT)
Quicksort (INT)
MDCT (FP)
IntToFloat (FP)
Q3 (FP)
FFT (FP)
VolInt (FP)
Quant (VMX)
Galaxy (VMX)
IDCT (VMX)
BigMult (VMX)

reply via email to

[Prev in Thread] Current Thread [Next in Thread]