qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-devel] [PATCH 00/57] target-i386 eflags cleanup and bmi/adx ex


From: Richard Henderson
Subject: Re: [Qemu-devel] [PATCH 00/57] target-i386 eflags cleanup and bmi/adx extensions
Date: Thu, 24 Jan 2013 12:37:21 -0800
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/20130110 Thunderbird/17.0.2

On 01/24/2013 08:57 AM, Laurent Desnogues wrote:
On Thu, Jan 24, 2013 at 5:52 PM, Richard Henderson <address@hidden> wrote:
On 2013-01-24 08:46, Laurent Desnogues wrote:

I gave a quick try a your branch. My host is an x86_64 CPU and I
ran an i386 nbench in user mode.  It works but some parts of the
benchmark are noticeably slower (>10%).  Is that expected?

Nope.  Everything in there should be about speeding up...

I'll have a look at it and see if there's something obvious.

Let me know if you need more information or the binary (I compiled
it some time ago with the oldest compiler I could find, gcc 2.96).

Would you look and see how much variability you're getting? I had a quick look with a (newly built) nbench binary and don't see any speed regressions outside the error bars.

Built with gcc 4.7.2, 4 trials each:
                Master                          Eflags3                 
                Avg             Stddev          Avg             Stddev          
 Change Error
Num S           585.92          18.25           573.79          4.39            
 -2.07% 3.12%
String S        51.14           1.10            51.52           0.13            
  0.73% 2.15%
Bitfield        1.64E+008       4.04E+006       1.62E+008       8.63E+005       
 -1.32% 2.46%
FP Emu  85.65           1.81            114.74          1.18             33.97% 
2.12%
Fourier 1365.03         28.79           2813.78         11.72           106.13% 
2.11%
Assign  14.86           0.24            14.89           0.21              0.22% 
1.62%
Idea            723.70          43.31           884.20          4.55            
 22.18% 5.98%
Huff            495.27          8.72            702.89          3.53            
 41.92% 1.76%
N Net           0.29            0.01            0.73            0.00            
149.99% 1.78%
LU Decomp       9.26            0.16            21.91           0.22            
136.61% 1.70%

I haven't looked to see where the massive fp improvements come from, but my first guess is not storing cc_op so often. Although perhaps it would keep us on the same page if we were talking about the exact same binary...


r~



reply via email to

[Prev in Thread] Current Thread [Next in Thread]