[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Qemu-devel] [PULL 01/48] target/arm: Vectorize USHL and SSHL
From: |
Peter Maydell |
Subject: |
Re: [Qemu-devel] [PULL 01/48] target/arm: Vectorize USHL and SSHL |
Date: |
Thu, 13 Jun 2019 15:11:43 +0100 |
On Thu, 13 Jun 2019 at 13:14, Peter Maydell <address@hidden> wrote:
>
> From: Richard Henderson <address@hidden>
>
> These instructions shift left or right depending on the sign
> of the input, and 7 bits are significant to the shift. This
> requires several masks and selects in addition to the actual
> shifts to form the complete answer.
>
> That said, the operation is still a small improvement even for
> two 64-bit elements -- 13 vector operations instead of 2 * 7
> integer operations.
>
> Signed-off-by: Richard Henderson <address@hidden>
> Message-id: address@hidden
> Reviewed-by: Peter Maydell <address@hidden>
> Signed-off-by: Peter Maydell <address@hidden>
Hi Richard -- I just noticed that this patch seems to
regress VSHL for AArch32.
160: f3026408 vshl.u8 d6, d8, d2
with
d2: 429de8b85a8dee77
d8: 279cd30880000000
now gives
d6: 00000000008dee77
when previously both QEMU and real Cortex-A7 hardware
gave d6: all-zeroes.
(this is testcases/aarch32-all/insn_VSHL__INC.risu.bin
from Alex's set.)
I'm going to drop this patch from the pullreq.
thanks
-- PMM
- [Qemu-devel] [PULL 00/48] target-arm queue, Peter Maydell, 2019/06/13
- [Qemu-devel] [PULL 01/48] target/arm: Vectorize USHL and SSHL, Peter Maydell, 2019/06/13
- Re: [Qemu-devel] [PULL 01/48] target/arm: Vectorize USHL and SSHL,
Peter Maydell <=
- [Qemu-devel] [PULL 02/48] target/arm: Use tcg_gen_gvec_bitsel, Peter Maydell, 2019/06/13
- [Qemu-devel] [PULL 06/48] target/arm: Fix output of PAuth Auth, Peter Maydell, 2019/06/13
- [Qemu-devel] [PULL 07/48] decodetree: Fix comparison of Field, Peter Maydell, 2019/06/13
- [Qemu-devel] [PULL 12/48] target/arm: Convert the VSEL instructions to decodetree, Peter Maydell, 2019/06/13
- [Qemu-devel] [PULL 11/48] target/arm: Explicitly enable VFP short-vectors for aarch32 -cpu max, Peter Maydell, 2019/06/13
- [Qemu-devel] [PULL 08/48] target/arm: Add stubs for AArch32 VFP decodetree, Peter Maydell, 2019/06/13
- [Qemu-devel] [PULL 10/48] target/arm: Fix Cortex-R5F MVFR values, Peter Maydell, 2019/06/13
- [Qemu-devel] [PULL 20/48] target/arm: Convert VFP two-register transfer insns to decodetree, Peter Maydell, 2019/06/13
- [Qemu-devel] [PULL 05/48] hw/core/bus.c: Only the main system bus can have no parent, Peter Maydell, 2019/06/13
- [Qemu-devel] [PULL 19/48] target/arm: Convert "single-precision" register moves to decodetree, Peter Maydell, 2019/06/13