[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[PATCH v6 74/82] target/arm: Implement aarch64 SUDOT, USDOT
From: |
Richard Henderson |
Subject: |
[PATCH v6 74/82] target/arm: Implement aarch64 SUDOT, USDOT |
Date: |
Fri, 30 Apr 2021 13:26:02 -0700 |
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
target/arm/cpu.h | 5 +++++
target/arm/translate-a64.c | 25 +++++++++++++++++++++++++
2 files changed, 30 insertions(+)
diff --git a/target/arm/cpu.h b/target/arm/cpu.h
index c75601b221..b2b684df55 100644
--- a/target/arm/cpu.h
+++ b/target/arm/cpu.h
@@ -4206,6 +4206,11 @@ static inline bool isar_feature_aa64_rcpc_8_4(const
ARMISARegisters *id)
return FIELD_EX64(id->id_aa64isar1, ID_AA64ISAR1, LRCPC) >= 2;
}
+static inline bool isar_feature_aa64_i8mm(const ARMISARegisters *id)
+{
+ return FIELD_EX64(id->id_aa64isar1, ID_AA64ISAR1, I8MM) != 0;
+}
+
static inline bool isar_feature_aa64_ccidx(const ARMISARegisters *id)
{
return FIELD_EX64(id->id_aa64mmfr2, ID_AA64MMFR2, CCIDX) != 0;
diff --git a/target/arm/translate-a64.c b/target/arm/translate-a64.c
index a8edd2d281..c875481784 100644
--- a/target/arm/translate-a64.c
+++ b/target/arm/translate-a64.c
@@ -12175,6 +12175,13 @@ static void
disas_simd_three_reg_same_extra(DisasContext *s, uint32_t insn)
}
feature = dc_isar_feature(aa64_dp, s);
break;
+ case 0x03: /* USDOT */
+ if (size != MO_32) {
+ unallocated_encoding(s);
+ return;
+ }
+ feature = dc_isar_feature(aa64_i8mm, s);
+ break;
case 0x18: /* FCMLA, #0 */
case 0x19: /* FCMLA, #90 */
case 0x1a: /* FCMLA, #180 */
@@ -12215,6 +12222,10 @@ static void
disas_simd_three_reg_same_extra(DisasContext *s, uint32_t insn)
u ? gen_helper_gvec_udot_b : gen_helper_gvec_sdot_b);
return;
+ case 0x3: /* USDOT */
+ gen_gvec_op4_ool(s, is_q, rd, rn, rm, rd, 0, gen_helper_gvec_usdot_b);
+ return;
+
case 0x8: /* FCMLA, #0 */
case 0x9: /* FCMLA, #90 */
case 0xa: /* FCMLA, #180 */
@@ -13360,6 +13371,13 @@ static void disas_simd_indexed(DisasContext *s,
uint32_t insn)
return;
}
break;
+ case 0x0f: /* SUDOT, USDOT */
+ if (is_scalar || (size & 1) || !dc_isar_feature(aa64_i8mm, s)) {
+ unallocated_encoding(s);
+ return;
+ }
+ size = MO_32;
+ break;
case 0x11: /* FCMLA #0 */
case 0x13: /* FCMLA #90 */
case 0x15: /* FCMLA #180 */
@@ -13474,6 +13492,13 @@ static void disas_simd_indexed(DisasContext *s,
uint32_t insn)
u ? gen_helper_gvec_udot_idx_b
: gen_helper_gvec_sdot_idx_b);
return;
+ case 0x0f: /* SUDOT, USDOT */
+ gen_gvec_op4_ool(s, is_q, rd, rn, rm, rd, index,
+ extract32(insn, 23, 1)
+ ? gen_helper_gvec_usdot_idx_b
+ : gen_helper_gvec_sudot_idx_b);
+ return;
+
case 0x11: /* FCMLA #0 */
case 0x13: /* FCMLA #90 */
case 0x15: /* FCMLA #180 */
--
2.25.1
- [PATCH v6 50/82] target/arm: Split out formats for 2 vectors + 1 index, (continued)
- [PATCH v6 50/82] target/arm: Split out formats for 2 vectors + 1 index, Richard Henderson, 2021/04/30
- [PATCH v6 46/82] target/arm: Implement SVE2 FMMLA, Richard Henderson, 2021/04/30
- [PATCH v6 55/82] target/arm: Implement SVE2 saturating multiply-add (indexed), Richard Henderson, 2021/04/30
- [PATCH v6 56/82] target/arm: Implement SVE2 saturating multiply (indexed), Richard Henderson, 2021/04/30
- [PATCH v6 54/82] target/arm: Implement SVE2 saturating multiply-add high (indexed), Richard Henderson, 2021/04/30
- [PATCH v6 48/82] target/arm: Pass separate addend to {U, S}DOT helpers, Richard Henderson, 2021/04/30
- [PATCH v6 65/82] target/arm: Implement SVE2 FCVTNT, Richard Henderson, 2021/04/30
- [PATCH v6 71/82] target/arm: Implement 128-bit ZIP, UZP, TRN, Richard Henderson, 2021/04/30
- [PATCH v6 77/82] target/arm: Fix decode for VDOT (indexed), Richard Henderson, 2021/04/30
- [PATCH v6 64/82] target/arm: Implement SVE2 TBL, TBX, Richard Henderson, 2021/04/30
- [PATCH v6 74/82] target/arm: Implement aarch64 SUDOT, USDOT,
Richard Henderson <=
- [PATCH v6 76/82] target/arm: Remove unused fpst from VDOT_scalar, Richard Henderson, 2021/04/30
- [PATCH v6 70/82] target/arm: Implement SVE2 LD1RO, Richard Henderson, 2021/04/30
- [PATCH v6 68/82] target/arm: Implement SVE2 FLOGB, Richard Henderson, 2021/04/30
- [PATCH v6 72/82] target/arm: Implement SVE2 bitwise shift immediate, Richard Henderson, 2021/04/30
- [PATCH v6 73/82] target/arm: Implement SVE2 fp multiply-add long, Richard Henderson, 2021/04/30
- [PATCH v6 78/82] target/arm: Split decode of VSDOT and VUDOT, Richard Henderson, 2021/04/30
- [PATCH v6 75/82] target/arm: Split out do_neon_ddda_fpst, Richard Henderson, 2021/04/30
- [PATCH v6 62/82] target/arm: Implement SVE2 crypto destructive binary operations, Richard Henderson, 2021/04/30
- [PATCH v6 66/82] target/arm: Implement SVE2 FCVTLT, Richard Henderson, 2021/04/30
- [PATCH v6 69/82] target/arm: Share table of sve load functions, Richard Henderson, 2021/04/30