[PATCH v2 00/12] target/arm: Implement BFloat16

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[PATCH v2 00/12] target/arm: Implement BFloat16

From:	Richard Henderson
Subject:	[PATCH v2 00/12] target/arm: Implement BFloat16
Date:	Tue, 25 May 2021 15:58:05 -0700

Whee!  All prerequisites are now merged.

Patches missing r-b:

  05-softfpu-Add-float_round_to_odd_inf.patch
  06-target-arm-Implement-bfloat16-dot-product-vector.patch
  07-target-arm-Implement-bfloat16-dot-product-indexed.patch
  11-linux-user-aarch64-Enable-hwcap-bits-for-bfloat16.patch
  12-target-arm-Enable-BFloat16-extensions.patch

Per the question of whether additional checks vs VFP or NEON
are required, I have disabled BF16 when either VFP or NEON are
also disabled.  Which seems like a similarly reasonable choice.


r~


Richard Henderson (12):
  target/arm: Add isar_feature_{aa32,aa64,aa64_sve}_bf16
  target/arm: Unify unallocated path in disas_fp_1src
  target/arm: Implement scalar float32 to bfloat16 conversion
  target/arm: Implement vector float32 to bfloat16 conversion
  softfpu: Add float_round_to_odd_inf
  target/arm: Implement bfloat16 dot product (vector)
  target/arm: Implement bfloat16 dot product (indexed)
  target/arm: Implement bfloat16 matrix multiply accumulate
  target/arm: Implement bfloat widening fma (vector)
  target/arm: Implement bfloat widening fma (indexed)
  linux-user/aarch64: Enable hwcap bits for bfloat16
  target/arm: Enable BFloat16 extensions

 include/fpu/softfloat-types.h |   4 +-
 target/arm/cpu.h              |  15 ++++
 target/arm/helper-sve.h       |   4 +
 target/arm/helper.h           |  15 ++++
 target/arm/neon-dp.decode     |   1 +
 target/arm/neon-shared.decode |  11 +++
 target/arm/sve.decode         |  19 ++++-
 target/arm/vfp.decode         |   2 +
 linux-user/elfload.c          |   2 +
 target/arm/cpu.c              |   3 +
 target/arm/cpu64.c            |   3 +
 target/arm/cpu_tcg.c          |   1 +
 target/arm/sve_helper.c       |   2 +
 target/arm/translate-a64.c    | 142 +++++++++++++++++++++++++++++-----
 target/arm/translate-neon.c   |  91 ++++++++++++++++++++++
 target/arm/translate-sve.c    | 112 +++++++++++++++++++++++++++
 target/arm/translate-vfp.c    |  24 ++++++
 target/arm/vec_helper.c       | 140 ++++++++++++++++++++++++++++++++-
 target/arm/vfp_helper.c       |  12 +++
 fpu/softfloat-parts.c.inc     |   6 +-
 20 files changed, 584 insertions(+), 25 deletions(-)

-- 
2.25.1

[Prev in Thread]

Current Thread

[Next in Thread]

[PATCH v2 00/12] target/arm: Implement BFloat16, Richard Henderson <=
- [PATCH v2 02/12] target/arm: Unify unallocated path in disas_fp_1src, Richard Henderson, 2021/05/25
- [PATCH v2 03/12] target/arm: Implement scalar float32 to bfloat16 conversion, Richard Henderson, 2021/05/25
- [PATCH v2 01/12] target/arm: Add isar_feature_{aa32, aa64, aa64_sve}_bf16, Richard Henderson, 2021/05/25
- [PATCH v2 04/12] target/arm: Implement vector float32 to bfloat16 conversion, Richard Henderson, 2021/05/25
- [PATCH v2 05/12] softfpu: Add float_round_to_odd_inf, Richard Henderson, 2021/05/25
- [PATCH v2 07/12] target/arm: Implement bfloat16 dot product (indexed), Richard Henderson, 2021/05/25
- [PATCH v2 10/12] target/arm: Implement bfloat widening fma (indexed), Richard Henderson, 2021/05/25
- [PATCH v2 08/12] target/arm: Implement bfloat16 matrix multiply accumulate, Richard Henderson, 2021/05/25
- [PATCH v2 06/12] target/arm: Implement bfloat16 dot product (vector), Richard Henderson, 2021/05/25
- [PATCH v2 09/12] target/arm: Implement bfloat widening fma (vector), Richard Henderson, 2021/05/25

Prev by Date: [Bug 1891748] Re: qemu-arm-static 5.1 can't run gcc
Next by Date: [PATCH v2 02/12] target/arm: Unify unallocated path in disas_fp_1src
Previous by thread: [PATCH v2] hw/display/artist: Fix bug in coordinate extraction in artist_vram_read() and artist_vram_write()
Next by thread: [PATCH v2 02/12] target/arm: Unify unallocated path in disas_fp_1src
Index(es):
- Date
- Thread