[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Qemu-devel] [PATCH] target/s390x: Use tcg_gen_gvec_bitsel
From: |
David Hildenbrand |
Subject: |
Re: [Qemu-devel] [PATCH] target/s390x: Use tcg_gen_gvec_bitsel |
Date: |
Mon, 3 Jun 2019 21:41:19 +0200 |
User-agent: |
Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.7.0 |
On 03.06.19 18:57, Richard Henderson wrote:
> This replaces the target-specific implementations for VSEL.
>
> Signed-off-by: Richard Henderson <address@hidden>
> ---
> target/s390x/translate_vx.inc.c | 38 ++++++---------------------------
> 1 file changed, 6 insertions(+), 32 deletions(-)
>
> diff --git a/target/s390x/translate_vx.inc.c b/target/s390x/translate_vx.inc.c
> index 7e0bfcb190..a8603cbfd6 100644
> --- a/target/s390x/translate_vx.inc.c
> +++ b/target/s390x/translate_vx.inc.c
> @@ -233,6 +233,9 @@ static void get_vec_element_ptr_i64(TCGv_ptr ptr, uint8_t
> reg, TCGv_i64 enr,
> #define gen_gvec_fn_3(fn, es, v1, v2, v3) \
> tcg_gen_gvec_##fn(es, vec_full_reg_offset(v1), vec_full_reg_offset(v2), \
> vec_full_reg_offset(v3), 16, 16)
> +#define gen_gvec_fn_4(fn, es, v1, v2, v3, v4) \
> + tcg_gen_gvec_##fn(es, vec_full_reg_offset(v1), vec_full_reg_offset(v2), \
> + vec_full_reg_offset(v3), vec_full_reg_offset(v4), 16,
> 16)
>
> /*
> * Helper to carry out a 128 bit vector computation using 2 i64 values per
> @@ -903,40 +906,11 @@ static DisasJumpType op_vsce(DisasContext *s, DisasOps
> *o)
> return DISAS_NEXT;
> }
>
> -static void gen_sel_i64(TCGv_i64 d, TCGv_i64 a, TCGv_i64 b, TCGv_i64 c)
> -{
> - TCGv_i64 t = tcg_temp_new_i64();
> -
> - /* bit in c not set -> copy bit from b */
> - tcg_gen_andc_i64(t, b, c);
> - /* bit in c set -> copy bit from a */
> - tcg_gen_and_i64(d, a, c);
> - /* merge the results */
> - tcg_gen_or_i64(d, d, t);
> - tcg_temp_free_i64(t);
> -}
> -
> -static void gen_sel_vec(unsigned vece, TCGv_vec d, TCGv_vec a, TCGv_vec b,
> - TCGv_vec c)
> -{
> - TCGv_vec t = tcg_temp_new_vec_matching(d);
> -
> - tcg_gen_andc_vec(vece, t, b, c);
> - tcg_gen_and_vec(vece, d, a, c);
> - tcg_gen_or_vec(vece, d, d, t);
> - tcg_temp_free_vec(t);
> -}
> -
Comparing against tcg_gen_bitsel_i64()
1. a and c are switched
2. b is _not_ switched (and() and andc() are switched)
Shouldn't this be
gen_gvec_fn_4(bitsel, ES_8, get_field(s->fields, v1),
get_field(s->fields, v4), get_field(s->fields, v3),
get_field(s->fields, v2));
?
Maybe I am missing something. It was a long day :)
Should I send this patch with the next s390x/tcg pull request?
--
Thanks,
David / dhildenb