qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [RFC PATCH v2 38/44] target/loongarch: Implement vbitsel vset


From: gaosong
Subject: Re: [RFC PATCH v2 38/44] target/loongarch: Implement vbitsel vset
Date: Tue, 11 Apr 2023 19:37:00 +0800
User-agent: Mozilla/5.0 (X11; Linux loongarch64; rv:68.0) Gecko/20100101 Thunderbird/68.7.0

 

在 2023/4/4 上午9:03, Richard Henderson 写道:
+void HELPER(vseteqz_v)(CPULoongArchState *env, uint32_t cd, uint32_t vj)
+{
+    VReg *Vj = &(env->fpr[vj].vreg);
+    env->cf[cd & 0x7] = (Vj->Q(0) == 0);
+}
+
+void HELPER(vsetnez_v)(CPULoongArchState *env, uint32_t cd, uint32_t vj)
+{
+    VReg *Vj = &(env->fpr[vj].vreg);
+    env->cf[cd & 0x7] = (Vj->Q(0) != 0);
+}

This is trivial inline.

e.g

static bool trans_vseteqz_v(DisasContext *ctx, arg_cv *a)
{
    TCGv_i64  t1, t2, al, ah, zero;

    al = tcg_temp_new_i64();
    ah = tcg_temp_new_i64();
    t1 = tcg_temp_new_i64();
    t2 = tcg_temp_new_i64();
    zero = tcg_constant_i64(0);

    get_vreg64(ah, a->vj, 1);
    get_vreg64(al, a->vj, 0);

    CHECK_SXE;
    tcg_gen_setcond_i64(TCG_COND_EQ, t1, al, zero);
    tcg_gen_setcond_i64(TCG_COND_EQ, t2, ah, zero);
    tcg_gen_and_i64(t1, t1, t2);
    tcg_gen_st8_tl(t1, cpu_env, offsetof(CPULoongArchState, cf[a->cd & 0x7]));

    return true;
}

and

static bool trans_vsetnez_v(DisasContext *ctx, arg_cv *a)
{
    TCGv_i64  t1, t2, al, ah, zero;

    al = tcg_temp_new_i64();
    ah = tcg_temp_new_i64();
    t1 = tcg_temp_new_i64();
    t2 = tcg_temp_new_i64();
    zero = tcg_constant_i64(0);

    get_vreg64(ah, a->vj, 1);
    get_vreg64(al, a->vj, 0);

    CHECK_SXE;
    tcg_gen_setcond_i64(TCG_COND_NE, t1, al, zero);
    tcg_gen_setcond_i64(TCG_COND_NE, t2, ah, zero);
    tcg_gen_or_i64(t1, t1, t2);
    tcg_gen_st8_tl(t1, cpu_env, offsetof(CPULoongArchState, cf[a->cd & 0x7]));

    return true;
}

+#define SETANYEQZ(NAME, BIT, E)                                     \
+void HELPER(NAME)(CPULoongArchState *env, uint32_t cd, uint32_t vj) \
+{                                                                   \
+    int i;                                                          \
+    bool ret = false;                                               \
+    VReg *Vj = &(env->fpr[vj].vreg);                                \
+                                                                    \
+    for (i = 0; i < LSX_LEN/BIT; i++) {                             \
+        ret |= (Vj->E(i) == 0);                                     \
+    }                                                               \
+    env->cf[cd & 0x7] = ret;                                        \
+}
+SETANYEQZ(vsetanyeqz_b, 8, B)
+SETANYEQZ(vsetanyeqz_h, 16, H)
+SETANYEQZ(vsetanyeqz_w, 32, W)
+SETANYEQZ(vsetanyeqz_d, 64, D)

These could be inlined, though slightly harder.
C.f. target/arm/sve_helper.c, do_match2 (your n == 0).

Do you mean an inline like trans_vseteqz_v or just an inline helper function?

Thanks.
Song Gao
Anyway, leaving this as-is for now is also ok.

reply via email to

[Prev in Thread] Current Thread [Next in Thread]