[v5,16/57] tcg/tci: Clean up deposit operations

Message ID	20210311143958.562625-17-richard.henderson@linaro.org
State	Superseded
Headers	show Delivered-To: patch@linaro.org Received-SPF: pass (google.com: domain of qemu-devel-bounces+patch=linaro.org@nongnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; From: Richard Henderson <richard.henderson@linaro.org> To: qemu-devel@nongnu.org Subject: [PATCH v5 16/57] tcg/tci: Clean up deposit operations Date: Thu, 11 Mar 2021 08:39:17 -0600 Message-Id: <20210311143958.562625-17-richard.henderson@linaro.org> In-Reply-To: <20210311143958.562625-1-richard.henderson@linaro.org> References: <20210311143958.562625-1-richard.henderson@linaro.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Received-SPF: pass client-ip=2607:f8b0:4864:20::730; envelope-from=richard.henderson@linaro.org; helo=mail-qk1-x730.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action Precedence: list Cc: sw@weilnetz.de, alex.bennee@linaro.org, f4bug@amsat.org Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>
Series	TCI fixes and cleanups \| expand [v5,00/57] TCI fixes and cleanups [v5,01/57] tcg/tci: Remove ifdefs for TCG_TARGET_HAS_ext32[us]_i64 [v5,02/57] tcg/tci: Rename tci_read_r to tci_read_rval [v5,03/57] tcg/tci: Split out tci_args_rrs [v5,04/57] tcg/tci: Split out tci_args_rr [v5,05/57] tcg/tci: Split out tci_args_rrr [v5,06/57] tcg/tci: Split out tci_args_rrrc [v5,07/57] tcg/tci: Split out tci_args_l [v5,08/57] tcg/tci: Split out tci_args_rrrrrc [v5,09/57] tcg/tci: Split out tci_args_rrcl and tci_args_rrrrcl [v5,10/57] tcg/tci: Split out tci_args_ri and tci_args_rI [v5,11/57] tcg/tci: Reuse tci_args_l for calls. [v5,12/57] tcg/tci: Reuse tci_args_l for exit_tb [v5,13/57] tcg/tci: Reuse tci_args_l for goto_tb [v5,14/57] tcg/tci: Split out tci_args_rrrrrr [v5,15/57] tcg/tci: Split out tci_args_rrrr [v5,16/57] tcg/tci: Clean up deposit operations [v5,17/57] tcg/tci: Reduce qemu_ld/st TCGMemOpIdx operand to 32-bits [v5,18/57] tcg/tci: Split out tci_args_{rrm,rrrm,rrrrm} [v5,19/57] tcg/tci: Hoist op_size checking into tci_args_* [v5,20/57] tcg/tci: Remove tci_disas [v5,21/57] tcg/tci: Implement the disassembler properly [v5,22/57] tcg: Build ffi data structures for helpers [v5,23/57] tcg/tci: Use ffi for calls [v5,24/57] tcg/tci: Improve tcg_target_call_clobber_regs [v5,25/57] tcg/tci: Move call-return regs to end of tcg_target_reg_alloc_order [v5,26/57] tcg/tci: Push opcode emit into each case [v5,27/57] tcg/tci: Split out tcg_out_op_rrs [v5,28/57] tcg/tci: Split out tcg_out_op_l [v5,29/57] tcg/tci: Split out tcg_out_op_p [v5,30/57] tcg/tci: Split out tcg_out_op_rr [v5,31/57] tcg/tci: Split out tcg_out_op_rrr [v5,32/57] tcg/tci: Split out tcg_out_op_rrrc [v5,33/57] tcg/tci: Split out tcg_out_op_rrrrrc [v5,34/57] tcg/tci: Split out tcg_out_op_rrrbb [v5,35/57] tcg/tci: Split out tcg_out_op_rrcl [v5,36/57] tcg/tci: Split out tcg_out_op_rrrrrr [v5,37/57] tcg/tci: Split out tcg_out_op_rrrr [v5,38/57] tcg/tci: Split out tcg_out_op_rrrrcl [v5,39/57] tcg/tci: Split out tcg_out_op_{rrm,rrrm,rrrrm} [v5,40/57] tcg/tci: Split out tcg_out_op_v [v5,41/57] tcg/tci: Split out tcg_out_op_np [v5,42/57] tcg/tci: Split out tcg_out_op_r[iI] [v5,43/57] tcg/tci: Reserve r13 for a temporary [v5,44/57] tcg/tci: Emit setcond before brcond [v5,45/57] tcg/tci: Remove tci_write_reg [v5,46/57] tcg/tci: Change encoding to uint32_t units [v5,47/57] tcg/tci: Implement goto_ptr [v5,48/57] tcg/tci: Implement movcond [v5,49/57] tcg/tci: Implement andc, orc, eqv, nand, nor [v5,50/57] tcg/tci: Implement extract, sextract [v5,51/57] tcg/tci: Implement clz, ctz, ctpop [v5,52/57] tcg/tci: Implement mulu2, muls2 [v5,53/57] tcg/tci: Implement add2, sub2 [v5,54/57] tcg/tci: Split out tci_qemu_ld, tci_qemu_st [v5,55/57] tests/tcg: Increase timeout for TCI [v5,56/57] gitlab: Rename ACCEL_CONFIGURE_OPTS to EXTRA_CONFIGURE_OPTS [v5,57/57] gitlab: Enable cross-i386 builds of TCI

Message ID

20210311143958.562625-17-richard.henderson@linaro.org

State

Superseded

Headers

Received-SPF: pass (google.com: domain of
	qemu-devel-bounces+patch=linaro.org@nongnu.org designates
	209.51.188.17 as permitted sender) client-ip=209.51.188.17; 
From: Richard Henderson <richard.henderson@linaro.org>
To: qemu-devel@nongnu.org
Subject: [PATCH v5 16/57] tcg/tci: Clean up deposit operations
Date: Thu, 11 Mar 2021 08:39:17 -0600
Message-Id: <20210311143958.562625-17-richard.henderson@linaro.org>
In-Reply-To: <20210311143958.562625-1-richard.henderson@linaro.org>
References: <20210311143958.562625-1-richard.henderson@linaro.org>
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Received-SPF: pass client-ip=2607:f8b0:4864:20::730;
	envelope-from=richard.henderson@linaro.org;
	helo=mail-qk1-x730.google.com
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
	DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
	RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001,
	SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.23
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Cc: sw@weilnetz.de, alex.bennee@linaro.org, f4bug@amsat.org
Errors-To: qemu-devel-bounces+patch=linaro.org@nongnu.org
Sender: "Qemu-devel" <qemu-devel-bounces+patch=linaro.org@nongnu.org>

Series

TCI fixes and cleanups | expand

Commit Message

Richard Henderson March 11, 2021, 2:39 p.m. UTC

Use the correct set of asserts during code generation.
We do not require the first input to overlap the output;
the existing interpreter already supported that.

Split out tci_args_rrrbb in the translator.
Use the deposit32/64 functions rather than inline expansion.

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>

---
 tcg/tci/tcg-target-con-set.h |  1 -
 tcg/tci.c                    | 33 ++++++++++++++++-----------------
 tcg/tci/tcg-target.c.inc     | 24 ++++++++++++++----------
 3 files changed, 30 insertions(+), 28 deletions(-)

-- 
2.25.1

Comments

Philippe Mathieu-Daudé March 17, 2021, 12:32 a.m. UTC | #1

On 3/11/21 3:39 PM, Richard Henderson wrote:
> Use the correct set of asserts during code generation.

> We do not require the first input to overlap the output;

> the existing interpreter already supported that.

> 

> Split out tci_args_rrrbb in the translator.

> Use the deposit32/64 functions rather than inline expansion.

> 

> Signed-off-by: Richard Henderson <richard.henderson@linaro.org>

> ---

>  tcg/tci/tcg-target-con-set.h |  1 -

>  tcg/tci.c                    | 33 ++++++++++++++++-----------------

>  tcg/tci/tcg-target.c.inc     | 24 ++++++++++++++----------

>  3 files changed, 30 insertions(+), 28 deletions(-)

> 

> diff --git a/tcg/tci/tcg-target-con-set.h b/tcg/tci/tcg-target-con-set.h

> index f51b7bcb13..316730f32c 100644

> --- a/tcg/tci/tcg-target-con-set.h

> +++ b/tcg/tci/tcg-target-con-set.h

> @@ -13,7 +13,6 @@ C_O0_I2(r, r)

>  C_O0_I3(r, r, r)

>  C_O0_I4(r, r, r, r)

>  C_O1_I1(r, r)

> -C_O1_I2(r, 0, r)

>  C_O1_I2(r, r, r)

>  C_O1_I4(r, r, r, r, r)

>  C_O2_I1(r, r, r)

> diff --git a/tcg/tci.c b/tcg/tci.c

> index 10f58e4f25..3ce2b72316 100644

> --- a/tcg/tci.c

> +++ b/tcg/tci.c

> @@ -168,6 +168,7 @@ static tcg_target_ulong tci_read_label(const uint8_t **tb_ptr)

>   *   tci_args_<arguments>

>   * where arguments is a sequence of

>   *

> + *   b = immediate (bit position)

>   *   i = immediate (uint32_t)

>   *   I = immediate (tcg_target_ulong)

>   *   r = register

> @@ -236,6 +237,16 @@ static void tci_args_rrrc(const uint8_t **tb_ptr,

>      *c3 = tci_read_b(tb_ptr);

>  }

>  

> +static void tci_args_rrrbb(const uint8_t **tb_ptr, TCGReg *r0, TCGReg *r1,

> +                           TCGReg *r2, uint8_t *i3, uint8_t *i4)

> +{

> +    *r0 = tci_read_r(tb_ptr);

> +    *r1 = tci_read_r(tb_ptr);

> +    *r2 = tci_read_r(tb_ptr);

> +    *i3 = tci_read_b(tb_ptr);

> +    *i4 = tci_read_b(tb_ptr);

> +}

> +

>  #if TCG_TARGET_REG_BITS == 32

>  static void tci_args_rrrr(const uint8_t **tb_ptr,

>                            TCGReg *r0, TCGReg *r1, TCGReg *r2, TCGReg *r3)

> @@ -432,11 +443,9 @@ uintptr_t QEMU_DISABLE_CFI tcg_qemu_tb_exec(CPUArchState *env,

>          TCGReg r0, r1, r2;

>          tcg_target_ulong t0;

>          tcg_target_ulong t1;

> -        tcg_target_ulong t2;

>          TCGCond condition;

>          target_ulong taddr;

> -        uint8_t tmp8;

> -        uint16_t tmp16;

> +        uint8_t pos, len;

>          uint32_t tmp32;

>          uint64_t tmp64;

>  #if TCG_TARGET_REG_BITS == 32

> @@ -627,13 +636,8 @@ uintptr_t QEMU_DISABLE_CFI tcg_qemu_tb_exec(CPUArchState *env,

>  #endif

>  #if TCG_TARGET_HAS_deposit_i32

>          case INDEX_op_deposit_i32:

> -            t0 = *tb_ptr++;

> -            t1 = tci_read_rval(regs, &tb_ptr);

> -            t2 = tci_read_rval(regs, &tb_ptr);

> -            tmp16 = *tb_ptr++;

> -            tmp8 = *tb_ptr++;

> -            tmp32 = (((1 << tmp8) - 1) << tmp16);

> -            tci_write_reg(regs, t0, (t1 & ~tmp32) | ((t2 << tmp16) & tmp32));

> +            tci_args_rrrbb(&tb_ptr, &r0, &r1, &r2, &pos, &len);

> +            regs[r0] = deposit32(regs[r1], pos, len, regs[r2]);

>              break;

>  #endif

>          case INDEX_op_brcond_i32:

> @@ -789,13 +793,8 @@ uintptr_t QEMU_DISABLE_CFI tcg_qemu_tb_exec(CPUArchState *env,

>  #endif

>  #if TCG_TARGET_HAS_deposit_i64

>          case INDEX_op_deposit_i64:

> -            t0 = *tb_ptr++;

> -            t1 = tci_read_rval(regs, &tb_ptr);

> -            t2 = tci_read_rval(regs, &tb_ptr);

> -            tmp16 = *tb_ptr++;

> -            tmp8 = *tb_ptr++;

> -            tmp64 = (((1ULL << tmp8) - 1) << tmp16);

> -            tci_write_reg(regs, t0, (t1 & ~tmp64) | ((t2 << tmp16) & tmp64));

> +            tci_args_rrrbb(&tb_ptr, &r0, &r1, &r2, &pos, &len);

> +            regs[r0] = deposit64(regs[r1], pos, len, regs[r2]);

>              break;

>  #endif

>          case INDEX_op_brcond_i64:

> diff --git a/tcg/tci/tcg-target.c.inc b/tcg/tci/tcg-target.c.inc

> index 2c64b4f617..640407b4a8 100644

> --- a/tcg/tci/tcg-target.c.inc

> +++ b/tcg/tci/tcg-target.c.inc

> @@ -126,11 +126,9 @@ static TCGConstraintSetIndex tcg_target_op_def(TCGOpcode op)

>      case INDEX_op_rotr_i64:

>      case INDEX_op_setcond_i32:

>      case INDEX_op_setcond_i64:

> -        return C_O1_I2(r, r, r);

> -

>      case INDEX_op_deposit_i32:

>      case INDEX_op_deposit_i64:

> -        return C_O1_I2(r, 0, r);

> +        return C_O1_I2(r, r, r);

>  

>      case INDEX_op_brcond_i32:

>      case INDEX_op_brcond_i64:

> @@ -480,13 +478,19 @@ static void tcg_out_op(TCGContext *s, TCGOpcode opc, const TCGArg *args,

>          break;

>  

>      CASE_32_64(deposit)  /* Optional (TCG_TARGET_HAS_deposit_*). */

> -        tcg_out_r(s, args[0]);

> -        tcg_out_r(s, args[1]);

> -        tcg_out_r(s, args[2]);

> -        tcg_debug_assert(args[3] <= UINT8_MAX);

> -        tcg_out8(s, args[3]);

> -        tcg_debug_assert(args[4] <= UINT8_MAX);

> -        tcg_out8(s, args[4]);

> +        {

> +            TCGArg pos = args[3], len = args[4];

> +            TCGArg max = opc == INDEX_op_deposit_i32 ? 32 : 64;

> +

> +            tcg_debug_assert(pos < max);

> +            tcg_debug_assert(pos + len <= max);

> +

> +            tcg_out_r(s, args[0]);

> +            tcg_out_r(s, args[1]);

> +            tcg_out_r(s, args[2]);

> +            tcg_out8(s, pos);

> +            tcg_out8(s, len);

> +        }

>          break;

>  

>      CASE_32_64(brcond)

> 


Another KISS :)

Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>

diff --git a/tcg/tci/tcg-target-con-set.h b/tcg/tci/tcg-target-con-set.h
index f51b7bcb13..316730f32c 100644
--- a/tcg/tci/tcg-target-con-set.h
+++ b/tcg/tci/tcg-target-con-set.h
@@ -13,7 +13,6 @@  C_O0_I2(r, r)
 C_O0_I3(r, r, r)
 C_O0_I4(r, r, r, r)
 C_O1_I1(r, r)
-C_O1_I2(r, 0, r)
 C_O1_I2(r, r, r)
 C_O1_I4(r, r, r, r, r)
 C_O2_I1(r, r, r)
diff --git a/tcg/tci.c b/tcg/tci.c
index 10f58e4f25..3ce2b72316 100644
--- a/tcg/tci.c
+++ b/tcg/tci.c
@@ -168,6 +168,7 @@  static tcg_target_ulong tci_read_label(const uint8_t **tb_ptr)
  *   tci_args_<arguments>
  * where arguments is a sequence of
  *
+ *   b = immediate (bit position)
  *   i = immediate (uint32_t)
  *   I = immediate (tcg_target_ulong)
  *   r = register
@@ -236,6 +237,16 @@  static void tci_args_rrrc(const uint8_t **tb_ptr,
     *c3 = tci_read_b(tb_ptr);
 }
 
+static void tci_args_rrrbb(const uint8_t **tb_ptr, TCGReg *r0, TCGReg *r1,
+                           TCGReg *r2, uint8_t *i3, uint8_t *i4)
+{
+    *r0 = tci_read_r(tb_ptr);
+    *r1 = tci_read_r(tb_ptr);
+    *r2 = tci_read_r(tb_ptr);
+    *i3 = tci_read_b(tb_ptr);
+    *i4 = tci_read_b(tb_ptr);
+}
+
 #if TCG_TARGET_REG_BITS == 32
 static void tci_args_rrrr(const uint8_t **tb_ptr,
                           TCGReg *r0, TCGReg *r1, TCGReg *r2, TCGReg *r3)
@@ -432,11 +443,9 @@  uintptr_t QEMU_DISABLE_CFI tcg_qemu_tb_exec(CPUArchState *env,
         TCGReg r0, r1, r2;
         tcg_target_ulong t0;
         tcg_target_ulong t1;
-        tcg_target_ulong t2;
         TCGCond condition;
         target_ulong taddr;
-        uint8_t tmp8;
-        uint16_t tmp16;
+        uint8_t pos, len;
         uint32_t tmp32;
         uint64_t tmp64;
 #if TCG_TARGET_REG_BITS == 32
@@ -627,13 +636,8 @@  uintptr_t QEMU_DISABLE_CFI tcg_qemu_tb_exec(CPUArchState *env,
 #endif
 #if TCG_TARGET_HAS_deposit_i32
         case INDEX_op_deposit_i32:
-            t0 = *tb_ptr++;
-            t1 = tci_read_rval(regs, &tb_ptr);
-            t2 = tci_read_rval(regs, &tb_ptr);
-            tmp16 = *tb_ptr++;
-            tmp8 = *tb_ptr++;
-            tmp32 = (((1 << tmp8) - 1) << tmp16);
-            tci_write_reg(regs, t0, (t1 & ~tmp32) | ((t2 << tmp16) & tmp32));
+            tci_args_rrrbb(&tb_ptr, &r0, &r1, &r2, &pos, &len);
+            regs[r0] = deposit32(regs[r1], pos, len, regs[r2]);
             break;
 #endif
         case INDEX_op_brcond_i32:
@@ -789,13 +793,8 @@  uintptr_t QEMU_DISABLE_CFI tcg_qemu_tb_exec(CPUArchState *env,
 #endif
 #if TCG_TARGET_HAS_deposit_i64
         case INDEX_op_deposit_i64:
-            t0 = *tb_ptr++;
-            t1 = tci_read_rval(regs, &tb_ptr);
-            t2 = tci_read_rval(regs, &tb_ptr);
-            tmp16 = *tb_ptr++;
-            tmp8 = *tb_ptr++;
-            tmp64 = (((1ULL << tmp8) - 1) << tmp16);
-            tci_write_reg(regs, t0, (t1 & ~tmp64) | ((t2 << tmp16) & tmp64));
+            tci_args_rrrbb(&tb_ptr, &r0, &r1, &r2, &pos, &len);
+            regs[r0] = deposit64(regs[r1], pos, len, regs[r2]);
             break;
 #endif
         case INDEX_op_brcond_i64:
diff --git a/tcg/tci/tcg-target.c.inc b/tcg/tci/tcg-target.c.inc
index 2c64b4f617..640407b4a8 100644
--- a/tcg/tci/tcg-target.c.inc
+++ b/tcg/tci/tcg-target.c.inc
@@ -126,11 +126,9 @@  static TCGConstraintSetIndex tcg_target_op_def(TCGOpcode op)
     case INDEX_op_rotr_i64:
     case INDEX_op_setcond_i32:
     case INDEX_op_setcond_i64:
-        return C_O1_I2(r, r, r);
-
     case INDEX_op_deposit_i32:
     case INDEX_op_deposit_i64:
-        return C_O1_I2(r, 0, r);
+        return C_O1_I2(r, r, r);
 
     case INDEX_op_brcond_i32:
     case INDEX_op_brcond_i64:
@@ -480,13 +478,19 @@  static void tcg_out_op(TCGContext *s, TCGOpcode opc, const TCGArg *args,
         break;
 
     CASE_32_64(deposit)  /* Optional (TCG_TARGET_HAS_deposit_*). */
-        tcg_out_r(s, args[0]);
-        tcg_out_r(s, args[1]);
-        tcg_out_r(s, args[2]);
-        tcg_debug_assert(args[3] <= UINT8_MAX);
-        tcg_out8(s, args[3]);
-        tcg_debug_assert(args[4] <= UINT8_MAX);
-        tcg_out8(s, args[4]);
+        {
+            TCGArg pos = args[3], len = args[4];
+            TCGArg max = opc == INDEX_op_deposit_i32 ? 32 : 64;
+
+            tcg_debug_assert(pos < max);
+            tcg_debug_assert(pos + len <= max);
+
+            tcg_out_r(s, args[0]);
+            tcg_out_r(s, args[1]);
+            tcg_out_r(s, args[2]);
+            tcg_out8(s, pos);
+            tcg_out8(s, len);
+        }
         break;
 
     CASE_32_64(brcond)

[v5,16/57] tcg/tci: Clean up deposit operations

Commit Message

Comments

Patch