[ARM] Use vector wide add for mixed-mode adds

Hi Kyrill,

I made the following changes based on your comments:

1. I rebased the patch so that it applies cleanly on trunk
2. Fixed the dg-add-options as requested to my new test cases
3. Fixed the GNU style issues identified by ./contrib/check_GNU_style.sh

The failure you are seeing on slp-reduc-3.c is a known failure. The test 
case has a xfail with 'xfail { vect_widen_sum_hi_to_si_pattern' which I 
added in my patch. Richard Biener resolved some of these issues with PR 
68333, but 'slp-reduc-3.c' still fails. I will create a new PR.

I retested on the Linaro testing infrastructure with the latest trunk 
and the only failure is 'slp-reduc-3.c'. Okay for GCC 7?

2016-02-12 Michael Collison <michael.collison@linaro.org>

     * config/arm/neon.md (widen_<us>sum<mode>): New patterns where
     mode is VQI to improve mixed mode vectorization.
     * config/arm/neon.md (vec_sel_widen_ssum_lo<VQI:mode><VW:mode>3): New
     define_insn to match low half of signed vaddw.
     * config/arm/neon.md (vec_sel_widen_ssum_hi<VQI:mode><VW:mode>3): New
     define_insn to match high half of signed vaddw.
     * config/arm/neon.md (vec_sel_widen_usum_lo<VQI:mode><VW:mode>3): New
     define_insn to match low half of unsigned vaddw.
     * config/arm/neon.md (vec_sel_widen_usum_hi<VQI:mode><VW:mode>3): New
     define_insn to match high half of unsigned vaddw.
     * config/arm/arm.c (arm_simd_vect_par_cnst_half): New function.
     (arm_simd_check_vect_par_cnst_half_p): Likewise.
     * config/arm/arm-protos.h (arm_simd_vect_par_cnst_half): Prototype
     for new function.
     (arm_simd_check_vect_par_cnst_half_p): Likewise.
     * config/arm/predicates.md (vect_par_constant_high): Support
     big endian and simplify by calling
     arm_simd_check_vect_par_cnst_half
     (vect_par_constant_low): Likewise.
     * testsuite/gcc.target/arm/neon-vaddws16.c: New test.
     * testsuite/gcc.target/arm/neon-vaddws32.c: New test.
     * testsuite/gcc.target/arm/neon-vaddwu16.c: New test.
     * testsuite/gcc.target/arm/neon-vaddwu32.c: New test.
     * testsuite/gcc.target/arm/neon-vaddwu8.c: New test.
     * testsuite/lib/target-supports.exp
     (check_effective_target_vect_widen_sum_hi_to_si_pattern): Indicate
     that arm neon support vector widen sum of HImode TO SImode.

On 02/09/2016 09:27 AM, Kyrill Tkachov wrote:
> Hi Michael,

>

> On 17/12/15 00:02, Michael Collison wrote:

>> Kyrill,

>>

>> I have attached a patch that address your comments. The only change I 

>> would ask you to re-consider renaming is the function 'bool 

>> aarch32_simd_check_vect_par_cnst_half'. This function was copied from 

>> the aarch64 port and I thought it as important to match the naming 

>> for maintenance purposes. I did rename the function to 'bool 

>> arm_simd_check_vect_par_cnst_half_p'. I changed 'aarch32' to 'arm' 

>> and added '_p' per you suggestions. Is this okay?

>>

>

> Ok, that's fine with me.

>

>> I implemented all your other change suggestions.

>>

>

> Thanks, sorry it took a long time to get back to this, I was busy with 

> regression-fixing patches as we're

> in bug-fixing mode...

>

>> 2015-12-16 Michael Collison <michael.collison@linaro.org>

>>

>>     * config/arm/neon.md (widen_<us>sum<mode>): New patterns where

>>     mode is VQI to improve mixed mode vectorization.

>>     * config/arm/neon.md (vec_sel_widen_ssum_lo<VQI:mode><VW:mode>3): 

>> New

>>     define_insn to match low half of signed vaddw.

>>     * config/arm/neon.md (vec_sel_widen_ssum_hi<VQI:mode><VW:mode>3): 

>> New

>>     define_insn to match high half of signed vaddw.

>>     * config/arm/neon.md (vec_sel_widen_usum_lo<VQI:mode><VW:mode>3): 

>> New

>>     define_insn to match low half of unsigned vaddw.

>>     * config/arm/neon.md (vec_sel_widen_usum_hi<VQI:mode><VW:mode>3): 

>> New

>>     define_insn to match high half of unsigned vaddw.

>>     * config/arm/arm.c (arm_simd_vect_par_cnst_half): New function.

>>     (arm_simd_check_vect_par_cnst_half_p): Likewise.

>>     * config/arm/arm-protos.h (arm_simd_vect_par_cnst_half): Prototype

>>     for new function.

>>     (arm_simd_check_vect_par_cnst_half_p): Likewise.

>>     * config/arm/predicates.md (vect_par_constant_high): Support

>>     big endian and simplify by calling

>>     arm_simd_check_vect_par_cnst_half

>>     (vect_par_constant_low): Likewise.

>>     * testsuite/gcc.target/arm/neon-vaddws16.c: New test.

>>     * testsuite/gcc.target/arm/neon-vaddws32.c: New test.

>>     * testsuite/gcc.target/arm/neon-vaddwu16.c: New test.

>>     * testsuite/gcc.target/arm/neon-vaddwu32.c: New test.

>>     * testsuite/gcc.target/arm/neon-vaddwu8.c: New test.

>>     * testsuite/lib/target-supports.exp

>>     (check_effective_target_vect_widen_sum_hi_to_si_pattern): Indicate

>>     that arm neon support vector widen sum of HImode TO SImode.

>>

>

> I've tried this out and I have a few comments.

> The arm.c hunk doesn't apply to current trunk anymore due to context.

> Can you please rebase the patch?

> I've fixed it up manually in my tree so I can build it.

> With this patch I'm seeing two PASS->FAIL on arm-none-eabi:

> FAIL: gcc.dg/vect/slp-reduc-3.c -flto -ffat-lto-objects 

> scan-tree-dump-times vect "vectorizing stmts using SLP" 1

> FAIL: gcc.dg/vect/slp-reduc-3.c scan-tree-dump-times vect "vectorizing 

> stmts using SLP" 1

> My compiler is configured with --with-float=hard --with-cpu=cortex-a9 

> --with-fpu=neon --with-mode=thumb

> Can you please look into these? Maybe it's just the tests that need 

> adjustment?

>

> Also, I'm seeing the new tests give an error:

> ERROR: gcc.target/arm/neon-vaddws16.c: Unrecognized option type: 

> arm_neon_ok for " dg-add-options 3 arm_neon_ok "

> UNRESOLVED: gcc.target/arm/neon-vaddws16.c: Unrecognized option type: 

> arm_neon_ok for " dg-add-options 3 arm_neon_ok "

>

> That've because the dg-add-options argument should be arm_neon rather 

> than arm_neon_ok.

> Also, since the new tests are compile-only the effective target check 

> should be arm_neon_ok rather than arm_neon_hw.

>

> I also see ./contrib/check_GNU_style.sh complaining about some minor 

> style issues like trailing whitespace and

> blocks of whitespace that should be replaced with tabs.

>

> In any case, this patch is GCC 7 material at this point, so I think 

> with the above issues resolved

> (and the FAILs investigated) this should be in good shape.

>

> Thanks,

> Kyrill

-- 
Michael Collison
Linaro Toolchain Working Group
michael.collison@linaro.org

[ARM] Use vector wide add for mixed-mode adds

Commit Message

Patch