clang-p2996

Files

Sushant Gokhale c5672e21ca [AArch64][CostModel] Reduce the cost of fadd reduction with fast flag (#108791 )

fadd reduction with
  1. Fast flag set
2. No of elements in input vector is power of 2 results in series of
faddp instructions. faddp instruction has latency/throughput identical
to fadd instruction and hence, we set relative cost=1 for faddp as well.

The change didn't show any regression with SPEC17-FP(C/C++),
llvm-test-suite on Neoverse-V2.

2024-09-24 14:35:01 +05:30

abs.ll

…

aggregates.ll

…

arith-fp-frem.ll

…

arith-fp-sve.ll

[AArch64] Add invalid 1 x vscale costs for reductions and reduction-operations. (#102105 )

2024-08-09 14:25:07 +01:00

arith-fp.ll

[AArch64] Set scalar fneg to free for fnmul (#104814 )

2024-08-21 18:10:16 +01:00

arith-overflow.ll

…

arith-ssat.ll

…

arith-usat.ll

…

arith-widening.ll

…

arith.ll

…

bitreverse.ll

…

bswap.ll

…

cast.ll

[AArch64] NFC: Rename -force-streaming-compatible-sve to -force-streaming-compatible (#92774 )

2024-05-22 07:58:54 +01:00

cmp.ll

[AArch64] Expand scmp/ucmp vector operations with sub (#108830 )

2024-09-16 18:44:52 +01:00

cost-scalable-vector-gep.ll

…

ctlz.ll

…

ctpop.ll

…

cttz_elts.ll

[AArch64] Add invalid 1 x vscale costs for reductions and reduction-operations. (#102105 )

2024-08-09 14:25:07 +01:00

cttz.ll

…

div_cte.ll

…

div.ll

[llvm][AArch64] Improve the cost model for i128 div's (#107306 )

2024-09-05 07:42:23 -07:00

ext-rhadd.ll

…

extract_float.ll

[AArch64][CostModel] Add NFC tests for extractelement cost (#108941 )

2024-09-17 22:57:05 +05:30

fp-conversions-odd-vector-types.ll

[AArch64] Add more type combinations to vector fp conversion cost tests.

2024-09-06 14:49:45 +01:00

fptoi_sat.ll

[AArch64] Extend costs for fptoi.sat intrinsics.

2024-07-28 10:47:40 +01:00

free-widening-casts.ll

…

fshl.ll

…

fshr.ll

…

gep.ll

…

getIntrinsicInstrCost-vector-reverse.ll

…

insert-extract.ll

…

kryo-inseltpoison.ll

…

lit.local.cfg

…

load-to-trunc.ll

…

logicalop.ll

…

masked_ldst_vls.ll

…

masked_ldst.ll

[LoopVectorize][AArch64] Add limited support for scalable vectorisation of i1 types (#95920 )

2024-06-25 15:04:24 +01:00

mem-op-cost-model.ll

…

min-max.ll

…

mul.ll

…

neon-stepvector.ll

Move stepvector intrinsic out of experimental namespace (#98043 )

2024-08-28 12:48:20 +01:00

no-sve-no-neon.ll

[AArch64] Fix assertion failure in getCastInstrCost

2024-07-16 10:43:07 +00:00

reduce-add.ll

…

reduce-and.ll

…

reduce-fadd.ll

[AArch64][CostModel] Reduce the cost of fadd reduction with fast flag (#108791 )

2024-09-24 14:35:01 +05:30

reduce-minmax.ll

…

reduce-or.ll

…

reduce-xor.ll

…

rem.ll

…

select.ll

…

shuffle-broadcast.ll

…

shuffle-load.ll

…

shuffle-other.ll

Revert "[AArch64] Remove special-case inserted shuffle cost."

2024-07-29 11:24:39 +00:00

shuffle-reverse.ll

…

shuffle-select.ll

…

shuffle-store.ll

…

shuffle-transpose.ll

…

splice.ll

…

store-ptr.ll

…

store.ll

…

sve-arith.ll

[AArch64] Add invalid 1 x vscale costs for reductions and reduction-operations. (#102105 )

2024-08-09 14:25:07 +01:00

sve-bitcast.ll

…

sve-cmpsel.ll

…

sve-ext.ll

…

sve-fixed-length.ll

…

sve-fpext.ll

…

sve-fptoi.ll

[AArch64] Auto-generate check-lines in cost model test.

2024-09-06 22:38:02 +01:00

sve-fptrunc.ll

…

sve-gather.ll

[LoopVectorize][AArch64] Add limited support for scalable vectorisation of i1 types (#95920 )

2024-06-25 15:04:24 +01:00

sve-illegal-types.ll

…

sve-insert-extract.ll

…

sve-intrinsics.ll

[AArch64] Consider histcnt smaller than i32 in the cost model (#108521 )

2024-09-19 13:56:52 +01:00

sve-ldst.ll

[LoopVectorize][AArch64] Add limited support for scalable vectorisation of i1 types (#95920 )

2024-06-25 15:04:24 +01:00

sve-math.ll

…

sve-min-max.ll

[AArch64] Add invalid 1 x vscale costs for reductions and reduction-operations. (#102105 )

2024-08-09 14:25:07 +01:00

sve-remainder.ll

…

sve-scatter.ll

[LoopVectorize][AArch64] Add limited support for scalable vectorisation of i1 types (#95920 )

2024-06-25 15:04:24 +01:00

sve-shuffle-broadcast.ll

…

sve-stepvector.ll

Move stepvector intrinsic out of experimental namespace (#98043 )

2024-08-28 12:48:20 +01:00

sve-trunc.ll

…

sve-vscale.ll

…

sve-widening-instruction.ll

…

vec3-ops.ll

…

vector-reduce.ll

…

vector-select.ll

…