clang-p2996

Files

Philip Reames 0b524efa95 [RISCV][TTI] Reduce cost of a <N x i1> build_vector pattern (#109449 )

This is a follow up to 7f6bbb3. When lowering a <N x i1> build_vector,
we currently chose to extend to i8, perform the build_vector there, and
then truncate back in vector. Our costing on the other hand accounts for
it as if we performed a vector extend, an insert, and a vector extract
for every element. This significantly over estimates the cost.

Note that we can likely do better in our build_vector lowering here by
packing the bits in scalar, and doing a build_vector of the packed bits.
Regardless, our costing should match our lowering.

2024-09-23 07:21:54 -07:00

lit.local.cfg

…

load-widening.ll

…

shuffle-of-intrinsics.ll

[RISCV][TTI] Reduce cost of a <N x i1> build_vector pattern (#109449 )

2024-09-23 07:21:54 -07:00

vecreduce-of-cast.ll

[vectorcombine] Pull sext/zext through reduce.or/and/xor (#99548 )

2024-07-18 13:56:40 -07:00

vpintrin-scalarization-shufflevector-splat.ll

…

vpintrin-scalarization.ll

[RISCV] Don't cost vector arithmetic fp ops as cheaper than scalar (#99594 )

2024-07-22 13:56:10 +08:00