Files
clang-p2996/llvm/test/Transforms/SLPVectorizer/NVPTX
peterbell10 55430f8673 [NVPTX] Customize getScalarizationOverhead (#128077)
We've observed that the SLPVectorizer is too conservative on NVPTX
because it over-estimates the cost to build a vector. PTX has a single
`mov` instruction that can build e.g. `<2 x half>` vectors from scalars,
however the SLPVectorizer over-estimates it as the cost of 2 insert
elements.

To fix this I customize `getScalarizationOverhead` to lower the cost for
building 2x16 types.
2025-03-29 01:31:33 +00:00
..