Files
clang-p2996/llvm/test/CodeGen
Benjamin Kramer 76268ac682 X86: Turn mul of <4 x i32> into pmuludq when no SSE4.1 is available.
pmuludq is slow, but it turns out that all the unpacking and packing of the
scalarized mul is even slower. 10% speedup on loop-vectorized paq8p.

llvm-svn: 170985
2012-12-22 16:07:56 +00:00
..
2012-12-21 00:55:10 +00:00
2012-12-20 17:47:27 +00:00
2012-12-11 21:25:42 +00:00