Files
clang-p2996/llvm/test/CodeGen/X86
Benjamin Kramer 76268ac682 X86: Turn mul of <4 x i32> into pmuludq when no SSE4.1 is available.
pmuludq is slow, but it turns out that all the unpacking and packing of the
scalarized mul is even slower. 10% speedup on loop-vectorized paq8p.

llvm-svn: 170985
2012-12-22 16:07:56 +00:00
..
2012-11-12 06:49:17 +00:00
2012-07-16 19:35:43 +00:00
2012-05-19 23:34:59 +00:00
2012-10-25 17:50:05 +00:00
2012-03-20 17:20:46 +00:00
2012-07-23 08:51:15 +00:00
2012-06-19 02:17:35 +00:00
2012-08-17 12:28:26 +00:00
2012-05-24 22:08:29 +00:00
2012-08-17 12:28:26 +00:00
2012-08-17 12:28:26 +00:00
2012-09-12 21:43:09 +00:00
2012-08-31 20:12:31 +00:00
2012-09-17 18:05:20 +00:00
2012-10-01 16:44:04 +00:00
2012-10-23 21:40:15 +00:00
2012-10-29 17:57:12 +00:00
2012-09-26 08:24:51 +00:00
2012-12-02 15:46:02 +00:00
2012-09-26 08:24:51 +00:00
2012-09-26 08:24:51 +00:00
2012-09-26 08:24:51 +00:00
2012-11-08 07:28:54 +00:00
2012-04-20 23:36:09 +00:00
2012-07-17 19:40:05 +00:00
2012-06-01 05:00:54 +00:00
2012-03-30 00:26:54 +00:00
2012-10-25 17:50:05 +00:00
2012-10-25 17:50:05 +00:00