clang-p2996

Author	SHA1	Message	Date
Sanjay Patel	31017cd10a	[InstCombine] add tests for unsigned add overflow; NFC llvm-svn: 342082	2018-09-12 21:13:37 +00:00
Roman Lebedev	e14b0282bb	[NFC][InstCombine] Drop newly-added interference-tests-for-high-bit-check.ll Now that i have actually double-checked, no, there is no such interference possible... llvm-svn: 342076	2018-09-12 20:06:46 +00:00
Roman Lebedev	91c668a276	[NFC][InstCombine] R38708 - inefficient pattern for high-bits checking. More complicated, canonical pattern: https://rise4fun.com/Alive/uhA https://godbolt.org/z/o4RB8D Also, we need to be careful not to skip some patters... https://bugs.llvm.org/show_bug.cgi?id=38708 llvm-svn: 342074	2018-09-12 19:44:26 +00:00
Roman Lebedev	75404fb9f8	[InstCombine] Inefficient pattern for high-bits checking (PR38708) Summary: It is sometimes important to check that some newly-computed value is non-negative and only `n` bits wide (where `n` is a variable.) There are many ways to check that: https://godbolt.org/z/o4RB8D The last variant seems best? (I'm sure there are some other variations i haven't thought of..) Let's handle the second variant first, since it is much simpler. https://rise4fun.com/Alive/LYjY https://bugs.llvm.org/show_bug.cgi?id=38708 Reviewers: spatel, craig.topper, RKSimon Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51985 llvm-svn: 342067	2018-09-12 18:19:43 +00:00
Roman Lebedev	99359f391e	[NFC][InstCombine] R38708 - inefficient pattern for high-bits checking. The simplest pattern for now: https://rise4fun.com/Alive/LYjY https://godbolt.org/z/o4RB8D https://bugs.llvm.org/show_bug.cgi?id=38708 llvm-svn: 342054	2018-09-12 14:11:37 +00:00
Sanjay Patel	1cf0734b2f	[InstCombine] add folds for unsigned-overflow compares Name: op_ugt_sum %a = add i8 %x, %y %r = icmp ugt i8 %x, %a => %notx = xor i8 %x, -1 %r = icmp ugt i8 %y, %notx Name: sum_ult_op %a = add i8 %x, %y %r = icmp ult i8 %a, %x => %notx = xor i8 %x, -1 %r = icmp ugt i8 %y, %notx https://rise4fun.com/Alive/ZRxI AFAICT, this doesn't interfere with any add-saturation patterns because those have >1 use for the 'add'. But this should be better for IR analysis and codegen in the basic cases. This is another fold inspired by PR14613: https://bugs.llvm.org/show_bug.cgi?id=14613 llvm-svn: 342004	2018-09-11 22:40:20 +00:00
Sanjay Patel	26725bdc50	[InstCombine] add folds for icmp with xor mask constant These are the folds in Alive; Name: xor_ult Pre: isPowerOf2(-C1) %xor = xor i8 %x, C1 %r = icmp ult i8 %xor, C1 => %r = icmp ugt i8 %x, ~C1 Name: xor_ugt Pre: isPowerOf2(C1+1) %xor = xor i8 %x, C1 %r = icmp ugt i8 %xor, C1 => %r = icmp ugt i8 %x, C1 https://rise4fun.com/Alive/Vty The ugt case in its simplest form was already handled by DemandedBits, but that's not ideal as shown in the multi-use test. I'm not sure if these are all of the symmetrical folds, but I adjusted the existing code for one of the folds to try to show the similarities. There's no obvious connection, but this is another preliminary step for PR14613... https://bugs.llvm.org/show_bug.cgi?id=14613 llvm-svn: 341997	2018-09-11 22:00:15 +00:00
Sanjay Patel	c79d964fdd	[InstCombine] add tests for icmp with xor; NFC llvm-svn: 341993	2018-09-11 21:13:20 +00:00
Sanjay Patel	342c3bcf11	[InstCombine] enhance vector demanded elements to look at a vector select condition operand I noticed that we were not back-propagating undef lanes to shuffle masks when we have a shuffle that reduces the vector width. This is part of investigating/solving PR38691: https://bugs.llvm.org/show_bug.cgi?id=38691 The DAG equivalent was proposed with: D51696 Differential Revision: https://reviews.llvm.org/D51433 llvm-svn: 341981	2018-09-11 18:49:00 +00:00
Sanjay Patel	44c1b3a331	[InstCombine] add tests for add-with-overflow compares; NFC llvm-svn: 341979	2018-09-11 18:45:28 +00:00
Craig Topper	4e63db8387	[InstCombine] Fix incorrect usage of getPrimitiveSizeInBits when we should be using the element size for vectors For vectors, getPrimitiveSizeInBits returns the full vector width. This code should using the element size for vectors. This could be fixed by calling getScalarSizeInBits, but its even easier to just get it from the APInt we're checking. Differential Revision: https://reviews.llvm.org/D51938 llvm-svn: 341971	2018-09-11 17:57:20 +00:00
Craig Topper	a57bb61a3e	[InstCombine] Support (mul (sext x), cst) --> (sext (mul x, cst')) and (mul (zext x), cst) --> (zext (mul x, cst')) for vectors constants. Similar to D51236, but for mul instead of add. Differential Revision: https://reviews.llvm.org/D51900 llvm-svn: 341961	2018-09-11 16:51:24 +00:00
Craig Topper	3de8d592d1	[InstCombine] Add testcases for (mul (sext x), cst) --> (sext (mul x, cst')) and (mul (zext x), cst) --> (zext (mul x, cst')) for vectors constants. If the multiply won't overflow in the original type we can use a smaller mul and sign extend afterwards. We don't currently support this for vector constants. llvm-svn: 341884	2018-09-10 23:48:21 +00:00
Alina Sbirlea	116caa2920	[InstCombine] Partially revert rL341674 due to PR38897. Summary: Revert min/max changes in rL341674 dues to high compile times causing timeouts (PR38897). Checking in to unblock failing builds. Patch available for post-commit review and re-revert once resolved. Working on a smaller reproducer for PR38897. Reviewers: craig.topper, spatel Subscribers: sanjoy, jlebar, llvm-commits Differential Revision: https://reviews.llvm.org/D51897 llvm-svn: 341883	2018-09-10 23:47:21 +00:00
Tim Northover	12c1f7675f	InstCombine: move hasOneUse check to the top of foldICmpAddConstant There were two combines not covered by the check before now, neither of which actually differed from normal in the benefit analysis. The most recent seems to be because it was just added at the top of the function (naturally). The older is from way back in 2008 (r46687) when we just didn't put those checks in so routinely, and has been diligently maintained since. llvm-svn: 341831	2018-09-10 14:26:44 +00:00
Sanjay Patel	caa4de72a2	[InstCombine][x86] add tests for possible blendv transform (PR38814); NFC llvm-svn: 341715	2018-09-07 21:40:41 +00:00
Sanjay Patel	c1416b60f2	[InstCombine] narrow vector select with padded condition and extracted result (PR38691) shuf (sel (shuf NarrowCond, undef, WideMask), X, Y), undef, NarrowMask) --> sel NarrowCond, (shuf X, undef, NarrowMask), (shuf Y, undef, NarrowMask) The motivating case from: https://bugs.llvm.org/show_bug.cgi?id=38691 ...is the last regression test. In that case, we're just left with the narrow select. Note that if we do create new shuffles, they use the existing extraction identity mask, so there's no danger that this transform creates arbitrary shuffles. Differential Revision: https://reviews.llvm.org/D51496 llvm-svn: 341708	2018-09-07 21:03:34 +00:00
Craig Topper	040c2b0acf	[InstCombine] Fold (min/max ~X, Y) -> ~(max/min X, ~Y) when Y is freely invertible If the ~X wasn't able to simplify above the max/min, we might be able to simplify it by moving it below the max/min. I had to modify the ~(min/max ~X, Y) transform to prevent getting stuck in a loop when we saw the new ~(max/min X, ~Y) before the ~Y had been folded away to remove the new not. Differential Revision: https://reviews.llvm.org/D51398 llvm-svn: 341674	2018-09-07 16:19:50 +00:00
Florian Hahn	e32ff4b28a	[InstCombine] Do not fold scalar ops over select with vector condition. If OtherOpT or OtherOpF have scalar types and the condition is a vector, we would create an invalid select. Reviewers: spatel, john.brawn, mssimpso, craig.topper Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D51781 llvm-svn: 341666	2018-09-07 14:40:06 +00:00
Sanjay Patel	93bd15a005	[InstCombine] add xor+not folds This fold is needed to avoid a regression when we try to recommit rL300977. We can't see the most basic win currently because demanded bits changes the patterns: https://rise4fun.com/Alive/plpp llvm-svn: 341559	2018-09-06 16:23:40 +00:00
Sanjay Patel	99d732052f	[InstCombine] add tests for xor-not; NFC These tests demonstrate a missing fold that would also be needed to avoid a regression when we try to recommit rL300977. llvm-svn: 341557	2018-09-06 15:35:01 +00:00
David Green	e6918ca2b3	[SLC] Add an alignment to CreateGlobalString Previously the alignment on the newly created global strings was not set, meaning that DataLayout::getPreferredAlignment was free to overalign it to 16 bytes. This caused unnecessary code bloat with the padding between variables. The main example of this happening was the printf->puts optimisation in SimplifyLibCalls, but as the change here is made in IRBuilderBase::CreateGlobalString, other globals using this will now be aligned too. Differential Revision: https://reviews.llvm.org/D51410 llvm-svn: 341527	2018-09-06 08:42:17 +00:00
Sanjay Patel	63cf26cf01	[InstCombine] fix xor-or-xor fold to check uses and handle commutes I'm probably missing some way to use m_Deferred to remove the code duplication, but that can be a follow-up. The improvement in demand_shrink_nsw.ll is an example of missing the fold because the pattern matching was deficient. I didn't try to follow the bits in that test, but Alive says it's correct: https://rise4fun.com/Alive/ugc llvm-svn: 341426	2018-09-04 23:22:13 +00:00
Sanjay Patel	018ce562a9	[InstCombine] update tests checks; NFC llvm-svn: 341424	2018-09-04 23:08:23 +00:00
Sanjay Patel	5bbe8cd7ef	[InstCombine] add tests for xor-or-xor fold; NFC There are 2 bugs shown here that were untested before: 1. We fail to perform the fold in 1/2 the possible commuted variants. 2. When the fold is done, it disregards extra uses. llvm-svn: 341415	2018-09-04 22:10:23 +00:00
Sanjay Patel	0f70f86ce0	[InstCombine] make ((X & C) ^ C) form consistent for vectors It would be better to create a 'not' here, but that's not possible yet. llvm-svn: 341410	2018-09-04 21:17:14 +00:00
Sanjay Patel	664b2e3bd6	[InstCombine] improve xor+and/or tests The tests attempted to check for commuted variants of these folds, but complexity-based canonicalization meant we had no coverage for at least 1/2 of the cases. Also, the folds correctly check hasOneUse(), but there was no coverage for that. llvm-svn: 341394	2018-09-04 19:06:46 +00:00
Nicola Zaghen	9588ad9611	[InstCombine] Fold icmp ugt/ult (add nuw X, C2), C --> icmp ugt/ult X, (C - C2) Support for sgt/slt was added in rL294898, this adds the same cases also for unsigned compares. This is the Alive proof: https://rise4fun.com/Alive/nyY Differential Revision: https://reviews.llvm.org/D50972 llvm-svn: 341353	2018-09-04 10:29:48 +00:00
Sanjay Patel	d75064e6d5	[InstCombine] allow add+not --> sub for arbitrary vector constants. llvm-svn: 341335	2018-09-03 18:21:59 +00:00
Sanjay Patel	faa02b1abb	[InstCombine] consolidate tests for ~(X+C); NFC llvm-svn: 341332	2018-09-03 18:04:21 +00:00
Florian Hahn	cc9dc599ba	[SLC] Support expanding pow(x, n+0.5) to x * x * ... * sqrt(x) Reviewers: evandro, efriedma, spatel Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D51435 llvm-svn: 341330	2018-09-03 17:37:39 +00:00
Sanjay Patel	17e709b66a	[InstCombine] allow not+sub fold for arbitrary vector constants The fold was implemented for the general case but use-limitation, but the later constant version which didn't check uses was only matching splat constants. llvm-svn: 341292	2018-09-02 19:31:45 +00:00
Sanjay Patel	04ab22b3f4	[InstCombine] move/add tests for not+sub; NFC llvm-svn: 341291	2018-09-02 19:18:13 +00:00
Evandro Menezes	2123ea7d5c	[InstCombine] Expand the simplification of pow() into exp2() Generalize the simplification of `pow(2.0, y)` to `pow(2.0 ** n, y)` for all scalar and vector types. This improvement helps some benchmarks in SPEC CPU2000 and CPU2006, such as 252.eon, 447.dealII, 453.povray. Otherwise, no significant regressions on x86-64 or A64. Differential revision: https://reviews.llvm.org/D49273 llvm-svn: 341095	2018-08-30 19:04:51 +00:00
Craig Topper	f0531da109	[InstCombine] Add test cases for D51398 These tests contain the pattern (neg (max ~X, C)) which we should transform to ((min X, ~C) + 1) llvm-svn: 341023	2018-08-30 06:14:54 +00:00
Reid Kleckner	9397c2a23b	Revert r340947 "[InstCombine] Expand the simplification of pow() into exp2()" It broke the clang-cl self-host. llvm-svn: 340991	2018-08-29 22:58:33 +00:00
Sanjay Patel	0f29e953b7	[InstCombine] canonicalize fneg with llvm.sin This is a follow-up to rL339604 which did the same transform for a sin libcall. The handling of intrinsics vs. libcalls is unfortunately scattered, so I'm just adding this next to the existing transform for llvm.cos for now. This should resolve PR38458: https://bugs.llvm.org/show_bug.cgi?id=38458 If the call was already negated, the negates will cancel each other out. llvm-svn: 340952	2018-08-29 18:27:49 +00:00
Sanjay Patel	12a7ea44ed	[InstCombine] add tests for llvm.sin(-x); NFC Also add a corresponding test for llvm.cos with FMF to make sure that was handled correctly. llvm-svn: 340950	2018-08-29 18:11:42 +00:00
Evandro Menezes	22e0bdf4ed	[InstCombine] Expand the simplification of pow() with nested exp{,2}() Expand the simplification of `pow(exp{,2}(x), y)` to all FP types. This improvement helps some benchmarks in SPEC CPU2000 and CPU2006, such as 252.eon, 447.dealII, 453.povray. Otherwise, no significant regressions on x86-64 or A64. Differential revision: https://reviews.llvm.org/D51195 llvm-svn: 340948	2018-08-29 17:59:48 +00:00
Evandro Menezes	a3a7b53571	[InstCombine] Expand the simplification of pow() into exp2() Generalize the simplification of `pow(2.0, y)` to `pow(2.0 ** n, y)` for all scalar and vector types. This improvement helps some benchmarks in SPEC CPU2000 and CPU2006, such as 252.eon, 447.dealII, 453.povray. Otherwise, no significant regressions on x86-64 or A64. Differential revision: https://reviews.llvm.org/D49273 llvm-svn: 340947	2018-08-29 17:59:34 +00:00
Sanjay Patel	3abd9f6bdc	[InstCombine] add test for vector demanded elements + shrinking; NFC llvm-svn: 340933	2018-08-29 15:34:19 +00:00
Matt Arsenault	10de2775bd	AMDGPU: Remove nan tests in class if src is nnan llvm-svn: 340850	2018-08-28 18:10:02 +00:00
Sanjay Patel	60ffc2e9a4	[InstCombine] fix baseline assertions rL340842 contained the wrong version of the check lines. llvm-svn: 340846	2018-08-28 17:23:20 +00:00
Sanjay Patel	c9756e5a23	[InstCombine] add tests for select narrowing (PR38691); NFC llvm-svn: 340842	2018-08-28 16:45:00 +00:00
Craig Topper	a6cd4b9bce	[InstCombine] Extend (add (sext x), cst) --> (sext (add x, cst')) and (add (zext x), cst) --> (zext (add x, cst')) to work for vectors Differential Revision: https://reviews.llvm.org/D51236 llvm-svn: 340796	2018-08-28 02:02:29 +00:00
Kit Barton	7c80f98b69	[PPC] Remove Darwin support from POWER backend. This patch issues an error message if Darwin ABI is attempted with the PPC backend. It also cleans up existing test cases, either converting the test to use an alternative triple or removing the test if the coverage is no longer needed. Updated Tests ------------- The majority of test cases were updated to use a different triple that does not include the Darwin ABI. Many tests were also updated to use FileCheck, in place of grep. Deleted Tests ------------- llvm/test/tools/dsymutil/PowerPC/sibling.test was originally added to test specific functionality of dsymutil using an object file created with an old version of llvm-gcc for a Powerbook G4. After a discussion with @JDevlieghere he suggested removing the test. llvm/test/CodeGen/PowerPC/combine_loads_from_build_pair.ll was converted from a PPC test to a SystemZ test, as the behavior is also reproducible there. All other tests that were deleted were specific to the darwin/ppc ABI and no longer necessary. Phabricator Review: https://reviews.llvm.org/D50988 llvm-svn: 340795	2018-08-28 01:18:29 +00:00
Craig Topper	e23e8a4f53	[InstCombine] Add test cases for D51236. NFC llvm-svn: 340789	2018-08-27 22:55:49 +00:00
Sanjay Patel	42d31c20a8	[InstCombine] allow shuffle+binop canonicalization with widening shuffles This lines up with the behavior of an existing transform where if both operands of the binop are shuffled, we allow moving the binop before the shuffle regardless of whether the shuffle changes the size of the vector. llvm-svn: 340787	2018-08-27 22:41:44 +00:00
Evandro Menezes	253991cfaf	[PATCH] [InstCombine] Fix issue in the simplification of pow() with nested exp{,2}() Fix the issue of duplicating the call to `exp{,2}()` when it's nested in `pow()`, as exposed by rL340462. Differential revision: https://reviews.llvm.org/D51194 llvm-svn: 340784	2018-08-27 22:11:15 +00:00
Sanjay Patel	57a0b4edd7	[InstCombine] add tests for shuffle+binop transform; NFC llvm-svn: 340683	2018-08-25 14:37:08 +00:00

1 2 3 4 5 ...

3694 Commits