clang-p2996

Author	SHA1	Message	Date
Nikita Popov	356d47ccb9	[ValueTracking] Handle and/or on RHS of isImpliedCondition() isImpliedCondition() currently handles and/or on the LHS, but not on the RHS, resulting in asymmetric behavior. This patch adds two new implication rules: * LHS ==> (RHS1 \|\| RHS2) if LHS ==> RHS1 or LHS ==> RHS2 * LHS ==> !(RHS1 && RHS2) if LHS ==> !RHS1 or LHS ==> !RHS2 Differential Revision: https://reviews.llvm.org/D125551	2022-05-16 16:30:26 +02:00
Nikita Popov	f01c7583b5	[InstSimplify] Add additional implied condition tests (NFC)	2022-05-13 17:19:41 +02:00
Nikita Popov	ddfee07519	[InstSimplify] Fold and/or using implied conditions This adds two conjugated folds: * A \| B -> B if A implies B (https://alive2.llvm.org/ce/z/R6GU4j) * A & B -> A if A implies B (https://alive2.llvm.org/ce/z/EGMqyy) If A and B are icmps themselves, we will usually fold this through other logic already (though the tests show a couple additional cases we previously missed). However, isImpliedCond() also supports A being of the form X & Y, which allows us to handle cases like (X & Y) \| B where X implies B. This addresses the regression from D125398. Something that notably doesn't work yet is the (X \| Y) & B case. This is due to an asymmetry in the isImpliedCondition() implementation that will have to be addressed separately. Differential Revision: https://reviews.llvm.org/D125530	2022-05-13 15:09:14 +02:00
Nikita Popov	4de9a8ae3f	[InstSimplify] Add tests for and/or with implied conditions (NFC)	2022-05-13 12:08:16 +02:00
Benjamin Kramer	08b20f20d2	[ConstantFold] Use getFltSemantics instead of manually checking the type Simplifies the code and makes fpext/fptrunc constant folding not crash when the result is bf16.	2022-05-05 15:52:19 +02:00
Craig Topper	ac8c720d48	[IR] Allow constant folding (insertelement <vscale x 2 x i32> zeroinitializer, i32 0, i32 i32 0. Most of insertelement constant folding is blocked if the vector type is scalable. I believe we can make an exception for inserting null into an all zeros vector. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D123413	2022-04-15 17:44:32 -07:00
Kevin P. Neal	d43d9e1d5c	[FPEnv][InstSimplify] Fold fsub -0.0, -X ==> X Currently the fsub optimizations in InstSimplify don't know how to fold -0.0 - (-X) to X when the constrained intrinsics are used. This adds partial support. The rest of the support will come later with work on the IR matchers. This review is split out from D107285. Differential Revision: https://reviews.llvm.org/D123396	2022-04-14 11:48:54 -04:00
Nikita Popov	1d530b914e	[InstSimplify] Don't fold phi of poison and trapping const expr (PR49839) Folding this case would result in the constant expression being executed unconditionally, which may introduce a new trap. Fixes https://github.com/llvm/llvm-project/issues/49839.	2022-04-12 17:32:25 +02:00
Nikita Popov	bc6d7ed8a9	[InstSimplify] Add test for PR49839 (NFC)	2022-04-12 17:32:25 +02:00
Nikita Popov	659871cede	[ConstantFold] Add test for load of i8 from i1 (NFC) Semantics here are a bit unclear, but the store-to-load forwarding case at least should be a miscompile.	2022-04-08 16:32:51 +02:00
Hirochika Matsumoto	447a4485c5	[InstSimplify] Fold (ctpop(X) == N) \|\| (X != 0) into X != 0 where N > 0 (ctpop(X) == N) \|\| (X != 0) --> (X != 0) https://alive2.llvm.org/ce/z/udgUVV (ctpop(X) != N) && (X == 0) --> (X == 0) https://alive2.llvm.org/ce/z/9dq-cR Differential Revision: https://reviews.llvm.org/D122757	2022-04-04 23:23:34 +09:00
Nikita Popov	3c9f3f76f1	[ConstantFold] Fold zero-index GEPs with opaque pointers With opaque pointers, we can eliminate zero-index GEPs even if they have multiple indices, as this no longer impacts the result type of the GEP. This optimization is already done for instructions in InstSimplify, but we were missing the corresponding constant expression handling. The constexpr transform is a bit more powerful, because it can produce a vector splat constant and also handles undef values -- it is an extension of an existing single-index transform.	2022-04-04 13:04:27 +02:00
Nikita Popov	d092df42f3	[InstSimplify] Add tests for zero-offset opaque ptr constexpr GEP (NFC)	2022-04-04 13:04:26 +02:00
Dávid Bolvanský	11b41910dd	[NFCI] Regenerate instsimplify test checks	2022-04-03 20:55:15 +02:00
Hirochika Matsumoto	f138a9964b	Reapply "[InstSimplify][NFC] Add baseline tests for folds of icmp with ctpop" This change was previously reverted because I forgot rerunning update_test_checks.py and tests were not actually baseline. Extracted from: https://reviews.llvm.org/D122757	2022-04-03 22:07:04 +09:00
Hirochika Matsumoto	f65c78a094	Revert "[InstSimplify][NFC] Add baseline tests for folds of icmp with ctpop" This reverts commit `b48abeea44`. Accidentally added already optimized tests, not baseline tests.	2022-04-03 02:27:59 +09:00
Hirochika Matsumoto	b48abeea44	[InstSimplify][NFC] Add baseline tests for folds of icmp with ctpop Extracted from: https://reviews.llvm.org/D122757	2022-04-03 02:19:24 +09:00
Kevin P. Neal	bd050a34fe	[FPEnv][InstSimplify] Teach CannotBeNegativeZero() about constrained intrinsics. Currently some optimizations are disabled because llvm::CannotBeNegativeZero() does not know how to deal with the constrained intrinsics. This patch fixes that by extending the existing implementation. Differential Revision: https://reviews.llvm.org/D121483	2022-03-18 10:24:48 -04:00
Kevin P. Neal	ae6db2f3d8	Precommit test for D121483: [FPEnv][InstSimplify] Teach CannotBeNegativeZero() about constrained intrinsics.	2022-03-17 15:03:05 -04:00
Nikita Popov	02c2106002	[InstSimplify] Handle vector GEP when simplifying zero indices If the base is a scalar and the index is a vector, we can't simplify, as this is effectively a splat operation.	2022-03-11 10:56:44 +01:00
Serge Pavlov	6982c38cb1	[ConstantFolding] Fix folding of constrained compare intrinsics The change fixes treatment of constrained compare intrinsics if compared values are of vector type. Differential revision: https://reviews.llvm.org/D110322	2022-02-27 10:19:19 +07:00
Sanjay Patel	fc3b34c508	[InstSimplify] remove shift that is redundant with part of funnel shift In D111530, I suggested that we add some relatively basic pattern-matching folds for shifts and funnel shifts and avoid a more specialized solution if possible. We can start by implementing at least one of these in IR because it's easier to write the code and verify with Alive2: https://alive2.llvm.org/ce/z/qHpmNn This will need to be adapted/extended for SDAG to handle the motivating bug ( #49541 ) because the patterns only appear later with that example (added some tests: `bb850d422b`) This can be extended within InstSimplify to handle cases where we 'and' with a shift too (in that case, kill the funnel shift). We could also handle patterns where the shift and funnel shift directions are inverted, but I think it's better to canonicalize that instead to avoid pattern-match case explosion. Differential Revision: https://reviews.llvm.org/D120253	2022-02-23 09:10:01 -05:00
Sanjay Patel	ee5580a8eb	[InstSimplify] add tests for funnel shift with redundant shift; NFC	2022-02-21 10:24:46 -05:00
Philip Reames	ff2e4c04c4	[instsimplify] Assume storage for byval args doesn't overlap allocas, globals, or other byval args This allows us to discharge many pointer comparisons based on byval arguments. Differential Revision: https://reviews.llvm.org/D120133	2022-02-18 11:08:01 -08:00
Philip Reames	a259e62bb6	[instsimplify] Add a couple more pointer compare folding tests [NFC]	2022-02-18 08:24:06 -08:00
Philip Reames	1cf790bd04	[instsimplify] Add pointer compare tests for byval args and globals	2022-02-18 07:50:57 -08:00
Philip Reames	9f7075de5c	{instsimplify] Precommit some tests for provable inequal pointers derived from allocas	2022-02-17 12:06:06 -08:00
Philip Reames	cf5e88864b	[instsimplify] When compare allocas, consider their minimal size The code was using exact sizing only, but since what we really need is just to make sure the offsets are in bounds, a minimum bound on the object size is sufficient. To demonstrate the difference, support computing minimum sizes from obects of scalable vector type.	2022-02-17 09:53:24 -08:00
Philip Reames	2404313d80	[instsimplify] Fix a miscompile with zero sized allocas Remove some code which tried to handle the case of comparing two allocas where an object size could not be precisely computed. This code had zero coverage in tree, and at least one nasty bug. The bug comes from the fact that the code uses the size of the result pointer as a proxy for whether the alloca can be of size zero. Since the result of an alloca is always a pointer type, and a pointer type can never be empty, this check was a nop. As a result, we blindly consider a zero offset from two allocas to never be equal. They can in fact be equal when one or more of the allocas is zero sized. This is particularly ugly because instcombine contains the exact opposite rule. If instcombine reaches the allocas first, it combines them into one (making them equal). If instsimplify reaches the compare first, it would consider them not equal. This creates all kinds of fun scenarios for order of optimization reaching different and contradictory conclusions.	2022-02-17 09:27:34 -08:00
Philip Reames	7eb3ce997a	[instsimplify] Precommit a test showing an alloca equality miscompile	2022-02-17 09:16:31 -08:00
Kevin P. Neal	c7400892ca	[FPEnv][InstSimplify] Fold fsub X, -0 ==> X, when we know X is not -0 Currently the fsub optimizations in InstSimplify don't know how to fold X - -0.0 to X when we know X is not zero and the constrained intrinsics are used. This adds the support. This review is split out from D107285. Differential Revision: https://reviews.llvm.org/D119746	2022-02-16 10:10:13 -05:00
Sanjay Patel	7cc0a29b3f	[Analysis] propagate poison through add/sub saturate intrinsics A more general enhancement needs to add tests and make sure that intrinsics that return structs are correct. There are also target-specific intrinsics, and I'm not sure what behavior is expected for those.	2022-02-15 10:45:32 -05:00
Sanjay Patel	00218c188b	[Analysis] propagate poison through integer min/max intrinsics A more general enhancement needs to add tests and make sure that intrinsics that return structs are correct. There are also target-specific intrinsics, and I'm not sure what behavior is expected for those.	2022-02-15 10:45:32 -05:00
Sanjay Patel	765b5b8105	[ConstProp] add tests for intrinsics with poison ops; NFC	2022-02-15 10:45:32 -05:00
Kevin P. Neal	22bd65fbe7	[FPEnv][InstSimplify] Fold fsub X, +0 ==> X Currently the fsub optimizations in InstSimplify don't know how to fold X - +0.0 to X when using the constrained intrinsics. This adds the support. This review is split out from D107285. Differential Revision: https://reviews.llvm.org/D118928	2022-02-14 11:56:45 -05:00
Nikita Popov	87a0b1bd23	[InstSimplify] Remove zero-index opaque pointer GEP With opaque pointers, a zero-index GEP is a no-op. It does not need to be retained for the pointer element type change it may perform.	2022-02-10 16:01:56 +01:00
Bjorn Pettersson	b280ee1dd7	[test] Use -passes=instsimplify instead of -instsimplify in a number of tests. NFC Another step moving away from the deprecated syntax of specifying pass pipeline in opt. Differential Revision: https://reviews.llvm.org/D119080	2022-02-07 14:26:58 +01:00
Nikita Popov	46f9e45ef0	[Statepoint] Update gc.statepoint calls in tests with elementtype (NFC) This updates tests for the LangRef change in D117890.	2022-02-04 14:15:41 +01:00
Serge Pavlov	d2f132f0b7	[ConstantFolding] Fold constrained compare intrinsics The change implements constant folding of ‘llvm.experimental.constrained.fcmp’ and ‘llvm.experimental.constrained.fcmps’ intrinsics. Differential Revision: https://reviews.llvm.org/D110322	2022-02-03 16:45:56 +07:00
Nikita Popov	784e01abca	[IR] Require matching signature in getCalledFunction() With opaque pointers, it's possible to directly call a function with a different signature, without an intermediate bitcast. However, lot's of code using getCalledFunction() reasonably assumes that the signatures match (which is always true without opaque pointers). Add an explicit check to that effect. The test case is from D105313, where I ran into the problem, but on further investigation this also affects lots of other code, we just have little coverage with mismatching signatures. The change from D105313 is still desirable for other reasons, but this patch addresses the root problem when it comes to opaque pointers. Differential Revision: https://reviews.llvm.org/D105733	2022-01-29 10:01:20 +01:00
Bjorn Pettersson	4f73528403	[test][NewGVN] Use -passes=newgvn instead of -newgvn Use the new PM syntax when specifying the pipeline in regression tests previously running "opt -newgvn ..." Instead we now do "opt -passes=newgvn ..." Notice that this also changes the aa-pipeline to become the default aa-pipeline instead of just basic-aa. Since these tests haven't been explicitly requesting basic-aa in the past (compared to the test cases updated in a separate patch involving "-basic-aa -newgvn") it is assumed that the exact aa-pipeline isn't important for the validity of the test cases. An alternative could have been to add -aa-pipeline=basic-aa as well to the run lines, but that might just add clutter in case the test cases do not care about the aa-pipeline. This is another step to move away from the legacy PM syntax when specifying passes in opt. Differential Revision: https://reviews.llvm.org/D118341	2022-01-28 13:58:22 +01:00
Sanjay Patel	2e26633af0	[IR] document and update ctlz/cttz intrinsics to optionally return poison rather than undef The behavior in Analysis (knownbits) implements poison semantics already, and we expect the transforms (for example, in instcombine) derived from those semantics, so this patch changes the LangRef and remaining code to be consistent. This is one more step in removing "undef" from LLVM. Without this, I think https://github.com/llvm/llvm-project/issues/53330 has a legitimate complaint because that report wants to allow subsequent code to mask off bits, and that is allowed with undef values. The clang builtins are not actually documented anywhere AFAICT, but we might want to add that to remove more uncertainty. Differential Revision: https://reviews.llvm.org/D117912	2022-01-23 11:22:48 -05:00
Nikita Popov	b4900296e4	[ConstantFold] Allow all float types in reinterpret load folding Rather than hardcoding just half, float and double, allow all floating point types.	2022-01-21 09:26:51 +01:00
Nikita Popov	3f9d1f516e	[InstSimplify] Add tests for reinterpret load of floats (NFC) Add tests for currently unsupported float types.	2022-01-21 09:26:50 +01:00
Nikita Popov	6a19cb837c	[ConstantFold] Support pointers in reinterpret load folding Peculiarly, the necessary code to handle pointers (including the check for non-integral address spaces) is already in place, because we were already allowing vectors of pointers here, just not plain pointers.	2022-01-21 09:13:37 +01:00
Nikita Popov	805bc24868	[InstSimplify] Add test for load of non-integral pointer (NFC)	2022-01-20 16:50:05 +01:00
Nikita Popov	0f283de9d1	[InstSimplify] Add test for reinterpret load of pointer type (NFC)	2022-01-20 16:25:54 +01:00
Nikita Popov	20d9c51dc0	[ConstantFold] Check for uniform value before reinterpret load The reinterpret load code will convert undef values into zero. Check the uniform value case before it to produce a better result for all-undef initializers. However, the uniform value handling will return the uniform value even if the access is out of bounds, while the reinterpret load code will return undef. Add an explicit check to retain the previous result in this case.	2022-01-14 10:18:02 +01:00
Nikita Popov	e7ce6acc83	[InstSimplify] Add test for load from undef (NFC) If we're loading from an all-undef value, we sometimes still return zero rather than undef.	2022-01-14 10:18:02 +01:00
Sanjay Patel	6bd127b079	[InstSimplify] use knownbits to fold more udiv/urem We could use knownbits on both operands for even more folds (and there are already tests in place for that), but this is enough to recover the example from: https://github.com/llvm/llvm-project/issues/51934 (the tests are derived from the code in that example) I am assuming no noticeable compile-time impact from this because udiv/urem are rare opcodes. Differential Revision: https://reviews.llvm.org/D116616	2022-01-12 14:59:43 -05:00

1 2 3 4 5 ...

1088 Commits