clang-p2996

Author	SHA1	Message	Date
Roman Lebedev	5b4b842ffb	[NFC] Port all InstSimplify tests to `-passes=` syntax	2022-12-08 02:38:45 +03:00
Bjorn Pettersson	ac696ac453	Use opt -passes=<name> instead of opt -name Updated the RUN line in several test cases to use the new PM syntax opt -passes=<pipeline> instead of the deprecated syntax opt -pass1 -pass2	2022-11-08 12:15:42 +01:00
Sanjay Patel	59f3b3d796	[EarlyCSE][ConstantFolding] move test files to dir of pass in RUN line; NFC	2022-08-08 10:08:55 -04:00
Sanjay Patel	8148c28fad	[ConstFolding] fix overzealous assert when converting FP half Fixes #56981	2022-08-07 13:34:51 -04:00
David Green	b2de84633a	[ConstProp] Don't fallthorugh for poison constants on vctp and active_lane_mask. Given a poison constant as input, the dyn_cast to a ConstantInt would fail so we would fall through to the generic code that attempts to fold each element of the input vectors. The inputs to these intrinsics are not vectors though, leading to a compile time crash. Instead bail out properly for poison values by returning nullptr. This doesn't try to define what poison means for these intrinsics. Fixes #56945	2022-08-05 11:19:36 +01:00
Nuno Lopes	d4b4747de5	ConstantFolding: fold OOB accesses to poison instead of undef	2022-07-30 15:20:32 +01:00
Nikita Popov	4bb7b6fae3	[IR] Remove support for float binop constant expressions As part of https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179, this removes support for the floating-point binop constant expressions fadd, fsub, fmul, fdiv and frem. As part of this change, the C APIs LLVMConstFAdd, LLVMConstFSub, LLVMConstFMul, LLVMConstFDiv and LLVMConstFRem are removed. The LLVMBuild APIs should be used instead. Differential Revision: https://reviews.llvm.org/D129478	2022-07-12 09:40:49 +02:00
Nikita Popov	02b38ba8aa	[ConstFold] Salvage some div/rem folding test (NFC) The div/rem constant expressions are going away in D129148. Convert some tests to use InstSimplify instead, to show that the constant folding still happens.	2022-07-06 10:03:03 +02:00
Nikita Popov	60a32157a5	[Tests] Remove unnecessary bitcasts from opaque pointer tests (NFC) Previously left these behind due to the required instruction renumbering, drop them now. This more accurately represents opaque pointer input IR. Also drop duplicate opaque pointer check lines in one SROA test.	2022-06-22 14:15:46 +02:00
Nikita Popov	2a3288776c	[InstSimplify] Update GEP test to use opaque pointers (NFC) With opaque pointers, we end up merging these GEPs and dropping the inrange attribute (in the last two cases). This did not happen previously, because typed pointers use less powerful GEP folding logic. I'm a bit unsure whether this is something we need to be concerned about or not. I believe that generally our stance is that we should perform folds even if this requires losing poison-generating flags like inrange. We can either a) accept this as-is, b) try to inhibit folding if it requires dropping inrange or c) try to fold to poison if we know that inrange is going to be violated. For now, we accept it as-is. Differential Revision: https://reviews.llvm.org/D127503	2022-06-13 10:45:55 +02:00
Nikita Popov	04b944e230	[InstSimplify] Convert tests to opaque pointers (NFC) The only interesting test change is in @PR31262, where the following fold is now performed, while it previously was not: https://alive2.llvm.org/ce/z/a5Qmr6 llvm/test/Transforms/InstSimplify/ConstProp/gep.ll has not been updated, because there is a tradeoff between folding and inrange preservation there that we may want to discuss. Updates have been performed using: https://gist.github.com/nikic/98357b71fd67756b0f064c9517b62a34	2022-06-10 17:16:28 +02:00
Nikita Popov	0a5ec1f034	[InstSimplify] Regenerate test checks (NFC)	2022-06-10 16:54:09 +02:00
Danila Malyutin	ed6c309d4b	[APFloat] Fix truncation of certain subnormal numbers Certain subnormals would be incorrectly rounded away from zero. Fixes #55838 Differential Revision: https://reviews.llvm.org/D127140	2022-06-08 21:54:35 +03:00
Sanjay Patel	abb21b54bc	[ConstProp] add tests for APFloat truncate miscompile; NFC issue #55838	2022-06-05 20:07:18 -04:00
Benjamin Kramer	08b20f20d2	[ConstantFold] Use getFltSemantics instead of manually checking the type Simplifies the code and makes fpext/fptrunc constant folding not crash when the result is bf16.	2022-05-05 15:52:19 +02:00
Craig Topper	ac8c720d48	[IR] Allow constant folding (insertelement <vscale x 2 x i32> zeroinitializer, i32 0, i32 i32 0. Most of insertelement constant folding is blocked if the vector type is scalable. I believe we can make an exception for inserting null into an all zeros vector. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D123413	2022-04-15 17:44:32 -07:00
Nikita Popov	659871cede	[ConstantFold] Add test for load of i8 from i1 (NFC) Semantics here are a bit unclear, but the store-to-load forwarding case at least should be a miscompile.	2022-04-08 16:32:51 +02:00
Sanjay Patel	7cc0a29b3f	[Analysis] propagate poison through add/sub saturate intrinsics A more general enhancement needs to add tests and make sure that intrinsics that return structs are correct. There are also target-specific intrinsics, and I'm not sure what behavior is expected for those.	2022-02-15 10:45:32 -05:00
Sanjay Patel	00218c188b	[Analysis] propagate poison through integer min/max intrinsics A more general enhancement needs to add tests and make sure that intrinsics that return structs are correct. There are also target-specific intrinsics, and I'm not sure what behavior is expected for those.	2022-02-15 10:45:32 -05:00
Sanjay Patel	765b5b8105	[ConstProp] add tests for intrinsics with poison ops; NFC	2022-02-15 10:45:32 -05:00
Bjorn Pettersson	b280ee1dd7	[test] Use -passes=instsimplify instead of -instsimplify in a number of tests. NFC Another step moving away from the deprecated syntax of specifying pass pipeline in opt. Differential Revision: https://reviews.llvm.org/D119080	2022-02-07 14:26:58 +01:00
Bjorn Pettersson	4f73528403	[test][NewGVN] Use -passes=newgvn instead of -newgvn Use the new PM syntax when specifying the pipeline in regression tests previously running "opt -newgvn ..." Instead we now do "opt -passes=newgvn ..." Notice that this also changes the aa-pipeline to become the default aa-pipeline instead of just basic-aa. Since these tests haven't been explicitly requesting basic-aa in the past (compared to the test cases updated in a separate patch involving "-basic-aa -newgvn") it is assumed that the exact aa-pipeline isn't important for the validity of the test cases. An alternative could have been to add -aa-pipeline=basic-aa as well to the run lines, but that might just add clutter in case the test cases do not care about the aa-pipeline. This is another step to move away from the legacy PM syntax when specifying passes in opt. Differential Revision: https://reviews.llvm.org/D118341	2022-01-28 13:58:22 +01:00
Sanjay Patel	2e26633af0	[IR] document and update ctlz/cttz intrinsics to optionally return poison rather than undef The behavior in Analysis (knownbits) implements poison semantics already, and we expect the transforms (for example, in instcombine) derived from those semantics, so this patch changes the LangRef and remaining code to be consistent. This is one more step in removing "undef" from LLVM. Without this, I think https://github.com/llvm/llvm-project/issues/53330 has a legitimate complaint because that report wants to allow subsequent code to mask off bits, and that is allowed with undef values. The clang builtins are not actually documented anywhere AFAICT, but we might want to add that to remove more uncertainty. Differential Revision: https://reviews.llvm.org/D117912	2022-01-23 11:22:48 -05:00
Nikita Popov	b4900296e4	[ConstantFold] Allow all float types in reinterpret load folding Rather than hardcoding just half, float and double, allow all floating point types.	2022-01-21 09:26:51 +01:00
Nikita Popov	3f9d1f516e	[InstSimplify] Add tests for reinterpret load of floats (NFC) Add tests for currently unsupported float types.	2022-01-21 09:26:50 +01:00
Nikita Popov	6a19cb837c	[ConstantFold] Support pointers in reinterpret load folding Peculiarly, the necessary code to handle pointers (including the check for non-integral address spaces) is already in place, because we were already allowing vectors of pointers here, just not plain pointers.	2022-01-21 09:13:37 +01:00
Nikita Popov	805bc24868	[InstSimplify] Add test for load of non-integral pointer (NFC)	2022-01-20 16:50:05 +01:00
Nikita Popov	0f283de9d1	[InstSimplify] Add test for reinterpret load of pointer type (NFC)	2022-01-20 16:25:54 +01:00
Nikita Popov	20d9c51dc0	[ConstantFold] Check for uniform value before reinterpret load The reinterpret load code will convert undef values into zero. Check the uniform value case before it to produce a better result for all-undef initializers. However, the uniform value handling will return the uniform value even if the access is out of bounds, while the reinterpret load code will return undef. Add an explicit check to retain the previous result in this case.	2022-01-14 10:18:02 +01:00
Nikita Popov	e7ce6acc83	[InstSimplify] Add test for load from undef (NFC) If we're loading from an all-undef value, we sometimes still return zero rather than undef.	2022-01-14 10:18:02 +01:00
Nikita Popov	c41aa41957	[ConstFold] Add missing check for inbounds gep If the gep is not inbounds, then the gep might compute a null value even if the base pointer is non-null.	2022-01-06 09:59:40 +01:00
Nikita Popov	37c9171764	[ConstantFold] Add test for invalid non-inbounds gep icmp fold The gep evaluated to null in this case, and as such is not ne null.	2022-01-06 09:59:40 +01:00
Nikita Popov	3dc1907d06	[ConstantFold] Use ConstantFoldLoadFromUniformValue() in more places In particular, this also preserves undef when loading from padding, rather than converting it to zero through a different codepath. This is the remaining part of D115924.	2022-01-05 12:47:50 +01:00
Nikita Popov	4e62d210c4	[ConstantFold] Add test for load of padding (NFC) This currently load zero rather than undef.	2022-01-05 12:47:49 +01:00
Nikita Popov	99c6b12b92	[ConstantFolding] Unify handling of load from uniform value There are a number of places that specially handle loads from a uniform value where all the bits are the same (zero, one, undef, poison), because we a) don't care about the load offset in that case b) it bypasses casts that might not be legal generally but do work with uniform values. We had multiple implementations of this, with a different set of supported values each time. This replaces two usages with a more complete helper. Other usages will be replaced separately, because they have larger impact. This is part of D115924.	2022-01-05 12:30:46 +01:00
Nikita Popov	00686ab4af	[ConstantFold] Add additional load from uniform value tests (NFC)	2022-01-05 12:30:46 +01:00
Nikita Popov	6c031780aa	[ConstantFold] Remove another incorrect icmp of gep fold This folded (null + X) == g to false, but of course this is incorrect if X == g. Possibly this got confused with the null == g case, which is already handled elsewhere.	2022-01-04 16:08:09 +01:00
Nikita Popov	25448826dd	[InstSimplify] Update test to make miscompile more obvious (NFC) This is now testing (null + g3) != g3 and still coming up with "true" as the answer. The original case was a less obvious miscompile with index overflow involved.	2022-01-04 16:08:09 +01:00
Nikita Popov	75db002725	[ConstantFold] Remove another incorrect icmp of GEP fold This fold is not correct, because indices might evaluate to zero even if they are not a literal zero integer. Additionally, this fold would be wrong (in the general case) for non-i8 types as well, due to index overflow. Drop this fold and instead let the target-dependent constant folder compute the actual offset and fold the comparison based on that.	2022-01-04 12:27:40 +01:00
Nikita Popov	aefab6f8d5	[InstSimplify] Use weak symbol in test to show miscompile (NFC) This fold is incorrect, because it assumes that all indices are non-zero. This happens to be true for the test as written, but doesn't hold if we use an extern weak global instead, for which ptrtoint might be zero. Add separate tests for the simple constant int case.	2022-01-04 12:27:40 +01:00
Nikita Popov	5afbfe33e7	[ConstantFold] Make icmp of gep fold offset based We can fold an equality or unsigned icmp between base+offset1 and base+offset2 with inbounds offsets by comparing the offsets directly. This replaces a pair of specialized folds that tried to reason based on the GEP structure instead. One of those folds was plain wrong (because it does not account for negative offsets), while the other is unnecessarily complicated and limited (e.g. it will fail with bitcasts involved). The disadvantage of this change is that it requires data layout, so the fold is no longer performed by datalayout-independent constant folding. I don't think this is a loss in practice, but it does regress the ConstantExprFold.ll test, which checks folding without running any passes. Differential Revision: https://reviews.llvm.org/D116332	2022-01-03 09:41:37 +01:00
Nikita Popov	3bfe0962ba	[ConstFold] Add another icmp of gep of global test (NFC) This time with some complex arithmetic involving bitcasts.	2021-12-28 14:28:28 +01:00
Nikita Popov	23de66d163	[ConstFold] Don't fold signed comparison of gep of global An inbounds GEP may still cross the sign boundary, so signed icmps cannot be folded (https://alive2.llvm.org/ce/z/XSgi4D). This was previously fixed for other folds in this function, but this one was missed.	2021-12-28 14:13:33 +01:00
Nikita Popov	1bd11d34fe	[ConstFold] Add additional icmp of gep of global tests (NFC) The fold is incorrect for the sgt case, as gep inbounds is allowed to cross the sign boundary.	2021-12-28 14:07:15 +01:00
Nikita Popov	2926d6d335	[ConstantFold][GlobalOpt] Don't create x86_mmx null value This fixes the assertion failure reported at https://reviews.llvm.org/D114889#3198921 with a straightforward check, until the cleaner fix in D115924 can be reapplied.	2021-12-21 09:11:41 +01:00
Nikita Popov	aeb36ae0f4	Revert "[ConstantFolding] Unify handling of load from uniform value" This reverts commit `9fd4f80e33`. This breaks SingleSource/Regression/C/gcc-c-torture/execute/pr19687.c in test-suite. Either the test is incorrect, or clang is generating incorrect union initialization code. I've submitted https://reviews.llvm.org/D115994 to fix the test, assuming my interpretation is correct. Reverting this in the meantime as it may take some time to resolve.	2021-12-18 20:46:52 +01:00
Nikita Popov	9fd4f80e33	[ConstantFolding] Unify handling of load from uniform value There are a number of places that specially handle loads from a uniform value where all the bits are the same (zero, one, undef, poison), because we a) don't care about the load offset in that case and b) it bypasses casts that might not be legal generally but do work with uniform values. We had multiple implementations of this, with a different set of supported values each time, as well as incomplete type checks in some cases. In particular, this fixes the assertion reported in https://reviews.llvm.org/D114889#3198921, as well as a similar assertion that could be triggered via constant folding. Differential Revision: https://reviews.llvm.org/D115924	2021-12-17 17:05:06 +01:00
Nikita Popov	65bec04295	[ConstantFold] Handle same type in ConstantFoldLoadThroughBitcast Usually the case where the types are the same ends up being handled fine because it's legal to do a trivial bitcast to the same type. However, this is not true for aggregate types. Short-circuit the whole code if the types match exactly to account for this.	2021-12-10 16:39:50 +01:00
Nikita Popov	9c244a33e7	[InstSimplify] Add test for load of aggregate (NFC) The test is switched to use -instsimplify as it is in the InstSimplify directory. In this particular case InstCombine does fold the load (in a very roundabout way), but InstSimplify does not.	2021-12-10 16:18:18 +01:00
David Green	ab0c5cea0b	[ARM] Use v2i1 for MVE and CDE intrinsics This adjusts all the MVE and CDE intrinsics now that v2i1 is a legal type, to use a <2 x i1> as opposed to emulating the predicate with a <4 x i1>. The v4i1 workarounds have been removed leaving the natural v2i1 types, notably in vctp64 which now generates a v2i1 type. AutoUpgrade code has been added to upgrade old IR, which needs to convert the old v4i1 to a v2i1 be converting it back and forth to an integer with arm.mve.v2i and arm.mve.i2v intrinsics. These should be optimized away in the final assembly. Differential Revision: https://reviews.llvm.org/D114455	2021-12-03 15:27:58 +00:00

1 2 3

109 Commits