clang-p2996

Author	SHA1	Message	Date
Luke Lau	275658d1af	[SelectionDAG] Implicitly truncate known bits in SPLAT_VECTOR Now that D139525 fixes the Hexagon infinite loop, the stopgap can be removed to provide more information about known bits in SPLAT_VECTOR whose operands are smaller than the bit width (which is most of the time) Reviewed By: reames Differential Revision: https://reviews.llvm.org/D141075	2023-01-06 15:43:47 +00:00
Roman Lebedev	62fc5f1640	[DAGCombiner] Add a most basic `combineShuffleToZeroExtendVectorInReg()` Sometimes we end up with a shuffles in DAG that would be better represented as a `ISD::ZERO_EXTEND_VECTOR_INREG`, and a failure to do so causes suboptimal codegen in a number of cases, especially when we will then cast vector to scalar. I acknowledge, the test changes here are rather underwhelming, but as with all of codegen, it's always a yak shawing, and this is the most stripped down version of the patch that shows some effect without having insurmountable amount of fallout to deal with. The next change resolves this regression. The transformation will be extended in follow-ups.	2022-12-26 22:54:03 +03:00
Roman Lebedev	1234754bbc	[DAGCombine] `BUILD_VECTOR` can not create undef or poison	2022-12-23 02:26:36 +03:00
Nemanja Ivanovic	cb3f415cd2	[PowerPC] Fix up memory ordering after combining BV to a load The combiner for BUILD_VECTOR that merges consecutive loads into a wide load had two issues: - It didn't check that the input loads all have the same input chain - It didn't update nodes that are chained to the original loads to be chained to the new load This caused issues with bootstrap when `3c4d2a0396` was committed. This patch fixes the issue so it can unblock this commit. Differential revision: https://reviews.llvm.org/D140046	2022-12-16 08:57:36 -06:00
Fangrui Song	67819a72c6	[CodeGen] llvm::Optional => std::optional	2022-12-13 09:06:36 +00:00
Kazu Hirata	f7dffc28b3	Don't include None.h (NFC) I've converted all known uses of None to std::nullopt, so we no longer need to include None.h. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-10 11:24:26 -08:00
OCHyams	1d1de7467c	[Assignment Tracking][Analysis] Add analysis pass The Assignment Tracking debug-info feature is outlined in this RFC: https://discourse.llvm.org/t/ rfc-assignment-tracking-a-better-way-of-specifying-variable-locations-in-ir Add initial revision of assignment tracking analysis pass --------------------------------------------------------- This patch squashes five individually reviewed patches into one: #1 https://reviews.llvm.org/D136320 #2 https://reviews.llvm.org/D136321 #3 https://reviews.llvm.org/D136325 #4 https://reviews.llvm.org/D136331 #5 https://reviews.llvm.org/D136335 Patch #1 introduces 2 new files: AssignmentTrackingAnalysis.h and .cpp. The two subsequent patches modify those files only. Patch #4 plumbs the analysis into SelectionDAG, and patch #5 is a collection of tests for the analysis as a whole. The analysis was broken up into smaller chunks for review purposes but for the most part the tests were written using the whole analysis. It would be possible to break up the tests for patches #1 through #3 for the purpose of landing the patches seperately. However, most them would require an update for each patch. In addition, patch #4 - which connects the analysis to SelectionDAG - is required by all of the tests. If there is build-bot trouble, we might try a different landing sequence. Analysis problem and goal ------------------------- Variables values can be stored in memory, or available as SSA values, or both. Using the Assignment Tracking metadata, it's not possible to determine a variable location just by looking at a debug intrinsic in isolation. Instructions without any metadata can change the location of a variable. The meaning of dbg.assign intrinsics changes depending on whether there are linked instructions, and where they are relative to those instructions. So we need to analyse the IR and convert the embedded information into a form that SelectionDAG can consume to produce debug variable locations in MIR. The solution is a dataflow analysis which, aiming to maximise the memory location coverage for variables, outputs a mapping of instruction positions to variable location definitions. API usage --------- The analysis is named `AssignmentTrackingAnalysis`. It is added as a required pass for SelectionDAGISel when assignment tracking is enabled. The results of the analysis are exposed via `getResults` using the returned `const FunctionVarLocs `'s const methods: const VarLocInfo single_locs_begin() const; const VarLocInfo single_locs_end() const; const VarLocInfo locs_begin(const Instruction Before) const; const VarLocInfo locs_end(const Instruction *Before) const; void print(raw_ostream &OS, const Function &Fn) const; Debug intrinsics can be ignored after running the analysis. Instead, variable location definitions that occur between an instruction `Inst` and its predecessor (or block start) can be found by looping over the range: locs_begin(Inst), locs_end(Inst) Similarly, variables with a memory location that is valid for their lifetime can be iterated over using the range: single_locs_begin(), single_locs_end() Further detail -------------- For an explanation of the dataflow implementation and the integration with SelectionDAG, please see the reviews linked at the top of this commit message. Reviewed By: jmorse	2022-12-09 16:17:37 +00:00
David Green	f4cc734a33	[DAG] Teach isConstOrConstSplat about SPLAT_VECTORs This teaches the DemandedElts version of isConstOrConstSplat about SPLAT_VECTORS, in the same way as the non-DemandedElts version by calling the demanded-bits version from the non-demanded-bits version. Differential Revision: https://reviews.llvm.org/D128919	2022-12-08 11:53:25 +00:00
Matt Arsenault	47c68904a5	DAG: ComputeNumSignBits from load range metadata The cases where the result type doesn't match the range type are inadequately tested, but I'm not sure how to write such a test. During the pre-legalize combine, any obviously optimizable code gets handled so it's harder to test legalized extloads.	2022-12-05 11:57:13 -05:00
Philip Reames	7969ab85e0	[SDAG] Allow scalable vectors in ComputeKnownBits (try 2) This was previously reverted due to a hang on a Hexagon bot. This turned out to be a bug in the Hexagon backend around how splat_vectors are legalized (which they're using for fixed length vectors!). I adjusted this patch to remove the implicit truncate support. This hides the hexagon bug for now, and unblocks the rest of the change. Original commit message: This is the SelectionDAG equivalent of D136470, and is thus an alternate patch to D128159. The basic idea here is that we track a single lane for scalable vectors which corresponds to an unknown number of lanes at runtime. This is enough for us to perform lane wise reasoning on many arithmetic operations. This patch also includes an implementation for SPLAT_VECTOR as without it, the lane wise reasoning has no base case. The original patch which inspired this (D128159), also included STEP_VECTOR. I plan to do that as a separate patch. Differential Revision: https://reviews.llvm.org/D137140	2022-12-05 08:52:37 -08:00
Kazu Hirata	9f252e5567	[llvm] Use std::nullopt instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 17:31:17 -08:00
Kazu Hirata	998960ee1f	[CodeGen] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 20:36:08 -08:00
Philip Reames	fc0efb7e78	[SDAG] Allow scalable vectors in ComputeNumSignBits (try 2) I had reverted this before the holiday week because a problem was reported with a related change (D137140 - scalable vector known bits in DAG). I had initially confused the two patches, and then decided to leave this reverted out an abundance of caution. Now that we're through the holiday week, reapplying. I also roled in fixes for several post commit review comments that hadn't landed with the original change. Original commit message This is a continuation of the series of patches adding lane wise support for scalable vectors in various knownbit-esq routines. The basic idea here is that we track a single lane for scalable vectors which corresponds to an unknown number of lanes at runtime. This is enough for us to perform lane wise reasoning on many arithmetic operations. Differential Revision: https://reviews.llvm.org/D137141	2022-11-29 08:25:05 -08:00
Simon Pilgrim	629f17c516	[DAG] isGuaranteedNotToBeUndefOrPoison - handle FrameIndex/TargetFrameIndex Fixes #58904	2022-11-22 18:16:15 +00:00
Philip Reames	6705d94005	Revert "[SDAG] Allow scalable vectors in ComputeKnownBits" This reverts commit `bc0fea0d55`. There was a "timeout for a Halide Hexagon test" reported. Revert until investigation complete.	2022-11-18 15:29:14 -08:00
Philip Reames	102f05bd34	Revert "[SDAG] Allow scalable vectors in ComputeNumSignBits" and follow up This reverts commits `3fb08d14a6` and `f8c63a7fbf`. There was a "timeout for a Halide Hexagon test" reported. Revert until investigation complete.	2022-11-18 15:25:59 -08:00
Philip Reames	3fb08d14a6	[SDAG] Address post commit review feedback from `f8c63a7f` The major change is falling through to ComputeKnownBits when we don't have an implementation of ComputeNumSignBits due to conservatism over scalable vectors. Right now, we're mostly conservative in the same cases, but this allows our results to improve when we change ComputeKnownBits without also needing to improve ComputeNumSignBits at the same time.	2022-11-18 12:30:10 -08:00
Philip Reames	f8c63a7fbf	[SDAG] Allow scalable vectors in ComputeNumSignBits This is a continuation of the series of patches adding lane wise support for scalable vectors in various knownbit-esq routines. The basic idea here is that we track a single lane for scalable vectors which corresponds to an unknown number of lanes at runtime. This is enough for us to perform lane wise reasoning on many arithmetic operations. Differential Revision: https://reviews.llvm.org/D137141	2022-11-18 10:50:06 -08:00
Philip Reames	bc0fea0d55	[SDAG] Allow scalable vectors in ComputeKnownBits his is the SelectionDAG equivalent of D136470, and is thus an alternate patch to D128159. The basic idea here is that we track a single lane for scalable vectors which corresponds to an unknown number of lanes at runtime. This is enough for us to perform lane wise reasoning on many arithmetic operations. This patch also includes an implementation for SPLAT_VECTOR as without it, the lane wise reasoning has no base case. The original patch which inspired this (D128159), also included STEP_VECTOR. I plan to do that as a separate patch. Differential Revision: https://reviews.llvm.org/D137140	2022-11-18 07:40:32 -08:00
YingChi Long	7a715bf317	[VP] Add support for vp.inttoptr & vp.ptrtoint Add vp.inttoptr & vp.ptrtoint support by lowering them into vp.zext / vp.truncate with in SelectionDAGBuilder. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D137169	2022-11-18 10:42:24 +08:00
Philip Reames	4105794e66	[SDAG] Assert we don't see scalable VECTOR_SHUFFLES It was pointed out in review of D137140 that this case should be impossible. This patch converts an existing bailout into an assert instead.	2022-11-17 08:18:51 -08:00
Simon Pilgrim	a92f5a08a1	[DAG] simplifySelect - add support for vselect(0, T, F) -> F fold We still need to add handling for the non-zero T fold (which requires getBooleanContents handling)	2022-11-16 13:11:14 +00:00
Matt Arsenault	116c894d72	DAG: Fix assert on load casted to vector with attached range metadata AMDGPU legalizes i64 loads to loads of <2 x i32>, leaving the i64 MMO with attached range metadata alone. The known bit width was using the scalar element type, and asserting on a mismatch.	2022-11-15 23:28:55 -08:00
Simon Pilgrim	55a11b542e	[VectorUtils] Add getShuffleDemandedElts helper We have similar code to translate a demanded elements mask for a shuffle's operands in multiple places - this patch adds a helper function to VectorUtils and updates a number of locations to use it directly. Differential Revision: https://reviews.llvm.org/D136832	2022-10-30 17:03:55 +00:00
Haohai Wen	21f23a37c6	[SelectionDAG] Clamp stack alignment for memset, memmove memcpy has clamped dst stack alignment to NaturalStackAlignment if hasStackRealignment is false. We should also clamp stack alignment for memset and memmove. If we don't clamp, SelectionDAG may first do tail call optimization which requires no stack realignment. Then memmove, memset in same function may be lowered to load/store with larger alignment leading to PEI emit stack realignment code which is absolutely not correct. Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D136456	2022-10-26 16:45:31 +08:00
Kazu Hirata	a1317be28d	[SelectionDAG] Use std::clamp (NFC)	2022-10-24 00:23:51 -07:00
Simon Pilgrim	913f08b74c	[DAG] Add freeze(sign/zero_extend_vector_inreg(x)) -> sign/zero_extend_vector_inreg(freeze(x)) folding	2022-10-23 12:19:42 +01:00
Simon Pilgrim	7511303c4f	[DAG] canCreateUndefOrPoison - add freeze(fsh(x,y,z)) -> fsh(freeze(x),freeze(y),freeze(z)) support The funnel-shift amount is always modulo, so won't introduce poison/undef	2022-10-22 18:39:52 +01:00
Simon Pilgrim	89111707ec	[DAG] canCreateUndefOrPoison - add freeze(rot(x,y)) -> rot(freeze(x),freeze(y)) support The rotation amount is always modulo, so won't introduce poison/undef	2022-10-22 17:24:53 +01:00
Sander de Smalen	02df03c5b7	[AArch64][SME] Add support for arm_locally_streaming functions. Functions with `aarch64_sme_pstatesm_body` will emit a SMSTART at the start of the function, and a SMSTOP at the end of the function, such that all operations use the right value for vscale. Because the placement of these nodes is critically important (i.e. no vscale-dependent operations should be done before SMSTART has been issued), we require glueing the CopyFromReg to the Entry node such that we can insert the SMSTART as part of that glued chain. More details about the SME attributes and design can be found in D131562. Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D131582	2022-10-14 13:47:53 +00:00
Mirko Brkusanin	8b8463ef6c	[SelectionDAG] Use consistent type sizes for opcode	2022-10-12 17:33:04 +02:00
Philip Reames	487695e7c9	[SDAG] Treat DemandedElts argument to isSplatVector as splat for scalable vectors [nfc] The previous code used a APInt(1, 0) to represent the demanded elts of a scalable vector, and then ignored that argument if type was scalable. This was inconsistent with the UndefElts parameter which is set to either APInt(1, 0) or APInt(1,1) - that is, implicitly broadcast across all lanes. Particularly since the undef code relied on the DemandedElts parameter having bitwidth 1 to achieve that result! This change switches the demanded parameter to APInt(1,1), documents the broadcast semantics, and takes advantage of it to remove one special case for scalable vectors which is no longer required.	2022-10-11 09:49:28 -07:00
Philip Reames	ac4f3fff8c	[SDAG] Clarify behavior of scalable demanded/undef elts in isSplatValue [nfc] Update comment, and add an assertion to check property expected by sole (non-test) caller. Remove tests which appear to have been copied from fixed vector tests, and whose demanded bits don't correspond to the way this interface is otherwise used.	2022-10-11 07:28:34 -07:00
Philip Reames	04bb32e58a	[DAG] Extract helper for (neg x) [nfc] This is a frequently reoccurring pattern, let's factor it out. Differential Revision: https://reviews.llvm.org/D135301	2022-10-06 13:23:52 -07:00
Pierre van Houtryve	3ec0085c3f	[DAG] Update `isKnownNeverNaN` for `FMA/FMAD` We can still get a NaN even if none of the operands are NaN, e.g. from +inf/-inf. D50804 didn't catch that. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D134854	2022-10-06 06:52:36 +00:00
Philip Reames	a200b0fc25	[DAG] Introduce getSplat utility for common dispatch pattern [nfc] We have a very common pattern of dispatching between BUILD_VECTOR and SPLAT_VECTOR creation repeated in many cases in code. Common the pattern into a utility function.	2022-10-03 12:49:39 -07:00
Markus Böck	36af4c8418	[SelectionDAG] Fix use-after-free introduced in D130881 The code introduced in https://reviews.llvm.org/D130881 has a bug as it may cause a use-after-free error that can be caught by ASAN. The bug essentially boils down to iterator invalidation of `DenseMap`. The expression `SDEI[To] = I->second;` may cause `SDEI` to grow if `To` is inserted for the very first time. When that happens, all existing iterators to the map are invalidated as their backing storage has been freed. Accessing `I->second` is then invalid and attempts to access freed memory (as `I` is an iterator of `SDEI`). This patch fixes that quite simply by first making a copy of `I->second`, and then moving into the possibly newly inserted KV of the ` DenseMap`. No test attached as I am not sure it is practible to test. Differential revision: https://reviews.llvm.org/D135019	2022-10-03 15:09:14 +02:00
Petar Avramovic	1fa2019828	[SelectionDAG] Add check for BUILD_VECTOR in isKnownNeverNaN Includes handling of constants with vector type in isKnownNeverNaN. For AMDGPU results in not making fcanonicalize during legalization for vector inputs to fmaxnum_ieee and fminnum_ieee. Does not affect end result since there is a combine that eliminates fcanonicalize. Differential Revision: https://reviews.llvm.org/D88573	2022-10-03 12:47:07 +02:00
Simon Pilgrim	61dc5014ac	[DAG] Update foldSelectWithIdentityConstant to use llvm::isNeutralConstant D133866 added the llvm::isNeutralConstant helper to track neutral/passthrough constants This patch updates foldSelectWithIdentityConstant to use the helper instead of maintaining its own opcode handling Differential Revision: https://reviews.llvm.org/D134966	2022-09-30 17:46:52 +01:00
Yeting Kuo	1cc02b05b7	[SelectionDAG] Add helper function to check whether a SDValue is neutral element. NFC. Using this helper makes work about neutral elements more easier. Although I only find one case now, I think it will have more chance to be used since so many combine works are related to neutral elements. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D133866	2022-09-30 11:29:11 +08:00
Matt Devereau	0a4771a7e8	[AArch64][SVE] Expand gather index to 32 bits instead of 64 bits For gathers which load in 8 and 16 bit data then use that data as an index, the index can be extended to 32 bits instead of 64 bits Differential Revision: https://reviews.llvm.org/D130692	2022-09-28 14:42:12 +00:00
Marco Elver	4c58b00801	[SelectionDAG] Propagate PCSections through SDNodes Add a new entry to SDNodeExtraInfo to propagate PCSections through SelectionDAG. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D130882	2022-09-07 11:22:50 +02:00
Marco Elver	7d63983c65	[SelectionDAG] Properly copy ExtraInfo on RAUW During SelectionDAG legalization SDNodes with associated extra info may be replaced with a new SDNode. Preserve associated extra info on ReplaceAllUsesWith and remove entries in DeallocateNode. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D130881	2022-09-06 16:32:50 +02:00
Marco Elver	cc3faf4226	[SelectionDAG] Rename CallSiteDbgInfo to NodeExtraInfo For information infrequently attached to SDNodes, it is useful to provide a way to add this information out-of-line. This is already done for call-site specific information. Rename CallSiteDbgInfo to NodeExtraInfo in preparation of adding additional information not necessarily related to call sites only. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D130880	2022-09-06 16:32:50 +02:00
Simon Pilgrim	4e6783f866	[DAG] getFreeze()/getNode() - account for operand depth when calling isGuaranteedNotToBeUndefOrPoison (PR57554) Similar to #57402 - we were calling isGuaranteedNotToBeUndefOrPoison on the freeze operand (with Depth = 0), but wasn't accounting for the fact that a later isGuaranteedNotToBeUndefOrPoison assertion will call from the new node (with Depth = 0 as well) - which will then recursively call isGuaranteedNotToBeUndefOrPoison for its operands with Depth = 1 Fixes #57554	2022-09-05 11:46:46 +01:00
Simon Pilgrim	62cdfdab4d	[DAG] canCreateUndefOrPoison - add freeze(insert_subvector(x,y,c)) -> insert_subvector(freeze(x),freeze(y),c) support We already have plenty of assertions in place to ensure that the insertion index is constant and inrange	2022-09-03 13:41:33 +01:00
Simon Pilgrim	88c7b16bed	[DAG] Strip poison generating flags in freeze(op()) -> op(freeze()) fold This patch follows the InstCombine approach of stripping poison generating flags (nsw/nuw from add/sub etc.) to allow us to push a freeze() through the op. Unlike InstCombine it doesn't retain any flags, but we have plenty of DAG folds that do the same thing already. We assert that the newly generated op isGuaranteedNotToBeUndefOrPoison. Similar to the ValueTracking approach, isGuaranteedNotToBeUndefOrPoison has been updated to confirm that if an op can't create undef/poison and its operands are guaranteed not to be undef/poison - then its not undef/poison. This is just for the generic opcodes - target specific opcodes will need to do this manually just in case they have some special cases. Differential Revision: https://reviews.llvm.org/D132333	2022-08-26 11:47:51 +01:00
Paul Walker	96c8d615d6	[SVE] Extend findMoreOptimalIndexType so BUILD_VECTORs do not force 64bit indices. Extends findMoreOptimalIndexType to allow ISD::BUILD_VECTOR based indices to be truncated when such truncation is lossless. This can enable the use of 32bit gather/scatter indices thus making it less likely to have to split a gather/scatter in two. Depends on D125194 Differential Revision: https://reviews.llvm.org/D130533	2022-08-18 18:00:53 +01:00
wanglian	fbc4c26e9a	[SelectionDAG][NFC] Fix return type when used isConstantIntBuildVectorOrConstantInt and isConstantFPBuildVectorOrConstantFP Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D131870	2022-08-16 10:07:24 +08:00
Simon Pilgrim	3a73133217	[DAG] canCreateUndefOrPoison - add freeze(sign_extend_inreg(x,vt)) -> sign_extend_inreg(freeze(x),vt) support Guaranteed not to create undef/poison	2022-08-15 12:18:59 +01:00

1 2 3 4 5 ...

2347 Commits