clang-p2996

Author	SHA1	Message	Date
Matt Arsenault	ed6dc62862	DAG: Handle equal size element build_vector promotion (#76213 )	2023-12-23 20:43:14 +07:00
Matt Arsenault	f7c3627338	DAG: Implement promotion for strict_fpextend (#74310 ) Test is a placeholder, will be merged into the existing test after additional bug fixes for illegal f16 targets are fixed.	2023-12-22 17:15:52 +07:00
Matt Arsenault	4d1cd38c95	DAG: Handle promotion of fcanonicalize This avoids a regression in a future commit	2023-12-22 12:50:18 +07:00
Craig Topper	0dcff0db3a	[RISCV] Add codegen support for experimental.vp.splice (#74688 ) IR intrinsics were already defined, but no codegen support had been added. I extracted this code from our downstream. Some of it may have come from https://repo.hca.bsc.es/gitlab/rferrer/llvm-epi/ originally.	2023-12-21 08:38:32 -08:00
yan zhou	cd09f4b951	[CodeGen] This patch fix a bug that may caused error for a self-defined target in SelectionDAG::getNode (#75320 ) we need first judge N1.getNumOperands() > 0. If Lowering Generated SDNode like. ``` v2i32 t20: TargetOpNode. i32 t21: extract_vector_elt t20 0 i32 t22: extract_vector_elt t20 1 ``` will cause a error.	2023-12-21 19:39:05 +07:00
Matt Arsenault	9e574a3936	DAG: Fix expansion of bf16 sourced extloads Also fix assorted vector extload failures for AMDGPU.	2023-12-20 19:24:27 +07:00
Jonas Paulsson	e32e147d6c	[DAGCombiner] Don't drop alignment info of original load. (#75626 ) Pass the original MMO instead of different individual values. getAlign() was used before where actually getOriginalAlign() would have been better, and this patch has the same effect.	2023-12-19 16:30:47 +01:00
Rin	0894c2ee5f	[DAGCombiner] Avoid the pre-truncate of BUILD_VECTOR sources. (#75792 ) Avoid the pre-truncate of BUILD_VECTOR sources when there is more than one use. This can avoid using unnecessary movs later down the instruction selection pipeline.	2023-12-19 15:25:38 +00:00
Wang Pengcheng	9348d437f5	[SelectionDAG] Add space-optimized forms of OPC_EmitRegister (#73291 ) The followed byte of `OPC_EmitRegister` is a MVT type, which is usually i32 or i64. We add `OPC_EmitRegisterI32` and `OPC_EmitRegisterI64` so that we can reduce one byte. Overall this reduces the llc binary size with all in-tree targets by about 10K.	2023-12-19 17:31:49 +08:00
Simon Pilgrim	7b1e4239b3	[DAG] Fold (vt trunc (extload (vt x))) -> (vt load x) (#75229 ) We were only folding cases which remained extloads, but DAG.getExtLoad can also handle the cases which don't need to extend at all (we just can't do truncloads). reduceLoadWidth can handle this for scalar loads, but not for vectors. Noticed while triaging D152928	2023-12-18 16:21:11 +00:00
Ulrich Weigand	82a1bffd34	[SelectionDAG] Do not crash on large integers in CheckInteger (#75787 ) The CheckInteger routine called from TableGen-generated selection logic uses getSExtValue - which will abort if the underlying APInt does not fit into an int64_t. This case is now triggered by the SystemZ back-end since i128 is a legal type on certain machines. While we do not have any regular instructions that take 128-bit immediates (like most other platforms), there are patterns in the .td files that recognize an i128 "xor ..., -1" as a "not". These patterns cause code to be generated that calls the CheckInteger routine on some i128-valued integer, which may trigger the assert. Fix by using trySExtValue instead. Fixes https://github.com/llvm/llvm-project/issues/75710	2023-12-18 14:03:57 +01:00
Philip Reames	e8a15eca92	[RISCV] Prefer whole register loads and stores when VL=VLMAX (#75531 ) If we're lowering a fixed length vector load or store which happens to exactly VLEN in size (when VLEN is exactly known), we can use a whole register load or store instead of the unit strided variants. This doesn't require a vsetvli in some cases, allows additional flexibility of vsetvli cases in others, and doesn't have a runtime dependency on the value of VL.	2023-12-15 09:26:57 -08:00
Craig Topper	2a21260ea8	[SelectionDAG] Use getVectorElementPointer in DAGCombiner::replaceStoreOfInsertLoad. (#74249 ) This ensures we clip the index to be in bounds of the vector we are inserting into. If the index is out of bounds the results of the insert element is poison. If we don't clip the index we can write memory that was not part of the original store. Fixes #74248 #75557.	2023-12-14 20:25:16 -08:00
Simon Pilgrim	39093102ca	[DAG] visitTRUNCATE - format (truncate (load x)) fold code. Reduces diff in #75229	2023-12-14 15:13:38 +00:00
Orlando Cazalet-Hyams	5ee088134f	[DebugInfo][RemoveDIs] Handle dbg.declares in SelectionDAGISel (#73496 ) This is a boring mechanical update to support DPValues that look like dbg.declares in SelectionDAG. The tests will become "live" once #74090 lands (see for more info).	2023-12-12 11:32:19 +00:00
Wang Pengcheng	714417455d	[SelectionDAG] Add OPC_MoveSibling (#73643 ) There are a lot of operations to move current node to parent and then move to another child. So `OPC_MoveSibling` and its space-optimized forms are added to do this "move to sibling" operations. These new operations will be generated when optimizing matcher in `ContractNodes`. Currently `MoveParent+MoveChild` will be optimized to `MoveSibling` and sequences `MoveParent+RecordChild+MoveChild` will be transformed into `MoveSibling+RecordNode`. Overall this reduces the llc binary size with all in-tree targets by about 30K.	2023-12-12 17:48:45 +08:00
Wang Pengcheng	0d5f1cc4d0	[SelectionDAG] Add space-optimized forms of OPC_EmitNode/OPC_MorphNodeTo (#73502 ) If there is only one bit set in EmitNodeInfo, then we can encode it implicitly to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 168K.	2023-12-12 17:45:32 +08:00
Wang Pengcheng	6111f5c592	[SelectionDAG] Add instantiated OPC_CheckChildType (#73297 ) The most common type is i32 or i64 so we add `OPC_CheckChildTypeI32` and `OPC_CheckChildTypeI64` to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 70K.	2023-12-12 17:31:12 +08:00
Wang Pengcheng	cbf1d58820	[SelectionDAG] Add space-optimized forms of OPC_EmitCopyToReg (#73293 ) These new opcodes implicitly indicate the RecNo. The old `OPC_EmitCopyToReg2` is renamed to `OPC_EmitCopyToRegTwoByte`. Overall this reduces the llc binary size with all in-tree targets by about 33K (most are from RISCV target).	2023-12-12 17:25:33 +08:00
Wang Pengcheng	50c174f99f	[SelectionDAG] Add space-optimized forms of OPC_EmitConvertToTarget (#73286 ) These new opcodes implicitly indicate the RecNo. Overall this reduces the llc binary size with all in-tree targets by about 13K.	2023-12-12 17:13:43 +08:00
Wang Pengcheng	e052c68869	[SelectionDAG] Add instantiated OPC_CheckType (#73283 ) The most common type is i32 or i64 so we add `OPC_CheckTypeI32` and `OPC_CheckTypeI64` to save one byte. Overall this reduces the llc binary size with all in-tree targets by about 29K.	2023-12-12 17:12:08 +08:00
Jay Foad	b3000ecb3c	[SelectionDAG] Fix typo in comment	2023-12-09 10:12:32 +00:00
Simon Pilgrim	faecc736e2	[DAG] isSplatValue - node is a splat if all demanded elts have the same whole constant value (#74443 )	2023-12-08 10:53:51 +00:00
Simon Pilgrim	f1200ca7ac	[DAG] visitEXTRACT_VECTOR_ELT - constant fold legal fp imm values (#74304 ) If we're extracting a constant floating point value, and the constant is a legal fp imm value, then replace the extraction with a fp constant.	2023-12-07 14:56:12 +00:00
Harald van Dijk	03edfe6148	Implement SoftPromoteHalf for FFREXP. (#74076 ) `llvm/test/CodeGen/RISCV/llvm.frexp.ll` and `llvm/test/CodeGen/X86/llvm.frexp.ll` contain a number of disabled tests for unimplemented functionality. This implements one missing part of it.	2023-12-07 11:10:17 +00:00
Simon Pilgrim	22df0886a1	[DAG] Don't split f64 constant stores if the fp imm is legal (#74622 ) If the target can generate a specific fp immediate constant, then don't split the store into 2 x i32 stores Another cleanup step for #74304	2023-12-07 10:33:03 +00:00
Philip Reames	a0dde7b8d1	[DAG] Remove (dead) legalization for atomic LoadSDNode and StoreSDNode This should have been part of `943f3e52` which removed the never completed migration code which added these. I left them out because I thought there was more generic SDAG code to cleanup, but I'd forgotten that SystemZ relied on custom legalizing ATOMIC_LOAD to (atomic) LoadSDNode. As a result, we still need the various legality checks on combines and the common infrastructure to suport them.	2023-12-05 12:19:10 -08:00
ZengZhijin	eaba81fd24	[SDAG] Count call argument attributes to reduce unnecessary extension (#73501 ) Count how often the value is with signext/zeroext calls when determining the preferred extension type.	2023-12-05 09:55:52 +01:00
Philip Reames	943f3e52a0	[X86] Remove x86-experimental-unordered-atomic-isel option and associated code This option enables an experimental lowering for unordered atomics I worked on a few years back. It never reached production quality, and hasn't been worked on in years. So let's rip it out. This wasn't a crazy idea, but I hit some stumbling block which prevented me from pushing it across the finish line. From the look of `027aa27`, that change description is probably a good summary. I don't remember the details any longer.	2023-12-04 20:03:38 -08:00
Vitaly Buka	7e3aeee3bf	[NFC][asan] Replace AsanInited/ENSURE_ASAN_INITED with TryAsanInitFromRtl (#74172 )	2023-12-04 14:56:21 -08:00
Craig Topper	5bc391a7c9	[SelectionDAG] Use getVectorElementPointer in DAGCombiner::replaceStoreOfInsertLoad. (#74249 ) This ensures we clip the index to be in bounds of the vector we are inserting into. If the index is out of bounds the results of the insert element is poison. If we don't clip the index we can write memory that was not part of the original store. Fixes #74248.	2023-12-04 11:11:37 -08:00
Wang Pengcheng	d49e9d88a2	[SelectionDAG][NFC] Replace C-style casting with static_cast (#74060 ) Fix comments in #73283.	2023-12-04 15:36:54 +08:00
Simon Pilgrim	625e1ecb7e	Fix MSVC signed/unsigned mismatch warning. NFC.	2023-12-01 18:34:01 +00:00
Ramkumar Ramachandra	9468de48fc	TargetInstrInfo: make getOperandLatency return optional (NFC) (#73769 ) getOperandLatency has the following behavior: it returns -1 as a special value, negative numbers other than -1 on some target-specific overrides, or a valid non-negative latency. This behavior can be surprising, as some callers do arithmetic on these negative values. Change the interface of getOperandLatency to return a std::optional<unsigned> to prevent surprises in callers. While at it, change the interface of getInstrLatency to return unsigned instead of int. This change was inspired by a refactoring in TargetSchedModel::computeOperandLatency.	2023-12-01 11:29:19 +00:00
Francesco Petrogalli	241076fb52	[SelectionDAG] Fix assertion for widening of mask operand in MSTORE. (#73295 ) According to the code in `SelectionDAG::getMaskedStore`, the Mask operator is in position 4, not 3: SDValue Ops[] = {Chain, Val, Base, Offset, Mask};	2023-11-29 12:56:30 +01:00
Simon Pilgrim	0fac9da734	[DAG] getNode() - relax (zext (trunc x)) -> x fold iff the upper bits are known zero. Just leave the (zext (trunc (and x, c))) pattern which is still being used to create some zext_inreg patterns.	2023-11-29 10:38:11 +00:00
Philip Reames	93e156833b	[DAG] Fix a miscompile in insert_subvector undef (insert_subvector undef, ..), idx combine (#73587 ) The combine was implicitly assuming that the index on the outer insert_subvector meant the same thing when the source was switched to be the index of the inner insert_subvector. This is not true if the innermost sub-vector is fixed, and the outer subvector is scalable. I could do a less restrictive fix here - i.e. allow the case where the scalability of the subvectors are the same - but there's no test coverage which shows this transform actually has profit. Given that, go for the simplest fix.	2023-11-27 16:45:29 -08:00
Youngsuk Kim	d8b8aa3a56	[llvm] Replace calls to Type::getPointerTo (NFC) Cleanup work towards removing the method Type::getPointerTo. If a call to Type::getPointerTo is used solely to support an unneeded pointer-cast, remove the call entirely.	2023-11-27 10:49:34 -06:00
Wang Pengcheng	2e6c01be0d	[SelectionDAG] Add instantiated OPC_EmitInteger and OPC_EmitStringInteger (#73241 ) These two opcodes are used to be followed by a MVT operand, which is always one of i8/i16/i32/i64. We add instantiated `OPC_EmitInteger` and `OPC_EmitStringInteger` with i8/i16/i32/i64 so that we can reduce one byte. We reserve `OPC_EmitInteger` and `OPC_EmitStringInteger` in case that we may need them someday, though I haven't found one usage after this change. Overall this reduces the llc binary size with all in-tree targets by about 200K.	2023-11-27 11:08:28 +08:00
Sander de Smalen	81b7f115fb	[llvm][TypeSize] Fix addition/subtraction in TypeSize. (#72979 ) It seems TypeSize is currently broken in the sense that: TypeSize::Fixed(4) + TypeSize::Scalable(4) => TypeSize::Fixed(8) without failing its assert that explicitly tests for this case: assert(LHS.Scalable == RHS.Scalable && ...); The reason this fails is that `Scalable` is a static method of class TypeSize, and LHS and RHS are both objects of class TypeSize. So this is evaluating if the pointer to the function Scalable == the pointer to the function Scalable, which is always true because LHS and RHS have the same class. This patch fixes the issue by renaming `TypeSize::Scalable` -> `TypeSize::getScalable`, as well as `TypeSize::Fixed` to `TypeSize::getFixed`, so that it no longer clashes with the variable in FixedOrScalableQuantity. The new methods now also better match the coding standard, which specifies that: * Variable names should be nouns (as they represent state) * Function names should be verb phrases (as they represent actions)	2023-11-22 08:52:53 +00:00
Yeting Kuo	a756a6b97e	[TargetLowering][RISCV] Introduce shouldFoldSelectWithSingleBitTest and RISC-V implement. (#72978 ) DAGCombiner folds (select_cc seteq (and x, y), 0, 0, A) to (and (sra (shl x)) A) where y has a single bit set. Previously, DAGCombiner relies on `shouldAvoidTransformToShift` to decide when to do the combine, but `shouldAvoidTransformToShift` is only about shift cost. This patch introuduces a specific hook to decide when to do the combine and disable the combine when Zicond enabled and AndMask <= 1024.	2023-11-22 08:22:14 +08:00
Jeremy Morse	4495485eb6	[DebugInfo][RemoveDIs] Interpret DPValue objects in SelectionDAG (#72253 ) DPValues are the non-intrinsic replacements for dbg.values, and when an IR function is converted by SelectionDAG we need to convert the variable location information in the same way. Happily all the information is in the same format, it's just stored in a slightly different object, therefore this patch refactors a few things to store the set of {Variable,Expr,DILocation,Location} instead of just a pointer to a DbgValueInst. This also adds a hook in llc that's much like the one I've added to opt in PR #71937, allowing tests to optionally ask for the use RemoveDIs mode if support for it is built into the compiler. I've added that flag to a variety of SelectionDAG debug-info tests to ensure that we get some coverage on the RemoveDIs / debug-info-iterator buildbot.	2023-11-21 17:10:25 +00:00
Min-Yih Hsu	0e24179797	[SelectionDAG] Add support to filter SelectionDAG dumps during ISel by function names (#72696 ) `-debug-only=isel-dump` is the new debug type for printing SelectionDAG after each ISel phase. This can be furthered filter by `-filter-print-funcs=<function names>`. Note that the existing `-debug-only=isel` will take precedence over the new behavior and print SelectionDAG dumps of every single function regardless of `-filter-print-funcs`'s values.	2023-11-20 14:00:47 -08:00
Simon Pilgrim	761a963dfc	[DAG] narrowExtractedVectorBinOp - ensure we limit late node creation to LegalOperations only (#72130 ) Avoids infinite issues in some upcoming patches to help D152928 - x86 sees a number of regressions that are addressed by extending SimplifyDemandedVectorEltsForTargetNode to cover more binop opcodes	2023-11-20 10:56:41 +00:00
Noah Goldstein	ed7c97e0ad	Recommit "[DAGCombiner] Transform `(icmp eq/ne (and X,C0),(shift X,C1))` to use rotate or to getter constants." (2nd Try) Added missing check that the mask and shift amount added up to correct bitwidth as well as test cases for the bug. Closes #71729	2023-11-19 12:15:04 -06:00
Simon Pilgrim	98efa8f9aa	[DAG] Fix ShrinkDemandedOp doxygen description to match behaviour. NFC. ShrinkDemandedOp checks for both isTruncateFree AND isZExtFree but extends with ANY_EXTEND.	2023-11-18 22:44:08 +00:00
David Stenberg	c093383ffa	[SelectionDAG] Fix crash for salvaging with indirect debug values (#72645 ) This is a follow-up to #68981, and fix for #72630, #72447. We may end up in SelectionDAG::salvageDebugInfo() with indirect debug values, and attempting to salvage ADD nodes with non-constant RHS would lead us to try to turn those indirect debug values variadic, which is not allowed. This triggered the following assert in the SDDbgValue constructor: Assertion `!(IsVariadic && IsIndirect)' failed. This also adds a lit test for salvaging when having an indirect debug value and constant RHS, as there seems like there was no such lit test. However, I am not sure if the use of the stack_value operation is correct in that case (which is existing behavior before #68981), but that at least documents the current behavior.	2023-11-18 17:01:27 +01:00
HaohaiWen	394bba766d	[CodeGen][DebugInfo] Add missing debug info for jump table BB (#71021 ) visitJumpTable is called on FinishBasicBlock. At that time, getCurSDLoc will always return SDLoc without DebugLoc since CurInst was set to nullptr after visiting each instruction. This patch passes SDLoc to buildJumpTable when visiting SwitchInst so that visitJumpTable can use it later.	2023-11-18 19:17:51 +08:00
Simon Pilgrim	de41396895	[DAG] foldABSToABD - add support for abs(sub(sign_extend_inreg(),sign_extend_inreg())) patterns Partial fix for ABDS regressions on D152928	2023-11-15 15:49:30 +00:00
Simon Pilgrim	9180b9f2be	[DAG] foldABSToABD - rename operand value types. NFC. Match operand variable naming.	2023-11-15 15:44:35 +00:00

1 2 3 4 5 ...

13222 Commits