clang-p2996

Author	SHA1	Message	Date
Craig Topper	1153313c33	[LocalStackSlotAllocation] Minor simplifications. NFC Instead of maintaining a separate valid flag for BaseReg, Use BaseReg.isValid(). I think this is left over from an older implementation that maintained a vector of base registers. The other change is not do a speculative assignment to BaseOffset that needs to be reverted. Only commit it after we do the check. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D141153	2023-01-09 09:45:22 -08:00
Tim Northover	5b24d42106	TailDuplication: do not remove trivial PHIs from addr-taken blocks. Unlike an anonymous block, it will not be removed even though we've resolved all valid paths to get here. So removing a PHI can leave vregs with no definition, violating SSA. Instead, this converts it to an IMPLICIT_DEF.	2023-01-09 11:12:33 +00:00
Dmitri Gribenko	05d722a11d	[llvm] Fix an "unused variable" warning when assertions are disabled	2023-01-09 11:33:05 +01:00
Stephen Tozer	da0faa0594	[DebugInfo] Produce variadic DBG_INSTR_REFs from ISel This patch modifies SelectionDAG and FastISel to produce DBG_INSTR_REFs with variadic expressions, and produce DBG_INSTR_REFs for debug values with variadic location expressions. The former essentially means just prepending DW_OP_LLVM_arg, 0 to the existing expression. The latter is achieved in MachineFunction::finalizeDebugInstrRefs and InstrEmitter::EmitDbgInstrRef. Reviewed By: jmorse, Orlando Differential Revision: https://reviews.llvm.org/D133929	2023-01-09 08:58:33 +00:00
Serguei Katkov	fd64bd94ed	[Inline Spiller] Extend the snippet by statepoint uses Snippet is a tiny live interval which has copy or fill like def and copy or spill like use at the end (any of them might abcent). Snippet has only one use/def inside interval and interval is located in one basic block. When inline spiller spills some reg around uses it also forces the spilling of connected snippets those which got by splitting the same original reg and its def is a full copy of our reg or its last use is a full copy to our reg. The definition of snippet is extended to allow not only one use/def but more. However all other uses are statepoint instructions which will fold fill into its operand. That way we do not introduce new fills/spills. Reviewed By: qcolombet, dantrushin Differential Revision: https://reviews.llvm.org/D138093	2023-01-09 13:30:57 +07:00
Kazu Hirata	e0e48187e6	[CodeGen] Fix a warning This patch fixes: llvm/lib/CodeGen/AssignmentTrackingAnalysis.cpp:1220:13: error: unused function 'locStr' [-Werror,-Wunused-function]	2023-01-08 16:31:45 -08:00
Ayke van Laethem	0408b131eb	[SelectionDAG][AVR] Add support for lrint and lround intrinsics Integer legalization already supported splitting the output integer of llround and llrint, but did not support this for lround and lrint yet. This is not a problem for 32-bit architectures, but for 8/16-bit architectures like AVR it results in a crash like this: ExpandIntegerResult #0: t7: i32 = lround t6 LLVM ERROR: Do not know how to expand the result of this operator! This patch simply add lrint/lround to the list of ISD opcodes to expand. Fixes https://github.com/llvm/llvm-project/issues/59573. Differential Revision: https://reviews.llvm.org/D140822	2023-01-08 18:56:07 +01:00
Benjamin Kramer	b6942a2880	[NFC] Hide implementation details in anonymous namespaces	2023-01-08 17:37:02 +01:00
Stephen Tozer	a344c9073c	[DebugInfo] Add support for variadic DBG_INSTR_REFs in LiveDebugValues Following support from the previous patches in this stack being added for variadic DBG_INSTR_REFs to exist, this patch modifies LiveDebugValues to handle those instructions. Support already exists for DBG_VALUE_LISTs, which covers most of the work needed to handle these instructions; this patch only modifies the transferDebugInstrRef function to correctly track them. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D133927	2023-01-06 23:01:25 +00:00
Stephen Tozer	c383f4d655	[DebugInfo] Allow non-stack_value variadic expressions and use in DBG_INSTR_REF Prior to this patch, variadic DIExpressions (i.e. ones that contain DW_OP_LLVM_arg) could only be created by salvaging debug values to create stack value expressions, resulting in a DBG_VALUE_LIST being created. As of the previous patch in this patch stack, DBG_INSTR_REF's syntax has been changed to match DBG_VALUE_LIST in preparation for supporting variadic expressions. This patch adds some minor changes needed to allow variadic expressions that aren't stack values to exist, and allows variadic expressions that are trivially reduceable to non-variadic expressions to be handled similarly to non-variadic expressions. Reviewed by: jmorse Differential Revision: https://reviews.llvm.org/D133926	2023-01-06 19:31:10 +00:00
James Y Knight	648ce3d358	Cleanup unwind table emission code a bit. This change removes the `tidyLandingPads` function, which previously had a few responsibilities: 1. Dealing with the deletion of an invoke, after MachineFunction lowering. 2. Dealing with the deletion of a landing pad BB, after MachineFunction lowering. 3. Cleaning up the type-id list generated by `MachineFunction::addLandingPad`. Case 3 has been fixed in the generator, and the others are now handled during table emission. This change also removes `MachineFunction`'s `addCatchTypeInfo`, `addFilterTypeInfo`, and `addCleanup` helper fns, as they had a single caller, and being outlined didn't make it simpler. Finally, as calling `tidyLandingPads` was effectively the only thing `DwarfCFIExceptionBase` did, that class has been eliminated.	2023-01-06 13:53:10 -05:00
James Y Knight	1ae36b1387	Remove special cases for invoke of non-throwing inline-asm. Non-throwing inline asm infers the nounwind attribute in instcombine. Thus, it can be handled in the same manner as non-throwing target functions are generally. Further special casing is unnecessary complexity.	2023-01-06 13:53:10 -05:00
Stephen Tozer	e10e936315	[DebugInfo][NFC] Add new MachineOperand type and change DBG_INSTR_REF syntax This patch makes two notable changes to the MIR debug info representation, which result in different MIR output but identical final DWARF output (NFC w.r.t. the full compilation). The two changes are: * The introduction of a new MachineOperand type, MO_DbgInstrRef, which consists of two unsigned numbers that are used to index an instruction and an output operand within that instruction, having a meaning identical to first two operands of the current DBG_INSTR_REF instruction. This operand is only used in DBG_INSTR_REF (see below). * A change in syntax for the DBG_INSTR_REF instruction, shuffling the operands to make it resemble DBG_VALUE_LIST instead of DBG_VALUE, and replacing the first two operands with a single MO_DbgInstrRef-type operand. This patch is the first of a set that will allow DBG_INSTR_REF instructions to refer to multiple machine locations in the same manner as DBG_VALUE_LIST. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D129372	2023-01-06 18:03:48 +00:00
Luke Lau	275658d1af	[SelectionDAG] Implicitly truncate known bits in SPLAT_VECTOR Now that D139525 fixes the Hexagon infinite loop, the stopgap can be removed to provide more information about known bits in SPLAT_VECTOR whose operands are smaller than the bit width (which is most of the time) Reviewed By: reames Differential Revision: https://reviews.llvm.org/D141075	2023-01-06 15:43:47 +00:00
Guillaume Chatelet	87b6b347fc	Revert D141134 "[NFC] Only expose getXXXSize functions in TypeSize" The patch should be discussed further. This reverts commit `dd56e1c92b`.	2023-01-06 15:27:50 +00:00
Guillaume Chatelet	dd56e1c92b	[NFC] Only expose getXXXSize functions in TypeSize Currently 'TypeSize' exposes two functions that serve the same purpose: - getFixedSize / getFixedValue - getKnownMinSize / getKnownMinValue source : `bf82070ea4/llvm/include/llvm/Support/TypeSize.h (L337-L338)` This patch offers to remove one of the two and stick to a single function in the code base. Differential Revision: https://reviews.llvm.org/D141134	2023-01-06 15:24:52 +00:00
Sanjay Patel	bf82070ea4	[SDAG] try to avoid multiply for X*Y==0 Forking this off from D140850 - https://alive2.llvm.org/ce/z/TgBeK_ https://alive2.llvm.org/ce/z/STVD7d We could almost justify doing this in IR, but consideration for "minsize" requires that we only try it in codegen -- the transform is not reversible. In all other cases, avoiding multiply should be a win because a mul is more expensive than simple/parallelizable compares. AArch even has a trick to keep instruction count even for some types. Differential Revision: https://reviews.llvm.org/D141086	2023-01-06 09:06:11 -05:00
OCHyams	042107494d	[DebugInfo][NFC] Rename is/setUndef to is/setKilllocation These names better reflect the semantics and also the implementation, since it's not just "undef" operands that are sentinels used to signal that the debug intrinsic terminates dominating locations definitions. Related to https://discourse.llvm.org/t/auto-undef-debug-uses-of-a-deleted-value Reviewed By: StephenTozer Differential Revision: https://reviews.llvm.org/D140903	2023-01-06 09:15:02 +00:00
Yeting Kuo	5a57ebcc43	[VP][RISCV] Add vp.abs and RISC-V support. RISC-V uses ISD::ABS lower method (abs x) -> (smax_vl x (sub_vl 0, x)) for ISD::VP_ABS. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D141033	2023-01-06 15:18:12 +08:00
Josh Stone	87f57f459e	[RegAllocFast] Handle new debug values for spills These new debug values get inserted after the place where the spill happens, which means they won't be reached by the reverse traversal of basic block instructions. This would crash or fail assertions if they contained any virtual registers to be replaced. We can manually handle the new debug values right away to resolve this. Fixes https://github.com/llvm/llvm-project/issues/59172 Reviewed By: StephenTozer Differential Revision: https://reviews.llvm.org/D139590	2023-01-05 20:41:11 -08:00
Fangrui Song	2aedfdd9b8	[CodeGen] Default TargetOptions::RelaxELFRelocations to true MC and lld/ELF defaults were flipped in 2016. For Clang: CMake ENABLE_X86_RELAX_RELOCATIONS defaults to on in 2020. It makes sense for the TargetOptions default to be true now. R_X86_64_GOTPCRELX/R_X86_64_REX_GOTPCRELX require GNU ld newer than 2015-10 (subsumed by the current requirement of -fbinutils-version=). This should fix `rustc -Z plt=no` PIC relocatable files with GNU ld. (See https://github.com/rust-lang/rust/pull/106380)	2023-01-05 13:28:48 -08:00
Luke Drummond	108766fc7e	Fix typos I found one typo of "implemnt", then some more. s/implemnt/implement/g	2023-01-05 18:49:23 +00:00
Craig Topper	11e92bd61f	[SelectionDAG] Improve codegen for udiv by constant if any divisors are 1. If the divisor is 1, the magic algorithm does not return a correct result and we end up using a select to pick the numerator for those elements at the end. Therefore we can use undef for that element of the earlier operations when the divisor is 1. We sometimes get this through SimplifyDemandedVectorElts, but not always. Definitely seems like we don't if the NPQ fixup is used. Unfortunately, DAGCombiner is unable to fold srl X, <0, undef> to X so I had to add flags to avoid emitting the srl unless one of the shift amounts is non-zero. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D141022	2023-01-05 08:41:44 -08:00
OCHyams	eebfee8f9e	[DebugInfo][SelectionDAGISel] Do not drop all dbg.declares if one with empty metadata is found This error was introduced in `1d1de7467c` (by me) about 1 month ago. Found while testing the D140901 patch stack. Reviewed By: jryans Differential Revision: https://reviews.llvm.org/D141052	2023-01-05 15:36:50 +00:00
serge-sans-paille	38818b60c5	Move from llvm::makeArrayRef to ArrayRef deduction guides - llvm/ part Use deduction guides instead of helper functions. The only non-automatic changes have been: 1. ArrayRef(some_uint8_pointer, 0) needs to be changed into ArrayRef(some_uint8_pointer, (size_t)0) to avoid an ambiguous call with ArrayRef((uint8_t), (uint8_t)) 2. CVSymbol sym(makeArrayRef(symStorage)); needed to be rewritten as CVSymbol sym{ArrayRef(symStorage)}; otherwise the compiler is confused and thinks we have a (bad) function prototype. There was a few similar situation across the codebase. 3. ADL doesn't seem to work the same for deduction-guides and functions, so at some point the llvm namespace must be explicitly stated. 4. The "reference mode" of makeArrayRef(ArrayRef<T> &) that acts as no-op is not supported (a constructor cannot achieve that). Per reviewers' comment, some useless makeArrayRef have been removed in the process. This is a follow-up to https://reviews.llvm.org/D140896 that introduced the deduction guides. Differential Revision: https://reviews.llvm.org/D140955	2023-01-05 14:11:08 +01:00
Diana Picus	6ee4f253b2	[GlobalISel] Add G_BUILD_VECTOR[_TRUNC] to CSE Add G_BUILD_VECTOR and G_BUILD_VECTOR_TRUNC to the list of opcodes in `shouldCSEOpc`. This simplifies the code generated for vector splats. Differential Revision: https://reviews.llvm.org/D140965	2023-01-05 10:15:31 +01:00
Diana Picus	22924bd48d	[GlobalISel] Don't switch opcodes in MIRBuilder::buildInstr At the moment, `MachineIRBuilder::buildInstr` may build an instruction with a different opcode than the one passed in as parameter. This may cause confusion for its consumers, such as `CSEMIRBuilder`, which will memoize the instruction based on the new opcode, but will search through the memoized instructions based on the original one (resulting in missed CSE opportunities). This is all the more unpleasant since buildInstr is virtual and may call itself recursively both directly and via buildCast, so it's not always easy to follow what's going on. This patch simplifies the API of `MachineIRBuilder` so that the `buildInstr` method does the least surprising thing (i.e. builds an instruction with the specified opcode) and only the convenience `buildX` methods (`buildMerge` etc) are allowed freedom over which opcode to use. This can still be confusing (e.g. one might write a unit test using `buildBuildVectorTrunc` but instead get a plain `G_BUILD_VECTOR`), but at least it's explained in the comments. In practice, this boils down to 3 changes: * `buildInstr(G_MERGE_VALUES)` will no longer call itself with `G_BUILD_VECTOR` or `G_CONCAT_VECTORS`; this functionality is moved to `buildMerge` and replaced with an assert; * `buildInstr(G_BUILD_VECTOR_TRUNC)` will no longer call itself with `G_BUILD_VECTOR`; this functionality is moved to `buildBuildVectorTrunc` and replaced with an assert; * `buildInstr(G_MERGE_VALUES)` will no longer call `buildCast` and will instead assert if we're trying to merge a single value; no change is needed in `buildMerge` since it was already asserting more than one source operand. This change is NFC for users of the `buildX` methods, but users that call `buildInstr` with relaxed parameters will have to update their code (such instances will hopefully be easy to find thanks to the asserts). Differential Revision: https://reviews.llvm.org/D140964	2023-01-05 10:02:39 +01:00
Craig Topper	f8751b8ee6	[TargetLowering] Remove stale FIXME. NFC This was implemented for scalars in D140750.	2023-01-04 18:40:42 -08:00
Craig Topper	3f749a5d9d	[Support][SelectionDAG][GlobalISel] Hoist PostShift adjustment for IsAdd into UnsignedDivideUsingMagic. Instead of doing the adjustment in 3 different places in the code base, do it inside UnsignedDivideUsingMagic::get. Differential Revision: https://reviews.llvm.org/D141014	2023-01-04 15:18:12 -08:00
Roman Lebedev	2a43a4478c	[NFCI][DAGCombiner] `foldExtendVectorInregToExtendOfSubvector()`: just build new VT Changing element type seems to not play well with non-simple types, even though we are operating on EVT's here.	2023-01-05 01:33:24 +03:00
Roman Lebedev	41005b7ab2	[DAGCombiner] Do try to combine `ISD::ANY_EXTEND_VECTOR_INREG` nodes These weren't previously getting combined at all here, only in target-specific combines.	2023-01-05 01:12:31 +03:00
Roman Lebedev	317a1adfe4	[DAGCombiner] Fold _EXTEND_INREG of one of CONCAT_VECTORS operands into _EXTEND of operand This appears to be the root problematic pattern for AArch64 regression in D140677. We already do this, and many more, as target-specific X86 combines, so this isn't causing much of an impact.	2023-01-05 01:12:31 +03:00
Roman Lebedev	846d06c707	[DAG] `tryToFoldExtendOfConstant()`: `sext undef` is not `undef` https://alive2.llvm.org/ce/z/cLGpWV, but https://alive2.llvm.org/ce/z/TGNH4P	2023-01-04 22:42:43 +03:00
Philip Reames	9560ac3a25	[MachineCombine] Reorganize code for readability and tracing [nfc]	2023-01-04 10:47:39 -08:00
Craig Topper	8bca60fb0a	[SelectionDAG][GlobalISel] Don't use UnsignedDivisionByConstantInfo for divisor of 1. The magic algorithm sets IsAdd indication for division by 1 that the caller had to ignore. I considered folding the ignore into UnsignedDivisionByConstantInfo, but we only allow 1 for vectors of mixed visiors. And really what we want to end up with is undef. Currently, we get to undef via DemandedElts optimizations using the select instruction. We could directly emit undef. Differential Revision: https://reviews.llvm.org/D140940	2023-01-04 10:01:15 -08:00
Jay Foad	6f7ff9b933	[MC] Consistently use MCInstrDesc::getImplicitUses and getImplicitDefs. NFC.	2023-01-04 13:16:12 +00:00
Yeting Kuo	1e9e1b9cf8	[VP][RISCV] Add vp.ctlz/cttz and RISC-V support. The patch also adds expandVPCTLZ and expandVPCTTZ to expand vp.ctlz/cttz nodes and the cost model of vp.ctlz/cttz. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D140370	2023-01-04 15:15:01 +08:00
Craig Topper	84daed7fd4	[SelectionDAG][GlobalISel] Move even divisor optimization for division by constant into UnsignedDivideUsingMagic implementation. NFC I've added a bool to UnsignedDivideUsingMagic so we can continue testing it in the unit test with and without this optimization in the unit test. This is a step towards supporting "uncooperative" odd divisors. See https://ridiculousfish.com/blog/posts/labor-of-division-episode-iii.html Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D140924	2023-01-03 16:34:13 -08:00
Samuel Parker	615333bc09	[TypePromotion] NewPM support. Differential Revision: https://reviews.llvm.org/D140893	2023-01-03 15:09:29 +00:00
chenglin.bi	a0b470c984	[TypePromotion] Add truncate in ConvertTruncs when the original truncate type is not extend type If the src type is not extend type, after convert the truncate to and we need to truncate the and also to make sure the all user is legal. The old fix D137613 doesn't work when the truncate convert to and have the other users. So this time I try to add the truncate after and to avoid all these potential issues. Fix: #59554 Reviewed By: samparker Differential Revision: https://reviews.llvm.org/D140869	2023-01-03 18:13:20 +08:00
Roman Lebedev	4fc417ec37	[DAGCombiner] `convertBuildVecZextToBuildVecWithZeros()`: rework split factor calculation The original computation was both making assumptions that do not hold in practice, and being overly pessimistic. We should just check every possible split factor, and pick the best one. Fixes https://github.com/llvm/llvm-project/issues/59781	2023-01-02 18:34:35 +03:00
Roman Lebedev	1337821f11	[DAGCombiner][X86] Fold a CONCAT_VECTORS of SHUFFLE_VECTOR and it's operand into wider SHUFFLE_VECTOR This was showing as a source of many regressions with more aggressive ZERO_EXTEND_VECTOR_INREG recognition.	2023-01-01 23:18:42 +03:00
Roman Lebedev	16facf1ca6	[DAGCombiner][TLI] Do not fuse bitcast to <1 x ?> into a load/store of a vector Single-element vectors are legalized by splitting, so the the memory operations would also get scalarized. While we do have some support to reconstruct scalarized loads, we clearly don't catch everything. The comment for the affected AArch64 store suggests that having two stores was the desired outcome in the first place. This was showing as a source of many regressions with more aggressive ZERO_EXTEND_VECTOR_INREG recognition.	2022-12-31 03:49:43 +03:00
Roman Lebedev	603e849072	[NFC][TLI] Move `isLoadBitCastBeneficial()` implementation into source file ... so any change to it does not cause 700 source files to be recompiled.	2022-12-31 02:07:50 +03:00
Roman Lebedev	e4d25a9c23	[DAG] BUILD_VECTOR: absorb ZERO_EXTEND of a single first operand if all other ops are zeros This kind of pattern seems to come up as regressions with better ZERO_EXTEND_VECTOR_INREG recognition. For initial implementation, this is quite restricted to the minimal viable transform, otherwise there are too many regressions to be dealt with.	2022-12-31 00:58:11 +03:00
Vitaly Buka	6f3400e380	Revert "[CodeGen] Temporarily disable-lsr in HWASAN build" We can do the same with cmake on the bot. This reverts commit `8f70b848d3`.	2022-12-30 10:57:49 -08:00
Filipp Zhinkin	98265db84c	[ScheduleDAG] Support REQ_SEQUENCE unscheduling REG_SEQUENCE node requires special treatment during the unscheduling because the node is untyped and neither its class, nor cost could be retrieved the same way as for typed nodes. Related issue: https://github.com/llvm/llvm-project/issues/58911 Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D138837	2022-12-30 15:17:11 +04:00
Vitaly Buka	8f70b848d3	[CodeGen] Temporarily disable-lsr in HWASAN build HWASAN exposes some non-determinism in the pass and triggers: ScalarEvolution.cpp:11540: bool llvm::ScalarEvolution::isLoopEntryGuardedByCond(const Loop , ICmpInst::Predicate, const SCEV , const SCEV *): Assertion `isAvailableAtLoopEntry(LHS, L) && "LHS is not available at Loop Entry"' failed. E.g. https://lab.llvm.org/buildbot/#/builders/236/builds/1629/steps/16/logs/stdio is broken after D137838. I tried to split D137838 into smaller patches and the one which reproduced was just a move of cpp from one dir to another. Maybe it has something do to with comparison of tagged pointeres and PtrSets used in pass. Issues is hard to reproduce, even slight changes in path, or preprocessing cpp file hide it.	2022-12-29 23:37:49 -08:00
Dmitry Borisenkov	0ec51a460a	DAG: Prevent store value forwarding to distinct addrspace load DAGCombiner replaces (load const_addr1) directly chained with (store (val, const_addr2)) with val if address space stripped const_addr1 == const_addr2. The patch fixes the issue by checking address spaces as well. However, it might makes sense to not to chain together side effects that belong to different address spaces in the first place and make SelectionDAG::root address space aware.	2022-12-29 18:19:55 -05:00
Roman Lebedev	248567a327	[DAGCombiner] Try to partition ISD::EXTRACT_VECTOR_ELT to accomodate it's ISD::BUILD_VECTOR users This mainly cleans up a few patterns that are legalized by scalarization from a wide-element vector, but then are further split apart to build a more narrow-sized-element vector. In particular this happens in some cases for illegal ISD::ZERO_EXTEND_VECTOR_INREG. Given a ISD::EXTRACT_VECTOR_ELT, which is a glorified bit sequence extract, recursively analyse all of it's users. and try to model themselves as bit sequence extractions. If all of them agree on the new, narrower element type, and all of them can be modelled as ISD::EXTRACT_VECTOR_ELT's of that new element type, do that, but only if unmodelled users are ISD::BUILD_VECTOR.	2022-12-30 01:15:53 +03:00

1 2 3 4 5 ...

33455 Commits