clang-p2996

Author	SHA1	Message	Date
OCHyams	4ece50737d	[Assignment Tracking][NFC] Replace LLVM command line option with a module flag Remove LLVM flag -experimental-assignment-tracking. Assignment tracking is still enabled from Clang with the command line -Xclang -fexperimental-assignment-tracking which tells Clang to ask LLVM to run the pass declare-to-assign. That pass converts conventional debug intrinsics to assignment tracking metadata. With this patch it now also sets a module flag debug-info-assignment-tracking with the value `i1 true` (using the flag conflict rule `Max` since enabling assignment tracking on IR that contains only conventional debug intrinsics should cause no issues). Update the docs and tests too. Reviewed By: CarlosAlbertoEnciso Differential Revision: https://reviews.llvm.org/D142027	2023-01-20 14:24:15 +00:00
Paul Kirth	af9a452e57	[llvm][codegen] Fix non-determinism in StackFrameLayoutAnalysisPass output We were iterating over a SmallPtrSet when outputting slot variables. This is still correct but made the test fail under reverse iteration. This patch replaces the SmallPtrSet with a SmallVector. Also remove the "Stack Frame Layout" lines from arm64-opt-remarks-lazy-bfi test, since those also break under reverse iteration. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D142127	2023-01-19 20:04:14 +00:00
Amaury Séchet	7e5681cf29	[DAG] Peek through ZEXT/TRUNC in foldAddSubMasked1 Fix a regression in D141883 Depends on D141883 Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D141884	2023-01-19 13:23:42 +00:00
Amaury Séchet	2826869d7b	[DAG] Do not combine any_ext when we combine and into zext. This transofrm loses information that can be useful for other transforms. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D141883	2023-01-19 12:37:05 +00:00
Paul Kirth	557a5bc336	[codegen] Add StackFrameLayoutAnalysisPass Issue #58168 describes the difficulty diagnosing stack size issues identified by -Wframe-larger-than. For simple code, its easy to understand the stack layout and where space is being allocated, but in more complex programs, where code may be heavily inlined, unrolled, and have duplicated code paths, it is no longer easy to manually inspect the source program and understand where stack space can be attributed. This patch implements a machine function pass that emits remarks with a textual representation of stack slots, and also outputs any available debug information to map source variables to those slots. The new behavior can be used by adding `-Rpass-analysis=stack-frame-layout` to the compiler invocation. Like other remarks the diagnostic information can be saved to a file in a machine readable format by adding -fsave-optimzation-record. Fixes: #58168 Reviewed By: nickdesaulniers, thegameg Differential Revision: https://reviews.llvm.org/D135488	2023-01-19 01:51:14 +00:00
Krzysztof Drewniak	5d98dc7124	[llvm][GenericUniformity] Hack around strict is_invocable() checks With recent (> 15, as far as I can tell, possibly > 16) clang, c++17, and GNU's libstdc++ (versions 9 and 10 and maybe others), LLVM fails to compile due to an is_invocable() check in unique_ptr::reset(). To resolve this issue, add a template argument to ImplDeleter to make things work. Differential Revision: https://reviews.llvm.org/D141865	2023-01-18 19:56:42 +00:00
Roman Lebedev	7460842fb2	[DAGCombiner] `combineShuffleOfSplatVal()`: don't assert that shuffle is non-undef As per the test case from Steven Johnson in https://reviews.llvm.org/rGf8d9097168b7#1165311 we can indeed encounter such shuffles, that produce all-undef after folding, before something else manages to optimize them away.	2023-01-18 18:45:08 +03:00
Denis Antrushin	f845076738	[StatepointLowering] Move statepoint correctness checks to Verifier. NFC. Since D140504, GCStrategy is available for use in opt. Now we can move statepoint correctness checks from StatepointLowering.cpp to Verifier. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D141948	2023-01-18 18:14:58 +03:00
Haojian Wu	9936064d66	Remove an unused variable, NFC	2023-01-18 13:08:21 +01:00
Simon Pilgrim	73cdbbea02	[DAG] combineInsertEltToShuffle - split off mergeInsertEltWithShuffle fold. NFC. combineInsertEltToShuffle was performing 2 very different folds in the same call, merging "(insert_vector_elt (vector_shuffle X, Y), (extract_vector_elt X, N), IdxC) --> (vector_shuffle X, Y)" and "(insert_vector_elt V, (bitcast X from vector type), IdxC) --> bitcast(shuffle (bitcast V), (extended X), Mask)" The folds are currently still attempted in the same order as before (just as 2 seperate calls) so there should be no change in behaviour. First step towards some adjustments to mergeInsertEltWithShuffle for D127115.	2023-01-18 11:56:36 +00:00
David Green	21df504399	[DAG][ARM][AArch64] Transform max(a,b) - min(a,b) -> abd(a,b) This adds both signed and unsigned transforms for max(a, b) - min(a, b) -> abd(a, b). unsigned: https://alive2.llvm.org/ce/z/RF4jGQ signed: https://alive2.llvm.org/ce/z/Cjr2zE Fixes: #59894 Differential Revision: https://reviews.llvm.org/D141706	2023-01-18 11:44:26 +00:00
David Green	e26ec330c4	[DAG][AArch64][ARM] Combine abd(sub(x, y)) to abd if the sub is nsw This implements the fold (abs (sub nsw x, y)) -> abds(x, y). Providing the sub is nsw this appears to be valid without the extensions that are usually used for abds. https://alive2.llvm.org/ce/z/XHVaB3. The equivalent abdu combine seems to not be valid. Differential Revision: https://reviews.llvm.org/D141665	2023-01-18 10:10:52 +00:00
Diana Picus	0a85c531fc	[GlobalISel] Detect splats built with G_CONCAT_VECTORS Add support to the MI matching of vector splats for patterns that consist of `G_CONCAT_VECTORS` of smaller splats with the same constant value. With this, we would consider the following pseudo-MIR to be a splat: ``` %0 = G_[F]CONSTANT [...] %1 = G_BUILD_VECTOR %0, %0, ..., %0 %2 = G_CONCAT_VECTORS %1, %1, ..., %1 ``` Since it uses recursion for matching splats, it could match pretty complicated patterns with all sorts of combinations of `G_BUILD_VECTOR` and `G_CONCAT_VECTORS` (e.g. a `G_CONCAT_VECTORS` with a `G_BUILD_VECTOR_TRUNC` and another `G_CONCAT_VECTORS` as operands), and it should also look through copies etc. This should make it easier to match complex immediates for certain instructions on AMDGPU, where for instance a <8 x s16> will be split before instruction selection into a `G_CONCAT_VECTORS` of <2 x s16> splats. Differential Revision: https://reviews.llvm.org/D141902	2023-01-18 10:56:04 +01:00
Diana Picus	544e02906a	Fix typo. NFC	2023-01-18 10:56:04 +01:00
Pierre van Houtryve	fd3300123d	[CodeGen] Prevent overlapping subregs in getCoveringSubRegIndexes If `getCoveringSubRegIndexes` returns a set of subregister indexes where some subregisters overlap others, it can create unsatisfiable copy bundles that eventually cause VirtRegRewriter to error out due to "cycles in copy bundle". We can simply prevent this by making the algorithm skip over subregisters indexes that would cause an overlap with already-covered lanes. Note that in the case of AMDGPU, this problem is caused by the lack of subregisters indexes for 13/14/15-register tuples. We have everything up until 12, then we have 16 and 32 but nothing between 12 and 16. This means that the best candidate to do the least amount of copies when splitting a 29-register tuple was to copy (e.g.) 0-15 and 14-29, causing an overlap. With this change, getCoveringSubRegIndexes will now prefer using something like 0-15, 16-28 and 1 Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D141576	2023-01-18 03:50:17 -05:00
Anshil Gandhi	5073a622a7	[MachineBasicBlock] Explicit FT branching param Introduce a parameter in getFallThrough() to optionally allow returning the fall through basic block in spite of an explicit branch instruction to it. This parameter is set to false by default. Introduce getLogicalFallThrough() which calls getFallThrough(false) to obtain the block while avoiding insertion of a jump instruction to its immediate successor. This patch also reverts the changes made by D134557 and solves the case where a jump is inserted after another jump (branch-relax-no-terminators.mir). Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D140790	2023-01-17 17:12:08 -07:00
Rahman Lavaee	3d6841b2b1	[Propeller] Use Fixed MBB ID instead of volatile MachineBasicBlock::Number. Let Propeller use specialized IDs for basic blocks, instead of MBB number. This allows optimizations not just prior to asm-printer, but throughout the entire codegen. This patch only implements the functionality under the new `LLVM_BB_ADDR_MAP` version, but the old version is still being used. A later patch will change the used version. ####Background Today Propeller uses machine basic block (MBB) numbers, which already exist, to map native assembly to machine IR. This is done as follows. - Basic block addresses are captured and dumped into the `LLVM_BB_ADDR_MAP` section just before the AsmPrinter pass which writes out object files. This ensures that we have a mapping that is close to assembly. - Profiling mapping works by taking a virtual address of an instruction and looking up the `LLVM_BB_ADDR_MAP` section to find the MBB number it corresponds to. - While this works well today, we need to do better when we scale Propeller to target other Machine IR optimizations like spill code optimization. Register allocation happens earlier in the Machine IR pipeline and we need an annotation mechanism that is valid at that point. - The current scheme will not work in this scenario because the MBB number of a particular basic block is not fixed and changes over the course of codegen (via renumbering, adding, and removing the basic blocks). - In other words, the volatile MBB numbers do not provide a one-to-one correspondence throughout the lifetime of Machine IR. Profile annotation using MBB numbers is restricted to a fixed point; only valid at the exact point where it was dumped. - Further, the object file can only be dumped before AsmPrinter and cannot be dumped at an arbitrary point in the Machine IR pass pipeline. Hence, MBB numbers are not suitable and we need something else. ####Solution We propose using fixed unique incremental MBB IDs for basic blocks instead of volatile MBB numbers. These IDs are assigned upon the creation of machine basic blocks. We modify `MachineFunction::CreateMachineBasicBlock` to assign the fixed ID to every newly created basic block. It assigns `MachineFunction::NextMBBID` to the MBB ID and then increments it, which ensures having unique IDs. To ensure correct profile attribution, multiple equivalent compilations must generate the same Propeller IDs. This is guaranteed as long as the MachineFunction passes run in the same order. Since the `NextBBID` variable is scoped to `MachineFunction`, interleaving of codegen for different functions won't cause any inconsistencies. The new encoding is generated under the new version number 2 and we keep backward-compatibility with older versions. ####Impact on Size of the `LLVM_BB_ADDR_MAP` Section Emitting the Propeller ID results in a 23% increase in the size of the `LLVM_BB_ADDR_MAP` section for the clang binary. Reviewed By: tmsriram Differential Revision: https://reviews.llvm.org/D100808	2023-01-17 15:25:29 -08:00
Francesco Petrogalli	229162d4d7	[MIScheduler] Print top/down cycle in the SUnit dump. Add an extra command line option to `llc` that allows checking at what cycle an instruction has been scheduled by the machine scheduler. Differential Revision: https://reviews.llvm.org/D141289	2023-01-17 15:55:43 +01:00
James Y Knight	52f6ed099c	Move Personalities array from MachineModuleInfo to DwarfCFIException. It was only ever used there, already. The previous location seems left-over from when the personality function was specified on a per-landingpad basis, instead of per-function.	2023-01-16 18:15:01 -05:00
James Y Knight	a1551fdd48	FastISel: remove EH_LABEL skipping code. This was intended to skip past the EH_LABEL which is added at the top of a landingpad block. But, it is unnecessary because `LastLocalValue` is already set to point past the EH_LABEL in that case. Thus, currently, this is dead-code. I am removing it because it _also_ attempts to skip over EH_LABELs emitted around a call. Currently, this situation never arises, but it becomes harmful after a future in-progress commit.	2023-01-16 18:15:00 -05:00
Alexey Lapshin	2216ee4909	This patch allows llvm-dwarfutil to utilize accelerator tables generation code from DWARFLinker. It adds command line option: --build-accelerator [none,DWARF] Build accelerator tables(default: none) =none - Do not build accelerators =DWARF - Build accelerator tables according to the resulting DWARF version DWARFv4: .debug_pubnames and .debug_pubtypes DWARFv5: .debug_names Differential Revision: https://reviews.llvm.org/D139638	2023-01-16 14:42:30 +01:00
Amaury Séchet	396ad408fd	[DAG] Recombine (binop (shift x y)) This helps address regressions in D127115 . Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D141809	2023-01-16 02:20:29 +00:00
Roman Lebedev	edf004e691	[NFC][TargetLowering] `isSplatValueForTargetNode()`: add `DAG` operand Without it we can't recurse further.	2023-01-16 00:02:20 +03:00
Roman Lebedev	f8d9097168	[DAGCombiner] `combineShuffleOfSplatVal()`: try to canonicalize to a splat shuffle As noted in https://reviews.llvm.org/D141778#inline-1369900, we fail to produce splat shuffles from certain sequences of shuffles, that may have non-shuffles in the middle of seq. There is a big pitfail to avoid here: just because `isSplatValue()` says that all demanded elements are splat, we can't pick any random one of them, because some of them could be undef! We must ignore those!	2023-01-15 21:11:33 +03:00
Matt Arsenault	ab6b48b711	DAG: Avoid stack lowering if bitcast has an illegal vector result type A bitcast of <10 x i32> to <5 x i64> was ending up on the stack. Instead of doing that, handle the case where the new type doesn't evenly divide but the elements do. Extract the individual elements and pad with undef. Avoids stack usage for bitcasts involving <5 x i64>. In some of these cases, later optimizations actually eliminated the stack objects but left behind the unused temporary stack object to final emission. Fixes: SWDEV-377548	2023-01-15 12:37:14 -05:00
Matt Arsenault	6ee5a1a090	GlobalISel: Enable CSE for G_SEXT_INREG	2023-01-15 11:38:30 -05:00
Sergei Barannikov	dca40e3288	[CodeGen] Replace CCValAssign::Loc with std::variant (NFCI) The motivation behind this change is as follows. Targets with stack growing up (there are no such in-tree targets) pass arguments at negative offsets relative to the stack pointer. This makes it hard to use the generic value assigner because CCValAssign stores the offset as an unsigned integer, which is then zero-extended when converted to int64_t, e.g. when passing to `CreateFixedObject`. This results in conversion of, for example, -4 into 4294967292, which is not desired. While it is possible to insert a cast to `int` before passing the result of `getLocMemOffset` into `CreateFixedObject` in backend code, this is error-prone, and some uses of `getLocMemOffset` are located in places common to all backends (e.g. `CallLowering::handleAssignments`). That said, I wanted to change the type of the memory offset from `unsigned` to `int64_t` (this would be consistent with other places where stack offsets are used). However, the `Loc` field which stores the offset is shared between three different kinds of the location: register, memory, and "pending". Storing a register number as `int64_t` does not seem right (there are `Register` and `MCRegister` for this), so I did the most straightforward change - replaced the `Loc` field with std::variant. The main change that changes the type of the memory offset from `unsigned` to `int64_t` will be in a follow-up patch to simplify the review. Reviewed By: MaskRay, nikic Differential Revision: https://reviews.llvm.org/D136043	2023-01-15 10:31:42 +03:00
Roman Lebedev	cc39c3b17f	[Codegen][LegalizeIntegerTypes] New legalization strategy for scalar shifts: shift through stack https://reviews.llvm.org/D140493 is going to teach SROA how to promote allocas that have variably-indexed loads. That does bring up questions of cost model, since that requires creating wide shifts. Indeed, our legalization for them is not optimal. We either split it into parts, or lower it into a libcall. But if the shift amount is by a multiple of CHAR_BIT, we can also legalize it throught stack. The basic idea is very simple: 1. Get a stack slot 2x the width of the shift type 2. store the value we are shifting into one half of the slot 3. pad the other half of the slot. for logical shifts, with zero, for arithmetic shift with signbit 4. index into the slot (starting from the base half into which we spilled, either upwards or downwards) 5. load 6. split loaded integer This works for both little-endian and big-endian machines: https://alive2.llvm.org/ce/z/YNVwd5 And better yet, if the original shift amount was not a multiple of CHAR_BIT, we can just shift by that remainder afterwards: https://alive2.llvm.org/ce/z/pz5G-K I think, if we are going perform shift->shift-by-parts expansion more than once, we should instead go through stack, which is what this patch does. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D140638	2023-01-14 19:12:18 +03:00
Matt Arsenault	e70ae0f46b	DAG/GlobalISel: Fix broken/redundant setting of MODereferenceable This was incorrectly setting dereferenceable on unaligned operands. getLoadMemOperandFlags does the alignment dereferenceabilty check without alignment, and then both paths went on to check isDereferenceableAndAlignedPointer. Make getLoadMemOperandFlags check isDereferenceableAndAlignedPointer, and remove the second call.	2023-01-13 20:30:30 -05:00
NAKAMURA Takumi	292019e931	MachineScheduler.cpp: Fixup D141707, suppress `MISchedDumpReservedCycles` conditionally. It is used in `LLVM_ENABLE_DUMP` regardless of `NDEBUG`.	2023-01-14 10:04:23 +09:00
Paul Kirth	fdc0bf6adc	Revert "[codegen] Add StackFrameLayoutAnalysisPass" This breaks on some AArch64 bots This reverts commit `0a652c5405`.	2023-01-13 22:59:36 +00:00
Craig Topper	e72ca520bb	[CodeGen] Remove uses of Register::isPhysicalRegister/isVirtualRegister. NFC Use isPhysical/isVirtual methods. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D141715	2023-01-13 14:38:08 -08:00
Paul Kirth	0a652c5405	[codegen] Add StackFrameLayoutAnalysisPass Issue #58168 describes the difficulty diagnosing stack size issues identified by -Wframe-larger-than. For simple code, its easy to understand the stack layout and where space is being allocated, but in more complex programs, where code may be heavily inlined, unrolled, and have duplicated code paths, it is no longer easy to manually inspect the source program and understand where stack space can be attributed. This patch implements a machine function pass that emits remarks with a textual representation of stack slots, and also outputs any available debug information to map source variables to those slots. The new behavior can be used by adding `-Rpass-analysis=stack-frame-layout` to the compiler invocation. Like other remarks the diagnostic information can be saved to a file in a machine readable format by adding -fsave-optimzation-record. Fixes: #58168 Reviewed By: nickdesaulniers, thegameg Differential Revision: https://reviews.llvm.org/D135488	2023-01-13 20:52:48 +00:00
HanSheng Zhang	95414345d5	[DAGCombiner] `visitFREEZE()`: gracefully handle node invalidation When we freeze operands of an operation that we are trying to freeze, doing so may invalidate the original SDValue. We should just re-fetch it from the ISD::FREEZE node, because if we bail, we'd hopefully just revisit the node and do that again. Fixes https://github.com/llvm/llvm-project/issues/59891 Differential Revision: https://reviews.llvm.org/D141256	2023-01-13 21:53:29 +03:00
Roman Lebedev	5ffa8c4019	[NFC][DAGCombiner] Fix typo in `visitFREEZE()`	2023-01-13 21:52:47 +03:00
Francesco Petrogalli	c3c6d47c45	[CodeGen] Fix build failure due to missing declaration. The failure was reported in https://github.com/llvm/llvm-project/issues/60011 FAILED: lib/CodeGen/CMakeFiles/LLVMCodeGen.dir/MachineScheduler.cpp.o "/build/llvm-toolchain-snapshot-16~++20230113111109+aba8983c9d86/build-llvm/./bin/clang++" -D_GNU_SOURCE -D__STDC_CONSTANT_MACROS -D__STDC_FORMAT_MACROS -D__STDC_LIMIT_MACROS -I"/build/llvm-toolchain-snapshot-16~++20230113111109+aba8983c9d86/build-llvm/tools/clang/stage2-bins/lib/CodeGen" -I"/build/llvm-toolchain-snapshot-16~++20230113111109+aba8983c9d86/llvm/lib/CodeGen" -I"/build/llvm-toolchain-snapshot-16~++20230113111109+aba8983c9d86/build-llvm/tools/clang/stage2-bins/include" -I"/build/llvm-toolchain-snapshot-16~++20230113111109+aba8983c9d86/llvm/include" -fstack-protector-strong -Wformat -Werror=format-security -Wno-unused-command-line-argument -Wdate-time -D_FORTIFY_SOURCE=2 -fPIC -fno-semantic-interposition -fvisibility-inlines-hidden -Werror=date-time -Werror=unguarded-availability-new -Wall -Wextra -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wmissing-field-initializers -pedantic -Wno-long-long -Wc++98-compat-extra-semi -Wimplicit-fallthrough -Wcovered-switch-default -Wno-noexcept-type -Wnon-virtual-dtor -Wdelete-non-virtual-dtor -Wsuggest-override -Wstring-conversion -Wmisleading-indentation -Wctad-maybe-unsupported -fdiagnostics-color -ffunction-sections -fdata-sections -ffile-prefix-map=/build/llvm-toolchain-snapshot-16~++20230113111109+aba8983c9d86/build-llvm/tools/clang/stage2-bins=build-llvm/tools/clang/stage2-bins -ffile-prefix-map=/build/llvm-toolchain-snapshot-16~++20230113111109+aba8983c9d86/= -no-canonical-prefixes -O2 -DNDEBUG -g1 -fno-exceptions -std=c++17 -MD -MT lib/CodeGen/CMakeFiles/LLVMCodeGen.dir/MachineScheduler.cpp.o -MF lib/CodeGen/CMakeFiles/LLVMCodeGen.dir/MachineScheduler.cpp.o.d -o lib/CodeGen/CMakeFiles/LLVMCodeGen.dir/MachineScheduler.cpp.o -c '/build/llvm-toolchain-snapshot-16~++20230113111109+aba8983c9d86/llvm/lib/CodeGen/MachineScheduler.cpp' /build/llvm-toolchain-snapshot-16~++20230113111109+aba8983c9d86/llvm/lib/CodeGen/MachineScheduler.cpp:2639:7: error: use of undeclared identifier 'MISchedDumpReservedCycles' if (MISchedDumpReservedCycles) ^ 1 error generated. Fixes #60011 Differential Revision: https://reviews.llvm.org/D141707	2023-01-13 19:43:56 +01:00
Craig Topper	c8bd5343df	[DAGCombiner][RISCV] Pre-promote (zext (abs X)) to (abs (sext X)) when X has an illegal type. Type legalization will insert a sign extend anyway. By doing it early we can remove the zext. ComputeNumSignBits can't spot it after type legalization because type legalization may expand the abs to sra+xor+sub. If the zext result type is larger than the type to be promoted to, we'll promote to a legal type and then zext the rest of the way. If the legal type is larger than the destination type we can promote and then truncate. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D140509	2023-01-13 10:40:25 -08:00
Florian Hahn	20ecc07991	[MachineCombiner] Lift same-bb restriction for reassociable ops. This patch relaxes the restriction that both reassociate operands must be in the same block as the root instruction. The comment indicates that the reason for this restriction was that the operands not in the same block won't have a depth in the trace. I believe this is outdated; if the operand is in a different block, it must dominate the current block (otherwise it would need to be phi), which in turn means the operand's block must be included in the current rance, and depths must be available. There's a test case (no_reassociate_different_block) added in `70520e2f1c` which shows that we have accurate depths for operands defined in other blocks. This allows reassociation of code that computes the final reduction value after vectorization, among other things. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D141302	2023-01-13 15:32:44 +00:00
Guillaume Chatelet	26bd6476c6	Deprecate DataLayout::getPrefTypeAlignment	2023-01-13 15:05:24 +00:00
Francesco Petrogalli	aba8983c9d	Recommit [SchedBoundary] Add dump method for resource usage. Summary: As supporting information, I have added an example that describes how the indexes of the vector of resources SchedBoundary::ReservedCycles are tracked by the field SchedBoundary::ReservedCyclesIndex. This has a minor rework of `b39a9a94f4` which was reverted in `df6ae1779f` becasue the llc invocation of the test was missing the argument `-mtriple`. See for example the failure at https://lab.llvm.org/buildbot#builders/231/builds/7245 that reported the following when targeting a non-aarch64 native build: 'cortex-a55' is not a recognized processor for this target (ignoring processor) Reviewers: jroelofs Subscribers: Differential Revision: https://reviews.llvm.org/D141367	2023-01-13 11:42:05 +01:00
Francesco Petrogalli	df6ae1779f	Revert "[SchedBoundary] Add dump method for resource usage." Reverting because of https://lab.llvm.org/buildbot#builders/16/builds/41860 When building on x86, I need to specify also -mtriple in the invocation of llc otherwise the folllowing error shows up: 'cortex-a55' is not a recognized processor for this target (ignoring processor) This reverts commit `b39a9a94f4`.	2023-01-13 11:14:20 +01:00
Francesco Petrogalli	b39a9a94f4	[SchedBoundary] Add dump method for resource usage. As supporting information, I have added an example that describes how the indexes of the vector of resources SchedBoundary::ReservedCycles are tracked by the field SchedBoundary::ReservedCyclesIndex. Reviewed By: jroelofs Differential Revision: https://reviews.llvm.org/D141367	2023-01-13 10:38:43 +01:00
Diana Picus	2004ab422a	MachineIRBuilder: Add buildMergeValues. NFC Add a `buildMergeValues` method that unconditionally builds a G_MERGE_VALUES instruction, as opposed to `buildMergeLikeInstr` which may decide on a different opcode based on the input types. I haven't audited all the uses of `buildMergeLikeInstr` to see if they can be replaced with `buildMergeValues`, but I did find a couple of obvious ones where we check that we're merging scalars right before calling `buildMerge`. This is a follow-up suggested in https://reviews.llvm.org/D140964 Differential Revision: https://reviews.llvm.org/D141373	2023-01-13 09:32:58 +01:00
Diana Picus	f95a5fbe7c	MachineIRBuilder: Rename buildMerge. NFC `buildMerge` may build a G_MERGE_VALUES, G_BUILD_VECTOR or G_CONCAT_VECTORS. Rename it to `buildMergeLikeInstr`. This is a follow-up suggested in https://reviews.llvm.org/D140964 Differential Revision: https://reviews.llvm.org/D141372	2023-01-13 09:32:58 +01:00
Diana Picus	0f831aa805	GlobalISel: s/Op/Instr in some places. NFC This patch replaces `GMergeLikeOp` with `GMergeLikeInstr` and `MachineIRBuilder::buildAssertOp` with `buildAssertInstr` in order to remove ambiguity. Discussed in: https://reviews.llvm.org/D141372	2023-01-13 09:32:58 +01:00
Xiang1 Zhang	593e17648e	Add -disable-check-noreturn-call for stack protect Differential Revision: https://reviews.llvm.org/D141556 Reviewed By: lanza	2023-01-13 08:28:38 +08:00
OCHyams	12ece76815	[DebugInfo] Replace UndefValue with PoisonValue in AssignmentTrackingAnalysis This helps towards the effort to remove UndefValue from LLVM. Related to https://discourse.llvm.org/t/auto-undef-debug-uses-of-a-deleted-value Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D140906	2023-01-12 09:51:45 +00:00
OCHyams	83f7f86e7d	[NFC][Assignment Tracking] Add is/setKillAddress Unlike D140903 this patch folds in treating an empty metadata address component of a dbg.assign the same as undef because it was already being treated that way in the AssignmentTrackingAnalysis pass. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D141125	2023-01-12 09:46:01 +00:00
Jannik Silvanus	df1a74ac3c	[IR] Support importing modules with invalid data layouts. Use the existing mechanism to change the data layout using callbacks. Before this patch, we had a callback type DataLayoutCallbackTy that receives a single StringRef specifying the target triple, and optionally returns the data layout string to be used. Module loaders (both IR and BC) then apply the callback to potentially override the module's data layout, after first having imported and parsed the data layout from the file. We can't do the same to fix invalid data layouts, because the import will already fail, before the callback has a chance to fix it. Instead, module loaders now tentatively parse the data layout into a string, wait until the target triple has been parsed, apply the override callback to the imported string and only then parse the tentative string as a data layout. Moreover, add the old data layout string S as second argument to the callback, in addition to the already existing target triple argument. S is either the default data layout string in case none is specified, or the data layout string specified in the module, possibly after auto-upgrades (for the BitcodeReader). This allows callbacks to inspect the old data layout string, and fix it instead of setting a fixed data layout. Also allow to pass data layout override callbacks to lazy bitcode module loader functions. Differential Revision: https://reviews.llvm.org/D140985	2023-01-12 10:10:45 +01:00
esmeyi	5ce0a26bd1	[XCOFF] handle the toc-data for object file generation. Summary: The toc-data feature has been supported for assembly file generation. This patch handles the toc-data for object file generation. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D139516	2023-01-11 23:27:47 -05:00

1 2 3 4 5 ...

33513 Commits