clang-p2996

Author	SHA1	Message	Date
Nikita Popov	0d81093742	[llvm] Remove calls to supportsTypedPointers() (NFC) Always returns false now.	2023-07-14 09:34:46 +02:00
Eli Friedman	60712732ea	[IndVars] Teach replaceCongruentIVs to avoid scrambling induction variables replaceCongruentIVs analysis is based on ScalarEvolution; this makes comparing different PHIs and performing the replacement straightforward. However, it can have some side-effects: it isn't aware whether an induction variable is in canonical form, so it can perform replacements which obscure the meaning of the IR. In test22 in widen-loop-comp.ll, the resulting loop can't be analyzed by ScalarEvolution at all. My attempted solution is to restrict the transform: don't try to replace induction variables using PHI nodes that don't represent simple induction variables. I'm not sure if this is the best solution; suggestions welcome. Differential Revision: https://reviews.llvm.org/D121950	2023-07-12 12:27:39 -07:00
Nikita Popov	b75254270e	[SCEVExpander] Remove GEP add rec splitting code (NFCI) I don't believe this is relevant anymore with opaque pointers, where we always expand the entire offset, without splitting it into parts.	2023-07-12 16:53:51 +02:00
Jie Fu	67f1e8d737	[Transforms] Remove FactorOutConstant to fix -Wunneeded-internal-declaration (NFC) /Users/jiefu/llvm-project/llvm/lib/Transforms/Utils/ScalarEvolutionExpander.cpp:293:13: error: function 'FactorOutConstant' is not needed and will not be emitted [-Werror,-Wunneeded-internal-declaration] static bool FactorOutConstant(const SCEV &S, const SCEV &Remainder, ^ 1 error generated.	2023-07-12 21:57:12 +08:00
Nikita Popov	02ba40593d	[SCEVExpander] Remove typed pointer support (NFC)	2023-07-12 15:34:58 +02:00
Nikita Popov	d69033d245	[SCEVExpander] Fix GEP IV inc reuse logic for opaque pointers Instead of checking the pointer type, check the element type of the GEP. Previously we ended up reusing GEP increments that were not in expanded form, thus not respecting LSRs choice of representation. The change in 2011-10-06-ReusePhi.ll recovers a regression that appeared when converting that test to opaque pointers. Changes in various Thumb tests now compute the step outside the loop instead of using add.w inside the loop, which is LSR's preferred representation for this target.	2023-07-12 11:32:13 +02:00
Matt Arsenault	3701ebe76b	AtomicExpand: Fix expanding atomics into unconstrained FP in strictfp functions Ideally the normal fadd/fmin/fmax this was creating would fail the verifier. It's probably also necessary to force off FP exception handlers in the cmpxchg loop but we don't have a generic way to do that now. Note strictfp builder is broken in the minnum/maxnum case https://reviews.llvm.org/D154993	2023-07-11 18:51:15 -04:00
Johannes Doerfert	8542d8f3cd	[Attributor][FIX] Ensure the function manager cache is updated When a function is removed we need to clear cached analysis from the function manager cache.	2023-07-09 20:12:31 -07:00
Elliot Goodrich	39d8e6e22c	Add missing StringExtras.h includes In preparation for removing the `#include "llvm/ADT/StringExtras.h"` from the header to source file of `llvm/Support/Error.h`, first add in all the missing includes that were previously included transitively through this header. This is fixing all files missed in `b0abd4893f`. Differential Revision: https://reviews.llvm.org/D154543	2023-07-08 10:19:07 +01:00
spupyrev	a7e13a99c2	A new code layout algorithm for function reordering [1/3] We are brining a new algorithm for function layout (reordering) based on the call graph (extracted from a profile data). The algorithm is an improvement of top of a known heuristic, C^3. It tries to co-locate hot and frequently executed together functions in the resulting ordering. Unlike C^3, it explores a larger search space and have an objective closely tied to the performance of instruction and i-TLB caches. Hence, the name CDS = Cache-Directed Sort. The algorithm can be used at the linking or post-linking (e.g., BOLT) stage. This diff modifies the existing data structures to facilitate the implementation (down the stack). This is a no-op change. Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D152833	2023-07-07 07:25:25 -07:00
Serguei Katkov	1614805eeb	Register new assumption in a cache When new assumption is created it should be registered in assumption cache or cache should be invalidated. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D154601	2023-07-07 10:53:03 +07:00
Jim Lin	893cc97007	[LibCallsShrinkWrap] Set IsFPConstrained is true for creating quiet floating comparision if function has strictfp attribute Create a quiet floating-point comparision if function has strictfp attribute. Avoid unexpected FP exception raised during libcall domain error checking. It raises an FP exception only in case where an input is a signaling NaN. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D152776	2023-07-06 13:23:34 +08:00
Florian Hahn	69ca5c9d62	[SCEV] Add flag to control invertible check for normalization. When normalizing a SCEV expression during expansion, there should be no need for it to be invertible, as it will only be used for code generation. This fixes a crash after `7f5b15ad15`. Fixes https://github.com/llvm/llvm-project/issues/63678.	2023-07-05 18:11:44 +01:00
Nikita Popov	b9808e5660	[LoopUnroll] Fold add chains during unrolling Loop unrolling tends to produce chains of `%x1 = add %x0, 1; %x2 = add %x1, 1; ...` with one add per unrolled iteration. This patch simplifies these adds to `%xN = add %x0, N` directly during unrolling, rather than waiting for InstCombine to do so. The motivation for this is that having a single add (rather than an add chain) on the induction variable makes it a simple recurrence, which we specially recognize in a number of places. This allows InstCombine to directly perform folds with that knowledge, instead of first folding the add chains, and then doing other folds in another InstCombine iteration. Due to the reduced number of InstCombine iterations, this also results in a small compile-time improvement. Differential Revision: https://reviews.llvm.org/D153540	2023-07-05 09:54:28 +02:00
Anna Thomas	da59873354	[MetaRenamer] Rename only unnamed instructions in mode renaming instructions `6f9e743b91` introduced a mode which renames only instructions in the function. This change updates that mode to skip instructions that are already named. This serves the original purpose of the mode (rename-only-inst) which is: 1. Modify IR without failing verifier with serially ordered number requirement (%1, %2, %3 required in order). 2. Give meaningful names to instructions.	2023-07-04 11:11:14 -04:00
Nikita Popov	bb3763e497	Revert "[SimplifyCFG] Allow dropping block that only contains ephemeral values" This reverts commit `20f0c68fd8`. https://reviews.llvm.org/D153966#4464594 reports an optimization regression in Rust. Additionally this change has caused an unexpected 0.3% compile-time regression.	2023-06-30 21:24:05 +02:00
Anna Thomas	6f9e743b91	[Metarenamer] Introduce option to only change inst names This is useful when needing to modify IR and test some optimizations on them, while keeping BB names and function names intact. If the IR uses ordered number naming (%1, %2, %3 etc), then we cannot just remove or reorder specific instructions since the verifier expects the numbers to be in order.	2023-06-30 11:13:15 -04:00
Nikita Popov	20f0c68fd8	[SimplifyCFG] Allow dropping block that only contains ephemeral values Perform the TryToSimplifyUncondBranchFromEmptyBlock() transform if the block is empty except for ephemeral values. The ephemeral values will be dropped in that case. This makes sure that assumes don't block this transforms, as reported in https://discourse.llvm.org/t/llvm-assume-blocks-optimization/71609. Differential Revision: https://reviews.llvm.org/D153966	2023-06-30 15:24:01 +02:00
Carlos Alberto Enciso	8763d79949	[loop-deletion] Overly defensive with undef-ing dbg.values. Explicitly inserting undef is overly defensive. Any values computed nside the loop that are referenced by dbg.values should naturally become undef when the loop is deleted, and all other values that are loop invariant must be preserved. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D153539	2023-06-30 06:24:56 +01:00
Han Shen	80155cbf0b	[Analysis] Refactor MBB hotness/coldness into templated PSI functions. Currently, to use PSI->isFunctionHotInCallGraph, we first need to calculate BPI->BFI, which is expensive. Instead, we can implement this directly with MBFI. Also as @wenlei mentioned in another patch review, that MachineSizeOpts already has isFunctionColdInCallGraph, isFunctionHotInCallGraphNthPercentile, etc implemented. These can be refactored and so they can be reused across MachineFunctionSplitting and MachineSizeOpts passes. This CL does this - it refactors out those internal static functions into PSI as templated functions, so they can be accessed easily. Differential Revision: https://reviews.llvm.org/D153927	2023-06-28 22:32:52 -07:00
Youngsuk Kim	243f0566dc	[llvm] Replace uses of Type::getPointerTo (NFC) Partial progress towards removing in-tree uses of `Type::getPointerTo`, before we can deprecate the API. If the API is used solely to support an unnecessary bitcast, get rid of the bitcast as well. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D153933	2023-06-28 09:21:34 -04:00
Fangrui Song	4bb3d0e531	Revert D153927 "Resubmit with fix: [NFC] Refactor MBB hotness/coldness into templated PSI functions." This reverts commit `4d8cf2ae68`. There is a library layering violation. LLVMAnalysis cannot depend on LLVMCodeGen. ``` llvm/include/llvm/Analysis/ProfileSummaryInfo.h:19:10: fatal error: 'llvm/CodeGen/MachineFunction.h' file not found 19 \| #include "llvm/CodeGen/MachineFunction.h" \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ```	2023-06-28 00:30:52 -07:00
Han Shen	4d8cf2ae68	Resubmit with fix: [NFC] Refactor MBB hotness/coldness into templated PSI functions. In D152399, we calculate BPI->BFI in MachineFunctionSplit pass just to use PSI->isFunctionHotInCallGraph, which is expensive. Instead, we can implement this directly with MBFI. Reviewer mentioned in the comment, that machine_size_opts already has isFunctionColdInCallGraph, isFunctionHotInCallGraphNthPercentile, etc implemented. These can be refactored and reused across MFS and machine size opts. This CL does this - it refactors out those internal static functions into PSI as templated functions, so they can be accessed easily. Differential Revision: https://reviews.llvm.org/D153927	2023-06-27 21:24:34 -07:00
Elliot Goodrich	f0fa2d7c29	[llvm] Move AttributeMask to a separate header Move `AttributeMask` out of `llvm/IR/Attributes.h` to a new file `llvm/IR/AttributeMask.h`. After doing this we can remove the `#include <bitset>` and `#include <set>` directives from `Attributes.h`. Since there are many headers including `Attributes.h`, but not needing the definition of `AttributeMask`, this causes unnecessary bloating of the translation units and slows down compilation. This commit adds in the include directive for `llvm/IR/AttributeMask.h` to the handful of source files that need to see the definition. This reduces the total number of preprocessing tokens across the LLVM source files in lib from (roughly) 1,917,509,187 to 1,902,982,273 - a reduction of ~0.76%. This should result in a small improvement in compilation time. Differential Revision: https://reviews.llvm.org/D153728	2023-06-27 15:26:17 +01:00
Aiden Grossman	bfa1057b81	Revert "[NFC] Refactor MBB hotness/coldness into templated PSI functions." This reverts commit `c3e3372040`. This has broken quite a few buildbots.	2023-06-27 06:05:27 +00:00
Jim Lin	f6cb9b82e2	[LibCallsShrinkWrap][NFC] Reuse createCond and createOrCond Add two new functions `createCond` and `createOrCond` that accept extra arguments Arg and Arg/Arg2 respectively. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D153253	2023-06-27 14:02:19 +08:00
Han Shen	c3e3372040	[NFC] Refactor MBB hotness/coldness into templated PSI functions. In D152399, we calculate BPI->BFI in MachineFunctionSplit pass just to use PSI->isFunctionHotInCallGraph, which is expensive. Instead, we can implement this directly with MBFI. Reviewer mentioned in the comment, that machine_size_opts already has isFunctionColdInCallGraph, isFunctionHotInCallGraphNthPercentile, etc implemented. These can be refactored and reused across MFS and machine size opts. This CL does this - it refactors out those internal static functions into PSI as templated functions, so they can be accessed easily. Differential Revision: https://reviews.llvm.org/D152758	2023-06-26 21:56:40 -07:00
Nikita Popov	0e34b6a504	[LCSSA] Compute SCEV of LCSSA phi if original instruction had SCEV The backstory is that the LCSSA invalidation we perform here is not really necessary from a SCEV perspective. However, other code may rely on the fact that invalidating only LCSSA phi nodes is sufficient for transforms like loop peeling (see https://reviews.llvm.org/D149331#4398582 for more details). However, performing invalidation during LCSSA construction also means that SCEV expansion (which may need to construct LCSSA) can invalidate SCEV, which is somewhat unexpected and code may not be prepared to deal with it (see the added test case, reported at https://reviews.llvm.org/D149435#4428219). Instead of invalidating SCEV, ensure that the LCSSA phi node also has cached SCEV if the original instruction did. This means that later invalidation of LCSSA phi nodes will work as expected. This should avoid both the above issues and be more efficient. Differential Revision: https://reviews.llvm.org/D153145	2023-06-26 14:43:31 +02:00
Jie Fu	86f564edc0	[SimplifyCFG] Remove unused variable 'Inc' (NFC) /data/llvm-project/llvm/lib/Transforms/Utils/SimplifyCFG.cpp:6051:10: error: unused variable 'Inc' [-Werror,-Wunused-variable] bool Inc, Wrapped = false; ^ 1 error generated.	2023-06-26 08:59:37 +08:00
khei4	28d13a6297	[SimplifyCFG] add nsw on BuildLookuptable LinearMap calculation Differential Revision: https://reviews.llvm.org/D150943	2023-06-26 08:30:23 +09:00
Elliot Goodrich	b0abd4893f	[llvm] Add missing StringExtras.h includes In preparation for removing the `#include "llvm/ADT/StringExtras.h"` from the header to source file of `llvm/Support/Error.h`, first add in all the missing includes that were previously included transitively through this header.	2023-06-25 15:42:22 +01:00
Kazu Hirata	6b31e63bc3	[LegacyPM] Remove RewriteSymbolsLegacyPass Differential Revision: https://reviews.llvm.org/D153679	2023-06-24 07:33:50 -07:00
Youngsuk Kim	d22a236ae7	[llvm] Replace use of Type::getPointerTo() (NFC) Partial progress towards replacing in-tree uses of `Type::getPointerTo()`. If `getPointerTo()` is used solely to support an unnecessary bitcast, remove the bitcast. Reviewed By: barannikov88, nikic Differential Revision: https://reviews.llvm.org/D153307	2023-06-23 22:32:29 -04:00
Tamás Danyluk	248b85344b	[SCCPSolver] Speed up SCCPSolver by avoiding repeated work list elements If a value is already the last element of the worklist, then I think that we don't have to add it again, it is not needed to process it repeatedly. For some long Triton-generated LLVM IR, this can cause a ~100x speedup. Differential Revision: https://reviews.llvm.org/D153561	2023-06-23 10:23:53 +02:00
Arthur Eubanks	d49984fa4f	[SimplifyCFG] Add option to not speculate blocks Required for phase ordering changes to not regress Rust code with D145265. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D153391	2023-06-22 08:51:40 -07:00
Anna Thomas	ec146cb7c0	[LV] Add support for minimum/maximum intrinsics {mini\|maxi}mum intrinsics are different from {min\|max}num intrinsics in the propagation of NaN and signed zero. Also, the minnum/maxnum intrinsics require the presence of nsz flags to be valid reductions in vectorizer. In this regard, we introduce a new recurrence kind and also add support for identifying reduction patterns using these intrinsics. The reduction intrinsics and lowering was introduced here: `26bfbec5d2`. There are tests added which show how this interacts across chains of min/max patterns. Differential Revision: https://reviews.llvm.org/D151482	2023-06-20 13:17:28 -04:00
Dmitry Makogon	d46d9689f7	[BBUtils] Don't add 'then' block to a loop if it's terminated with unreachable SplitBlockAndInsertIfThen utility creates two new blocks, they're called ThenBlock and Tail (true and false destinations of a conditional branch correspondingly). The function has a bool parameter Unreachable, and if it's set, then ThenBlock is terminated with an unreachable. At the end of the function the new blocks are added to the loop of the split block. However, in case ThenBlock is terminated with an unreachable, it cannot belong to any loop. Differential Revision: https://reviews.llvm.org/D152434	2023-06-19 20:24:30 +07:00
Nikita Popov	664b7a4cd5	[SCCP] Fix conversion of range to constant for vectors (PR63380) The ConstantRange specifies the range of the scalar elements in the vector. When converting into a Constant, we need to create a vector splat with the correct type. For that purpose, pass in the expected type for the constant. Fixes https://github.com/llvm/llvm-project/issues/63380.	2023-06-19 12:29:44 +02:00
Yevgeny Rouban	1ebbbf1614	[LoopUnrollRuntime] Allow indirect transition to deopt non-latch exit blocks Relax condition on runtime trip count unrolling loops with 1 non-latch exit that leads to a deop block. There are cases when the deopt blocks are common exits for different loops. LoopSimplify pass splits such edges to the common deopting blocks to make sure that all exit nodes of the loop only have predecessors that are inside of the loop (See simplifyOneLoop()). This breaks the current condition for unrolling. This patch allows the split transitive blocks that still lead to the deopting blocks. Differential Revision: https://reviews.llvm.org/D152639	2023-06-19 11:10:01 +07:00
Arthur Eubanks	3e39cfe5b4	Revert "Revert "InstSimplify: Require instruction be parented"" This reverts commit `0c03f48480`. Going to fix forward size regression instead due to more dependent patches needing to be reverted otherwise.	2023-06-16 13:53:31 -07:00
Arthur Eubanks	0c03f48480	Revert "InstSimplify: Require instruction be parented" This reverts commit `1536e299e6`. Causes large binary size regressions, see comments on https://reviews.llvm.org/rG1536e299e63d7788f38117b0212ca50eb76d7a3b.	2023-06-16 11:24:29 -07:00
Alan Zhao	d6b4f6786b	Revert "Revert "InstSimplify: Require instruction be parented"" This reverts commit `00264eac4d`. Reason: caused a bunch of bots to break	2023-06-16 10:58:54 -07:00
Alan Zhao	00264eac4d	Revert "InstSimplify: Require instruction be parented" This reverts commit `1536e299e6`. Reason: causes a regression in the inliner (see https://crbug.com/1454531 and https://reviews.llvm.org/rG1536e299e63d7788f38117b0212ca50eb76d7a3b#1217141)	2023-06-16 10:36:49 -07:00
Arthur Eubanks	405f91475b	[SimplifyCFG] Check optforfuzzing attribute during in the pass implementation Instead of setting the SimplifyCFGOptions options at the beginning of the pass. Otherwise it always gets overriden by the pass and the value in SimplifyCFGOptions is ignored.	2023-06-15 13:57:51 -07:00
Matt Arsenault	0696240384	LowerMemIntrinsics: Check address space aliasing for memmove expansion For cases where we cannot insert an addrspacecast, we can still expand like a memcpy if we know the address spaces cannot alias. Normally non-aliasing memmoves are optimized to memcpy, but we cannot rely on that for lowering. If a target has aliasing address spaces that cannot be casted between, we still have to give up lowering this.	2023-06-14 07:56:58 -04:00
Noah Goldstein	3391bdc255	Revert "[FunctionAttrs] Propagate some func/arg/ret attributes from caller to callsite (WIP)" Accidental commit/push! This reverts commit `4fa971ff62`.	2023-06-13 00:53:31 -05:00
Noah Goldstein	4fa971ff62	[FunctionAttrs] Propagate some func/arg/ret attributes from caller to callsite (WIP) This is the consolidation of D151644 and D151943 moved from InstCombine to FunctionAttrs. This is based on discussion in the above patches as well as D152081 (Attributor). This patch was written in a way so it can have an immediate impact in currently active passes (FunctionAttrs), but should be easy to port elsewhere (Attributor or Inliner) if that makes more sense later on. Some function attributes imply the attribute for all/some instructions in the function. These attributes can be safely propagated to callsites within the function that are missing the attribute. This can be useful when 1) analyzing individual instructions in a function and 2) if the original caller is later inlined, as if the attributes are not propagated, they will be lost. This patch implements propagation in a new class/file `InferCallsiteAttrs` which can hypothetically be included elsewhere. At the moment this patch infers the following: Function Attributes: - mustprogress - nofree - willreturn - All memory attributes (readnone, readonly, writeonly, argmem, etc...) - The memory attributes are only propagated IFF the set of pointers available to the callsite is the same as the set available outside the caller (i.e no local memory arguments from alloca or local malloc like functions). Argument Attributes: - noundef - nonnull - nofree - readnone - readonly - writeonly - nocapture - nocapture is only propagated IFF the set of pointers available to the callsite is the same as the set available outside the caller and its guranteed that between the callsite and function return, the state of any capture pointers will not change (so the nocaptured gurantee of the caller has been met by the instruction preceding the callsite and will not changed). Argument are only propagated to callsite arguments that are also function arguments, but not derived values. Return Attributes: - noundef - nonnull Return attributes are only propagated if the callsite's return value is used as the caller's return and execution is guranteed to pass from callsite to return. The compile time hit of this for -O3 and -O3+thinLTO is ~[.02, .37]% regression. Proper LTO, however, has more significant regressions (up to 3.92%): https://llvm-compile-time-tracker.com/compare.php?from=94407e1bba9807193afde61c56b6125c0fc0b1d1&to=79feb6e78b818e33ec69abdc58c5f713d691554f&stat=instructions:u Differential Revision: https://reviews.llvm.org/D152226	2023-06-13 00:47:43 -05:00
Matt Arsenault	ee19fabc98	LowerMemIntrinsics: Handle inserting addrspacecast for memmove lowering We're missing a trivial non-AA way to check for non-aliasing address spaces.	2023-06-12 21:10:30 -04:00
Vitaly Buka	23ea58f374	Revert "[SCCP] Replace new value's value state with removed value's" Breaks all sanitizers bootstrap bots: https://lab.llvm.org/buildbot/#/waterfall?tags=sanitizer This reverts commit `cf79773a90`.	2023-06-12 11:07:46 -07:00
luxufan	cf79773a90	[SCCP] Replace new value's value state with removed value's In replaceSignedInst, if a signed instruction can be repalced with unsigned instruction, we created a new instruction and removed the old instruction's value state. If the following instructions has this new instruction as a use operand, transformations like replaceSignedInst and refineInstruction would be blocked. The reason is there is no value state for the new instrution. This patch set the new instruction's value state with the removed instruction's value state. I believe it is correct bacause when we repalce a signed instruction with unsigned instruction, the value state is not changed. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D152337	2023-06-12 11:40:47 +08:00

1 2 3 4 5 ...

6951 Commits