clang-p2996

Files

Sameer Sahasrabuddhe 475ce4c200 RFC: Uniformity Analysis for Irreducible Control Flow

Uniformity analysis is a generalization of divergence analysis to
include irreducible control flow:

  1. The proposed spec presents a notion of "maximal convergence" that
     captures the existing convention of converging threads at the
     headers of natual loops.

  2. Maximal convergence is then extended to irreducible cycles. The
     identity of irreducible cycles is determined by the choices made
     in a depth-first traversal of the control flow graph. Uniformity
     analysis uses criteria that depend only on closed paths and not
     cycles, to determine maximal convergence. This makes it a
     conservative analysis that is independent of the effect of DFS on
     CycleInfo.

  3. The analysis is implemented as a template that can be
     instantiated for both LLVM IR and Machine IR.

Validation:
  - passes existing tests for divergence analysis
  - passes new tests with irreducible control flow
  - passes equivalent tests in MIR and GMIR

Based on concepts originally outlined by
Nicolai Haehnle <nicolai.haehnle@amd.com>

With contributions from Ruiling Song <ruiling.song@amd.com> and
Jay Foad <jay.foad@amd.com>.

Support for GMIR and lit tests for GMIR/MIR added by
Yashwant Singh <yashwant.singh@amd.com>.

Differential Revision: https://reviews.llvm.org/D130746

2022-12-20 07:22:24 +05:30

AsmPrinter

[DebugInfo] Add function to test debug values for equivalence

2022-12-19 17:14:25 +00:00

GlobalISel

[Support] llvm::Optional => std::optional

2022-12-16 08:49:10 +00:00

LiveDebugValues

[DebugInfo] Add function to test debug values for equivalence

2022-12-19 17:14:25 +00:00

MIRParser

[CodeGen] std::optional::value => operator*/operator->

2022-12-16 23:41:36 +00:00

SelectionDAG

[SDAG] neg x with only low bit demanded is x

2022-12-19 15:25:43 -08:00

AggressiveAntiDepBreaker.cpp

…

AggressiveAntiDepBreaker.h

…

AllocationOrder.cpp

Cleanup includes: final pass

2022-03-29 09:00:21 +02:00

AllocationOrder.h

…

Analysis.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

AssignmentTrackingAnalysis.cpp

[Transforms,CodeGen] std::optional::value => operator*/operator->

2022-12-16 23:21:27 +00:00

AtomicExpandPass.cpp

[CodeGen] Use poison instead of undef as placeholder in AtomicExpandPass [NFC]

2022-11-24 08:42:28 +00:00

BasicBlockSections.cpp

Revert "[Propeller] Use Fixed MBB ID instead of volatile MachineBasicBlock::Number."

2022-12-13 11:13:57 -08:00

BasicBlockSectionsProfileReader.cpp

Revert "[Propeller] Use Fixed MBB ID instead of volatile MachineBasicBlock::Number."

2022-12-13 11:13:57 -08:00

BasicTargetTransformInfo.cpp

…

BranchFolding.cpp

Revert "[Windows SEH]: HARDWARE EXCEPTION HANDLING (MSVC -EHa) - Part 2"

2022-12-02 02:44:18 -08:00

BranchFolding.h

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

BranchRelaxation.cpp

[CodeGen] Fix restore blocks' BasicBlock information in branch relaxation

2022-12-02 02:42:22 +08:00

BreakFalseDeps.cpp

[NFC] Add checks for potential null returns

2022-12-13 22:30:31 +08:00

CalcSpillWeights.cpp

RegAlloc: Use SmallSet instead of std::set

2022-09-12 07:55:10 -04:00

CallingConvLower.cpp

Use CTAD on llvm::SaveAndRestore

2022-12-02 15:36:12 -08:00

CFGuardLongjmp.cpp

…

CFIFixup.cpp

[CodeGen] Async unwind - add a pass to fix CFI information

2022-04-11 13:27:26 +01:00

CFIInstrInserter.cpp

[CodeGen] llvm::Optional => std::optional

2022-12-13 09:06:36 +00:00

CMakeLists.txt

RFC: Uniformity Analysis for Irreducible Control Flow

2022-12-20 07:22:24 +05:30

CodeGen.cpp

RFC: Uniformity Analysis for Irreducible Control Flow

2022-12-20 07:22:24 +05:30

CodeGenCommonISel.cpp

[GlobalISel][DebugInfo] salvageDebugInfo analogue for gMIR

2022-08-01 11:14:53 +02:00

CodeGenPassBuilder.cpp

…

CodeGenPrepare.cpp

Correct typos (NFC)

2022-12-16 10:51:26 -08:00

CommandFlags.cpp

Iterate over StringMaps using structured bindings. NFCI.

2022-12-04 18:36:41 +01:00

ComplexDeinterleavingPass.cpp

Ensure newlines at the end of files (NFC)

2022-12-16 23:36:51 -08:00

CriticalAntiDepBreaker.cpp

…

CriticalAntiDepBreaker.h

…

DeadMachineInstructionElim.cpp

DeadMachineInstructionElim: Don't repeat per-function init

2022-09-13 08:19:54 -04:00

DetectDeadLanes.cpp

[NFC] Use Register instead of unsigned for variables that receive a Register object

2022-12-07 00:23:34 +00:00

DFAPacketizer.cpp

[CodeGen] Qualify auto variables in for loops (NFC)

2022-07-17 01:33:28 -07:00

DwarfEHPrepare.cpp

[DwarfEhPrepare] Assign dummy debug location for inserted _Unwind_Resume calls (PR57469)

2022-09-01 16:35:49 +02:00

EarlyIfConversion.cpp

[EarlyIfConversion] Add target hook to allow for multiple ifcvt iterations.

2022-12-14 13:36:20 -08:00

EdgeBundles.cpp

…

EHContGuardCatchret.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

ExecutionDomainFix.cpp

…

ExpandLargeDivRem.cpp

Move TargetTransformInfo::maxLegalDivRemBitWidth -> TargetLowering::maxSupportedDivRemBitWidth

2022-09-12 17:06:16 +01:00

ExpandLargeFpConvert.cpp

[X86] Add ExpandLargeFpConvert Pass and enable for X86

2022-12-01 13:47:43 +08:00

ExpandMemCmp.cpp

[CodeGen] Use std::optional in ExpandMemCmp.cpp (NFC)

2022-11-26 14:29:56 -08:00

ExpandPostRAPseudos.cpp

[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments

2022-03-16 20:25:42 +08:00

ExpandReductions.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

ExpandVectorPredication.cpp

[Transforms,CodeGen] std::optional::value => operator*/operator->

2022-12-16 23:21:27 +00:00

FaultMaps.cpp

[CodeGen] Qualify auto variables in for loops (NFC)

2022-07-17 01:33:28 -07:00

FEntryInserter.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

FinalizeISel.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

FixupStatepointCallerSaved.cpp

[CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot

2022-12-17 11:55:34 +05:30

FuncletLayout.cpp

…

GCMetadata.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

GCMetadataPrinter.cpp

…

GCRootLowering.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

GlobalMerge.cpp

[llvm] Use range-based for loops (NFC)

2022-09-03 11:17:40 -07:00

HardwareLoops.cpp

[CodeGen] Qualify auto variables in for loops (NFC)

2022-07-17 01:33:28 -07:00

IfConversion.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

ImplicitNullChecks.cpp

Don't include Optional.h

2022-12-14 21:16:22 -08:00

IndirectBrExpandPass.cpp

[CodeGen] Use std::optional in IndirectBrExpandPass.cpp (NFC)

2022-11-26 14:50:12 -08:00

InlineSpiller.cpp

[CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot

2022-12-17 11:55:34 +05:30

InterferenceCache.cpp

…

InterferenceCache.h

…

InterleavedAccessPass.cpp

[CodeGen] Qualify auto variables in for loops (NFC)

2022-07-17 01:33:28 -07:00

InterleavedLoadCombinePass.cpp

[CodeGen] Qualify auto variables in for loops (NFC)

2022-07-17 01:33:28 -07:00

IntrinsicLowering.cpp

[Intrinsic] Rename flt.rounds intrinsic to get.rounding

2022-12-19 15:22:39 +08:00

JMCInstrumenter.cpp

[JMCInstrument] rename ELF section name from ".just.my.code" to ".data.just.my.code"

2022-10-19 10:49:54 -07:00

LatencyPriorityQueue.cpp

…

LazyMachineBlockFrequencyInfo.cpp

[CodeGen] Apply clang-tidy fixes for readability-redundant-smartptr-get (NFC)

2022-03-20 23:11:06 -07:00

LexicalScopes.cpp

…

LiveDebugVariables.cpp

[CodeGen] llvm::Optional => std::optional

2022-12-13 09:06:36 +00:00

LiveDebugVariables.h

…

LiveInterval.cpp

[LiveInterval] Simplify with partition_point. NFC

2022-06-27 19:25:26 -07:00

LiveIntervalCalc.cpp

[NFC] Use Register instead of unsigned for variables that receive a Register object

2022-12-07 00:23:34 +00:00

LiveIntervals.cpp

[CodeGen] Use cloneVirtualRegister in LiveIntervals and LiveRangeEdit

2022-12-17 11:54:33 +05:30

LiveIntervalUnion.cpp

…

LivePhysRegs.cpp

…

LiveRangeCalc.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

LiveRangeEdit.cpp

[CodeGen] Use cloneVirtualRegister in LiveIntervals and LiveRangeEdit

2022-12-17 11:54:33 +05:30

LiveRangeShrink.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

LiveRangeUtils.h

…

LiveRegMatrix.cpp

…

LiveRegUnits.cpp

LiveRegUnits: Break register loop when a clobber is encountered

2022-09-13 10:15:08 -04:00

LiveStacks.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

LiveVariables.cpp

[CodeGen] Use range-based for loops (NFC)

2022-07-23 16:10:46 -07:00

LLVMTargetMachine.cpp

[MC][re-land] Omit DWARF unwind info if compact unwind is present where eligible

2022-06-12 17:24:19 -04:00

LocalStackSlotAllocation.cpp

[iwyu] Handle regressions in libLLVM header include

2022-05-04 08:32:38 +02:00

LoopTraversal.cpp

…

LowerEmuTLS.cpp

[CodeGen] Qualify auto variables in for loops (NFC)

2022-07-17 01:33:28 -07:00

LowLevelType.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

MachineBasicBlock.cpp

[Transforms,CodeGen] std::optional::value => operator*/operator->

2022-12-16 23:21:27 +00:00

MachineBlockFrequencyInfo.cpp

Don't include None.h (NFC)

2022-12-10 11:24:26 -08:00

MachineBlockPlacement.cpp

[NFC][BlockPlacement]Add an option to renumber blocks based on function layout order.

2022-11-07 07:52:45 -08:00

MachineBranchProbabilityInfo.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

MachineCFGPrinter.cpp

-dot-machine-cfg for printing MachineFunction to a dot file

2022-09-22 12:48:33 +05:30

MachineCheckDebugify.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

MachineCombiner.cpp

[MachineCombiner][RISCV] Add fmadd/fmsub/fnmsub instructions patterns

2022-11-17 13:24:04 +03:00

MachineCopyPropagation.cpp

[Target] llvm::Optional => std::optional

2022-12-04 22:43:14 +00:00

MachineCSE.cpp

[MachineCSE] Allow CSE for instructions with ignorable operands

2022-11-14 19:34:59 +00:00

MachineCycleAnalysis.cpp

RFC: Uniformity Analysis for Irreducible Control Flow

2022-12-20 07:22:24 +05:30

MachineDebugify.cpp

[Debugify] Accumulate the number of variables in debugify metadata

2022-11-25 10:53:55 +03:00

MachineDominanceFrontier.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

MachineDominators.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

MachineFrameInfo.cpp

[MachineFrameInfo][RISCV] Call ensureStackAlignment for objects created with scalable vector stack id.

2022-10-20 14:05:46 -07:00

MachineFunction.cpp

Revert "[Propeller] Use Fixed MBB ID instead of volatile MachineBasicBlock::Number."

2022-12-13 11:13:57 -08:00

MachineFunctionPass.cpp

[NFC][MachineFunctionPass] Only lookup pass name if we request printing

2022-09-07 21:38:00 -07:00

MachineFunctionPrinterPass.cpp

…

MachineFunctionSplitter.cpp

[Transforms,CodeGen] std::optional::value => operator*/operator->

2022-12-16 23:21:27 +00:00

MachineInstr.cpp

[DebugInfo] Add function to test debug values for equivalence

2022-12-19 17:14:25 +00:00

MachineInstrBundle.cpp

[CodeGen] Apply clang-tidy fixes for readability-redundant-smartptr-get (NFC)

2022-03-20 23:11:06 -07:00

MachineLateInstrsCleanup.cpp

Reapply "[CodeGen] Add new pass for late cleanup of redundant definitions."

2022-12-05 12:53:50 -06:00

MachineLICM.cpp

CodeGen: Remove AliasAnalysis from regalloc

2022-07-18 17:23:41 -04:00

MachineLoopInfo.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

MachineLoopUtils.cpp

Fix an unused-variable warning in release build, NFC.

2022-06-19 20:52:00 +02:00

MachineModuleInfo.cpp

[MC][re-land] Omit DWARF unwind info if compact unwind is present where eligible

2022-06-12 17:24:19 -04:00

MachineModuleInfoImpls.cpp

…

MachineModuleSlotTracker.cpp

…

MachineOperand.cpp

[CodeGen] llvm::Optional => std::optional

2022-12-13 09:06:36 +00:00

MachineOptimizationRemarkEmitter.cpp

[YAML] Convert Optional to std::optional

2022-12-06 12:49:32 -08:00

MachineOutliner.cpp

[CodeGen] Use std::nullopt instead of None (NFC)

2022-12-02 20:36:08 -08:00

MachinePassManager.cpp

Revert "[llvm] Replace llvm::Any with std::any"

2022-12-08 12:07:30 +01:00

MachinePipeliner.cpp

[SWP] Recognize mem carried dep with different base

2022-11-07 09:53:41 +00:00

MachinePostDominators.cpp

…

MachineRegionInfo.cpp

…

MachineRegisterInfo.cpp

[CodeGen] Use delegate to notify targets when virtual registers are created

2022-12-17 11:53:34 +05:30

MachineScheduler.cpp

[CodeGen] Fixed undeclared MISchedCutoff in case of NDEBUG and LLVM_ENABLE_ABI_BREAKING_CHECKS

2022-07-30 18:24:50 +02:00

MachineSink.cpp

[CodeGen] Qualify auto variables in for loops (NFC)

2022-07-17 01:33:28 -07:00

MachineSizeOpts.cpp

…

MachineSSAContext.cpp

RFC: Uniformity Analysis for Irreducible Control Flow

2022-12-20 07:22:24 +05:30

MachineSSAUpdater.cpp

Revert "[MachineSSAUpdater] compile time improvement in GetValueInMiddleOfBlock"

2022-06-14 20:27:21 +07:00

MachineStableHash.cpp

Address feedback in https://reviews.llvm.org/D133637

2022-09-13 16:12:41 -07:00

MachineStripDebug.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

MachineTraceMetrics.cpp

[Analysis] llvm::Optional => std::optional

2022-12-14 07:32:24 +00:00

MachineUniformityAnalysis.cpp

RFC: Uniformity Analysis for Irreducible Control Flow

2022-12-20 07:22:24 +05:30

MachineVerifier.cpp

[GlobalISel] Add a new G_INVOKE_REGION_START instruction to fix an EH bug.

2022-12-07 10:28:51 -08:00

MacroFusion.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

MBFIWrapper.cpp

[YAML] Convert Optional to std::optional

2022-12-06 12:49:32 -08:00

MIRCanonicalizerPass.cpp

[CodeGen] Qualify auto variables in for loops (NFC)

2022-07-17 01:33:28 -07:00

MIRFSDiscriminator.cpp

Fix warnings about variables that are set but only used in debug mode

2022-04-06 10:01:46 +03:00

MIRNamerPass.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

MIRPrinter.cpp

[NFC] Use Register instead of unsigned for variables that receive a Register object

2022-12-07 00:23:34 +00:00

MIRPrintingPass.cpp

…

MIRSampleProfile.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

MIRVRegNamerUtils.cpp

[MIRVRegNamer] Avoid opcode hash collision

2022-11-02 13:53:12 +00:00

MIRVRegNamerUtils.h

…

MIRYamlMapping.cpp

…

MLRegallocEvictAdvisor.cpp

[CodeGen] llvm::Optional => std::optional

2022-12-13 09:06:36 +00:00

MLRegallocEvictAdvisor.h

[MLGO] Add per-instruction MBB frequencies to regalloc dev features

2022-09-28 18:45:04 +00:00

MLRegallocPriorityAdvisor.cpp

[mlgo] Use LLVM_HAVE_TFLITE instead of LLVM_HAVE_TF_API in C++ code (NFC)

2022-12-12 11:28:40 -08:00

ModuloSchedule.cpp

[Transforms,CodeGen] std::optional::value => operator*/operator->

2022-12-16 23:21:27 +00:00

MultiHazardRecognizer.cpp

…

NonRelocatableStringpool.cpp

[Debuginfo][DWARF][NFC] Refactor DwarfStringPoolEntryRef - remove isIndexed().

2022-06-05 21:18:31 +03:00

OptimizePHIs.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

ParallelCG.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

PatchableFunction.cpp

[CodeGen][X86] Crash fixes for "patchable-function" pass

2022-11-30 07:29:54 -05:00

PeepholeOptimizer.cpp

[CodeGen] llvm::Optional => std::optional

2022-12-13 09:06:36 +00:00

PHIElimination.cpp

[NFC] Use Register instead of unsigned for variables that receive a Register object

2022-12-07 00:23:34 +00:00

PHIEliminationUtils.cpp

…

PHIEliminationUtils.h

…

PostRAHazardRecognizer.cpp

[CodeGen] Apply clang-tidy fixes for readability-redundant-smartptr-get (NFC)

2022-03-20 23:11:06 -07:00

PostRASchedulerList.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

PreISelIntrinsicLowering.cpp

[WinEH] Apply funclet operand bundles to nounwind intrinsics that lower to function calls in the course of IR transforms

2022-07-26 17:52:43 +02:00

ProcessImplicitDefs.cpp

[llvm] Remove redundaunt virtual specifiers (NFC)

2022-07-24 21:50:35 -07:00

PrologEpilogInserter.cpp

[CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot

2022-12-17 11:55:34 +05:30

PseudoProbeInserter.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

PseudoSourceValue.cpp

CodeGen: Move getAddressSpaceForPseudoSourceKind into TargetMachine

2022-06-01 09:45:40 -04:00

RDFGraph.cpp

Recommit [RDF] Remove explicit template arguments from Print

2022-08-08 07:28:45 -07:00

RDFLiveness.cpp

Recommit [RDF] Remove explicit template arguments from Print

2022-08-08 07:28:45 -07:00

RDFRegisters.cpp

…

ReachingDefAnalysis.cpp

[CodeGen] Qualify auto variables in for loops (NFC)

2022-07-17 01:33:28 -07:00

README.txt

…

RegAllocBase.cpp

[CodeGen] Qualify auto variables in for loops (NFC)

2022-07-17 01:33:28 -07:00

RegAllocBase.h

…

RegAllocBasic.cpp

CodeGen: Remove AliasAnalysis from regalloc

2022-07-18 17:23:41 -04:00

RegAllocEvictionAdvisor.cpp

[mlgo] Use LLVM_HAVE_TFLITE instead of LLVM_HAVE_TF_API in C++ code (NFC)

2022-12-12 11:28:40 -08:00

RegAllocEvictionAdvisor.h

[CodeGen] llvm::Optional => std::optional

2022-12-13 09:06:36 +00:00

RegAllocFast.cpp

[CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot

2022-12-17 11:55:34 +05:30

RegAllocGreedy.cpp

[CodeGen] llvm::Optional => std::optional

2022-12-13 09:06:36 +00:00

RegAllocGreedy.h

[ADT] Alias llvm::Optional to std::optional

2022-12-20 01:01:46 +01:00

RegAllocPBQP.cpp

[CodeGen] Qualify auto variables in for loops (NFC)

2022-07-17 01:33:28 -07:00

RegAllocPriorityAdvisor.cpp

[mlgo] Use LLVM_HAVE_TFLITE instead of LLVM_HAVE_TF_API in C++ code (NFC)

2022-12-12 11:28:40 -08:00

RegAllocPriorityAdvisor.h

[nfc][mlgo] Lazily compute the regalloc reward

2022-09-26 15:34:29 -07:00

RegAllocScore.cpp

[llvm] Don't include STLForwardCompat.h (NFC)

2022-12-06 20:09:56 -08:00

RegAllocScore.h

CodeGen: Remove AliasAnalysis from regalloc

2022-07-18 17:23:41 -04:00

RegisterBank.cpp

[nfc][codegen] Move RegisterBank[Info].h under CodeGen

2022-03-01 21:53:25 -08:00

RegisterBankInfo.cpp

[globalisel] Select register bank for DBG_VALUE

2022-08-09 13:11:51 +08:00

RegisterClassInfo.cpp

Fix CSR update check

2022-08-24 18:09:49 -07:00

RegisterCoalescer.cpp

[RegisterCoalescer] fix dst subreg replacement during remat copy trick

2022-09-23 18:52:29 +00:00

RegisterCoalescer.h

…

RegisterPressure.cpp

[CodeGen] Qualify auto variables in for loops (NFC)

2022-07-17 01:33:28 -07:00

RegisterScavenging.cpp

[CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot

2022-12-17 11:55:34 +05:30

RegisterUsageInfo.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

RegUsageInfoCollector.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

RegUsageInfoPropagate.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

RemoveRedundantDebugValues.cpp

[CodeGen] Use std::nullopt instead of None (NFC)

2022-12-02 20:36:08 -08:00

RenameIndependentSubregs.cpp

[NFC] Use Register instead of unsigned for variables that receive a Register object

2022-12-07 00:23:34 +00:00

ReplaceWithVeclib.cpp

[LV][SLP] Mark fptosi_sat as vectorizable

2022-05-03 09:32:34 +01:00

ResetMachineFunctionPass.cpp

…

SafeStack.cpp

[CodeGen] Use std::optional in SafeStack.cpp (NFC)

2022-11-26 14:57:44 -08:00

SafeStackLayout.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

SafeStackLayout.h

[NFC][Alignment] Use Align in SafeStack

2022-06-14 10:56:36 +00:00

SanitizerBinaryMetadata.cpp

Use-after-return sanitizer binary metadata

2022-12-05 14:40:31 +01:00

ScheduleDAG.cpp

…

ScheduleDAGInstrs.cpp

[NFC][ScheduleDAGInstrs] Use structure bindings and emplace_back

2022-09-13 12:49:04 +03:00

ScheduleDAGPrinter.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

ScoreboardHazardRecognizer.cpp

[llvm] LLVM_FALLTHROUGH => [[fallthrough]]. NFC

2022-08-08 11:24:15 -07:00

SelectOptimize.cpp

[Transforms,CodeGen] std::optional::value => operator*/operator->

2022-12-16 23:21:27 +00:00

ShadowStackGCLowering.cpp

[CodeGen] Use std::optional in ShadowStackGCLowering.cpp (NFC)

2022-11-26 15:09:25 -08:00

ShrinkWrap.cpp

…

SjLjEHPrepare.cpp

Use poison instead of undef where its used as a placeholder [NFC]

2022-12-11 17:18:00 +00:00

SlotIndexes.cpp

[LiveIntervals] Find better anchoring end points when repairing ranges

2022-07-18 19:34:43 +01:00

SpillPlacement.cpp

…

SpillPlacement.h

…

SplitKit.cpp

Don't include None.h (NFC)

2022-12-10 11:24:26 -08:00

SplitKit.h

Correct typos (NFC)

2022-12-16 10:51:26 -08:00

StackColoring.cpp

[StackColoring] Don't merge slots with differing StackIDs

2022-05-17 08:28:49 +01:00

StackMapLivenessAnalysis.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

StackMaps.cpp

[Alignment][NFC] Use Align in MCStreamer::emitValueToAlignment

2022-11-24 16:09:44 +00:00

StackProtector.cpp

Attributes: Add function getter to parse integer string attributes

2022-12-14 13:12:35 -05:00

StackSlotColoring.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

SwiftErrorValueTracking.cpp

Use any_of (NFC)

2022-07-30 10:35:56 -07:00

SwitchLoweringUtils.cpp

…

TailDuplication.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

TailDuplicator.cpp

[llvm][TailDuplicator] don't taildup isInlineAsmBrIndirectTargets

2022-08-31 13:07:10 -07:00

TargetFrameLoweringImpl.cpp

[iwyu] Handle regressions in libLLVM header include

2022-04-13 20:53:19 +02:00

TargetInstrInfo.cpp

[CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot

2022-12-17 11:55:34 +05:30

TargetLoweringBase.cpp

[X86] Add ExpandLargeFpConvert Pass and enable for X86

2022-12-01 13:47:43 +08:00

TargetLoweringObjectFileImpl.cpp

[IR] llvm::Optional => std::optional

2022-12-05 04:13:11 +00:00

TargetOptionsImpl.cpp

Cleanup codegen includes

2022-03-16 08:43:00 +01:00

TargetPassConfig.cpp

[AA] Remove CFL AA passes

2022-12-12 09:34:20 +01:00

TargetRegisterInfo.cpp

…

TargetSchedule.cpp

[CodeGen] Use std::lcm (NFC)

2022-09-03 11:17:33 -07:00

TargetSubtargetInfo.cpp

[regalloc] Remove -consider-local-interval-cost

2022-03-14 10:49:16 -07:00

TwoAddressInstructionPass.cpp

[TwoAddressInstruction] Fix stale LiveVariables info in processStatepoint

2022-10-21 14:57:03 +01:00

TypePromotion.cpp

[TypePromotion] Replace Zext to Truncate for the case src bitwidth is larger

2022-11-09 05:08:01 +08:00

UnreachableBlockElim.cpp

[NFC][CodeGen] Rename some functions in MachineInstr.h and remove duplicated comments

2022-03-16 20:25:42 +08:00

ValueTypes.cpp

Add new vector types for LLVM

2022-11-29 17:02:04 +01:00

VirtRegMap.cpp

[NFC] Use Register instead of unsigned for variables that receive a Register object

2022-12-07 00:23:34 +00:00

VLIWMachineScheduler.cpp

[llvm] Fix comment typos (NFC)

2022-08-07 00:16:14 -07:00

WasmEHPrepare.cpp

[NFC] Cleanup: Replaces BB->getInstList().erase() with BB->erase().

2022-12-01 18:19:23 -08:00

WinEHPrepare.cpp

[NFC] Rename Instruction::insertAt() to Instruction::insertInto(), to be consistent with BasicBlock::insertInto()

2022-12-15 12:27:45 -08:00

XRayInstrumentation.cpp

Attributes: Add function getter to parse integer string attributes

2022-12-14 13:12:35 -05:00

README.txt

//===---------------------------------------------------------------------===//

Common register allocation / spilling problem:

        mul lr, r4, lr
        str lr, [sp, #+52]
        ldr lr, [r1, #+32]
        sxth r3, r3
        ldr r4, [sp, #+52]
        mla r4, r3, lr, r4

can be:

        mul lr, r4, lr
        mov r4, lr
        str lr, [sp, #+52]
        ldr lr, [r1, #+32]
        sxth r3, r3
        mla r4, r3, lr, r4

and then "merge" mul and mov:

        mul r4, r4, lr
        str r4, [sp, #+52]
        ldr lr, [r1, #+32]
        sxth r3, r3
        mla r4, r3, lr, r4

It also increase the likelihood the store may become dead.

//===---------------------------------------------------------------------===//

bb27 ...
        ...
        %reg1037 = ADDri %reg1039, 1
        %reg1038 = ADDrs %reg1032, %reg1039, %noreg, 10
    Successors according to CFG: 0x8b03bf0 (#5)

bb76 (0x8b03bf0, LLVM BB @0x8b032d0, ID#5):
    Predecessors according to CFG: 0x8b0c5f0 (#3) 0x8b0a7c0 (#4)
        %reg1039 = PHI %reg1070, mbb<bb76.outer,0x8b0c5f0>, %reg1037, mbb<bb27,0x8b0a7c0>

Note ADDri is not a two-address instruction. However, its result %reg1037 is an
operand of the PHI node in bb76 and its operand %reg1039 is the result of the
PHI node. We should treat it as a two-address code and make sure the ADDri is
scheduled after any node that reads %reg1039.

//===---------------------------------------------------------------------===//

Use local info (i.e. register scavenger) to assign it a free register to allow
reuse:
        ldr r3, [sp, #+4]
        add r3, r3, #3
        ldr r2, [sp, #+8]
        add r2, r2, #2
        ldr r1, [sp, #+4]  <==
        add r1, r1, #1
        ldr r0, [sp, #+4]
        add r0, r0, #2

//===---------------------------------------------------------------------===//

LLVM aggressively lift CSE out of loop. Sometimes this can be negative side-
effects:

R1 = X + 4
R2 = X + 7
R3 = X + 15

loop:
load [i + R1]
...
load [i + R2]
...
load [i + R3]

Suppose there is high register pressure, R1, R2, R3, can be spilled. We need
to implement proper re-materialization to handle this:

R1 = X + 4
R2 = X + 7
R3 = X + 15

loop:
R1 = X + 4  @ re-materialized
load [i + R1]
...
R2 = X + 7 @ re-materialized
load [i + R2]
...
R3 = X + 15 @ re-materialized
load [i + R3]

Furthermore, with re-association, we can enable sharing:

R1 = X + 4
R2 = X + 7
R3 = X + 15

loop:
T = i + X
load [T + 4]
...
load [T + 7]
...
load [T + 15]
//===---------------------------------------------------------------------===//

It's not always a good idea to choose rematerialization over spilling. If all
the load / store instructions would be folded then spilling is cheaper because
it won't require new live intervals / registers. See 2003-05-31-LongShifts for
an example.

//===---------------------------------------------------------------------===//

With a copying garbage collector, derived pointers must not be retained across
collector safe points; the collector could move the objects and invalidate the
derived pointer. This is bad enough in the first place, but safe points can
crop up unpredictably. Consider:

        %array = load { i32, [0 x %obj] }** %array_addr
        %nth_el = getelementptr { i32, [0 x %obj] }* %array, i32 0, i32 %n
        %old = load %obj** %nth_el
        %z = div i64 %x, %y
        store %obj* %new, %obj** %nth_el

If the i64 division is lowered to a libcall, then a safe point will (must)
appear for the call site. If a collection occurs, %array and %nth_el no longer
point into the correct object.

The fix for this is to copy address calculations so that dependent pointers
are never live across safe point boundaries. But the loads cannot be copied
like this if there was an intervening store, so may be hard to get right.

Only a concurrent mutator can trigger a collection at the libcall safe point.
So single-threaded programs do not have this requirement, even with a copying
collector. Still, LLVM optimizations would probably undo a front-end's careful
work.

//===---------------------------------------------------------------------===//

The ocaml frametable structure supports liveness information. It would be good
to support it.

//===---------------------------------------------------------------------===//

The FIXME in ComputeCommonTailLength in BranchFolding.cpp needs to be
revisited. The check is there to work around a misuse of directives in inline
assembly.

//===---------------------------------------------------------------------===//

It would be good to detect collector/target compatibility instead of silently
doing the wrong thing.

//===---------------------------------------------------------------------===//

It would be really nice to be able to write patterns in .td files for copies,
which would eliminate a bunch of explicit predicates on them (e.g. no side
effects).  Once this is in place, it would be even better to have tblgen
synthesize the various copy insertion/inspection methods in TargetInstrInfo.

//===---------------------------------------------------------------------===//

Stack coloring improvements:

1. Do proper LiveStacks analysis on all stack objects including those which are
   not spill slots.
2. Reorder objects to fill in gaps between objects.
   e.g. 4, 1, <gap>, 4, 1, 1, 1, <gap>, 4 => 4, 1, 1, 1, 1, 4, 4

//===---------------------------------------------------------------------===//

The scheduler should be able to sort nearby instructions by their address. For
example, in an expanded memset sequence it's not uncommon to see code like this:

  movl $0, 4(%rdi)
  movl $0, 8(%rdi)
  movl $0, 12(%rdi)
  movl $0, 0(%rdi)

Each of the stores is independent, and the scheduler is currently making an
arbitrary decision about the order.

//===---------------------------------------------------------------------===//

Another opportunitiy in this code is that the $0 could be moved to a register:

  movl $0, 4(%rdi)
  movl $0, 8(%rdi)
  movl $0, 12(%rdi)
  movl $0, 0(%rdi)

This would save substantial code size, especially for longer sequences like
this. It would be easy to have a rule telling isel to avoid matching MOV32mi
if the immediate has more than some fixed number of uses. It's more involved
to teach the register allocator how to do late folding to recover from
excessive register pressure.