clang-p2996/llvm/lib/Target/AMDGPU/AMDGPULateCodeGenPrepare.cpp at 3ccacc4e44afa66f20dd6430bc7ff966cc670708

Files

choikwa 45759fe5b4 [AMDGPU] Filter candidates of LiveRegOptimizer for profitable cases (#124624 )

It is known that for vector whose element fits in i16 will be split and
scalarized in SelectionDag's type legalizer
(see SIISelLowering::getPreferredVectorAction).

LRO attempts to undo the scalarizing of vectors across basic block
boundary and shoehorn Values in VGPRs. LRO is beneficial for operations
that natively work on illegal vector types to prevent flip-flopping
between unpacked and packed. If we know that operations on vector will
be split and scalarized, then we don't want to shoehorn them back to
packed VGPR.

Operations that we know to work natively on illegal vector types usually
come in the form of intrinsics (MFMA, DOT8), buffer store, shuffle, phi
nodes to name a few.

2025-03-05 18:44:48 -05:00

24 KiB

Raw Blame History

View Raw

24 KiB Raw Blame History

24 KiB

Raw Blame History