clang-p2996/llvm/test/CodeGen/AMDGPU/llvm.amdgcn.s.buffer.load.ll at d661b4b57504b965a37dc30b79bdd5ac36fca9ec

Files

Jay Foad eb7491769a [AMDGPU] Reimplement the GFX11 early release VGPRs optimization

Implement this optimization in SIInsertWaitcnts, where we already have
information about whether there might be outstanding VMEM store
instructions. This has the following advantages:
- Correctly handles atomics-with-return.
- Correctly handles call instructions.
- Should be faster because it does not require running a separate pass.

Differential Revision: https://reviews.llvm.org/D153279

2023-06-19 17:12:54 +01:00

48 KiB

Raw Blame History

View Raw

48 KiB Raw Blame History

48 KiB

Raw Blame History