Files
clang-p2996/llvm/test/CodeGen/AMDGPU/readcyclecounter.ll
Joseph Huber f956e7fbf1 [AMDGPU] Prefer s_memtime for readcyclecounter on GFX10 (#80211)
Summary:
The old `s_memtime` instruction was supported until the GFX10
architecture. Although this instruction has a higher latency than the
new shader counter, it's much more usable as a processor clock as it is
a full 64-bit counter. The new shader counter is only a 20-bit counter,
which makes it difficult to use as a standard cycle counter as it will
overflow in a few milliseconds. This patch suggests preferring
`s_memtime` for this instrinsic if it is still available.
2024-02-01 07:19:57 -06:00

4.0 KiB