clang-p2996/mlir/lib/Dialect/AMDGPU/IR/AMDGPUDialect.cpp at da69eb75cbc634a56886e94de3e546c63c17567e

Files

Krzysztof Drewniak 25622aa745 [mlir][AMDGPU] Add gfx950 MFMAs to the amdgpu.mfma op (#133553 )

This commit extends the lowering of amdgpu.mfma to handle the new
double-rate MFMAs in gfx950 and adds tests for these operations.

It also adds support for MFMAs on small floats (f6 and f4), which are
implented using the "scaled" MFMA intrinsic with a scale value of 0 in
order to have an unscaled MFMA.

This commit does not add a `amdgpu.scaled_mfma` operation, as that is
future work.

---------

Co-authored-by: Jakub Kuderski <kubakuderski@gmail.com>

2025-04-01 11:59:09 -05:00

16 KiB

Raw Blame History

View Raw

16 KiB Raw Blame History

16 KiB

Raw Blame History