clang-p2996/mlir/lib/Conversion/AMDGPUToROCDL/AMDGPUToROCDL.cpp at e0ad34e56590fa2e6ffdf617e044de7eadee2139

Files

Krzysztof Drewniak 6292ea6879 [mlir][AMDGPU] Remove an old bf16 workaround (#108409 )

The AMDGPU backend now implements LLVM's `bfloat` type. Therefore, we no
longer need to type convert MLIR's `bf16` to `i16` during lowerings to
ROCDL.

As a result of this change, we discovered that, whel the code for MFMA
and WMMA intrinsics was mainly prepared for this change, we were failing
to bitcast the bf16 results of WMMA operations out from the i16 they're
natively represented as. This commit also fixes that issue.

---------

Co-authored-by: Jakub Kuderski <kubakuderski@gmail.com>

2024-09-12 17:45:39 -05:00

44 KiB

Raw Blame History

View Raw

44 KiB Raw Blame History

44 KiB

Raw Blame History