clang-p2996/mlir/test/python/dialects/nvvm.py at fd3907ccb583df99e9c19d2fe84e4e7c52d75de9

Files

Guray Ozen 12c241b365 [MLIR][NVVM] Explicit Data Type for Output in wgmma.mma_async (#78713 )

The current implementation of `nvvm.wgmma.mma_async` Op deduces the data
type of the output matrix from the data type of struct member, which can be
non-intuitive, especially in cases where types like `2xf16` are packed
into `i32`.

This PR addresses this issue by improving the Op to include an explicit
data type for the output matrix.

The modified Op now includes an explicit data type for Matrix-D (<f16>),
and looks as follows:

```
%result = llvm.mlir.undef : !llvm.struct<(struct<(i32, i32, ...
nvvm.wgmma.mma_async
    %descA, %descB, %result,
    #nvvm.shape<m = 64, n = 32, k = 16>,
    D [<f16>, #nvvm.wgmma_scale_out<zero>],
    A [<f16>, #nvvm.wgmma_scale_in<neg>, <col>],
    B [<f16>, #nvvm.wgmma_scale_in<neg>, <col>]
```

2024-01-22 08:37:20 +01:00

1.8 KiB

Raw Blame History

View Raw

1.8 KiB Raw Blame History

1.8 KiB

Raw Blame History