clang-p2996

Files

xiaoleis-nv d03f35f9b6 [MLIR][NVVM] Fix the datatype error for nvvm.mma.sync when the operand is bf16 (#122664 )

The PR fixes the datatype error for `nvvm.mma.sync` when the operand is
`bf16`. This operation originally requires the A/B type to be `f16x2`
for the `bf16` MMA. However, it violates the NVVM intrinsic
[[here](372044ee09/llvm/include/llvm/IR/IntrinsicsNVVM.td (L119))],
where the A/B operand type should be `i32`. This is a bug, and there are
no tests in MLIR that cover this datatype.

```
    // mma bf16 -> s32 @ m16n8k16/m16n8k8
    !eq(gft,"m16n8k16:a:bf16") : !listsplat(llvm_i32_ty, 4),
    !eq(gft,"m16n8k16:b:bf16") : !listsplat(llvm_i32_ty, 2),
    !eq(gft,"m16n8k8:a:bf16") : !listsplat(llvm_i32_ty, 2),
    !eq(gft,"m16n8k8:b:bf16") : [llvm_i32_ty],
```

This PR addresses this bug and adds tests to guarantee correctness.

Co-authored-by: Xiaolei Shi <xiaoleis@nvidia.com>

2025-01-13 15:03:05 +05:30

mlir

[MLIR][NVVM] Fix the datatype error for nvvm.mma.sync when the operand is bf16 (#122664 )

2025-01-13 15:03:05 +05:30

mlir-c

[MLIR][CAPI] export LLVMFunctionType param getter and setters (#121888 )

2025-01-07 02:39:44 -05:00