clang-p2996

Files

lorenzo chelini f381768a8d [MLIR][Linalg] introduce batch-reduce GEMM

The batch-reduce GEMM kernel essentially multiplies a sequence of input tensor
blocks (which form a batch) and the partial multiplication results are reduced
into a single output tensor block.

See: https://ieeexplore.ieee.org/document/9139809 for more details.

Reviewed By: nicolasvasilache

Differential Revision: https://reviews.llvm.org/D134163

2022-09-19 12:11:54 +02:00

mlir

[MLIR][Linalg] introduce batch-reduce GEMM

2022-09-19 12:11:54 +02:00

.style.yapf

…

CMakeLists.txt

[MLIR] Fix checks for native arch

2022-08-04 11:10:08 +02:00

requirements.txt

Upstream MLIR PyTACO implementation.

2022-01-21 08:38:36 -08:00