Files
clang-p2996/mlir/lib/Dialect/Linalg/TransformOps/GPUHeuristics.cpp
Nicolas Vasilache 171a5a761d [mlir][Linalg] Add a greedy transform to map copies to threads efficiently.
This revision adds a new transformation to map a copy operation to a gpu grid of threads.
It implements a first heuristic that allows trading off coalesced accesses vs predication and occupancy.

Differential Revision: https://reviews.llvm.org/D154836
2023-07-10 16:11:04 +00:00

11 KiB