Files
clang-p2996/mlir/lib/Dialect/NVGPU/Transforms/Utils.cpp
Matthias Springer db393288ff [mlir][NVGPU][transform] Add create_async_groups transform op
This transform looks for suitable vector transfers from global memory to shared memory and converts them to async device copies.

Differential Revision: https://reviews.llvm.org/D155569
2023-07-18 14:36:41 +02:00

3.4 KiB