This revision adds a new transformation to map a copy operation to a gpu grid of threads. It implements a first heuristic that allows trading off coalesced accesses vs predication and occupancy. Differential Revision: https://reviews.llvm.org/D154836
11 KiB
11 KiB