clang-p2996

Files

Matthias Springer 1e1a3112f1 [mlir][bufferization] Privatize buffers for parallel regions

One-Shot Bufferize correctly handles RaW conflicts around repetitive regions (loops). Specical handling is needed for parallel regions. These are a special kind of repetitive regions that can have additional RaW conflicts that would not be present if the regions would be executed sequentially.

Example:
```
%0 = bufferization.alloc_tensor()
scf.forall ... {
  %1 = linalg.fill ins(...) outs(%0)
  ...
  scf.forall.in_parallel {
    tensor.parallel_insert_slice %1 into ...
  }
}
```

A separate (private) buffer must be allocated for each iteration of the `scf.forall` loop.

This change adds a new interface method to `BufferizableOpInterface` to detect parallel regions. By default, regions are assumed to be sequential.

A buffer is privatized if an OpOperand bufferizes to a memory read inside a parallel region that is different from the parallel region where operand's value is defined.

Differential Revision: https://reviews.llvm.org/D159286

2023-09-06 14:28:43 +02:00

[mlir] Move FunctionInterfaces to Interfaces directory and inherit from CallableOpInterface

2023-08-31 11:28:23 +00:00

TransformOps

[SCF][Transform] Add transform.loop.fuse_sibling

2023-08-19 15:24:23 +05:30

Transforms

[mlir][bufferization] Privatize buffers for parallel regions

2023-09-06 14:28:43 +02:00

Utils

[SCF][Transform] Add transform.loop.fuse_sibling

2023-08-19 15:24:23 +05:30

CMakeLists.txt

…