With this PR I am trying to address: https://github.com/llvm/llvm-project/issues/63230. What changed: - While merging identical blocks, don't add a block argument if it is "identical" to another block argument. I.e., if the two block arguments refer to the same `Value`. The operations operands in the block will point to the argument we already inserted. This needs to happen to all the arguments we pass to the different successors of the parent block - After merged the blocks, get rid of "unnecessary" arguments. I.e., if all the predecessors pass the same block argument, there is no need to pass it as an argument. - This last simplification clashed with `BufferDeallocationSimplification`. The reason, I think, is that the two simplifications are clashing. I.e., `BufferDeallocationSimplification` contains an analysis based on the block structure. If we simplify the block structure (by merging and/or dropping block arguments) the analysis is invalid . The solution I found is to do a more prudent simplification when running that pass. **Note-1**: I ran all the integration tests (`-DMLIR_INCLUDE_INTEGRATION_TESTS=ON`) and they passed. **Note-2**: I fixed a bug found by @Dinistro in #97697 . The issue was that, when looking for redundant arguments, I was not considering that the block might have already some arguments. So the index (in the block args list) of the i-th `newArgument` is `i+numOfOldArguments`.
22 lines
871 B
MLIR
22 lines
871 B
MLIR
// RUN: mlir-opt %s -allow-unregistered-dialect -pass-pipeline="builtin.module(func.func(linalg-detensorize))" | FileCheck %s
|
|
|
|
#map = affine_map<() -> ()>
|
|
func.func @main(%arg0: tensor<f32>) -> tensor<f32> {
|
|
%0 = tensor.empty() : tensor<f32>
|
|
%1 = linalg.generic {indexing_maps = [#map, #map], iterator_types = []} ins(%arg0 : tensor<f32>) outs(%0 : tensor<f32>) {
|
|
^bb0(%in: f32, %out: f32):
|
|
linalg.yield %in : f32
|
|
} -> tensor<f32>
|
|
cf.br ^bb1(%1 : tensor<f32>)
|
|
^bb1(%2: tensor<f32>): // pred: ^bb0
|
|
return %2 : tensor<f32>
|
|
}
|
|
|
|
// CHECK-LABEL: @main
|
|
// CHECK-SAME: (%[[ARG0:.+]]: tensor<f32>) -> tensor<f32>
|
|
// CHECK: %[[EXTRACTED:.+]] = tensor.extract %[[ARG0]][] : tensor<f32>
|
|
// CHECK: cf.br ^{{.*}}
|
|
// CHECK: ^{{.*}}:
|
|
// CHECK: %[[ELEMENTS:.+]] = tensor.from_elements %[[EXTRACTED]] : tensor<f32>
|
|
// CHECK: return %[[ELEMENTS]] : tensor<f32>
|