clang-p2996

Author	SHA1	Message	Date
Matthias Springer	c4457e10fe	[mlir][IR] Change block/region walkers to enumerate `this` block/region (#75020 ) This change makes block/region walkers consistent with operation walkers. An operation walk enumerates the current operation. Similarly, block/region walks should enumerate the current block/region. Example: ``` // Current behavior: op1->walk([](Operation op2) { / op1 is enumerated / }); block1->walk([](Block block2) { /* block1 is NOT enumerated / }); region1->walk([](Block block) { /* blocks of region1 are NOT enumerated / }); region1->walk([](Region region2) { /* region1 is NOT enumerated }); // New behavior: op1->walk([](Operation op2) { / op1 is enumerated / }); block1->walk([](Block block2) { /* block1 IS enumerated / }); region1->walk([](Block block) { /* blocks of region1 ARE enumerated / }); region1->walk([](Region region2) { /* region1 IS enumerated }); ```	2023-12-20 14:51:45 +09:00
Matthias Springer	a43641c9db	[mlir][bufferization] Fix `regionOperatesOnMemrefValues` (#75016 ) `Region::walk([](Block *b) {...})` does not enumerate blocks that are direct children of the region. These blocks must be checked manually.	2023-12-12 08:56:23 +09:00
Matthias Springer	3dae97cc01	[mlir][bufferization] Fix op dominance bug in rewrite pattern (#74159 ) Fixes a bug in `SplitDeallocWhenNotAliasingAnyOther`. This pattern used to generate invalid IR (op dominance error). We never noticed this bug in existing test cases because other patterns and/or foldings were applied afterwards and those rewrites "fixed up" the IR again. (The bug is visible when running `mlir-opt -debug`.) Also add additional comments to the implementation and simplify the code a bit. Apart from the fixed dominance error, this change is NFC. Without this change, buffer deallocation tests will fail when running with #74270.	2023-12-05 11:46:30 +09:00
Matthias Springer	1abd8d1a8d	[mlir][Interfaces] Add `SubsetOpInterface` and `SubsetExtractionOpInterface` (#70617 ) There is currently an op interface for subset insertion ops (`SubsetInsertionOpInterface`), but not for subset extraction ops. This commit adds `SubsetExtractionOpInterface` to `mlir/Interfaces`, as well as a common dependent op interface: `SubsetOpInterface`. - `SubsetOpInterface` is for ops that operate on tensor subsets. It provides interface methods to check if two subset ops operate on equivalent or disjoint subsets. Ops that implement this interface must implement either `SubsetExtractionOpInterface` or `SubsetInsertionOpInterface`. - `SubsetExtractionOpInterface` is for ops that extract from a tensor at a subset. E.g., `tensor.extract_slice`, `tensor.gather`, `vector.transfer_read`. Current implemented only on `tensor.extract_slice`. - `SubsetInsertionOpInterface` is for ops that insert into a destination tensor at a subset. E.g., `tensor.insert_slice`, `tensor.parallel_insert_slice`, `tensor.scatter`, `vector.transfer_write`. Currently only implemented on `tensor.insert_slice`, `tensor.parallel_insert_slice`. Other changes: - Rename `SubsetInsertionOpInterface.td` to `SubsetOpInterface.td`. - Add helper functions to `ValueBoundsOpInterface.cpp` for checking whether two slices are disjoint. The new interfaces will be utilized by a new "loop-invariant subset hoisting" transformation. (This new transform is roughly what `Linalg/Transforms/SubsetHoisting.cpp` is doing, but in a generic and interface-driven way.)	2023-11-01 10:26:31 +09:00
Matthias Springer	4fbbb7ad7c	[mlir][bufferization] Fix ownership computation of unknown ops (#70773 ) No ownership is assumed for memref results of ops that implement none of the relevant interfaces and have no memref operands. This fixes #68948.	2023-11-01 09:26:47 +09:00
Matthias Springer	a8d0c86174	[mlir][Interfaces][NFC] Move `SubsetInsertionOpInterface` to `mlir/Interfaces` (#70615 ) `SubsetInsertionOpInterface` is an interface for ops that insert into a destination tensor at a subset. It is currently used by the bufferization framework to support efficient `tensor.extract_slice/insert_slice` bufferization and to drive "empty tensor elimination". This commit moves the interface to `mlir/Interfaces`. This is in preparation of adding a new "loop-invariant subset hoisting" transformation to `mlir/Transforms/Utils/LoopInvariantCodeMotionUtils.cpp`, which will utilize `SubsetInsertionOpInterface`. (This new transform is roughly what `Linalg/Transforms/SubsetHoisting.cpp` is doing, but in a generic and interface-driven way.)	2023-10-30 13:42:44 +09:00
Matthias Springer	4d80eff861	[mlir][bufferization] Ownership-based deallocation: Allow manual (de)allocs (#68648 ) Add a new attribute `bufferization.manual_deallocation` that can be attached to allocation and deallocation ops. Buffers that are allocated with this attribute are assigned an ownership of "false". Such buffers can be deallocated manually (e.g., with `memref.dealloc`) if the deallocation op also has the attribute set. Previously, the ownership-based buffer deallocation pass used to reject IR with existing deallocation ops. This is no longer the case if such ops have this new attribute. This change is useful for the sparse compiler, which currently deallocates the sparse tensor buffers by itself.	2023-10-23 09:45:33 +09:00
Matthias Springer	9d34c05222	[mlir][bufferization][NFC] Simplify `bufferizeOp` function signature (#68625 ) Remove the `opFilter` and `copyBeforeWrite` function arguments. These options can already be configured in the `options` object.	2023-10-09 17:52:52 -07:00
Matthias Springer	3d0ca2cfe3	[mlir][bufferization] Allow cyclic function graphs without tensors (#68632 ) Cyclic function call graphs are generally not supported by One-Shot Bufferize. However, they can be allowed when a function does not have tensor arguments or results. This is because it is then no longer necessary that the callee will be bufferized before the caller.	2023-10-09 17:52:04 -07:00
Matthias Springer	876334321f	[mlir][bufferization] Update empty_tensor_elimination transform op (#68497 ) The empty tensor elimination pass semantics have changed recently: when applied to a module, the One-Shot Module Analysis is run. Otherwise, the regular One-Shot Analysis is run. The latter one is slightly different because it ignores function boundaries and treats function block arguments as "read-only". This commit updates the transform dialect op to behave in the same way.	2023-10-08 08:46:43 -07:00
Matthias Springer	8ee38f3b32	[mlir][bufferization] Follow up for #68074 (#68488 ) Address additional comments in #68074. This should have been part of #68074.	2023-10-07 10:07:17 -07:00
long.chen	5979e1dfb1	[mlir] Fix `empty-tensor-elimination` around self-copies (#68129 ) * Fixes #67977, a crash in `empty-tensor-elimination`. * Also improves `linalg.copy` canonicalization. * Also improves indentation indentation in `mlir-linalg-ods-yaml-gen.cpp`.	2023-10-05 12:04:20 +02:00
Matthias Springer	c95fcd343d	[mlir][bufferization] Remove `resolveUsesInRepetitiveRegions` (#67927 ) The bufferization analysis has been improved over the last months and this workaround is no longer needed.	2023-10-02 16:04:27 +02:00
Matthias Springer	43198b0aa2	[mlir][bufferization] Better analysis around allocs and block arguments (#67923 ) Values that are the result of buffer allocation ops are guaranteed to not be the same allocation as block arguments of containing blocks. This fact can be used to allow for more aggressive simplification of `bufferization.dealloc` ops.	2023-10-02 11:01:12 +02:00
Martin Erhart	6a651c7f44	Revert "[mlir][bufferization] Don't clone on unknown ownership and verify function boundary ABI (#66626 )" This reverts commit `aa9eb47da2`. It introduced a double free in a test case. Reverting to have some time for fixing this and relanding later.	2023-09-28 09:14:46 +00:00
Martin Erhart	aa9eb47da2	[mlir][bufferization] Don't clone on unknown ownership and verify function boundary ABI (#66626 ) Inserting clones requires a lot of assumptions to hold on the input IR, e.g., all writes to a buffer need to dominate all reads. This is not guaranteed by one-shot bufferization and isn't easy to verify, thus it could quickly lead to incorrect results that are hard to debug. This commit changes the mechanism of how an ownership indicator is materialized when there is not already a unique ownership present. Additionally, we don't create copies of returned memrefs anymore when we don't have ownership. Instead, we insert assert operations to make sure we have ownership at runtime, or otherwise report to the user that correctness could not be guaranteed.	2023-09-28 10:45:35 +02:00
Matthias Springer	5109cb28fd	[mlir][bufferization] Make buffer deallocation pipeline op type independent (#67546 ) The buffer deallocation pipeline now works on modules and functions. Also add extra test cases that run the buffer deallocation pipeline on modules and functions. (Test cases that insert a helper function.)	2023-09-27 15:06:25 +02:00
Matthias Springer	913286baed	[mlir][linalg] Add `SubsetInsertionOpInterface` to `linalg.copy` (#67524 ) This commit enables empty tensor elimination on `linalg.copy` ops.	2023-09-27 10:04:37 +02:00
Martin Erhart	77813b088e	[mlir][bufferization] OwnershipBasedBufferDeallocation fixes (#67418 ) * Properly handle the case where an op is deleted and thus no other interfaces should be processed anymore. * Don't add ownership indicator arguments and results to function declarations	2023-09-27 09:16:43 +02:00
Martin Erhart	5c6eefb252	[mlir][bufferization] LowerDeallocation: declare helper function private (#67408 )	2023-09-26 12:35:42 +02:00
Matthias Springer	1a3abc254a	[mlir][bufferization][NFC] Remove yielded tensor analysis (#67126 ) Remove the yielded tensor analysis. This analysis was used to detect cases where One-Shot Bufferize cannot deallocate buffers. Deallocation has recently been removed from One-Shot Bufferize. Buffers are now deallocated by the buffer deallocation pass. This analysis is no longer needed.	2023-09-22 15:13:55 +02:00
Martin Erhart	65341b09b0	[mlir][bufferization][NFC] Move memref specific implementation of AllocationOpInterface to memref dialect directory (#66637 ) Follow-up on #65578	2023-09-20 14:49:52 +02:00
Martin Erhart	522c1d0eea	[mlir][gpu][bufferization] Implement BufferDeallocationOpInterface for gpu.terminator (#66880 ) This is necessary to support deallocation of IR with gpu.launch operations because it does not implement the RegionBranchOpInterface. Implementing the interface would require it to support regions with unstructured control flow and produced arguments/results.	2023-09-20 12:28:28 +02:00
Matthias Springer	695a5a6a66	[mlir][IR] Trigger `notifyOperationRemoved` callback for nested ops (#66771 ) When cloning an op, the `notifyOperationInserted` callback is triggered for all nested ops. Similarly, the `notifyOperationRemoved` callback should be triggered for all nested ops when removing an op. Listeners may inspect the IR during a `notifyOperationRemoved` callback. Therefore, when multiple ops are removed in a single `RewriterBase::eraseOp` call, the notifications must be triggered in an order in which the ops could have been removed one-by-one: * Op removals must be interleaved with `notifyOperationRemoved` callbacks. A callback is triggered right before the respective op is removed. * Ops are removed post-order and in reverse order. Other traversal orders could delete an op that still has uses. (This is not avoidable in graph regions and with cyclic block graphs.) Differential Revision: Imported from https://reviews.llvm.org/D144193.	2023-09-20 08:45:46 +02:00
Matthias Springer	6923a31542	[mlir][IR] Change `MutableArrayRange` to enumerate `OpOperand &` (#66622 ) In line with #66515, change `MutableArrayRange::begin`/`end` to enumerate `OpOperand &` instead of `Value`. Also remove `ForOp::getIterOpOperands`/`setIterArg`, which are now redundant. Note: `MutableOperandRange` cannot be made a derived class of `indexed_accessor_range_base` (like `OperandRange`), because `MutableOperandRange::assign` can change the number of operands in the range.	2023-09-19 09:09:21 +02:00
Martin Erhart	6bf043e743	[mlir][bufferization] Remove allow-return-allocs and create-deallocs pass options, remove bufferization.escape attribute (#66619 ) This commit removes the deallocation capabilities of one-shot-bufferization. One-shot-bufferization should never deallocate any memrefs as this should be entirely handled by the ownership-based-buffer-deallocation pass going forward. This means the `allow-return-allocs` pass option will default to true now, `create-deallocs` defaults to false and they, as well as the escape attribute indicating whether a memref escapes the current region, will be removed. A new `allow-return-allocs-from-loops` option is added as a temporary workaround for some bufferization limitations.	2023-09-18 16:44:48 +02:00
Martin Erhart	08b7a71bcc	[mlir][bufferization] Define a pipeline for buffer deallocation (#66352 ) Since ownership based buffer deallocation requires a few passes to be run in a somewhat fixed sequence, it makes sense to have a pipeline for convenience (and to reduce the number of transform ops to represent default deallocation).	2023-09-15 09:39:17 +02:00
Martin Erhart	942ce31985	[mlir][bufferization] BufferDeallocationOpInterface: support custom ownership update logic (#66350 ) Add a method to the BufferDeallocationOpInterface that allows operations to implement the interface and provide custom logic to compute the ownership indicators of values it defines. As a demonstrating example, this new method is implemented by the `arith.select` operation.	2023-09-14 14:34:04 +02:00
Martin Erhart	8160bce969	[mlir][bufferization][NFC] Introduce BufferDeallocationOpInterface (#66349 ) This new interface allows operations to implement custom handling of ownership values and insertion of dealloc operations which is useful when an op cannot implement the interfaces supported by default by the buffer deallocation pass (e.g., because they are not exactly compatible or because there are some additional semantics to it that would render the default implementations in buffer deallocation invalid, or because no interfaces exist for this kind of behavior and it's not worth introducing one plus a default implementation in buffer deallocation). Additionally, it can also be used to provide more efficient handling for a specific op than the interface based default implementations can.	2023-09-14 13:58:30 +02:00
Martin Erhart	01334d1abb	[mlir][bufferization] Add an ownership based buffer deallocation pass (#66337 ) Add a new Buffer Deallocation pass with the intend to replace the old one. For now it is added as a separate pass alongside in order to allow downstream users to migrate over gradually. This new pass has the goal of inserting fewer clone operations and supporting additional use-cases. Please refer to the Buffer Deallocation section in the updated Bufferization.md file for more information on how this new pass works.	2023-09-14 12:13:37 +02:00
Matthias Springer	a1ef5a9437	[mlir][bufferization] Empty tensor elimination based on SubsetOpInterface (#65766 ) This commit generalizes empty tensor elimination to operate on subset ops. No new test cases are added because all current subset ops were already supported previously. From this perspective, this change is NFC. A new interface method (and a helper method) are added to `SubsetInsertionOpInterface` to build the subset of the destination tensor.	2023-09-14 09:45:22 +02:00
Martin Erhart	c199f7dc62	Revert "[mlir][bufferization] Remove allow-return-allocs and create-deallocs pass options, remove bufferization.escape attribute" This reverts commit `6a91dfedeb`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:48 +00:00
Martin Erhart	520407a7c8	Revert "[mlir][bufferization] Improve buffer deallocation pass" This reverts commit `1bebb60a75`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:48 +00:00
Martin Erhart	792caac0f8	Revert "[mlir][bufferization][NFC] Introduce BufferDeallocationOpInterface" This reverts commit `29d86175e6`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:47 +00:00
Martin Erhart	9782232ec7	Revert "[mlir][bufferization] BufferDeallocationOpInterface: support custom ownership update logic" This reverts commit `89117f1807`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:47 +00:00
Martin Erhart	7995a4701d	Revert "[mlir][bufferization] Define a pipeline for buffer deallocation" This reverts commit `f0c4663942`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:47 +00:00
Matthias Springer	8143307b33	[mlir][bufferization] Generalize tensor slice rules to subset ops (#65619 ) This commit generalizes the special tensor.extract_slice/tensor.insert_slice bufferization rules to tensor subset ops. Ops that insert a tensor into a tensor at a specified subset (e.g., tensor.insert_slice, tensor.scatter) can implement the `SubsetInsertionOpInterface`. Apart from adding a new op interface (extending the API), this change is NFC. The only ops that currently implement the new interface are tensor.insert_slice and tensor.parallel_insert_slice, and those ops were are supported by One-Shot Bufferize.	2023-09-13 12:27:19 +02:00
Martin Erhart	f0c4663942	[mlir][bufferization] Define a pipeline for buffer deallocation Since buffer deallocation requires a few passes to be run in a somewhat fixed sequence, it makes sense to have a pipeline for convenience (and to reduce the number of transform ops to represent default deallocation). Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D159432	2023-09-13 09:30:24 +00:00
Martin Erhart	89117f1807	[mlir][bufferization] BufferDeallocationOpInterface: support custom ownership update logic Add a method to the BufferDeallocationOpInterface that allows operations to implement the interface and provide custom logic to compute the ownership indicators of values it defines. As a demonstrating example, this new method is implemented by the `arith.select` operation. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158828	2023-09-13 09:30:23 +00:00
Martin Erhart	29d86175e6	[mlir][bufferization][NFC] Introduce BufferDeallocationOpInterface This new interface allows operations to implement custom handling of ownership values and insertion of dealloc operations which is useful when an op cannot implement the interfaces supported by default by the buffer deallocation pass (e.g., because they are not exactly compatible or because there are some additional semantics to it that would render the default implementations in buffer deallocation invalid, or because no interfaces exist for this kind of behavior and it's not worth introducing one plus a default implementation in buffer deallocation). Additionally, it can also be used to provide more efficient handling for a specific op than the interface based default implementations can. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158756	2023-09-13 09:30:23 +00:00
Martin Erhart	1bebb60a75	[mlir][bufferization] Improve buffer deallocation pass Add a new Buffer Deallocation pass replacing the old one with the goal of inserting fewer clone operations and supporting additional use-cases. Please refer to the Buffer Deallocation section in the updated Bufferization.md file for more information on how this new pass works. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158421	2023-09-13 09:30:23 +00:00
Martin Erhart	6a91dfedeb	[mlir][bufferization] Remove allow-return-allocs and create-deallocs pass options, remove bufferization.escape attribute This is the first commit in a series with the goal to rework the BufferDeallocation pass. Currently, this pass heavily relies on copies to perform correct deallocations, which leads to very slow code and potentially high memory usage. Additionally, there are unsupported cases such as returning memrefs which this series of commits aims to add support for as well. This first commit removes the deallocation capabilities of one-shot-bufferization.One-shot-bufferization should never deallocate any memrefs as this should be entirely handled by the buffer-deallocation pass going forward. This means the allow-return-allocs pass option will default to true now, create-deallocs defaults to false and they, as well as the escape attribute indicating whether a memref escapes the current region, will be removed. The documentation should w.r.t. these pass option changes should also be updated in this commit. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D156662	2023-09-13 09:30:22 +00:00
Martin Erhart	7c6419bc3c	[NFC][mlir][bufferization] Move AllocationOpInterface implementations (#65578 ) The new Buffer Deallocation pass introduced in D158421 will not need the AllocationOpInterface anymore, thus it is better to move those default implementations to a place where they will still be used.	2023-09-07 09:59:51 +02:00
Matthias Springer	1e1a3112f1	[mlir][bufferization] Privatize buffers for parallel regions One-Shot Bufferize correctly handles RaW conflicts around repetitive regions (loops). Specical handling is needed for parallel regions. These are a special kind of repetitive regions that can have additional RaW conflicts that would not be present if the regions would be executed sequentially. Example: ``` %0 = bufferization.alloc_tensor() scf.forall ... { %1 = linalg.fill ins(...) outs(%0) ... scf.forall.in_parallel { tensor.parallel_insert_slice %1 into ... } } ``` A separate (private) buffer must be allocated for each iteration of the `scf.forall` loop. This change adds a new interface method to `BufferizableOpInterface` to detect parallel regions. By default, regions are assumed to be sequential. A buffer is privatized if an OpOperand bufferizes to a memory read inside a parallel region that is different from the parallel region where operand's value is defined. Differential Revision: https://reviews.llvm.org/D159286	2023-09-06 14:28:43 +02:00
Martin Erhart	34a35a8b24	[mlir] Move FunctionInterfaces to Interfaces directory and inherit from CallableOpInterface Functions are always callable operations and thus every operation implementing the `FunctionOpInterface` also implements the `CallableOpInterface`. The only exception was the FuncOp in the toy example. To make implementation of the `FunctionOpInterface` easier, this commit lets `FunctionOpInterface` inherit from `CallableOpInterface` and merges some of their methods. More precisely, the `CallableOpInterface` has methods to get the argument and result attributes and a method to get the result types of the callable region. These methods are always implemented the same way as their analogues in `FunctionOpInterface` and thus this commit moves all the argument and result attribute handling methods to the callable interface as well as the methods to get the argument and result types. The `FuntionOpInterface` then does not have to declare them as well, but just inherits them from the `CallableOpInterface`. Adding the inheritance relation also required to move the `FunctionOpInterface` from the IR directory to the Interfaces directory since IR should not depend on Interfaces. Reviewed By: jpienaar, springerm Differential Revision: https://reviews.llvm.org/D157988	2023-08-31 11:28:23 +00:00
Matthias Springer	6ecebb496c	[mlir][bufferization] Support unstructured control flow This revision adds support for unstructured control flow to the bufferization infrastructure. In particular: regions with multiple blocks, `cf.br`, `cf.cond_br`. Two helper templates are added to `BufferizableOpInterface.h`, which can be implemented by ops that supported unstructured control flow in their regions (e.g., `func.func`) and ops that branch to another block (e.g., `cf.br`). A block signature is always bufferized together with the op that owns the block. Differential Revision: https://reviews.llvm.org/D158094	2023-08-31 12:55:53 +02:00
Martin Erhart	950f0944c9	[mlir][bufferization] Factor out bufferization.dealloc lowering into separate pass Moves the lowering of `bufferization.dealloc` to memref into a separate pass, but still registers the pattern in the conversion pass. This is helpful when some tensor values (and thus `to_memref` or `to_tensor` operations) still remain, e.g., when the function boundaries are not converted, or when constant tensors are converted to memref.get_global at a later point. However, it is still recommended to perform all bufferization before deallocation to avoid memory leaks as all memref allocations inserted after the deallocation pass was applied, have to be handled manually. Note: The buffer deallocation pass assumes that memref values defined by `bufferization.to_memref` don't return ownership and don't have to be deallocated. `bufferization.to_tensor` operations are handled similarly to `bufferization.clone` operations with the exception that the result value is not handled because it's a tensor (not a memref). Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D159180	2023-08-31 07:10:31 +00:00
Mikhail Goncharov	0a0aff2d24	fix unused variable warnings in conditionals warning was updated in `92023b1509`	2023-08-30 19:09:27 +02:00
Markus Böck	4dd744ac9c	Reland "[mlir] Use a type for representing branch points in `RegionBranchOpInterface`" This reverts commit `b26bb30b46`.	2023-08-30 09:31:54 +02:00
Markus Böck	b26bb30b46	Revert "[mlir] Use a type for representing branch points in `RegionBranchOpInterface`" This reverts commit `024f562da6`. Forgot to update flang	2023-08-29 20:17:50 +02:00

1 2 3 4 5 ...

267 Commits