clang-p2996

Author	SHA1	Message	Date
Jakub Kuderski	971b852546	[mlir][NFC] Simplify type checks with isa predicates (#87183 ) For more context on isa predicates, see: https://github.com/llvm/llvm-project/pull/83753.	2024-04-01 11:40:09 -04:00
Jianbang Yang	4bb9f918ff	[mlir][tensor] fix out-of-bound index in tensor.dim (#85901 ) fix a crash when fold tensor.dim with out-of-bound index. Fixes: https://github.com/llvm/llvm-project/issues/70183	2024-03-25 21:08:18 +08:00
Matthias Springer	dbfc38ed6b	[mlir][bufferization] Add `BufferOriginAnalysis` (#86461 ) This commit adds the `BufferOriginAnalysis`, which can be queried to check if two buffer SSA values originate from the same allocation. This new analysis is used in the buffer deallocation pass to fold away or simplify `bufferization.dealloc` ops more aggressively. The `BufferOriginAnalysis` is based on the `BufferViewFlowAnalysis`, which collects buffer SSA value "same buffer" dependencies. E.g., given IR such as: ``` %0 = memref.alloc() %1 = memref.subview %0 %2 = memref.subview %1 ``` The `BufferViewFlowAnalysis` will report the following "reverse" dependencies (`resolveReverse`) for `%2`: {`%2`, `%1`, `%0`}. I.e., all buffer SSA values in the reverse use-def chain that originate from the same allocation as `%2`. The `BufferOriginAnalysis` is built on top of that. It handles only simple cases at the moment and may conservatively return "unknown" around certain IR with branches, memref globals and function arguments. This analysis enables additional simplifications during `-buffer-deallocation-simplification`. In particular, "regular" scf.for loop nests, that yield buffers (or reallocations thereof) in the same order as they appear in the iter_args, are now handled much more efficiently. Such IR patterns are generated by the sparse compiler.	2024-03-25 18:57:53 +09:00
Matthias Springer	a45e58af1b	[mlir][bufferization] Add `BufferViewFlowOpInterface` (#78718 ) This commit adds the `BufferViewFlowOpInterface` to the bufferization dialect. This interface can be implemented by ops that operate on buffers to indicate that a buffer op result and/or region entry block argument may be the same buffer as a buffer operand (or a view thereof). This interface is queried by the `BufferViewFlowAnalysis`. The new interface has two interface methods: * `populateDependencies`: Implementations use the provided callback to declare dependencies between operands and op results/region entry block arguments. E.g., for `%r = arith.select %c, %m1, %m2 : memref<5xf32>`, the interface implementation should declare two dependencies: %m1 -> %r and %m2 -> %r. * `mayBeTerminalBuffer`: An SSA value is a terminal buffer if the buffer view flow analysis stops at the specified value. E.g., because the value is a newly allocated buffer or because no further information is available about the origin of the buffer. Ops that implement the `RegionBranchOpInterface` or `BranchOpInterface` do not have to implement the `BufferViewFlowOpInterface`. The buffer dependencies can be inferred from those two interfaces. This commit makes the `BufferViewFlowAnalysis` more accurate. For unknown ops, it conservatively used to declare all combinations of operands and op results/region entry block arguments as dependencies (false positives). This is no longer the case. While the analysis is still a "maybe" analysis with false positives (e.g., when analyzing ops such as `arith.select` or `scf.if` where the taken branch is not known at compile time), results and region entry block arguments of unknown ops are now marked as terminal buffers. This commit addresses a TODO in `BufferViewFlowAnalysis.cpp`: ``` // TODO: We should have an op interface instead of a hard-coded list of // interfaces/ops. ``` It is no longer needed to hard-code ops.	2024-03-24 12:48:19 +09:00
Matthias Springer	35d3b3430e	[mlir][bufferization] Add "bottom-up from terminators" analysis heuristic (#83964 ) One-Shot Bufferize currently does not support loops where a yielded value bufferizes to a buffer that is different from the buffer of the region iter_arg. In such a case, the bufferization fails with an error such as: ``` Yield operand #0 is not equivalent to the corresponding iter bbArg scf.yield %0 : tensor<5xf32> ``` One common reason for non-equivalent buffers is that an op on the path from the region iter_arg to the terminator bufferizes out-of-place. Ops that are analyzed earlier are more likely to bufferize in-place. This commit adds a new heuristic that gives preference to ops that are reachable on the reverse SSA use-def chain from a region terminator and are within the parent region of the terminator. This is expected to work better than the existing heuristics for loops where an iter_arg is written to multiple times within a loop, but only one write is fed into the terminator. Current users of One-Shot Bufferize are not affected by this change. "Bottom-up" is still the default heuristic. Users can switch to the new heuristic manually. This commit also turns the "fuzzer" pass option into a heuristic, cleaning up the code a bit.	2024-03-21 14:16:02 +09:00
Benjamin Kramer	db60491127	[mlir][bufferization] Check OpFilter before casting to BufferizableOpInterface (#85690 ) This doesn't change functionality, but lets us avoid attaching all the interfaces after `513cdb8222` turned casting without loading into an error.	2024-03-19 10:31:25 +01:00
Aart Bik	f3a8af07fa	[mlir][sparse] best effort finalization of escaping empty sparse tensors (#85482 ) This change lifts the restriction that purely allocated empty sparse tensors cannot escape the method. Instead it makes a best effort to add a finalizing operation before the escape. This assumes that (1) we never build sparse tensors across method boundaries (e.g. allocate in one, insert in other method) (2) if we have other uses of the empty allocation in the same method, we assume that either that op will fail or will do the finalization for us. This is best-effort, but fixes some very obvious missing cases.	2024-03-15 16:43:09 -07:00
Matthias Gehre	e6048b728d	[MLIR][Bufferization] BufferResultsToOutParams: Add option to add attribute to output arguments (#84320 ) Adds a new pass option `add-result-attr` that will make the pass add the attribute `{bufferize.result}` to each argument that was converted from a result. This is important e.g. when later using the python bindings / execution engine to understand which arguments are actually results. To be able to test this, the pass option was added to the tablegen. To avoid collisions with the existing, manually defined option struct `BufferResultsToOutParamsOptions`, that one was renamed to `BufferResultsToOutParamsOpts`.	2024-03-14 07:50:16 +01:00
Matthias Gehre	afac64cef4	[MLIR] BufferResultsToOutParams: Allow to configure memCpyFn This allows us to configure the pass to emit linalg.copy instead of memref.copy. This is consistent with one-shot-bufferize, which also allows to configure the `memCpyFn`, see https://discord.com/channels/636084430946959380/642426447167881246/1211698722438783087	2024-03-07 11:30:08 +01:00
Thomas Preud'homme	a9304edf20	Fix remaining build failures with GCC 8.3 (#83266 ) When compiling for GCC 8.x (< 8.4), SFINAE is disabled for iterator_range constructor causing ambiguous resolution to construct an OperandRange from a MutableOperatorRange, even in the presence of a static_cast<OperatorRange>. This adds an explicit conversion method to lift the ambiguity. Tested with a full MLIR build with GCC 8.3.	2024-03-05 19:32:27 +00:00
Matthias Springer	914e607487	[mlir][IR][NFC] Rename `notifyRemoved` to `notifyErased` (#82253 ) Rename listener callback names: * `notifyOperationRemoved` -> `notifyOperationErased` * `notifyBlockRemoved` -> `notifyBlockErased` The current callback names are misnomers. The callbacks are triggered when an operation/block is erased, not when it is removed (unlinked). E.g.: ```c++ /// Notify the listener that the specified operation is about to be erased. /// At this point, the operation has zero uses. /// /// Note: This notification is not triggered when unlinking an operation. virtual void notifyOperationErased(Operation *op) {} ``` This change is in preparation of adding listener support to the dialect conversion. The dialect conversion internally unlinks IR before erasing it at a later point of time. There is an important difference between "remove" and "erase". Lister callback names should be accurate to avoid confusion.	2024-02-20 09:08:19 +01:00
ian Bearman	067d2779fc	[MLIR] Setting MemorySpace During Bufferization (#78484 ) Collection of changes with the goal of being able to convert `encoding` to `memorySpace` during bufferization - new API for encoder to allow implementation to select destination memory space - update existing bufferization implementations to support the new interface	2024-02-08 16:59:37 +01:00
Matthias Springer	c08d972a00	[mlir][bufferization][NFC] Pass `DeallocationOptions` instead of flags (#80675 ) Pass `DeallocationOptions` instead of `privateFuncDynamicOwnership`. This will make it easier to add new options in the future.	2024-02-05 17:20:55 +01:00
Matthias Springer	0940be1581	[mlir][bufferization] Never pass ownership to functions (#80655 ) Even when `private-function-dynamic-ownership` is set, ownership should never be passed to the callee. This can lead to double deallocs (#77096) or use-after-free in the caller because ownership is currently passed regardless of whether there are any further uses of the buffer in the caller or not. Note: This is consistent with the fact that ownership is never passed to nested regions. This commit fixes #77096.	2024-02-05 12:11:49 +01:00
Ryan Holt	fa10121415	[mlir][MLProgram] Add MLProgram to MemRef bufferization pass (#75103 ) There is currently no lowering out of `ml_program` in the LLVM repository. This change adds a lowering to `memref` so that it can be lowered all the way to LLVM. This lowering was taken from the [reference backend in torch-mlir](`f416953600` ). I had tried implementing the `BufferizableOpInterface` for `ml_program` instead of adding a new pass but that did not work because `OneShotBufferize` does not visit module-level ops like `ml_program.global`.	2024-01-30 16:34:33 +01:00
Kohei Yamaguchi	bcd14b099d	[mlir][bufferization] Fix SimplifyClones with dealloc before cloneOp (#79098 ) The SimplifyClones pass relies on the assumption that the deallocOp follows the cloneOp. However, a crash occurs when there is a redundantDealloc preceding the cloneOp. This PR addresses the issue by ensuring the presence of deallocOp after cloneOp. The verification is performed by checking if the loop of the sub sequent node of cloneOp reaches the tail of the list. Fix #74306	2024-01-25 11:06:46 +01:00
Matthias Springer	5cc0f76d34	[mlir][IR] Add rewriter API for moving operations (#78988 ) The pattern rewriter documentation states that "all IR mutations [...] are required to be performed via the `PatternRewriter`." This commit adds two functions that were missing from the rewriter API: `moveOpBefore` and `moveOpAfter`. After an operation was moved, the `notifyOperationInserted` callback is triggered. This allows listeners such as the greedy pattern rewrite driver to react to IR changes. This commit narrows the discrepancy between the kind of IR modification that can be performed and the kind of IR modifications that can be listened to.	2024-01-25 11:01:28 +01:00
Matthias Springer	fbb62d449c	[mlir][bufferization] Buffer deallocation: Make op preconditions stricter (#75127 ) The buffer deallocation pass checks the IR ("operation preconditions") to make sure that there is no IR that is unsupported. In such a case, the pass signals a failure. The pass now rejects all ops with unknown memory effects. We do not know whether such an op allocates memory or not. Therefore, the buffer deallocation pass does not know whether a deallocation op should be inserted or not. Memory effects are queried from the `MemoryEffectOpInterface` interface. Ops that do not implement this interface but have the `RecursiveMemoryEffects` trait do not have any side effects (apart from the ones that their nested ops may have). Unregistered ops are now rejected by the pass because they do not implement the `MemoryEffectOpInterface` and neither do we know if they have `RecursiveMemoryEffects` or not. All test cases that currently have unregistered ops are updated to use registered ops.	2024-01-21 11:10:09 +01:00
Matthias Springer	b4f24be7ef	[mlir][bufferization] Simplify helper `potentiallyAliasesMemref` (#78690 ) This commit simplifies a helper function in the ownership-based buffer deallocation pass. Fixes a potential double-free (depending on the scheduling of patterns).	2024-01-19 13:22:02 +01:00
Matthias Springer	4fc128f817	[mlir][bufferization][NFC] Clean up code (#78594 ) Clean up code and remove dead code.	2024-01-19 10:20:41 +01:00
Matthias Springer	5fcf907b34	[mlir][IR] Rename "update root" to "modify op" in rewriter API (#78260 ) This commit renames 4 pattern rewriter API functions: * `updateRootInPlace` -> `modifyOpInPlace` * `startRootUpdate` -> `startOpModification` * `finalizeRootUpdate` -> `finalizeOpModification` * `cancelRootUpdate` -> `cancelOpModification` The term "root" is a misnomer. The root is the op that a rewrite pattern matches against (https://mlir.llvm.org/docs/PatternRewriter/#root-operation-name-optional). A rewriter must be notified of all in-place op modifications, not just in-place modifications of the root (https://mlir.llvm.org/docs/PatternRewriter/#pattern-rewriter). The old function names were confusing and have contributed to various broken rewrite patterns. Note: The new function names use the term "modify" instead of "update" for consistency with the `RewriterBase::Listener` terminology (`notifyOperationModified`).	2024-01-17 11:08:59 +01:00
Matthias Springer	8f2d83da26	[mlir][bufferization] Add `BufferizableOpInterface::hasTensorSemantics` (#75273 ) Add a new interface method to `BufferizableOpInterface`: `hasTensorSemantics`. This method returns "true" if the op has tensor semantics and should be bufferized. Until now, we assumed that an op has tensor semantics if it has tensor operands and/or tensor op results. However, there are ops like `ml_program.global` that do not have any results/operands but must still be bufferized (#75103). The new interface method can return "true" for such ops. This change also decouples `bufferization::bufferizeOp` a bit from the func dialect.	2024-01-16 10:07:34 +01:00
Kazu Hirata	8e8bbbd48e	[mlir] Use llvm::is_contained (NFC)	2024-01-12 22:08:29 -08:00
donald chen	eaa4b6cf29	[mlir][bufferization] Clone simplify fails when input and result type not cast compatiable (#71310 ) The simplify of bufferization.clone generates a memref.cast op, but the checks in simplify do not verify whether the operand types and return types of clone op is compatiable, leading to errors. This patch addresses this issue.	2024-01-12 16:11:00 +01:00
Javed Absar	0ba868db70	[MLIR][Bufferizer][NFC] Simplify some codes. (#77254 ) NFC. clean up.	2024-01-08 09:37:57 +00:00
Matthias Springer	dd450f08cf	[mlir][Interfaces][NFC] Move region loop detection to `RegionBranchOpInterface` (#77090 ) `BufferPlacementTransformationBase::isLoop` checks if there a loop in the region branching graph of an operation. This algorithm is similar to `isRegionReachable` in the `RegionBranchOpInterface`. To avoid duplicate code, `isRegionReachable` is generalized, so that it can be used to detect region loops. A helper function `RegionBranchOpInterface::hasLoop` is added. This change also turns a recursive implementation into an iterative one, which is the preferred implementation strategy in LLVM. Also move the `isLoop` to `BufferOptimizations.cpp`, so that we can gradually retire `BufferPlacementTransformationBase`. (This is so that proper error handling can be added to `BufferViewFlowAnalysis`.)	2024-01-07 13:49:29 +01:00
Matthias Springer	b662c9aa0e	[mlir][bufferization][NFC] Buffer deallocation: Add comment to `handleInterface` (#76956 ) This is a follow-up for #68648.	2024-01-05 09:30:52 +01:00
Jacques Pienaar	6ae7f66ff5	[mlir] Add config for PDL (#69927 ) Make it so that PDL in pattern rewrites can be optionally disabled. PDL is still enabled by default and not optional bazel. So this should be a NOP for most folks, while enabling other to disable. This only works with tests disabled. With tests enabled this still compiles but tests fail as there is no lit config to disable tests that depend on PDL rewrites yet.	2024-01-03 20:37:20 -08:00
max	b49e0ebedf	Revert "[mlir] Add config for PDL (#69927 )" This reverts commit `5930725c89`.	2024-01-03 12:16:19 -06:00
Jacques Pienaar	5930725c89	[mlir] Add config for PDL (#69927 ) Make it so that PDL in pattern rewrites can be optionally disabled. PDL is still enabled by default and not optional bazel. So this should be a NOP for most folks, while enabling other to disable. This is piped through mlir-tblgen invocation and that could be changed/avoided by splitting up the passes file instead. This only works with tests disabled. With tests enabled this still compiles but tests fail as there is no lit config to disable tests that depend on PDL rewrites yet.	2024-01-03 09:43:22 -08:00
Matthias Springer	c4457e10fe	[mlir][IR] Change block/region walkers to enumerate `this` block/region (#75020 ) This change makes block/region walkers consistent with operation walkers. An operation walk enumerates the current operation. Similarly, block/region walks should enumerate the current block/region. Example: ``` // Current behavior: op1->walk([](Operation op2) { / op1 is enumerated / }); block1->walk([](Block block2) { /* block1 is NOT enumerated / }); region1->walk([](Block block) { /* blocks of region1 are NOT enumerated / }); region1->walk([](Region region2) { /* region1 is NOT enumerated }); // New behavior: op1->walk([](Operation op2) { / op1 is enumerated / }); block1->walk([](Block block2) { /* block1 IS enumerated / }); region1->walk([](Block block) { /* blocks of region1 ARE enumerated / }); region1->walk([](Region region2) { /* region1 IS enumerated }); ```	2023-12-20 14:51:45 +09:00
Matthias Springer	a43641c9db	[mlir][bufferization] Fix `regionOperatesOnMemrefValues` (#75016 ) `Region::walk([](Block *b) {...})` does not enumerate blocks that are direct children of the region. These blocks must be checked manually.	2023-12-12 08:56:23 +09:00
Matthias Springer	68f91cd257	[mlir][bufferization] Fix invalid IR in `SimplifyClones` canonicalization (#74417 ) `SimplifyClones` used to generate an invalid op: ``` error: 'memref.cast' op operand type 'memref<xf32>' and result type 'memref<xf32>' are cast incompatible %2 = bufferization.clone %1 : memref<xf32> to memref<xf32 ``` This commit fixes tests such as `mlir/test/Dialect/Bufferization/canonicalize.mlir` when verifying the IR after each pattern (#74270).	2023-12-06 09:41:57 +09:00
Matthias Springer	3dae97cc01	[mlir][bufferization] Fix op dominance bug in rewrite pattern (#74159 ) Fixes a bug in `SplitDeallocWhenNotAliasingAnyOther`. This pattern used to generate invalid IR (op dominance error). We never noticed this bug in existing test cases because other patterns and/or foldings were applied afterwards and those rewrites "fixed up" the IR again. (The bug is visible when running `mlir-opt -debug`.) Also add additional comments to the implementation and simplify the code a bit. Apart from the fixed dominance error, this change is NFC. Without this change, buffer deallocation tests will fail when running with #74270.	2023-12-05 11:46:30 +09:00
Matthias Springer	437c62178c	[mlir][memref] Remove redundant `memref.tensor_store` op (#71010 ) `bufferization.materialize_in_destination` should be used instead. Both ops bufferize to a memcpy. This change also conceptually cleans up the memref dialect a bit: the memref dialect no longer contains ops that operate on tensor values.	2023-11-05 12:47:18 +09:00
Mehdi Amini	5a71f7a4b7	[mlir] Fix bufferization.alloc_tensor canonicalization crash (#70891 ) This make sure that an invalid negative dimension is ignored and stays dynamic instead of crashing the compiler. Fixes #70887	2023-10-31 20:58:29 -07:00
Matthias Springer	1abd8d1a8d	[mlir][Interfaces] Add `SubsetOpInterface` and `SubsetExtractionOpInterface` (#70617 ) There is currently an op interface for subset insertion ops (`SubsetInsertionOpInterface`), but not for subset extraction ops. This commit adds `SubsetExtractionOpInterface` to `mlir/Interfaces`, as well as a common dependent op interface: `SubsetOpInterface`. - `SubsetOpInterface` is for ops that operate on tensor subsets. It provides interface methods to check if two subset ops operate on equivalent or disjoint subsets. Ops that implement this interface must implement either `SubsetExtractionOpInterface` or `SubsetInsertionOpInterface`. - `SubsetExtractionOpInterface` is for ops that extract from a tensor at a subset. E.g., `tensor.extract_slice`, `tensor.gather`, `vector.transfer_read`. Current implemented only on `tensor.extract_slice`. - `SubsetInsertionOpInterface` is for ops that insert into a destination tensor at a subset. E.g., `tensor.insert_slice`, `tensor.parallel_insert_slice`, `tensor.scatter`, `vector.transfer_write`. Currently only implemented on `tensor.insert_slice`, `tensor.parallel_insert_slice`. Other changes: - Rename `SubsetInsertionOpInterface.td` to `SubsetOpInterface.td`. - Add helper functions to `ValueBoundsOpInterface.cpp` for checking whether two slices are disjoint. The new interfaces will be utilized by a new "loop-invariant subset hoisting" transformation. (This new transform is roughly what `Linalg/Transforms/SubsetHoisting.cpp` is doing, but in a generic and interface-driven way.)	2023-11-01 10:26:31 +09:00
Matthias Springer	4fbbb7ad7c	[mlir][bufferization] Fix ownership computation of unknown ops (#70773 ) No ownership is assumed for memref results of ops that implement none of the relevant interfaces and have no memref operands. This fixes #68948.	2023-11-01 09:26:47 +09:00
Matthias Springer	a8d0c86174	[mlir][Interfaces][NFC] Move `SubsetInsertionOpInterface` to `mlir/Interfaces` (#70615 ) `SubsetInsertionOpInterface` is an interface for ops that insert into a destination tensor at a subset. It is currently used by the bufferization framework to support efficient `tensor.extract_slice/insert_slice` bufferization and to drive "empty tensor elimination". This commit moves the interface to `mlir/Interfaces`. This is in preparation of adding a new "loop-invariant subset hoisting" transformation to `mlir/Transforms/Utils/LoopInvariantCodeMotionUtils.cpp`, which will utilize `SubsetInsertionOpInterface`. (This new transform is roughly what `Linalg/Transforms/SubsetHoisting.cpp` is doing, but in a generic and interface-driven way.)	2023-10-30 13:42:44 +09:00
Matthias Springer	5558504374	[mlir][IR] Make `OpOperand` comparable (#70410 ) Two `OpOperand`s are the same if they belong to the same owner and have the same operand number. There are currently no comparison operators defined on `OpOperand` and we work around this in multiple places by comparing pointers. Note: `OpOperand`s are stored in an op, so it is valid to compare their pointers to determine if they are the same operand. E.g., `getOperandNumber` is also implemented via pointer arithmetics.	2023-10-27 15:51:45 +09:00
Kazu Hirata	f9306f6de3	[ADT] Rename llvm::erase_value to llvm::erase (NFC) (#70156 ) C++20 comes with std::erase to erase a value from std::vector. This patch renames llvm::erase_value to llvm::erase for consistency with C++20. We could make llvm::erase more similar to std::erase by having it return the number of elements removed, but I'm not doing that for now because nobody seems to care about that in our code base. Since there are only 50 occurrences of erase_value in our code base, this patch replaces all of them with llvm::erase and deprecates llvm::erase_value.	2023-10-24 23:03:13 -07:00
Matthias Springer	4d80eff861	[mlir][bufferization] Ownership-based deallocation: Allow manual (de)allocs (#68648 ) Add a new attribute `bufferization.manual_deallocation` that can be attached to allocation and deallocation ops. Buffers that are allocated with this attribute are assigned an ownership of "false". Such buffers can be deallocated manually (e.g., with `memref.dealloc`) if the deallocation op also has the attribute set. Previously, the ownership-based buffer deallocation pass used to reject IR with existing deallocation ops. This is no longer the case if such ops have this new attribute. This change is useful for the sparse compiler, which currently deallocates the sparse tensor buffers by itself.	2023-10-23 09:45:33 +09:00
Kazu Hirata	195f6236c7	[mlir] Use llvm::erase_value (NFC)	2023-10-19 23:35:11 -07:00
Matthias Springer	6d88ac11d7	[mlir][bufferization] Transfer `restrict` during empty tensor elimination (#68729 ) Empty tensor elimination is looking for `bufferization.materialize_in_destination` ops with a `tensor.empty` source. It replaces the `tensor.empty` with a `bufferization.to_tensor restrict` of the memref destination. As part of this rewrite, the `restrict` keyword should be removed, so that no second `to_tensor restrict` op will be inserted. Such IR would be invalid. `bufferization.materialize_in_destination` with memref destination and without the `restrict` attribute are ignored by empty tensor elimination. Also relax the verifier of `materialize_in_destination`. The `restrict` keyword is not generally needed because the op does not expose the buffer as a tensor.	2023-10-11 08:51:16 -07:00
Matthias Springer	9d34c05222	[mlir][bufferization][NFC] Simplify `bufferizeOp` function signature (#68625 ) Remove the `opFilter` and `copyBeforeWrite` function arguments. These options can already be configured in the `options` object.	2023-10-09 17:52:52 -07:00
Matthias Springer	3d0ca2cfe3	[mlir][bufferization] Allow cyclic function graphs without tensors (#68632 ) Cyclic function call graphs are generally not supported by One-Shot Bufferize. However, they can be allowed when a function does not have tensor arguments or results. This is because it is then no longer necessary that the callee will be bufferized before the caller.	2023-10-09 17:52:04 -07:00
Matthias Springer	876334321f	[mlir][bufferization] Update empty_tensor_elimination transform op (#68497 ) The empty tensor elimination pass semantics have changed recently: when applied to a module, the One-Shot Module Analysis is run. Otherwise, the regular One-Shot Analysis is run. The latter one is slightly different because it ignores function boundaries and treats function block arguments as "read-only". This commit updates the transform dialect op to behave in the same way.	2023-10-08 08:46:43 -07:00
Matthias Springer	8ee38f3b32	[mlir][bufferization] Follow up for #68074 (#68488 ) Address additional comments in #68074. This should have been part of #68074.	2023-10-07 10:07:17 -07:00
Matthias Springer	0fcaca2fea	[mlir][bufferization] `MaterializeInDestinationOp`: Support memref destinations (#68074 ) Extend `bufferization.materialize_in_destination` to support memref destinations. This op can now be used to indicate that a tensor computation should materialize in a given buffer (that may have been allocated by another component/runtime). The op still participates in "empty tensor elimination". Example: ```mlir func.func @test(%out: memref<10xf32>) { %t = tensor.empty() : tensor<10xf32> %c = linalg.generic ... outs(%t: tensor<10xf32>) -> tensor<10xf32> bufferization.materialize_in_destination %c in restrict writable %out : (tensor<10xf32>, memref<10xf32>) -> () return } ``` After "empty tensor elimination", the above IR can bufferize without an allocation: ```mlir func.func @test(%out: memref<10xf32>) { linalg.generic ... outs(%out: memref<10xf32>) return } ``` This change also clarifies the meaning of the `restrict` unit attribute on `bufferization.to_tensor` ops.	2023-10-06 11:57:10 +02:00
Matthias Springer	5958043e2d	[mlir][bufferization] Add `dump_alias_sets` option to transform op (#68289 ) Add `dump_alias_sets` to `transform.bufferization.one_shot_bufferize`. This option is useful for debugging. Also improve the verifier to ensure that `test_analysis_only` is set when other debugging flags are enabled.	2023-10-05 14:05:45 +02:00

1 2 3 4 5 ...

393 Commits