clang-p2996

Author	SHA1	Message	Date
Martin Erhart	6bf043e743	[mlir][bufferization] Remove allow-return-allocs and create-deallocs pass options, remove bufferization.escape attribute (#66619 ) This commit removes the deallocation capabilities of one-shot-bufferization. One-shot-bufferization should never deallocate any memrefs as this should be entirely handled by the ownership-based-buffer-deallocation pass going forward. This means the `allow-return-allocs` pass option will default to true now, `create-deallocs` defaults to false and they, as well as the escape attribute indicating whether a memref escapes the current region, will be removed. A new `allow-return-allocs-from-loops` option is added as a temporary workaround for some bufferization limitations.	2023-09-18 16:44:48 +02:00
Martin Erhart	08b7a71bcc	[mlir][bufferization] Define a pipeline for buffer deallocation (#66352 ) Since ownership based buffer deallocation requires a few passes to be run in a somewhat fixed sequence, it makes sense to have a pipeline for convenience (and to reduce the number of transform ops to represent default deallocation).	2023-09-15 09:39:17 +02:00
Martin Erhart	942ce31985	[mlir][bufferization] BufferDeallocationOpInterface: support custom ownership update logic (#66350 ) Add a method to the BufferDeallocationOpInterface that allows operations to implement the interface and provide custom logic to compute the ownership indicators of values it defines. As a demonstrating example, this new method is implemented by the `arith.select` operation.	2023-09-14 14:34:04 +02:00
Martin Erhart	8160bce969	[mlir][bufferization][NFC] Introduce BufferDeallocationOpInterface (#66349 ) This new interface allows operations to implement custom handling of ownership values and insertion of dealloc operations which is useful when an op cannot implement the interfaces supported by default by the buffer deallocation pass (e.g., because they are not exactly compatible or because there are some additional semantics to it that would render the default implementations in buffer deallocation invalid, or because no interfaces exist for this kind of behavior and it's not worth introducing one plus a default implementation in buffer deallocation). Additionally, it can also be used to provide more efficient handling for a specific op than the interface based default implementations can.	2023-09-14 13:58:30 +02:00
Martin Erhart	01334d1abb	[mlir][bufferization] Add an ownership based buffer deallocation pass (#66337 ) Add a new Buffer Deallocation pass with the intend to replace the old one. For now it is added as a separate pass alongside in order to allow downstream users to migrate over gradually. This new pass has the goal of inserting fewer clone operations and supporting additional use-cases. Please refer to the Buffer Deallocation section in the updated Bufferization.md file for more information on how this new pass works.	2023-09-14 12:13:37 +02:00
Matthias Springer	a1ef5a9437	[mlir][bufferization] Empty tensor elimination based on SubsetOpInterface (#65766 ) This commit generalizes empty tensor elimination to operate on subset ops. No new test cases are added because all current subset ops were already supported previously. From this perspective, this change is NFC. A new interface method (and a helper method) are added to `SubsetInsertionOpInterface` to build the subset of the destination tensor.	2023-09-14 09:45:22 +02:00
Martin Erhart	c199f7dc62	Revert "[mlir][bufferization] Remove allow-return-allocs and create-deallocs pass options, remove bufferization.escape attribute" This reverts commit `6a91dfedeb`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:48 +00:00
Martin Erhart	520407a7c8	Revert "[mlir][bufferization] Improve buffer deallocation pass" This reverts commit `1bebb60a75`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:48 +00:00
Martin Erhart	792caac0f8	Revert "[mlir][bufferization][NFC] Introduce BufferDeallocationOpInterface" This reverts commit `29d86175e6`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:47 +00:00
Martin Erhart	9782232ec7	Revert "[mlir][bufferization] BufferDeallocationOpInterface: support custom ownership update logic" This reverts commit `89117f1807`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:47 +00:00
Martin Erhart	7995a4701d	Revert "[mlir][bufferization] Define a pipeline for buffer deallocation" This reverts commit `f0c4663942`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:47 +00:00
Matthias Springer	8143307b33	[mlir][bufferization] Generalize tensor slice rules to subset ops (#65619 ) This commit generalizes the special tensor.extract_slice/tensor.insert_slice bufferization rules to tensor subset ops. Ops that insert a tensor into a tensor at a specified subset (e.g., tensor.insert_slice, tensor.scatter) can implement the `SubsetInsertionOpInterface`. Apart from adding a new op interface (extending the API), this change is NFC. The only ops that currently implement the new interface are tensor.insert_slice and tensor.parallel_insert_slice, and those ops were are supported by One-Shot Bufferize.	2023-09-13 12:27:19 +02:00
Martin Erhart	f0c4663942	[mlir][bufferization] Define a pipeline for buffer deallocation Since buffer deallocation requires a few passes to be run in a somewhat fixed sequence, it makes sense to have a pipeline for convenience (and to reduce the number of transform ops to represent default deallocation). Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D159432	2023-09-13 09:30:24 +00:00
Martin Erhart	89117f1807	[mlir][bufferization] BufferDeallocationOpInterface: support custom ownership update logic Add a method to the BufferDeallocationOpInterface that allows operations to implement the interface and provide custom logic to compute the ownership indicators of values it defines. As a demonstrating example, this new method is implemented by the `arith.select` operation. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158828	2023-09-13 09:30:23 +00:00
Martin Erhart	29d86175e6	[mlir][bufferization][NFC] Introduce BufferDeallocationOpInterface This new interface allows operations to implement custom handling of ownership values and insertion of dealloc operations which is useful when an op cannot implement the interfaces supported by default by the buffer deallocation pass (e.g., because they are not exactly compatible or because there are some additional semantics to it that would render the default implementations in buffer deallocation invalid, or because no interfaces exist for this kind of behavior and it's not worth introducing one plus a default implementation in buffer deallocation). Additionally, it can also be used to provide more efficient handling for a specific op than the interface based default implementations can. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158756	2023-09-13 09:30:23 +00:00
Martin Erhart	1bebb60a75	[mlir][bufferization] Improve buffer deallocation pass Add a new Buffer Deallocation pass replacing the old one with the goal of inserting fewer clone operations and supporting additional use-cases. Please refer to the Buffer Deallocation section in the updated Bufferization.md file for more information on how this new pass works. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158421	2023-09-13 09:30:23 +00:00
Martin Erhart	6a91dfedeb	[mlir][bufferization] Remove allow-return-allocs and create-deallocs pass options, remove bufferization.escape attribute This is the first commit in a series with the goal to rework the BufferDeallocation pass. Currently, this pass heavily relies on copies to perform correct deallocations, which leads to very slow code and potentially high memory usage. Additionally, there are unsupported cases such as returning memrefs which this series of commits aims to add support for as well. This first commit removes the deallocation capabilities of one-shot-bufferization.One-shot-bufferization should never deallocate any memrefs as this should be entirely handled by the buffer-deallocation pass going forward. This means the allow-return-allocs pass option will default to true now, create-deallocs defaults to false and they, as well as the escape attribute indicating whether a memref escapes the current region, will be removed. The documentation should w.r.t. these pass option changes should also be updated in this commit. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D156662	2023-09-13 09:30:22 +00:00
Martin Erhart	7c6419bc3c	[NFC][mlir][bufferization] Move AllocationOpInterface implementations (#65578 ) The new Buffer Deallocation pass introduced in D158421 will not need the AllocationOpInterface anymore, thus it is better to move those default implementations to a place where they will still be used.	2023-09-07 09:59:51 +02:00
Matthias Springer	1e1a3112f1	[mlir][bufferization] Privatize buffers for parallel regions One-Shot Bufferize correctly handles RaW conflicts around repetitive regions (loops). Specical handling is needed for parallel regions. These are a special kind of repetitive regions that can have additional RaW conflicts that would not be present if the regions would be executed sequentially. Example: ``` %0 = bufferization.alloc_tensor() scf.forall ... { %1 = linalg.fill ins(...) outs(%0) ... scf.forall.in_parallel { tensor.parallel_insert_slice %1 into ... } } ``` A separate (private) buffer must be allocated for each iteration of the `scf.forall` loop. This change adds a new interface method to `BufferizableOpInterface` to detect parallel regions. By default, regions are assumed to be sequential. A buffer is privatized if an OpOperand bufferizes to a memory read inside a parallel region that is different from the parallel region where operand's value is defined. Differential Revision: https://reviews.llvm.org/D159286	2023-09-06 14:28:43 +02:00
Martin Erhart	34a35a8b24	[mlir] Move FunctionInterfaces to Interfaces directory and inherit from CallableOpInterface Functions are always callable operations and thus every operation implementing the `FunctionOpInterface` also implements the `CallableOpInterface`. The only exception was the FuncOp in the toy example. To make implementation of the `FunctionOpInterface` easier, this commit lets `FunctionOpInterface` inherit from `CallableOpInterface` and merges some of their methods. More precisely, the `CallableOpInterface` has methods to get the argument and result attributes and a method to get the result types of the callable region. These methods are always implemented the same way as their analogues in `FunctionOpInterface` and thus this commit moves all the argument and result attribute handling methods to the callable interface as well as the methods to get the argument and result types. The `FuntionOpInterface` then does not have to declare them as well, but just inherits them from the `CallableOpInterface`. Adding the inheritance relation also required to move the `FunctionOpInterface` from the IR directory to the Interfaces directory since IR should not depend on Interfaces. Reviewed By: jpienaar, springerm Differential Revision: https://reviews.llvm.org/D157988	2023-08-31 11:28:23 +00:00
Matthias Springer	6ecebb496c	[mlir][bufferization] Support unstructured control flow This revision adds support for unstructured control flow to the bufferization infrastructure. In particular: regions with multiple blocks, `cf.br`, `cf.cond_br`. Two helper templates are added to `BufferizableOpInterface.h`, which can be implemented by ops that supported unstructured control flow in their regions (e.g., `func.func`) and ops that branch to another block (e.g., `cf.br`). A block signature is always bufferized together with the op that owns the block. Differential Revision: https://reviews.llvm.org/D158094	2023-08-31 12:55:53 +02:00
Martin Erhart	950f0944c9	[mlir][bufferization] Factor out bufferization.dealloc lowering into separate pass Moves the lowering of `bufferization.dealloc` to memref into a separate pass, but still registers the pattern in the conversion pass. This is helpful when some tensor values (and thus `to_memref` or `to_tensor` operations) still remain, e.g., when the function boundaries are not converted, or when constant tensors are converted to memref.get_global at a later point. However, it is still recommended to perform all bufferization before deallocation to avoid memory leaks as all memref allocations inserted after the deallocation pass was applied, have to be handled manually. Note: The buffer deallocation pass assumes that memref values defined by `bufferization.to_memref` don't return ownership and don't have to be deallocated. `bufferization.to_tensor` operations are handled similarly to `bufferization.clone` operations with the exception that the result value is not handled because it's a tensor (not a memref). Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D159180	2023-08-31 07:10:31 +00:00
Mikhail Goncharov	0a0aff2d24	fix unused variable warnings in conditionals warning was updated in `92023b1509`	2023-08-30 19:09:27 +02:00
Markus Böck	4dd744ac9c	Reland "[mlir] Use a type for representing branch points in `RegionBranchOpInterface`" This reverts commit `b26bb30b46`.	2023-08-30 09:31:54 +02:00
Markus Böck	b26bb30b46	Revert "[mlir] Use a type for representing branch points in `RegionBranchOpInterface`" This reverts commit `024f562da6`. Forgot to update flang	2023-08-29 20:17:50 +02:00
Markus Böck	024f562da6	[mlir] Use a type for representing branch points in `RegionBranchOpInterface` The current implementation is not very ergonomic or descriptive: It uses `std::optional<unsigned>` where `std::nullopt` represents the parent op and `unsigned` is the region number. This doesn't give us any useful methods specific to region control flow and makes the code fragile to changes due to now taking the region number into account. This patch introduces a new type called `RegionBranchPoint`, replacing all uses of `std::optional<unsigned>` in the interface. It can be implicitly constructed from a region or a `RegionSuccessor`, can be compared with a region to check whether the branch point is branching from the parent, adds `isParent` to check whether we are coming from a parent op and adds `RegionSuccessor::parent` as a descriptive way to indicate branching from the parent. Differential Revision: https://reviews.llvm.org/D159116	2023-08-29 20:02:23 +02:00
Martin Erhart	fff183050a	[mlir][bufferization] Run the simple dealloc canonicalization patterns as part of BufferDeallocationSimplification Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158744	2023-08-28 08:04:03 +00:00
Martin Erhart	fea185d70d	[mlir][bufferization] Add pattern to BufferDeallocationSimplification pass This new pattern allows us to simplify the dealloc result value (by replacing it with a constant 'true') and to trim the 'memref' operand list when we know that all retained memrefs alias with one in the 'memref' list that has a constant 'true' condition. Because the conditions of aliasing memrefs are combined by disjunction, we know that once a single constant 'true' value is in the disjunction the remaining elements don't matter anymore. This complements the RemoveDeallocMemrefsContainedInRetained pattern which removes values from the 'memref' list when static information is available for all retained values by also allowing to remove values in the presence of may-aliases, but under above mentioned condition instead. The BufferDeallocation pass often adds dealloc operations where the memref and retain lists are the same and all conditions are 'true'. If the operands are all function arguments, for example, they are always determined to may-alias which renders the other patterns invalid, but the op could still be trivially optimized away. It would even be enough to directly compare the two operand lists and check the conditions are all constant 'true' (plus checking for the extract_strided_metadata operation), but this pattern is a bit more general and still works when there are additional memrefs in the 'memref' list that actually have to be deallocated (e.g., see regression test). Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158518	2023-08-23 10:41:05 +00:00
Xiaolei Shi	bcabaa5590	Add LLVM_MARK_AS_BITMASK_ENUM to HoistingKind enum This revision adds LLVM_MARK_AS_BITMASK_ENUM to HoistingKind to avoid static_cast when performing bitwise operations. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D158580	2023-08-22 23:22:32 -07:00
Xiaolei Shi	55e3857931	Make buffer hoisting/promotion passes use AllocationOpInterface This update implements the usage of AllocationOpInterface in the buffer hoisting/promotion passes. Two interface methods, namely `getHoistingKind` and `buildPromotedAlloc`, have been added. The former indicates which kind of hoisting (loop, block) an allocation operation supports, while the latter builds a stack allocation operation for promotable allocations used by the promote-buffers-to-stack pass. This update makes these passes be functional for user customized allocation operation. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158398	2023-08-22 16:51:04 -07:00
Matthias Springer	f36e19347f	[mlir][bufferization] Improve `bufferizesToElementwiseAccess` The operands for which elementwise access is relevant can now be specified. All other operands are ignored. This is useful because only two particular operands participate in a RaW conflict. Furthermore, the two tensors no longer must be equivalent to rule out conflicts due to elementwise access. Equivalent tensor sets may be formed after an inplace bufferization decision is made. The two tensors are actually not required to be equivalent. The only important thing is that they have "equivalent" indexing into the same base buffer. Differential Revision: https://reviews.llvm.org/D158428	2023-08-22 09:00:17 +02:00
Martin Erhart	765e82eeb6	[mlir][bufferization] Generalize dealloc simplification pattern We are allowed to remove any values from the `memref` list for which there is no memref in the `retained` list with a may-alias relation. Before removing, we just have to make sure that the corresponding op results for all retained memrefs with must-alias relation are updated accordingly. This means, the the condition operand has to become part of the disjunction the result value is computed with. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158395	2023-08-21 13:08:00 +00:00
Matthias Springer	a623689878	[mlir][bufferization] Check for invalid pass flags instead of failed assertions Check for invalid pass flags when running `-one-shot-bufferize` instead of crashing with a failed assertion. This fixes #64267. Differential Revision: https://reviews.llvm.org/D158162	2023-08-18 15:30:32 +02:00
Matthias Springer	a88732d98b	[mlir][bufferization][NFC] Extract block signature bufferization into separate function When bufferizing "func.func", the entry block signature is bufferized. (Only functions with a single block are supported at the moment.) This functionality is moved into a separate function, so that it can be used for bufferizing unstructured control flow in the future. Differential Revision: https://reviews.llvm.org/D158154	2023-08-17 11:16:49 +02:00
Matthias Springer	878950b82c	[mlir][bufferization] Simplify `getBufferType` `getBufferType` computes the bufferized type of an SSA value without bufferizing any IR. This is useful for predicting the bufferized type of iter_args of a loop. To avoid endless recursion (e.g., in the case of "scf.for", the type of the iter_arg depends on the type of init_arg and the type of the yielded value; the type of the yielded value depends on the type of the iter_arg again), `fixedTypes` was used to fall back to "fixed" type. A simpler way is to maintain an "invocation stack". `getBufferType` implementations can then inspect the invocation stack to detect repetitive computations (typically when computing the bufferized type of a block argument). Also improve error messages in case of inconsistent memory spaces inside of a loop. Differential Revision: https://reviews.llvm.org/D158060	2023-08-16 15:02:07 +02:00
Matthias Springer	a02ad6c177	[mlir][bufferization] Generalize getAliasingOpResults to getAliasingValues This revision is needed to support bufferization of `cf.br`/`cf.cond_br`. It will also be useful for better analysis of loop ops. This revision generalizes `getAliasingOpResults` to `getAliasingValues`. An OpOperand can now not only alias with OpResults but also with BlockArguments. In the case of `cf.br` (will be added in a later revision): a `cf.br` operand will alias with the corresponding argument of the destination block. If an op does not implement the `BufferizableOpInterface`, the analysis in conservative. It previously assumed that an OpOperand may alias with each OpResult. It now assumes that an OpOperand may alias with each OpResult and each BlockArgument of the entry block. Differential Revision: https://reviews.llvm.org/D157957	2023-08-15 15:02:47 +02:00
Martin Erhart	0bcae5e763	[mlir][bufferization] Add pattern to BufferDeallocationSimplification pass Add a pattern that splits one dealloc operation into multiple dealloc operation depending on static aliasing information of the values in the `memref` operand list. This reduces the total number of aliasing checks required at runtime and can enable futher canonicalizations of the new and simplified dealloc operations. Depends on D157407 Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D157508	2023-08-15 12:39:57 +00:00
Matthias Springer	06dacf5ea7	[mlir][func][bufferization][NFC] Simplify implementation The bufferization implementation of `func.func` and `func.call` can be simplified. It still contained code that was necessary when One-Shot Bufferize removed return values. This functionality has been extracted into a separate pass a while ago. Differential Revision: https://reviews.llvm.org/D157893	2023-08-15 12:00:12 +02:00
Matthias Springer	061aa2e3ba	[mlir][bufferization] Better error checking for ops with unstructured control flow Report an error when trying to bufferize an op that contains unstructured control flow but for ops for which the bufferization implementation does not support unstructured control flow. At the moment, there are no ops for which unstructured control flow is supported. Differential Revision: https://reviews.llvm.org/D157893	2023-08-15 11:39:49 +02:00
Martin Erhart	c5e8fbbf71	[mlir][bufferization] Add pattern to BufferDeallocationSimplification pass that removes unnecessary retain values Adds a pattern that removes memrefs from the `retained` list which are guaranteed to not alias any memref in the `memrefs` list. The corresponding result value can be replaced with `false` in that case according to the operation description. When applied after BufferDeallocation, this can considerably reduce the overhead that needs to be added during the lowering of the dealloc operation to check for aliasing (especially when there is only one element in the `memref` list and all `retained` values can be removed). Depends on D157398 Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D157407	2023-08-10 13:46:52 +00:00
Martin Erhart	87f2dee423	[mlir][bufferization] Add DeallocationSimplification pass Adds a pass that can be run after buffer deallocation to simplify the deallocation operations. In particular, there are patterns that need alias information and thus cannot be added as a regular canonicalization pattern. This initial commit moves an incorrect canonicalization pattern from over to this new pass and fixes it by querying the alias analysis for the additional information it needs to be correct (there must not by any potential aliasing memref in the retain list other than the currently mached one). Also, improves this pattern by considering the `extract_strided_metadata` operation which is inserted by the deallocation pass by default. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D157398	2023-08-10 12:45:38 +00:00
Markus Böck	5b29f86b42	[mlir] Fix verifier of `RegionBranchOpInterface` The verifier incorrectly passed the region number of the predecessor region instead of the successor region to `getSuccessorOperands`. This went unnoticed since all upstream `RegionBranchTerminatorOpInterface` implementations did not make use of the `index` parameter. Adding an assert to e.g. `scf.condition` to make sure the index is valid or adding a region terminator that passes different operands to different successors immediately causes the verifier to fail as it suddenly gets incorrect types. This patch fixes the implementation to correctly pass the successor region index. Differential Revision: https://reviews.llvm.org/D157507	2023-08-10 12:38:54 +02:00
Markus Böck	138df29820	[mlir] Revamp `RegionBranchOpInterface` successor mechanism The `RegionBranchOpInterface` had a few fundamental issues caused by the API design of `getSuccessorRegions`. It always required passing values for the `operands` parameter. This is problematic as the operands parameter actually changes meaning depending on which predecessor `index` is referring to. If coming from a region, you'd have to find a `RegionBranchTerminatorOpInterface` in that region, get its operand count, and then create a `SmallVector` of that size. This is not only inconvenient, but also error-prone, which has lead to a bug in the implementation of a previously existing `getSuccessorRegions` overload. Additionally, this made the method dual-use, trying to serve two different use-cases: 1) Trying to determine possible control flow edges between regions and 2) Trying to determine the region being branched to based on constant operands. This patch fixes these issues by changing the interface methods and adding new ones: * The `operands` argument of `getSuccessorRegions` has been removed. The method is now only responsible for returning possible control flow edges between regions. * An optional `getEntrySuccessorRegions` method has been added. This is used to determine which regions are branched to from the parent op based on constant operands of the parent op. By default, it calls `getSuccessorRegions`. This is analogous to `getSuccessorForOperands` from `BranchOpInterface`. * Add `getSuccessorRegions` to `RegionBranchTerminatorOpInterface`. This is used to get the possible successors of the terminator based on constant operands. By default, it calls the containing `RegionBranchOpInterface`s `getSuccessorRegions` method. * `getSuccessorEntryOperands` was renamed to `getEntrySuccessorOperands` for consistency. Differential Revision: https://reviews.llvm.org/D157506	2023-08-10 10:27:27 +02:00
Markus Böck	10ae8ae837	[mlir][NFC] Make `ReturnLike` trait imply `RegionBranchTerminatorOpInterface` This implication was already done de-facto and there were plenty of users and wrapper functions specifically used to handle the "return-like or RegionBranchTerminatorOpInterface" case. These simply existed due to up until recently missing features in ODS. With the new capabilities of traits, we can make `ReturnLike` imply `RegionBranchTerminatorOpInterface` and auto generate proper definitions for its methods. Various occurrences and wrapper methods used for `isa<RegionBranchTerminatorOpInterface>() \|\| hasTrait<ReturnLike>()` have all been removed. Differential Revision: https://reviews.llvm.org/D157402	2023-08-08 22:11:39 +02:00
Matthias Springer	5468340553	[mlir][bufferization] Improve analysis for element-wise operations Before this change, two equivalent operands that bufferize to a memory read and write, respectively, were always conflicting. This change improves the analysis for ops that bufferize to element-wise access. Such ops can bufferize in-place, because an original element value is not needed anymore after computing and writing an updated element value. This change allows ops such as the following one to bufferize in-place: ``` %0 = linalg.elemwise_binary {fun = #linalg.binary_fn<add>} ins(%a, %b : tensor<5xf32>, tensor<5xf32>) outs(%a : tensor<5xf32>) -> tensor<5xf32> ``` Differential Revision: https://reviews.llvm.org/D156887	2023-08-03 16:35:55 +02:00
Martin Erhart	b9982b203b	[mlir][bufferization] Add rename function to BufferViewFlowAnalysis This new function to replace a Value with another Value saves us from re-running the entire alias analysis when an operation has to be re-build because additional result values have to be added (e.g., when adding more iter_args to an scf.for). Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D156665	2023-08-02 10:59:19 +00:00
Matthias Springer	aba0ef7059	[mlir][bufferization] Support casts in EmptyTensorElimination EmptyTensorElimination is a pre-bufferization transformation that replaces "tensor.empty" ops with "tensor.extract_slice" ops. This revision adds support for cases where the input IR contains "tensor.cast" ops. Differential Revision: https://reviews.llvm.org/D156167	2023-07-31 15:20:00 +02:00
Matthias Springer	ba745eea40	[mlir][bufferization] Remove cleanup pipeline from bufferization pass To keep the pass simple, users should apply cleanup passes manually when necessary. In particular, `-cse -canonicalize` are often desireable to fold away self-copies that are created by the bufferization. This addresses a comment in D120191. Differential Revision: https://reviews.llvm.org/D155923	2023-07-21 12:11:25 +02:00
Alex Zinenko	8dbddb1718	[mlir] allow region branch spec from parent op to itself RegionBranchOpInterface did not allow the operation with regions to specify itself as successors. Therefore, this implied that the control is always transferred to a region before being transferred back to the parent op. Since the region can only transfer the control back to the parent op from a terminator, this transitively implied that the first block of any region with a RegionBranchOpInterface is always executed until the terminator can transfer the control flow back. This is trivially false for any conditional-like operation that may or may not execute the region, as well as for loop-like operations that may not execute the body. Remove the restriction from the interface description and update the only transform that relied on it. See https://discourse.llvm.org/t/rfc-region-control-flow-interfaces-should-encode-region-not-executed-correctly/72103. Depends On: https://reviews.llvm.org/D155757 Reviewed By: Mogball, springerm Differential Revision: https://reviews.llvm.org/D155822	2023-07-21 09:16:56 +00:00
Matthias Springer	98770ecd76	[mlir][bufferization] Add `buffer_loop_hoisting` transform op This op hoists buffer allocation from loops. Differential Revision: https://reviews.llvm.org/D155289	2023-07-14 17:09:38 +02:00

1 2 3 4 5

242 Commits