clang-p2996

Author	SHA1	Message	Date
Matthias Springer	1e1a3112f1	[mlir][bufferization] Privatize buffers for parallel regions One-Shot Bufferize correctly handles RaW conflicts around repetitive regions (loops). Specical handling is needed for parallel regions. These are a special kind of repetitive regions that can have additional RaW conflicts that would not be present if the regions would be executed sequentially. Example: ``` %0 = bufferization.alloc_tensor() scf.forall ... { %1 = linalg.fill ins(...) outs(%0) ... scf.forall.in_parallel { tensor.parallel_insert_slice %1 into ... } } ``` A separate (private) buffer must be allocated for each iteration of the `scf.forall` loop. This change adds a new interface method to `BufferizableOpInterface` to detect parallel regions. By default, regions are assumed to be sequential. A buffer is privatized if an OpOperand bufferizes to a memory read inside a parallel region that is different from the parallel region where operand's value is defined. Differential Revision: https://reviews.llvm.org/D159286	2023-09-06 14:28:43 +02:00
Matthias Springer	6ecebb496c	[mlir][bufferization] Support unstructured control flow This revision adds support for unstructured control flow to the bufferization infrastructure. In particular: regions with multiple blocks, `cf.br`, `cf.cond_br`. Two helper templates are added to `BufferizableOpInterface.h`, which can be implemented by ops that supported unstructured control flow in their regions (e.g., `func.func`) and ops that branch to another block (e.g., `cf.br`). A block signature is always bufferized together with the op that owns the block. Differential Revision: https://reviews.llvm.org/D158094	2023-08-31 12:55:53 +02:00
Matthias Springer	f36e19347f	[mlir][bufferization] Improve `bufferizesToElementwiseAccess` The operands for which elementwise access is relevant can now be specified. All other operands are ignored. This is useful because only two particular operands participate in a RaW conflict. Furthermore, the two tensors no longer must be equivalent to rule out conflicts due to elementwise access. Equivalent tensor sets may be formed after an inplace bufferization decision is made. The two tensors are actually not required to be equivalent. The only important thing is that they have "equivalent" indexing into the same base buffer. Differential Revision: https://reviews.llvm.org/D158428	2023-08-22 09:00:17 +02:00
Matthias Springer	a02ad6c177	[mlir][bufferization] Generalize getAliasingOpResults to getAliasingValues This revision is needed to support bufferization of `cf.br`/`cf.cond_br`. It will also be useful for better analysis of loop ops. This revision generalizes `getAliasingOpResults` to `getAliasingValues`. An OpOperand can now not only alias with OpResults but also with BlockArguments. In the case of `cf.br` (will be added in a later revision): a `cf.br` operand will alias with the corresponding argument of the destination block. If an op does not implement the `BufferizableOpInterface`, the analysis in conservative. It previously assumed that an OpOperand may alias with each OpResult. It now assumes that an OpOperand may alias with each OpResult and each BlockArgument of the entry block. Differential Revision: https://reviews.llvm.org/D157957	2023-08-15 15:02:47 +02:00
Matthias Springer	061aa2e3ba	[mlir][bufferization] Better error checking for ops with unstructured control flow Report an error when trying to bufferize an op that contains unstructured control flow but for ops for which the bufferization implementation does not support unstructured control flow. At the moment, there are no ops for which unstructured control flow is supported. Differential Revision: https://reviews.llvm.org/D157893	2023-08-15 11:39:49 +02:00
Markus Böck	10ae8ae837	[mlir][NFC] Make `ReturnLike` trait imply `RegionBranchTerminatorOpInterface` This implication was already done de-facto and there were plenty of users and wrapper functions specifically used to handle the "return-like or RegionBranchTerminatorOpInterface" case. These simply existed due to up until recently missing features in ODS. With the new capabilities of traits, we can make `ReturnLike` imply `RegionBranchTerminatorOpInterface` and auto generate proper definitions for its methods. Various occurrences and wrapper methods used for `isa<RegionBranchTerminatorOpInterface>() \|\| hasTrait<ReturnLike>()` have all been removed. Differential Revision: https://reviews.llvm.org/D157402	2023-08-08 22:11:39 +02:00
Matthias Springer	5468340553	[mlir][bufferization] Improve analysis for element-wise operations Before this change, two equivalent operands that bufferize to a memory read and write, respectively, were always conflicting. This change improves the analysis for ops that bufferize to element-wise access. Such ops can bufferize in-place, because an original element value is not needed anymore after computing and writing an updated element value. This change allows ops such as the following one to bufferize in-place: ``` %0 = linalg.elemwise_binary {fun = #linalg.binary_fn<add>} ins(%a, %b : tensor<5xf32>, tensor<5xf32>) outs(%a : tensor<5xf32>) -> tensor<5xf32> ``` Differential Revision: https://reviews.llvm.org/D156887	2023-08-03 16:35:55 +02:00
Martin Erhart	9312b4f90f	[mlir][bufferization] Cache enclosing repetitive region The `getEnclosingRepetitiveRegion` functions walk the ancestor regions everytime which can be expensive especially when there are multiple regions inbetween. This commit adds a cache to the bufferization analysis to remember the result of the walk. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D154710	2023-07-08 09:30:41 +00:00
Matthias Springer	6badbd6fd3	[mlir][linalg] BufferizeToAllocation: Bufferize non-allocating ops Until now, only `tensor.pad` ops could be bufferized to an allocation. This revision adds support for all bufferizable ops that do not already bufferize to an allocation. (Those still need special handling.) Differential Revision: https://reviews.llvm.org/D153971	2023-07-04 14:46:54 +02:00
Matthias Springer	4ef6028340	[mlir][bufferization] Allow to_memref ops in One-Shot Analysis bufferization.to_memref ops are allowed in One-Shot Bufferize, but they are treated conservatively: in the absence of a memref analysis, we have to assume that the result buffer is read and written. Note: to_memref cannot introduce any future aliases that would have to be considered during One-Shot Bufferize, because only to_tensor ops with the `restrict` attribute are supported. Such tensors are guaranteed to not alias with any other buffer after bufferization. Differential Revision: https://reviews.llvm.org/D153365	2023-06-21 08:42:25 +02:00
Tres Popp	68f58812e3	[mlir] Move casting calls from methods to function calls The MLIR classes Type/Attribute/Operation/Op/Value support cast/dyn_cast/isa/dyn_cast_or_null functionality through llvm's doCast functionality in addition to defining methods with the same name. This change begins the migration of uses of the method to the corresponding function call as has been decided as more consistent. Note that there still exist classes that only define methods directly, such as AffineExpr, and this does not include work currently to support a functional cast/isa call. Context: - https://mlir.llvm.org/deprecation/ at "Use the free function variants for dyn_cast/cast/isa/…" - Original discussion at https://discourse.llvm.org/t/preferred-casting-style-going-forward/68443 Implementation: This patch updates all remaining uses of the deprecated functionality in mlir/. This was done with clang-tidy as described below and further modifications to GPUBase.td and OpenMPOpsInterfaces.td. Steps are described per line, as comments are removed by git: 0. Retrieve the change from the following to build clang-tidy with an additional check: main...tpopp:llvm-project:tidy-cast-check 1. Build clang-tidy 2. Run clang-tidy over your entire codebase while disabling all checks and enabling the one relevant one. Run on all header files also. 3. Delete .inc files that were also modified, so the next build rebuilds them to a pure state. ``` ninja -C $BUILD_DIR clang-tidy run-clang-tidy -clang-tidy-binary=$BUILD_DIR/bin/clang-tidy -checks='-,misc-cast-functions'\ -header-filter=mlir/ mlir/ -fix rm -rf $BUILD_DIR/tools/mlir/*/.inc ``` Differential Revision: https://reviews.llvm.org/D151542	2023-05-26 10:29:55 +02:00
Matthias Springer	bb9d1b551a	[mlir][bufferization] Add option to dump alias sets This is useful for debugging. Differential Revision: https://reviews.llvm.org/D143314	2023-05-15 15:38:20 +02:00
Matthias Springer	1f479c1e46	[mlir][bufferization] Improve findValueInReverseUseDefChain signature Instead of passing traversal options as a long list of arguments, store them in a TraversalConfig object and pass that object. Differential Revision: https://reviews.llvm.org/D143927	2023-05-15 15:31:56 +02:00
Tres Popp	5550c82189	[mlir] Move casting calls from methods to function calls The MLIR classes Type/Attribute/Operation/Op/Value support cast/dyn_cast/isa/dyn_cast_or_null functionality through llvm's doCast functionality in addition to defining methods with the same name. This change begins the migration of uses of the method to the corresponding function call as has been decided as more consistent. Note that there still exist classes that only define methods directly, such as AffineExpr, and this does not include work currently to support a functional cast/isa call. Caveats include: - This clang-tidy script probably has more problems. - This only touches C++ code, so nothing that is being generated. Context: - https://mlir.llvm.org/deprecation/ at "Use the free function variants for dyn_cast/cast/isa/…" - Original discussion at https://discourse.llvm.org/t/preferred-casting-style-going-forward/68443 Implementation: This first patch was created with the following steps. The intention is to only do automated changes at first, so I waste less time if it's reverted, and so the first mass change is more clear as an example to other teams that will need to follow similar steps. Steps are described per line, as comments are removed by git: 0. Retrieve the change from the following to build clang-tidy with an additional check: https://github.com/llvm/llvm-project/compare/main...tpopp:llvm-project:tidy-cast-check 1. Build clang-tidy 2. Run clang-tidy over your entire codebase while disabling all checks and enabling the one relevant one. Run on all header files also. 3. Delete .inc files that were also modified, so the next build rebuilds them to a pure state. 4. Some changes have been deleted for the following reasons: - Some files had a variable also named cast - Some files had not included a header file that defines the cast functions - Some files are definitions of the classes that have the casting methods, so the code still refers to the method instead of the function without adding a prefix or removing the method declaration at the same time. ``` ninja -C $BUILD_DIR clang-tidy run-clang-tidy -clang-tidy-binary=$BUILD_DIR/bin/clang-tidy -checks='-,misc-cast-functions'\ -header-filter=mlir/ mlir/ -fix rm -rf $BUILD_DIR/tools/mlir/*/.inc git restore mlir/lib/IR mlir/lib/Dialect/DLTI/DLTI.cpp\ mlir/lib/Dialect/Complex/IR/ComplexDialect.cpp\ mlir/lib/**/IR/\ mlir/lib/Dialect/SparseTensor/Transforms/SparseVectorization.cpp\ mlir/lib/Dialect/Vector/Transforms/LowerVectorMultiReduction.cpp\ mlir/test/lib/Dialect/Test/TestTypes.cpp\ mlir/test/lib/Dialect/Transform/TestTransformDialectExtension.cpp\ mlir/test/lib/Dialect/Test/TestAttributes.cpp\ mlir/unittests/TableGen/EnumsGenTest.cpp\ mlir/test/python/lib/PythonTestCAPI.cpp\ mlir/include/mlir/IR/ ``` Differential Revision: https://reviews.llvm.org/D150123	2023-05-12 11:21:25 +02:00
Matthias Springer	8f7e7400b7	[mlir][bufferization] Add restrict and writable attrs to to_tensor `restrict` is similar to the C++ restrict keyword. Results of `to_tensor` that have the `restrict` attribute are guaranteed to not alias any other `to_tensor` result (after bufferization). Note: Since `to_memref` ops are not supported by One-Shot Bufferize and all bufferizable ops follow DPS rules (i.e., the buffer of the result is the buffer of an operand or an alias thereof), the buffer of a `to_tensor` op that has the `restrict` attribute is always an entirely "new" buffer that is not aliasing with the future buffer of any tensor value in the entire program. This makes such `to_tensor` ops "safe" from a bufferization perspective; they cannot cause RaW conflicts. Differential Revision: https://reviews.llvm.org/D144021	2023-02-15 10:04:54 +01:00
Matthias Springer	9fa6b3504b	[mlir][bufferization] Improve aliasing OpOperand/OpResult property `getAliasingOpOperands`/`getAliasingOpResults` now encodes OpOperand/OpResult, buffer relation and a degree of certainty. E.g.: ``` // aliasingOpOperands(%r) = {(%t, EQUIV, DEFINITE)} // aliasingOpResults(%t) = {(%r, EQUIV, DEFINITE)} %r = tensor.insert %f into %t[%idx] : tensor<?xf32> // aliasingOpOperands(%r) = {(%t0, EQUIV, MAYBE), (%t1, EQUIV, MAYBE)} // aliasingOpResults(%t0) = {(%r, EQUIV, MAYBE)} // aliasingOpResults(%t1) = {(%r, EQUIV, MAYBE)} %r = arith.select %c, %t0, %t1 : tensor<?xf32> ``` `BufferizableOpInterface::bufferRelation` is removed, as it is now part of `getAliasingOpOperands`/`getAliasingOpResults`. This change allows for better analysis, in particular wrt. equivalence. This allows additional optimizations and better error checking (which is sometimes overly conservative). Examples: * EmptyTensorElimination can eliminate `tensor.empty` inside `scf.if` blocks. This requires a modeling of equivalence: It is not a per-OpResult property anymore. Instead, it can be specified for each OpOperand and OpResult. This is important because `tensor.empty` may be eliminated only if all values on the SSA use-def chain to the final consumer (`tensor.insert_slice`) are equivalent. * The detection of "returning allocs from a block" can be improved. (Addresses a TODO in `assertNoAllocsReturned`.) This allows us to bufferize IR such as "yielding a `tensor.extract_slice` result from an `scf.if` branch", which currently fails to bufferize because the alloc detection is too conservative. * Better bufferization of loops. Aliases of the iter_arg can be yielded (even if they are not equivalent) without having to realloc and copy the entire buffer on each iteration. The above-mentioned examples are not yet implemented with this change. This change just improves the BufferizableOpInterface, its implementations and related helper functions, so that better aliasing information is available for each op. Differential Revision: https://reviews.llvm.org/D142129	2023-02-09 11:35:03 +01:00
Matthias Springer	2b5a020d3e	[mlir][bufferization][NFC] Cache definitions of read tensors This is to avoid unnecessary traversals of the IR. Differential Revision: https://reviews.llvm.org/D143408	2023-02-09 09:27:39 +01:00
Matthias Springer	dc7ad194c7	[mlir][bufferize][NFC] Optimize read-only tensor detection Check alias sets instead of traversing the IR. Differential Revision: https://reviews.llvm.org/D143500	2023-02-09 09:07:14 +01:00
Matthias Springer	6d14b11018	[mlir][bufferize][NFC] OneShotAnalysis: Expose analysis hooks from AnalysisState This is in preparation of reusing the same AnalysisState for tensor.empty elimination and One-Shot Bufferize (to address performance bottlenecks). Differential Revision: https://reviews.llvm.org/D143379	2023-02-08 09:28:08 +01:00
Matthias Springer	cf2d374e99	[mlir][bufferize][NFC] Merge AnalysisState and BufferizationAliasInfo There is no longer a need to keep the two separate. This is in preparation of reusing the same AnalysisState for tensor.empty elimination and One-Shot Bufferize (to address performance bottlenecks). Differential Revision: https://reviews.llvm.org/D143313	2023-02-08 09:12:09 +01:00
Matthias Springer	c89c31a230	[mlir][bufferization] Fix bufferization of repetitive regions The previous strategy was too complex and faulty. Op dominance cannot be used to rule out RaW conflicts due to op ordering if the reading op and the conflicting writing op are in a sub repetitive region of the closest enclosing repetitive region of the definition of the read value. Differential Revision: https://reviews.llvm.org/D143087	2023-02-06 16:23:08 +01:00
Matthias Springer	1fdf06d6d7	[mlir][bufferization] Reads from tensors with undefined data are not a conflict Reading from tensor.empty or bufferization.alloc_tensor (without copy) cannot cause a conflict because these ops do not specify the contents of their result tensors. Differential Revision: https://reviews.llvm.org/D143183	2023-02-06 16:11:13 +01:00
Matthias Springer	1ac248e485	[mlir][bufferization][NFC] Rename getAliasingOpOperand/getAliasingOpResult * `getAliasingOpOperand` => `getAliasingOpOperands` * `getAliasingOpResult` => `getAliasingOpResults` Also a few minor code cleanups and better documentation. Differential Revision: https://reviews.llvm.org/D142979	2023-02-01 10:07:41 +01:00
Matthias Springer	f3483c23ce	[mlir][bufferization] Fix getAliasingOpOperand/OpResult for non-bufferizable ops Also enable analysis of unknown ops. Differential Revision: https://reviews.llvm.org/D142006	2023-01-30 10:10:43 +01:00
Matthias Springer	1840d18a10	[mlir][bufferization][NFC] Rename: "last-write" -> "definition" The previous lingo was confusing. There are no writes on tensors. There are only definitions. Also some minor cleanup and better documentation. Differential Revision: https://reviews.llvm.org/D141790	2023-01-30 09:51:53 +01:00
Matthias Springer	34d65e81e8	[mlir][bufferization] Generalize and rename isMemoryWrite The name of the method was confusing. It is bufferizesToMemoryWrite, but from the perspective of OpResults. `bufferizesToMemoryWrite(OpResult)` now supports ops with regions that do not have aliasing OpOperands (such as `scf.if`). These ops no longer need to implement `isMemoryWrite`. Differential Revision: https://reviews.llvm.org/D141684	2023-01-30 09:34:04 +01:00
Kazu Hirata	91682b2631	Remove redundant initialization of std::optional (NFC)	2023-01-14 14:06:18 -08:00
Kazu Hirata	0a81ace004	[mlir] Use std::optional instead of llvm::Optional (NFC) This patch replaces (llvm::\|)Optional< with std::optional<. I'll post a separate patch to remove #include "llvm/ADT/Optional.h". This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-14 01:25:58 -08:00
Kazu Hirata	a1fe1f5f77	[mlir] Add #include <optional> (NFC) This patch adds #include <optional> to those files containing llvm::Optional<...> or Optional<...>. I'll post a separate patch to actually replace llvm::Optional with std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-13 21:05:06 -08:00
Matthias Springer	45cd0e453d	[mlir][bufferization][NFC] Make getEnclosingRepetitiveRegion public These functions are generally useful and not specific to One-Shot Analysis. Move them to `BufferizableOpInterface.h` and make them public. Differential Revision: https://reviews.llvm.org/D141685	2023-01-13 16:39:41 +01:00
Matthias Springer	ae05bd99d3	[mlir][bufferize][NFC] Add statistics to OneShotBufferizePass Print statistics about the number of alloc/deallocs and in-place/out-of-place bufferization. Differential Revision: https://reviews.llvm.org/D139538	2022-12-15 18:02:51 +01:00
Matthias Springer	be630f07de	[mlir][bufferize] Implement BufferizableOpInterface for tensor.empty The op is not bufferizable but should be analyzable (for `EliminateEmptyTensors`, which uses the bufferization infrastructure). Also improve debugging functionality and error messages. Also adds a missing pass to the sparse pipeline. (tensor.empty should be replaced with bufferization.alloc_tensor, but it sometimes used to work without depending on how the tensor.empty is used. Now we always fail explicitly.)	2022-12-12 14:19:38 +01:00
Kazu Hirata	1a36588ec6	[mlir] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 18:50:27 -08:00
Matthias Springer	c069feb89c	[mlir][bufferize] Improve error message when returning allocs The previous error message was confusing. Also improve code documentation and some minor code cleanups. Differential Revision: https://reviews.llvm.org/D138902	2022-11-30 09:28:12 +01:00
Matthias Springer	7cdfc843b5	[mlir][bufferize][NFC] Minor code and comment cleanups Differential Revision: https://reviews.llvm.org/D135056	2022-11-22 17:04:09 +01:00
Matthias Springer	faa9be75ee	[mlir][bufferize][NFC] Rename DialectAnalysisState and move to OneShotAnalysis `DialectAnalysisState` is now `OneShotAnalysisState::Extension`. This state extension mechanism is needed only for One-Shot Analysis, so it is moved from `BufferizableOpInterface.h` to `OneShotAnalysis.h`. Extensions are now identified via TypeIDs instead of StringRefs. The API of state extensions is cleaned up and follows the same pattern as other extension mechanisms in MLIR (e.g., `transform::TransformState::Extension`). Also delete some dead code. Differential Revision: https://reviews.llvm.org/D135051	2022-11-22 14:34:55 +01:00
Matthias Springer	28b2f79215	[mlir][bufferize][NFC] Consolidate transform header files Differential Revision: https://reviews.llvm.org/D137830	2022-11-11 14:33:23 +01:00
Matthias Springer	e0b40af722	[mlir][bufferize][NFC] Better debug output for One-Shot Analysis Run mlir-opt with `-debug-only="one-shot-analysis"` for detailed debug output. Differential Revision: https://reviews.llvm.org/D135549	2022-10-31 11:09:23 +01:00
Matthias Springer	1b99f3a224	[mlir][bufferize] Treat certain aliasing-only uses like memory reads This fixes an issue in One-Shot Bufferize that could lead to missing buffer copies in the future. This bug can currently not be triggered because of the order in which ops are analyzed (always bottom-to-top). However, if we consider different traversal orders for the analysis in the future, this bug can cause subtle issues that are difficult to debug. Example: ``` %0 = ... %1 = tensor.insert ... into %0 %2 = tensor.extract_slice %0 tensor.extract %2[...] ``` In case of a top-to-bottom analysis of the above IR, the `tensor.insert` is analyzed before the `tensor.extract_slice`. In that case, the `tensor.insert` will bufferize in-place because %2 is not yet known to become an alias of %0 (and therefore causing a conflict). With this change, the `tensor.insert` will bufferize out-of-place, regardless of the traversal order. Differential Revision: https://reviews.llvm.org/D135049	2022-10-14 10:40:45 +09:00
Matthias Springer	1f8ffbd1cc	[mlir][bufferize][NFC] Address review comments of D135420 These changes should have been landed as part of D135420. Differential Revision: https://reviews.llvm.org/D135438	2022-10-07 19:54:08 +09:00
Matthias Springer	2e210034da	[mlir][bufferize] Fix repetitive region conflict detection This fixes a bug where a required buffer copy was not inserted. Not only written aliases, but also read aliases should be taken into account when computing common enclosing repetitive regions. Furthermore, for writing ops, it does not matter where the destination tensor is defined, but where the op itself is located. Differential Revision: https://reviews.llvm.org/D135420	2022-10-07 16:39:03 +09:00
Matthias Springer	f4e8f44811	[mlir][bufferize] Fix enclosing repetitive region computation The wrong function overload was called. Differential Revision: https://reviews.llvm.org/D135342	2022-10-07 10:37:04 +09:00
Matthias Springer	f7dd9a3206	[mlir][bufferize] Add new debug flag: copy-before-write If this flag is set, the analysis is skipped and buffers are copied before every write. Differential Revision: https://reviews.llvm.org/D133288	2022-09-05 14:41:19 +02:00
Matthias Springer	f7f0c7f7e3	[mlir][bufferize] Add isRepetitiveRegion to BufferizableOpInterface This method allows to declare regions as "repetitive" even if the parent op does not implement the RegionBranchOpInterface. This is needed to support loop-like ops that have parallel semantics but do not branch between regions. Differential Revision: https://reviews.llvm.org/D133113	2022-09-02 14:47:20 +02:00
Matthias Springer	3f914d84c3	[mlir][bufferize] Better error handling: Fail if ToMemrefOps are found bufferization.to_memref ops are not supported in One-Shot Analysis. They often trigger a failed assertion that can be confusing. Instead, scan for to_memref ops before running the analysis and immediately abort with a proper error message. Differential Revision: https://reviews.llvm.org/D132027	2022-08-18 11:37:57 +02:00
Matthias Springer	b3ebe3beed	[mlir][bufferize] Bufferize after TensorCopyInsertion This change changes the bufferization so that it utilizes the new TensorCopyInsertion pass. One-Shot Bufferize no longer calls the One-Shot Analysis. Instead, it relies on the TensorCopyInsertion pass to make the entire IR fully inplacable. The `bufferize` implementations of all ops are simplified; they no longer have to account for out-of-place bufferization decisions. These were already materialized in the IR in the form of `bufferization.alloc_tensor` ops during the TensorCopyInsertion pass. Differential Revision: https://reviews.llvm.org/D127652	2022-06-17 13:29:52 +02:00
Matthias Springer	032be23309	[mlir][bufferize] Improve buffer writability analysis Find writability conflicts (writes to buffers that are not allowed to be written to) by checking SSA use-def chains. This is better than the current writability analysis, which is too conservative and finds false positives. Differential Revision: https://reviews.llvm.org/D127256	2022-06-08 10:11:52 +02:00
Matthias Springer	3490aadf56	[mlir][bufferization][NFC] Remove post-analysis step infrastructure Now that analysis and bufferization are better separated, post-analysis steps are no longer needed. Users can directly interleave analysis and bufferization as needed. Differential Revision: https://reviews.llvm.org/D126571	2022-05-28 04:37:13 +02:00
Matthias Springer	ffdbecccaf	[mlir][bufferization] Add bufferization.alloc_tensor op This change adds a new op `alloc_tensor` to the bufferization dialect. During bufferization, this op is always lowered to a buffer allocation (unless it is "eliminated" by a pre-processing pass). It is useful to have such an op in tensor land, because it allows users to model tensor SSA use-def chains (which drive bufferization decisions) and because tensor SSA use-def chains can be analyzed by One-Shot Bufferize, while memref values cannot. This change also replaces all uses of linalg.init_tensor in bufferization-related code with bufferization.alloc_tensor. linalg.init_tensor and bufferization.alloc_tensor are similar, but the purpose of the former one is just to carry a shape. It does not indicate a memory allocation. linalg.init_tensor is not suitable for modelling SSA use-def chains for bufferization purposes, because linalg.init_tensor is marked as not having side effects (in contrast to alloc_tensor). As such, it is legal to move linalg.init_tensor ops around/CSE them/etc. This is not desirable for alloc_tensor; it represents an explicit buffer allocation while still in tensor land and such allocations should not suddenly disappear or get moved around when running the canonicalizer/CSE/etc. BEGIN_PUBLIC No public commit message needed for presubmit. END_PUBLIC Differential Revision: https://reviews.llvm.org/D126003	2022-05-21 02:47:32 +02:00
Matthias Springer	988748c077	[mlir][bufferize] Do not copy buffers with undefined contents Buffers with undefined contents (e.g., the result of an init_tensor) are no longer copied. Differential Revision: https://reviews.llvm.org/D125015	2022-05-06 17:31:01 +09:00

1 2

63 Commits