clang-p2996

Author	SHA1	Message	Date
Jakub Kuderski	a0a76804c4	[ADT] Allow `llvm::enumerate` to enumerate over multiple ranges This does not work by a mere composition of `enumerate` and `zip_equal`, because C++17 does not allow for recursive expansion of structured bindings. This implementation uses `zippy` to manage the iteratees and adds the stream of indices as the first zipped range. Because we have an upfront assertion that all input ranges are of the same length, we only need to check if the second range has ended during iteration. As a consequence of using `zippy`, `enumerate` will now follow the reference and lifetime semantics of the `zip*` family of functions. The main difference is that `enumerate` exposes each tuple of references through a new tuple-like type `enumerate_result`, with the familiar `.index()` and `.value()` member functions. Because the `enumerate_result` returned on dereference is a temporary, enumeration result can no longer be used through an lvalue ref. Reviewed By: dblaikie, zero9178 Differential Revision: https://reviews.llvm.org/D144503	2023-03-15 19:34:22 -04:00
Matthias Springer	9fa6b3504b	[mlir][bufferization] Improve aliasing OpOperand/OpResult property `getAliasingOpOperands`/`getAliasingOpResults` now encodes OpOperand/OpResult, buffer relation and a degree of certainty. E.g.: ``` // aliasingOpOperands(%r) = {(%t, EQUIV, DEFINITE)} // aliasingOpResults(%t) = {(%r, EQUIV, DEFINITE)} %r = tensor.insert %f into %t[%idx] : tensor<?xf32> // aliasingOpOperands(%r) = {(%t0, EQUIV, MAYBE), (%t1, EQUIV, MAYBE)} // aliasingOpResults(%t0) = {(%r, EQUIV, MAYBE)} // aliasingOpResults(%t1) = {(%r, EQUIV, MAYBE)} %r = arith.select %c, %t0, %t1 : tensor<?xf32> ``` `BufferizableOpInterface::bufferRelation` is removed, as it is now part of `getAliasingOpOperands`/`getAliasingOpResults`. This change allows for better analysis, in particular wrt. equivalence. This allows additional optimizations and better error checking (which is sometimes overly conservative). Examples: * EmptyTensorElimination can eliminate `tensor.empty` inside `scf.if` blocks. This requires a modeling of equivalence: It is not a per-OpResult property anymore. Instead, it can be specified for each OpOperand and OpResult. This is important because `tensor.empty` may be eliminated only if all values on the SSA use-def chain to the final consumer (`tensor.insert_slice`) are equivalent. * The detection of "returning allocs from a block" can be improved. (Addresses a TODO in `assertNoAllocsReturned`.) This allows us to bufferize IR such as "yielding a `tensor.extract_slice` result from an `scf.if` branch", which currently fails to bufferize because the alloc detection is too conservative. * Better bufferization of loops. Aliases of the iter_arg can be yielded (even if they are not equivalent) without having to realloc and copy the entire buffer on each iteration. The above-mentioned examples are not yet implemented with this change. This change just improves the BufferizableOpInterface, its implementations and related helper functions, so that better aliasing information is available for each op. Differential Revision: https://reviews.llvm.org/D142129	2023-02-09 11:35:03 +01:00
Matthias Springer	330372f2c5	[mlir][tensor][bufferize] tensor.empty does not define the result tensor contents This is encoded in the `BufferizableOpInterface` via `resultBufferizesToMemoryWrite = false`. Differential Revision: https://reviews.llvm.org/D143181	2023-02-06 10:26:38 +01:00
Matthias Springer	b6ae3f8873	[mlir][tensor][bufferize] Implement getBufferType for CastOp This interface method is used to compute the buffer type of a value during bufferization. It was missing. This is interface method is used during loop bufferization. Also fix a bug where a cast from an unranked tensor to a ranked tensor type did not always apply a fully dynamic layout map on the result memref. Differential Revision: https://reviews.llvm.org/D143063	2023-02-01 14:24:10 +01:00
Matthias Springer	1ac248e485	[mlir][bufferization][NFC] Rename getAliasingOpOperand/getAliasingOpResult * `getAliasingOpOperand` => `getAliasingOpOperands` * `getAliasingOpResult` => `getAliasingOpResults` Also a few minor code cleanups and better documentation. Differential Revision: https://reviews.llvm.org/D142979	2023-02-01 10:07:41 +01:00
Matthias Springer	148432ea84	[mlir][bufferization][NFC] Rename BufferRelation::None to BufferRelation::Unknown The previous name was incorrect. `None` does not mean that there is no buffer relation between two buffers (seems to imply that they do not alias for sure); instead it means that there is no further information available. Differential Revision: https://reviews.llvm.org/D142870	2023-01-30 11:09:28 +01:00
Matthias Springer	1840d18a10	[mlir][bufferization][NFC] Rename: "last-write" -> "definition" The previous lingo was confusing. There are no writes on tensors. There are only definitions. Also some minor cleanup and better documentation. Differential Revision: https://reviews.llvm.org/D141790	2023-01-30 09:51:53 +01:00
Mehdi Amini	ab32f5b7ef	Apply clang-tidy fixes for readability-simplify-boolean-expr in BufferizableOpInterfaceImpl.cpp (NFC)	2022-12-28 22:42:39 +00:00
Matthias Springer	e5dc99e642	[mlir][tensor][bufferize] Improve bufferization of DimOp/RankOp The tensor operands do not bufferize to a memory read. Differential Revision: https://reviews.llvm.org/D140007	2022-12-14 12:47:46 +01:00
Matthias Springer	be630f07de	[mlir][bufferize] Implement BufferizableOpInterface for tensor.empty The op is not bufferizable but should be analyzable (for `EliminateEmptyTensors`, which uses the bufferization infrastructure). Also improve debugging functionality and error messages. Also adds a missing pass to the sparse pipeline. (tensor.empty should be replaced with bufferization.alloc_tensor, but it sometimes used to work without depending on how the tensor.empty is used. Now we always fail explicitly.)	2022-12-12 14:19:38 +01:00
Matthias Springer	13593dc9dc	[mlir][tensor][bufferize] Fix tensor.insert_slice regression This reverts D132662 (apart from overall cleanups), which introduced a too aggressive optimization for tensor.insert_slice bufferization. Instead, bufferizesToMemoryRead is improved to handle some of these cases. The remaining cases can still bufferize efficiently when running the canonicalizer before the bufferization. Differential Revision: https://reviews.llvm.org/D138745	2022-11-26 19:14:33 +01:00
Lei Zhang	9bb633741a	[mlir][bufferization] Support general Attribute as memory space MemRef has been accepting a general Attribute as memory space for a long time. This commits updates bufferization side to catch up, which allows downstream users to plugin customized symbolic memory space. This also eliminates quite a few `getMemorySpaceAsInt` calls, which is deprecated. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D138330	2022-11-21 09:40:50 -05:00
Matthias Springer	09dfb44193	[mlir][tensor][bufferize] Support memory_space for tensor.pad This change adds memory space support to tensor.pad. (tensor.generate and tensor.from_elements do not support memory spaces yet.) The memory space is inferred from the buffer of the source tensor. Instead of lowering tensor.pad to tensor.generate + tensor.insert_slice, it is now lowered to bufferization.alloc_tensor (with the correct memory space) + linalg.map + tensor.insert_slice. Memory space support for the remaining two tensor ops is left for a later point, as this requires some more design discussions. Differential Revision: https://reviews.llvm.org/D136265	2022-10-27 12:29:57 +02:00
Matthias Springer	c1f0a15c65	[mlir][tensor][bufferize] Lower tensor.generate to linalg.map There is no memref equivalent of tensor.generate. The purpose of this change is to avoid creating scf.parallel loops during bufferization. Differential Revision: https://reviews.llvm.org/D136767	2022-10-27 12:03:13 +02:00
Matthias Springer	2d5edc644d	[mlir][bufferize] Provide default BufferizableOpInterface impl for destination style ops tensor.insert and tensor.insert_slice (as destination style ops) do no longer need to implement the entire BufferizableOpInterface. Differential Revision: https://reviews.llvm.org/D136347	2022-10-27 10:52:47 +02:00
Matthias Springer	6cdd34b973	[mlir][tensor][bufferize] Bufferize inserts into equivalent tensors in-place Inserting a tensor into an equivalent tensor is a no-op after bufferization. No alloc is needed. Differential Revision: https://reviews.llvm.org/D132662	2022-10-06 15:06:33 +09:00
Jakub Kuderski	abc362a107	[mlir][arith] Change dialect name from Arithmetic to Arith Suggested by @lattner in https://discourse.llvm.org/t/rfc-define-precise-arith-semantics/65507/22. Tested with: `ninja check-mlir check-mlir-integration check-mlir-mlir-spirv-cpu-runner check-mlir-mlir-vulkan-runner check-mlir-examples` and `bazel build --config=generic_clang @llvm-project//mlir:all`. Reviewed By: lattner, Mogball, rriddle, jpienaar, mehdi_amini Differential Revision: https://reviews.llvm.org/D134762	2022-09-29 11:23:28 -04:00
Matthias Springer	04ff6009fc	[mlir][tensor][bufferize] Implement getBufferType for Expand/CollapseShapeOp This function must be implemented for all ops, where the result memref type is different from the input memref type. Differential Revision: https://reviews.llvm.org/D134331	2022-09-21 18:31:59 +09:00
Alex Zinenko	46b90a7b5d	[mlir] make remaining memref dialect ops produce strided layouts The three following ops in the memref dialect: transpose, expand_shape, collapse_shape, have been originally designed to operate on memrefs with strided layouts but had to go through the affine map representation as the type did not support anything else. Make these ops produce memref values with StridedLayoutAttr instead now that it is available. Depends On D133938 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D133947	2022-09-16 10:56:48 +02:00
Matthias Springer	4cd7362083	[mlir][SCF] foreach_thread: Capture shared output tensors explicitly This change refines the semantics of scf.foreach_thread. Tensors that are inserted into in the terminator must now be passed to the region explicitly via `shared_outs`. Inside of the body of the op, those tensors are then accessed via block arguments. The body of a scf.foreach_thread is now treated as a repetitive region. I.e., op dominance can no longer be used in conflict detection when using a value that is defined outside of the body. Such uses may now be considered as conflicts (if there is at least one read and one write in the body), effectively privatizing the tensor. Shared outputs are not privatized when they are used via their corresponding block arguments. As part of this change, it was also necessary to update the "tiling to scf.foreach_thread", such that the generated tensor.extract_slice ops use the scf.foreach_thread's block arguments. This is implemented by cloning the TilingInterface op inside the scf.foreach_thread, rewriting all of its outputs with block arguments and then calling the tiling implementation. Afterwards, the cloned op is deleted again. Differential Revision: https://reviews.llvm.org/D133114	2022-09-02 14:54:04 +02:00
Matthias Springer	123c4b0251	[mlir][SCF][bufferize] Support different iter_arg/init_arg types (scf.for) Even though iter_arg and init_arg of an scf.for loop may have the same tensor type, their bufferized memref types are not necessarily equal. It is sometimes necessary to insert a cast in case of differing layout maps. Differential Revision: https://reviews.llvm.org/D132860	2022-08-30 16:35:32 +02:00
Matthias Springer	111c919665	[mlir][bufferization] Generalize getBufferType This change generalizes getBufferType. This function can be used to predict the buffer type of any tensor value (not just BlockArguments) without changing any IR. It also subsumes getMemorySpace. This is useful for loop bufferization, where the precise buffer type of an iter_arg cannot be known without examining the loop body. Differential Revision: https://reviews.llvm.org/D132859	2022-08-30 16:26:44 +02:00
Matthias Springer	ba95bf765d	[mlir][tensor] Add getMixedSizes helper This helper function computes the dimensions of a tensor value as OpFoldResults. Differential Revision: https://reviews.llvm.org/D132475	2022-08-25 10:25:41 +02:00
Matthias Springer	c37ed7762e	[tensor][bufferize] Use affine.apply instead of arith.addi in PadOp lowering Affine exprs compose better than arith ops. Differential Revision: https://reviews.llvm.org/D132456	2022-08-23 11:46:11 +02:00
Matthias Springer	9ee12f4778	[mlir][tensor][bufferize] Bufferize tensor.pad tensor.pad is lowered to tensor.generate + tensor.insert_slice during bufferization. For best performance with constant padding values, users should vectorize the IR before bufferizing it. This change also relaxes tje restriction that no new ops that bufferize to a memory write should be added during bufferization. Since bufferization has been split into two steps a while ago (tensor copy insertion + bufferization), it is reasonable to allow this now. Differential Revision: https://reviews.llvm.org/D132355	2022-08-22 17:00:33 +02:00
Matthias Springer	1defec8730	[mlir][tensor][bufferize][NFC] Remove duplicate code InsertSliceOp and ParallelInsertSliceOp are very similar and can share some of the bufferization analysis code. Differential Revision: https://reviews.llvm.org/D130465	2022-07-25 12:34:16 +02:00
Matthias Springer	664ffa46bb	[mlir][tensor][bufferize] Fix deallocation of GenerateOp/FromElementsOp Both ops allocate a buffer. There were cases in which the buffer was not deallocated. Differential Revision: https://reviews.llvm.org/D130469	2022-07-25 12:25:06 +02:00
Matthias Springer	5f5f71e737	[mlir][tensor][bufferize] Load dependent dialects Load dialects that will be generated by the extension. (Except for BufferizationDialect and MemrefDialect which are loaded already.) Differential Revision: https://reviews.llvm.org/D130463	2022-07-25 11:36:10 +02:00
Jacques Pienaar	136d746ec7	[mlir] Flip accessors to prefixed form (NFC) Another mechanical sweep to keep diff small for flip to _Prefixed.	2022-07-10 21:19:11 -07:00
Matthias Springer	606f7c8f7a	[mlir][bufferization][NFC] Move more unknown type conversion logic into BufferizationOptions The `unknownTypeConversion` bufferization option (enum) is now a type converter function option. Some logic of `getMemRefType` is now handled by that function. This change makes type conversion more controllable. Previously, there were only two options when generating memref types for non-bufferizable ops: Static identity layout or fully dynamic layout. With this change, users of One-Shot Bufferize can provide a function with custom logic. Differential Revision: https://reviews.llvm.org/D129273	2022-07-07 13:36:28 +02:00
Matthias Springer	6c3c5f8069	[mlir][memref] Improve type inference for rank-reducing subviews The result shape of a rank-reducing subview cannot be inferred in the general case. Just the result rank is not enough. The only thing that we can infer is the layout map. This change also improves the bufferization patterns of tensor.extract_slice and tensor.insert_slice to fully support rank-reducing operations. Differential Revision: https://reviews.llvm.org/D129144	2022-07-05 16:49:07 +02:00
Nicolas Vasilache	7fbf55c927	[mlir][Tensor] Move ParallelInsertSlice to the tensor dialect This is moslty NFC and will allow tensor.parallel_insert_slice to gain rank-reducing semantics by reusing the vast majority of the tensor.insert_slice impl. Depends on D128857 Differential Revision: https://reviews.llvm.org/D128920	2022-07-04 01:53:12 -07:00
Matthias Springer	c0b0b6a00a	[mlir][bufferize] Infer memory space in all bufferization patterns This change updates all remaining bufferization patterns (except for scf.while) and the remaining bufferization infrastructure to infer the memory space whenever possible instead of falling back to "0". (If a default memory space is set in the bufferization options, we still fall back to that value if the memory space could not be inferred.) Differential Revision: https://reviews.llvm.org/D128423	2022-06-27 16:32:52 +02:00
Matthias Springer	45b995cda4	[mlir][bufferize][NFC] Change signature of allocateTensorForShapedValue Add a failure return value and bufferization options argument. This is to keep a subsequent change smaller. Differential Revision: https://reviews.llvm.org/D128278	2022-06-27 16:00:06 +02:00
Matthias Springer	5d50f51c97	[mlir][bufferization][NFC] Add error handling to getBuffer This is in preparation of adding memory space support. Differential Revision: https://reviews.llvm.org/D128277	2022-06-27 13:48:01 +02:00
Matthias Springer	b06614e2e8	[mlir][bufferization][NFC] Change signature of getMemRefType These functions now accep unsigned attributes for address spaces instead of Attributes. Differential Revision: https://reviews.llvm.org/D128275	2022-06-27 10:41:40 +02:00
Alex Zinenko	8b68da2c7d	[mlir] move SCF headers to SCF/{IR,Transforms} respectively This aligns the SCF dialect file layout with the majority of the dialects. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D128049	2022-06-20 10:18:01 +02:00
Jacques Pienaar	8df54a6a03	[mlir] Update accessors to prefixed form (NFC) Follow up from flipping dialects to both, flip accessor used to prefixed variant ahead to flipping from _Both to _Prefixed. This just flips to the accessors introduced in the preceding change which are just prefixed forms of the existing accessor changed from. Mechanical change using helper script https://github.com/jpienaar/llvm-project/blob/main/clang-tools-extra/clang-tidy/misc/AddGetterCheck.cpp and clang-format.	2022-06-18 17:53:22 -07:00
Jacques Pienaar	eca86cb2ed	[mlir] Start migrating more dialects to prefixed form Marked all dialects that could be (reasonably) easily flipped to _Both prefix. Updating the accessors to prefixed form will happen in follow up, this was to flush out conflicts and to mark all dialects explicitly as I plan to flip OpBase default to _Prefixed to avoid needing to migrate new dialects. Except for Standalone example which got flipped to _Prefixed. Differential Revision: https://reviews.llvm.org/D128027	2022-06-18 10:10:31 -07:00
Matthias Springer	b55d55ecd9	[mlir][bufferize][NFC] Remove BufferizationState With the recent refactorings, this class is no longer needed. We can use BufferizationOptions in all places were BufferizationState was used. Differential Revision: https://reviews.llvm.org/D127653	2022-06-17 14:04:11 +02:00
Matthias Springer	b3ebe3beed	[mlir][bufferize] Bufferize after TensorCopyInsertion This change changes the bufferization so that it utilizes the new TensorCopyInsertion pass. One-Shot Bufferize no longer calls the One-Shot Analysis. Instead, it relies on the TensorCopyInsertion pass to make the entire IR fully inplacable. The `bufferize` implementations of all ops are simplified; they no longer have to account for out-of-place bufferization decisions. These were already materialized in the IR in the form of `bufferization.alloc_tensor` ops during the TensorCopyInsertion pass. Differential Revision: https://reviews.llvm.org/D127652	2022-06-17 13:29:52 +02:00
Matthias Springer	996834e681	[mlir][SCF] Fix scf.while bufferization Before this fix, the bufferization implementation made the incorrect assumption that the values yielded from the "before" region must match with the values yielded from the "after" region. Differential Revision: https://reviews.llvm.org/D125835	2022-05-18 00:35:50 +02:00
Matthias Springer	8f42939a07	[mlir][bufferize][NFC] Make getContiguousMemRefType a static function No need to expose this as public API anymore. Differential Revision: https://reviews.llvm.org/D125361	2022-05-13 11:27:43 +02:00
Matthias Springer	248e113e9f	[mlir][bufferize][NFC] Move helper functions to BufferizationOptions Move helper functions for creating allocs/deallocs/memcpys to BufferizationOptions. Differential Revision: https://reviews.llvm.org/D125375	2022-05-11 16:23:22 +02:00
Ashay Rane	53ff0daa7e	[mlir] Fail early if AnalysisState::getBuffer() returns failure This patch updates calls to AnalysisState::getBuffer() so that we return early with a failure if the call does not succeed. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D125251	2022-05-10 08:08:38 -07:00
Ashay Rane	e287d647c6	[mlir] Add translation from tensor.reshape to memref.reshape This patch augments the `tensor-bufferize` pass by adding a conversion rule to translate ReshapeOp from the `tensor` dialect to the `memref` dialect, in addition to adding a unit test to validate the translation. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D125031	2022-05-09 17:45:07 +02:00
River Riddle	eda6f907d2	[mlir][NFC] Shift a bunch of dialect includes from the .h to the .cpp Now that dialect constructors are generated in the .cpp file, we can drop all of the dependent dialect includes from the .h file. Differential Revision: https://reviews.llvm.org/D124298	2022-04-23 01:09:29 -07:00
Adrian Kuegel	a74e5a89b9	[mlir] Move isGuaranteedCollapsible to CollapseShapeOp (NFC). It seems more natural than to have it as a static method of ExpandShapeOp. Also fix a typo ("the the" -> "the"). Differential Revision: https://reviews.llvm.org/D124234	2022-04-22 10:31:25 +02:00
Matthias Springer	d7a9bf9143	[mlir][tensor] Fix verifier and bufferization of collapse_shape Insert a buffer copy unless the dims are guaranteed to be collapsible. In the verifier, accept collapses unless they are guaranteed to be non-collapsible. Differential Revision: https://reviews.llvm.org/D123316	2022-04-08 18:20:40 +09:00
Matthias Springer	73c0333dee	[mlir][tensor][bufferize] Support 0-d collapse_shape with offset Differential Revision: https://reviews.llvm.org/D122901	2022-04-01 22:30:37 +09:00

1 2

62 Commits