clang-p2996

Author	SHA1	Message	Date
Alexander Belyaev	f6fb0a4f35	[mlir] Make patterns for folding tensor.empty optional. At the moment, they are a part of EmptyOp::getCanonicalizationPatterns. When extract_slice(tensor.empty) is rewritten as a new tensor.empty, it could happen that we end up with two tensor.empty ops, since the original tensor.empty can have two users. After bufferization such cases result in two allocations. Differential Revision: https://reviews.llvm.org/D139308	2022-12-07 23:01:34 +01:00
Javier Setoain	da291bab81	[mlir] Add hoisting of transfer ops in affine loops The only way to do this with the current hoisting strategy is by lowering Affine to Scf first, but that prevents further passes on Affine. Differential Revision: https://reviews.llvm.org/D137600	2022-12-07 20:08:07 +00:00
Thomas Raoux	f7fda6ba4a	[mlir][linalg] Add extra parameter to tiling reduction to foreach_thread This adds a tile_size parameter, when it is used the tiles are cyclically distributed onto the threads of the scf.foreach_thread op. Differential Revision: https://reviews.llvm.org/D139474	2022-12-07 18:37:05 +00:00
Hanhan Wang	0f297cad4d	[mlir][tensor][linalg] Introduce DataLayoutPropagation pass. It introduces a pattern that swaps `linalg.generic + tensor.pack` to `tensor.pack + linalg.generic`. It requires all the iteration types being parallel; the indexing map of output operand is identiy. They can all be relaxed in the future. The user can decide whether the propagation should be applied or not by passing a control function. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D138882	2022-12-06 15:00:07 -08:00
Mitch Phillips	969f0cba7e	Revert "[mlir] Add hoisting of transfer ops in affine loops" This reverts commit `825da072a8`. Reason: Broke the sanitizer buildbots. See original review for more details: https://reviews.llvm.org/D137600	2022-12-06 09:44:59 -08:00
Javier Setoain	825da072a8	[mlir] Add hoisting of transfer ops in affine loops The only way to do this with the current hoisting strategy is by lowering Affine to Scf first, but that prevents further passes on Affine. Differential Revision: https://reviews.llvm.org/D137600	2022-12-06 10:07:21 +00:00
Guray Ozen	12cc8e7310	[mlir] Fix infinite loop in collapse Incrementing `counter` variable is inside the if statement. If the code does not enter there, the while loop will iterate infinitely. This revision moves the codes outside of if statement. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D139005	2022-12-05 12:57:20 +01:00
Adrian Kuegel	215666d983	[mlir][Linalg] Apply ClangTidy fixes (NFC)	2022-12-05 08:18:00 +01:00
Kazu Hirata	192d9dd731	[mlir] Use std::nullopt instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 19:58:32 -08:00
Kazu Hirata	1a36588ec6	[mlir] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 18:50:27 -08:00
River Riddle	b74192b7ae	[mlir] Remove support for non-prefixed accessors This finishes off a year long pursuit to LLVMify the generated operation accessors, prefixing them with get/set. Support for any other accessor naming is fully removed after this commit. https://discourse.llvm.org/t/psa-raw-accessors-are-being-removed/65629 Differential Revision: https://reviews.llvm.org/D136727	2022-12-02 13:32:36 -08:00
Hanhan Wang	b1d3afc93e	[mlir] Factor more common utils to IndexingUtils Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D139159	2022-12-02 13:27:01 -08:00
Nicolas Vasilache	495acf98da	[mlir][Linalg] NFC - Purge OpBuilder uses in favor of RewriterBase in places unrelated to op definitions RewriterBase is the proper builder to use so one can listen to IR modifications (i.e. not just creation). Differential Revision: https://reviews.llvm.org/D137922	2022-12-02 08:06:29 -08:00
Nicolas Vasilache	3a6ae0f8f5	[mlir][Linalg][NFC] Improve debugging during vectorization Make more systematic use of `notifyMatchFailure`.	2022-12-01 02:49:52 -08:00
Christian Sigg	be065c41d8	[mlir] Change scf::LoopNest to store 'results'. This fixes the case where scf::LoopNest::loops is empty. Change LoopVector and ValueVector to SmallVector. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D136926	2022-12-01 06:51:45 +01:00
Murali Vijayaraghavan	2d2cdf4176	[mlir][linalg] Changing the positions of introduced parallel loop in SplitReduction to be consistent with IREE's downstream passes IREE's passes depend on the behavior of SplitReduction's introduced parallel loop being the same as the introduced dimension in the intermediate tensor (the order of loops was changed in https://reviews.llvm.org/D137478). Differential Revision: https://reviews.llvm.org/D138972	2022-11-30 04:01:07 +00:00
Ivan Butygin	e3f75c1cb7	[mlir][linalg] Allow some fusion on mixed generics Relax linalg elementwise fusion check to allow mixed consumers. Producer is still required to be fully tensor to avoid potential memref aliasing. Differential Revision: https://reviews.llvm.org/D138759	2022-11-29 15:35:02 +01:00
Hanhan Wang	9b16d9d271	[mlir][linalg] Add a new pattern to handle folding unit reduction dims. The output operands will be added to input operands if the generic op (on tensors) becomes an elementwise operation. The outputs of the generic op is still the same. They will be cleaned up by ReplaceWithEmptyTensorIfUnused pattern. This is https://reviews.llvm.org/D138251, plus a cmake dep fix. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D138843	2022-11-28 14:14:43 -08:00
Guray Ozen	135977c92a	[mlir] Export `collapseGenericOpIterationDims` (NFC) This revision exports `collapseGenericOpIterationDims` to a header so it can be used outside of the pattern. We have use-case where we want to call this function directly. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D138697	2022-11-28 13:54:26 +01:00
Ramkumar Ramachandra	537137ece1	mlir/linalg: use std::optional This is part of an effort to migrate from llvm::Optional to std::optional: See also: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716 Signed-off-by: Ramkumar Ramachandra <r@artagnon.com>	2022-11-27 13:32:20 -08:00
Lorenzo Chelini	a9733b8a5e	[MLIR] Adopt `DenseI64ArrayAttr` in tensor, memref and linalg transform This commit is a first step toward removing inconsistencies between dynamic and static attributes (i64 v. index) by dropping `I64ArrayAttr` and using `DenseI64ArrayAttr` in Tensor, Memref and Linalg Transform ops. In Linalg Transform ops only `TileToScfForOp` and `TileOp` have been updated. See related discussion: https://discourse.llvm.org/t/rfc-inconsistency-between-dynamic-and-static-attributes-i64-v-index/66612/1 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D138567	2022-11-25 09:43:30 +01:00
Hanhan Wang	a827c5c7ab	Revert "[mlir][linalg] Add a new pattern to handle folding unit reduction dims." This reverts commit `6eee66d12a`. It breaks builds, see https://lab.llvm.org/buildbot/#/builders/61/builds/35742 Differential Revision: https://reviews.llvm.org/D138633	2022-11-23 19:07:01 -08:00
Hanhan Wang	6eee66d12a	[mlir][linalg] Add a new pattern to handle folding unit reduction dims. The output operands will be added to input operands if the generic op (on tensors) becomes an elementwise operation. The outputs of the generic op is still the same. They will be cleaned up by ReplaceWithEmptyTensorIfUnused pattern. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D138251	2022-11-23 10:47:10 -08:00
Alexander Belyaev	f286af29d8	[mlir] Remove clone methods from DPS interface. Differential Revision: https://reviews.llvm.org/D138586	2022-11-23 19:25:26 +01:00
Mahesh Ravishankar	2d4b998697	[mlir][Linalg] Avoid unnecessary propagating producer result to fused op result. Elementwise op fusion conserves the result of the producer in the fused op, relying on later clean up patterns to drop unused results of the fused op. Instead, if the producer result has no other use apart from the consumer op, avoid making the producer result available in the fused node. This saves some unnecessary IR manipulations. Differential Revision: https://reviews.llvm.org/D138096	2022-11-22 07:08:17 +00:00
Aliia Khasanova	399638f98c	Merge kDynamicSize and kDynamicSentinel into one constant. resolve conflicts Differential Revision: https://reviews.llvm.org/D138282	2022-11-21 13:01:26 +00:00
Murali Vijayaraghavan	dddf6ab272	Simplifying the SplitReduction logic that uses the control to get the dimension where the extra parallel dimension is inserted Currently, the innerParallel and non innerParallel strategies use two different ways to fix for where the extra loop is inserted and where the extra dimension for the intermediate result is inserted - innerParallel adds the extra (parallel) loop right after the pre-existing reduction loop, whereas non innerParallel adds the reduction loop in the successor to the index supplied by control, and the parallel loop in the index supplied by the control. The semantics of the index supplied by the control is supposed to only control where the extra tensor dimension is inserted in the intermediate tensor. Conflating this index with where the reduction (and parallel) loops are inserted leads to more complex (and confusing) logic overall. This differential removes conflating the two uses of the index, and keeps the reduction and parallel loops in the same vicinity and uses the supplied index to only determine the position of the extra tensor dimension. It also simplifies the code by merging the two strategies in a lot more places. Differential Revision: https://reviews.llvm.org/D137478	2022-11-17 22:26:02 +00:00
Mahesh Ravishankar	da8a8e9280	[mlir][Linalg] Move patterns to remove dead arguments and results out of canonicalization. The patterns to remove dead arguments and results of `linalg.generic` operations are not necessarily canonicalizations. Instead a new entry point `populateEraseUnusedOperandsAndResults` is added to allow using these patterns when needed. The transformations that rely on this pattern for cleanup now include these patterns explicitly. Differential Revision: https://reviews.llvm.org/D138085	2022-11-16 16:00:43 +00:00
Thomas Raoux	99833cd818	[mlir][linalg] Add reduction tiling using scf.foreachthread This adds a transformation to tile reduction operations to partial reduction using scf.foreachthread. This uses PartialReductionOpInterface to create a merge operation of the partial tiles. Differential Revision: https://reviews.llvm.org/D137912	2022-11-14 18:05:40 +00:00
Oleg Shyshkov	e6598b053d	Revert "Revert "[mlir][linalg] Replace "string" iterator_types attr with enums in LinalgInterface."" With python code fixed. This reverts commit `41280908e4`.	2022-11-11 10:54:08 +01:00
Guray Ozen	6663f34704	[mlir] Introduce device mapper attribute for `thread_dim_map` and `mapped to dims` `scf.foreach_thread` defines mapping its loops to processors via an integer array, see an example below. A lowering can use this mapping. However, expressing mapping as an integer array is very confusing, especially when there are multiple levels of parallelism. In addition, the op does not verify the integer array. This change introduces device mapping attribute to make mapping descriptive and verifiable. Then it makes GPU transform dialect use it. ``` scf.foreach_thread (%i, %j) in (%c1, %c2) { scf.foreach_thread (%i2, %j2) in (%c1, %c2) {...} { thread_dim_mapping = [0, 1]} } { thread_dim_mapping = [0, 1]} ``` It first introduces a `DeviceMappingInterface` which is an attribute interface. `scf.foreach_thread` defines its mapping via this interface. A lowering must define its attributes and implement this interface as well. This way gives us a clear validation. The change also introduces two new attributes (`#gpu.thread<x/y/z>` and `#gpu.block<x,y,z>` ). After this change, the above code prints as below, as seen here, this way clarifies the loop mappings. The change also implements consuming of these two new attribute by the transform dialect. Transform dialect binds the outermost loops to the thread blocks and innermost loops to threads. ``` scf.foreach_thread (%i, %j) in (%c1, %c2) { scf.foreach_thread (%i2, %j2) in (%c1, %c2) {...} { thread_dim_mapping = [#gpu.thread<x>, #gpu.thread<y>]} } { thread_dim_mapping = [#gpu.block<x>, #gpu.block<y>]} ``` Reviewed By: ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D137413	2022-11-11 08:44:57 +01:00
Prashant Kumar	04b449e147	The fillOp's value needs to casted During elementwise fusion the fillOp's value was directly referred without casting which can create mismatching dtypes. Reviewed By: mravishankar, ThomasRaoux Differential Revision: https://reviews.llvm.org/D137447	2022-11-10 03:43:22 +00:00
Oleg Shyshkov	41280908e4	Revert "[mlir][linalg] Replace "string" iterator_types attr with enums in LinalgInterface." Breaks linalg python tests. Would need to also update python/mlir/dialects/linalg/opdsl. This reverts commit `b809d73973`.	2022-11-09 15:59:54 +01:00
Oleg Shyshkov	b809d73973	[mlir][linalg] Replace "string" iterator_types attr with enums in LinalgInterface. [RFC: EnumAttr for iterator types in Linalg](https://discourse.llvm.org/t/rfc-enumattr-for-iterator-types-in-linalg/64535) This affect touches and probably breaks most of the code that creates `linalg.generic`. A fix would be to replace calls to `getParallelIteratorTypeName/getReductionIteratorTypeName` with `mlir::utils::IteratorType::parallel/reduction` and types from `StringRef` to `mlir::utils::IteratorType`. Due to limitations of tablegen, shared C++ definition of IteratorType enum lives in StructuredOpsUtils.td, but each dialect should have it's own EnumAttr wrapper. To avoid conflict, all enums in a dialect are put into a separate file with a separate tablegen rule. Test dialect td files are refactored a bit. Printed format of `linalg.generic` temporarily remains unchanged to avoid breaking code and tests in the same change. Differential Revision: https://reviews.llvm.org/D137658	2022-11-09 15:47:29 +01:00
Rob Suderman	9c923f4e58	[mlir][linalg] Fix vectorization of linalg depthwise conv for int types Vectorization of Linalg's depthwise convolution only supports floating point types. Previous version assumed floating point operations would work. This version checks whether the computation is integer or floating point and adjust the inner loop computation. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D137595	2022-11-08 16:21:05 -08:00
Oleg Shyshkov	bada35390a	[mlir][NFC] Remove unnecessary attr name getters from StructuredOpsUtils.h. Those methods were added long time ago. Now we get the same methods generated by tablegen, so there is no need for duplicates. Differential Revision: https://reviews.llvm.org/D137544	2022-11-07 14:40:56 +01:00
Thomas Raoux	3310fe55d9	[mlir][linalg] Add reduction tiling transformation Add a transformation to tile reduction ops into a parallel operation followed by a merge operation. This is equivalent to the existing reduction spliting transformation but using loops instead of using higher dimensions linalg. Differential Revision: https://reviews.llvm.org/D136586	2022-11-03 23:07:12 +00:00
Hanhan Wang	c050dd4717	[mlir][linalg] Add support for vectorizing convs that have different types. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D137208	2022-11-02 11:03:14 -07:00
Nicolas Vasilache	d4c4e49196	[mlir][Linalg] Drop usage of tileWithLinalgTilingOptions in the structured.tile transform This is on a path to deprecation. Context: https://discourse.llvm.org/t/psa-retire-tileandfuselinalgops-method/63850 As the interface-based transformation is more generic, some additional folding of AffineMin/MaxOp and some extra canonicalizations are needed. This can be further evolved. Differential Revision: https://reviews.llvm.org/D137195	2022-11-01 14:36:24 -07:00
Alexander Belyaev	b4db15a949	[mlir] Rename getInputs->getDpsInputs and getOutputs->getDpsInits in DPS interface. https://discourse.llvm.org/t/rfc-interface-for-destination-style-ops/64056 Differential Revision: https://reviews.llvm.org/D136943	2022-10-28 15:41:12 +02:00
Alexander Belyaev	c2403f1e3f	[mlir] Fix asan issue in Vectorization.cpp of Linalg. Differential Revision: https://reviews.llvm.org/D136852	2022-10-27 18:11:08 +02:00
Matthias Springer	b169643f3a	[mlir][interfaces] Remove getDestinationOperands from TilingInterface `getDestinationOperands` was almost a duplicate of `DestinationStyleOpInterface::getOutputOperands`. Now that the interface has been moved to mlir/Interfaces, it is no longer needed. Differential Revision: https://reviews.llvm.org/D136240	2022-10-24 09:26:19 +02:00
Aliia Khasanova	fb4cedcc1e	[mlir][nfc] Clean-up usage of kDynamicSize. This patch prepares MLIR code base to change the value of kDynamicSize. https://discourse.llvm.org/t/rfc-unify-kdynamicsize-and-kdynamicstrideoroffset/64534/4 Differential Revision: https://reviews.llvm.org/D136327	2022-10-20 13:54:57 +00:00
Matthias Springer	cfc9ddaafc	[mlir][interfaces][NFC] Move DestinationStyleOpInterface to mlir/Interfaces This is the second (and final) step of making "destination style" usable without depending on the Linalg dialect. (The first step was D135129.) This change allows us to provide default bufferization implementations for all destination-style ops. It also allows us to simplify `TilingInterface`. (E.g., `getDestinationOperands` can be removed.) Differential Revision: https://reviews.llvm.org/D136179	2022-10-18 17:39:06 +02:00
Che-Yu Wu	d09bef82c0	[MLIR] Vectorize tensor.extract on 1-d tensor This patch implements the vectorization of tensor.extract for the basic 1-d lookup case. It only vectorizes the tensor.extract to a vector.gather when the op extracts value from an 1-d tensor. Related discussion: https://github.com/iree-org/iree/issues/9198 Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D133786	2022-10-18 00:06:02 +00:00
Alexander Belyaev	a7cccb9cbb	[mlir] Simplify DestinationStyleOpInterface. Differential Revision: https://reviews.llvm.org/D135348	2022-10-17 12:43:41 +02:00
Oleg Shyshkov	c38d9cf20e	[mlir] Remove iterator_types() method from LinalgStructuredInterface. `getIteratorTypesArray` should be used instead. It's a better substitute for all the current usages of the interface. The current `ArrayAttr iterator_types()` has a few problems: * It creates an assumption operation has iterators types as an attribute, but it's not always the case. Sometime iterator types can be inferred from other attribute, or they're just static. * ArrayAttr is an obscure contained and required extracting values in the client code. * Makes it hard to migrate iterator types from strings to enums ([RFC](https://discourse.llvm.org/t/rfc-enumattr-for-iterator-types-in-linalg/64535/9)). Concrete ops, like `linalg.generic` will still have iterator types as an attribute if needed. As a side effect, this change helps a bit with migration to prefixed accessors. Differential Revision: https://reviews.llvm.org/D135765	2022-10-13 07:52:43 +00:00
Nicolas Vasilache	f4ad1b6f69	[mlir][Linalg] Quarantine usage of LinalgTransformationFilter to TestTilingInterface. This revision also retires code that has now become dead. Context: https://discourse.llvm.org/t/psa-retire-linalg-filter-based-patterns/63785 Differential Revision: https://reviews.llvm.org/D135771	2022-10-12 08:36:51 -07:00
Nicolas Vasilache	e0cea169f7	[mlir][Linalg] Drop filter-based splitReduction This transformation is available and tested via the transform dialect. Differential Revision: https://reviews.llvm.org/D135767	2022-10-12 07:27:35 -07:00
Nicolas Vasilache	bcfbf8cc41	[mlir][Linalg] NFC - Drop filter from LinalgGeneralizationPattern Differential Revision: https://reviews.llvm.org/D135761	2022-10-12 04:47:12 -07:00

1 2 3 4 5 ...

1195 Commits