clang-p2996

Author	SHA1	Message	Date
MaheshRavishankar	76ead96c1d	[mlir][TilingInterface] Use `LoopLikeOpInterface` in tiling using SCF to unify tiling with `scf.for` and `scf.forall`. (#77874 ) Using `LoopLikeOpInterface` as the basis for the implementation unifies all the tiling logic for both `scf.for` and `scf.forall`. The only difference is the actual loop generation. This is a follow up to https://github.com/llvm/llvm-project/pull/72178 Instead of many entry points for each loop type, the loop type is now passed as part of the options passed to the tiling method. This is a breaking change with the following changes 1) The `scf::tileUsingSCFForOp` is renamed to `scf::tileUsingSCF` 2) The `scf::tileUsingSCFForallOp` is deprecated. The same functionality is obtained by using `scf::tileUsingSCF` and setting the loop type in `scf::SCFTilingOptions` passed into this method to `scf::SCFTilingOptions::LoopType::ForallOp` (using the `setLoopType` method). 3) The `scf::tileConsumerAndFusedProducerGreedilyUsingSCFForOp` is renamed to `scf::tileConsumerAndFuseProducerUsingSCF`. The use of the `controlFn` in `scf::SCFTileAndFuseOptions` allows implementing any strategy with the default callback implemeting the greedy fusion. 4) The `scf::SCFTilingResult` and `scf::SCFTileAndFuseResult` now use `SmallVector<LoopLikeOpInterface>`. 5) To make `scf::ForallOp` implement the parts of `LoopLikeOpInterface` needed, the `getOutputBlockArguments()` method is replaced with `getRegionIterArgs()` These changes now bring the tiling and fusion capabilities using `scf.forall` on par with what was already supported by `scf.for`	2024-01-25 21:26:23 -08:00
Matthias Springer	5fcf907b34	[mlir][IR] Rename "update root" to "modify op" in rewriter API (#78260 ) This commit renames 4 pattern rewriter API functions: * `updateRootInPlace` -> `modifyOpInPlace` * `startRootUpdate` -> `startOpModification` * `finalizeRootUpdate` -> `finalizeOpModification` * `cancelRootUpdate` -> `cancelOpModification` The term "root" is a misnomer. The root is the op that a rewrite pattern matches against (https://mlir.llvm.org/docs/PatternRewriter/#root-operation-name-optional). A rewriter must be notified of all in-place op modifications, not just in-place modifications of the root (https://mlir.llvm.org/docs/PatternRewriter/#pattern-rewriter). The old function names were confusing and have contributed to various broken rewrite patterns. Note: The new function names use the term "modify" instead of "update" for consistency with the `RewriterBase::Listener` terminology (`notifyOperationModified`).	2024-01-17 11:08:59 +01:00
MaheshRavishankar	aa2a96a24a	[mlir][TilingInterface] Move TilingInterface tests to use transform dialect ops. (#77204 ) In the process a couple of test transform dialect ops are added just for testing. These operations are not intended to use as full flushed out of transformation ops, but are rather operations added for testing. A separate operation is added to `LinalgTransformOps.td` to convert a `TilingInterface` operation to loops using the `generateScalarImplementation` method implemented by the operation. Eventually this and other operations related to tiling using the `TilingInterface` need to move to a better place (i.e. out of `Linalg` dialect)	2024-01-11 21:31:03 -08:00
MaheshRavishankar	4435ced949	[mlir][TilingInterface] Allow controlling what fusion is done within tile and fuse (#76871 ) Currently the `tileConsumerAndFuseProducerGreedilyUsingSCFFor` method greedily fuses through all slices that are generated during the tile and fuse flow. That is not the normal use case. Ideally the caller would like to control which slices get fused and which dont. This patch introduces a new field to the `SCFTileAndFuseOptions` to specify this control. The contol function also allows the caller to specify if the replacement for the fused producer needs to be yielded from within the tiled computation. This allows replacing the fused producers in case they have other uses. Without this the original producers still survive negating the utility of the fusion. The change here also means that the name of the function `tileConsumerAndFuseProducerGreedily...` can be updated. Defering that to a later stage to reduce the churn of API changes.	2024-01-08 13:26:10 -08:00
Han-Chung Wang	899c2bed9e	[mlir][TilingInterface] Early return cloned ops if tile sizes are zeros. (#75410 ) It is a trivial early-return case. If the cloned ops are not returned, it will generate `extract_slice` op that extracts the whole slice. However, it is not folded away. Early-return to avoid the case. E.g., ```mlir func.func @matmul_tensors( %arg0: tensor<?x?xf32>, %arg1: tensor<?x?xf32>, %arg2: tensor<?x?xf32>) -> tensor<?x?xf32> { %0 = linalg.matmul ins(%arg0, %arg1: tensor<?x?xf32>, tensor<?x?xf32>) outs(%arg2: tensor<?x?xf32>) -> tensor<?x?xf32> return %0 : tensor<?x?xf32> } module attributes {transform.with_named_sequence} { transform.named_sequence @__transform_main(%arg1: !transform.any_op {transform.readonly}) { %0 = transform.structured.match ops{["linalg.matmul"]} in %arg1 : (!transform.any_op) -> !transform.any_op %1 = transform.structured.tile_using_for %0 [0, 0, 0] : (!transform.any_op) -> (!transform.any_op) transform.yield } } ``` Apply the transforms and canonicalize the IR: ``` mlir-opt --transform-interpreter -canonicalize input.mlir ``` we will get ```mlir module { func.func @matmul_tensors(%arg0: tensor<?x?xf32>, %arg1: tensor<?x?xf32>, %arg2: tensor<?x?xf32>) -> tensor<?x?xf32> { %c1 = arith.constant 1 : index %c0 = arith.constant 0 : index %dim = tensor.dim %arg0, %c0 : tensor<?x?xf32> %dim_0 = tensor.dim %arg0, %c1 : tensor<?x?xf32> %dim_1 = tensor.dim %arg1, %c1 : tensor<?x?xf32> %extracted_slice = tensor.extract_slice %arg0[0, 0] [%dim, %dim_0] [1, 1] : tensor<?x?xf32> to tensor<?x?xf32> %extracted_slice_2 = tensor.extract_slice %arg1[0, 0] [%dim_0, %dim_1] [1, 1] : tensor<?x?xf32> to tensor<?x?xf32> %extracted_slice_3 = tensor.extract_slice %arg2[0, 0] [%dim, %dim_1] [1, 1] : tensor<?x?xf32> to tensor<?x?xf32> %0 = linalg.matmul ins(%extracted_slice, %extracted_slice_2 : tensor<?x?xf32>, tensor<?x?xf32>) outs(%extracted_slice_3 : tensor<?x?xf32>) -> tensor<?x?xf32> return %0 : tensor<?x?xf32> } } ``` The revision early-return the case so we can get: ```mlir func.func @matmul_tensors(%arg0: tensor<?x?xf32>, %arg1: tensor<?x?xf32>, %arg2: tensor<?x?xf32>) -> tensor<?x?xf32> { %0 = linalg.matmul ins(%arg0, %arg1 : tensor<?x?xf32>, tensor<?x?xf32>) outs(%arg2 : tensor<?x?xf32>) -> tensor<?x?xf32> return %0 : tensor<?x?xf32> } ```	2023-12-19 09:14:43 -08:00
MaheshRavishankar	ec1086f2a0	Fix build error from #72178 (#72905 )	2023-11-20 23:09:59 -08:00
Jie Fu	3e6ae77950	[mlir] Non-void lambda does not return a value in all control paths in yieldReplacementForFusedProducer (NFC) /llvm-project/mlir/lib/Dialect/SCF/Transforms/TileUsingInterface.cpp:703:5: error: non-void lambda does not return a value in all control paths [-Werror,-Wreturn-type] }; ^ 1 error generated.	2023-11-21 09:11:50 +08:00
MaheshRavishankar	4a020018ce	[NFC] Simplify the tiling implementation using cloning. (#72178 ) The current implementation of tiling using `scf.for` is convoluted to make sure that the destination passing style of the untiled program is preserved. The addition of support to tile using `scf.forall` (adapted from the transform operation in Linalg) in https://github.com/llvm/llvm-project/pull/67083 used cloning of the tiled operations to better streamline the implementation. This PR adapts the other tiling methods to use a similar approach, making the transformations (and handling destination passing style semantics) more systematic. --------- Co-authored-by: Abhishek-Varma <avarma094@gmail.com>	2023-11-20 09:05:48 -08:00
Matthias Springer	98a6edd38f	[mlir][Interfaces] `LoopLikeOpInterface`: Expose tied loop results (#70535 ) Expose loop results, which correspond to the region iter_arg values that are returned from the loop when there are no more iterations. Exposing loop results is optional because some loops (e.g., `scf.while`) do not have a 1-to-1 mapping between region iter_args and op results. Also add additional helper functions to query tied results/iter_args/inits.	2023-11-01 08:34:14 +09:00
Matthias Springer	3cd2a0bc1a	[mlir][Interfaces] `LoopLikeOpInterface`: Add helpers to query tied inits/iter_args (#70408 ) The `LoopLikeOpInterface` already has interface methods to query inits and iter_args. This commit adds helper functions to query tied init/iter_arg pairs and removes the corresponding functions for `scf::ForOp`.	2023-10-28 12:10:36 +09:00
Adrian Kuegel	1c27899e24	[mlir][SCF] Pass result of getAsOpFoldResult to getBoundedTileSize. A recent change modified the parameter tileSize from Value to OpFoldResult. Therefore we should call getAsOpFoldResult before passing on the tileSize. Adjust a test regarding this new behavior.	2023-10-20 10:25:32 +00:00
MaheshRavishankar	d871daea81	[mlir][TilingInterface] Add scf::tileUsingSCFForallOp method to tile using the interface to generate `scf::forall`. (#67083 ) Similar to `scf::tileUsingSCFForOp` that is a method that tiles operations that implement the `TilingInterface`, using `scf.for` operations, this method introduces tiling of operations using `scf.forall`. Most of this implementation is derived from `linalg::tileToForallOp` method. Eventually that method will either be deprecated or moved to use the method introduced here.	2023-10-19 23:21:45 -07:00
Matthias Springer	8823e961f6	[mlir][ODS] Change `get...Mutable` to return `OpOperand &` for single operands (#66519 ) The TableGen code generator now generates C++ code that returns a single `OpOperand &` for `get...Mutable` of operands that are not variadic and not optional. `OpOperand::set`/`assign` can be used to set a value (same as `MutableOperandRange::assign`). This is safer than `MutableOperandRange` because only single values (and no longer `ValueRange`) can be assigned. E.g.: ``` // Assignment of multiple values to non-variadic operand. // Before: Compiles, but produces invalid op. // After: Compilation error. extractSliceOp.getSourceMutable().assign({v1, v2}); ```	2023-10-04 08:35:40 +02:00
Adrian Kuegel	d2b7a8e83e	[mlir] Partial revert of `93c42299bd` This part of the change was not NFC.	2023-09-27 06:27:04 +00:00
Matthias Springer	63086d6aa0	[mlir][Interfaces] `LoopLikeOpInterface`: Add `replaceWithAdditionalYields` (#67121 ) `affine::replaceForOpWithNewYields` and `replaceLoopWithNewYields` (for "scf.for") are now interface methods and additional loop-carried variables can now be added to "scf.for"/"affine.for" uniformly. (No more `TypeSwitch` needed.) Note: `scf.while` and other loops with loop-carried variables can implement `replaceWithAdditionalYields`, but to keep this commit small, that is not done in this commit.	2023-09-27 07:53:39 +02:00
MaheshRavishankar	93c42299bd	[mlir][TilingInterface] NFC code changes separated out from introduction of `scf::tileUsingSCFForallop`. (#67081 ) This patch contains NFC changes that are precursor to the introduction of `scf::tileUsingSCFForallOp` method introduced in https://github.com/llvm/llvm-project/pull/67083.	2023-09-26 13:42:27 -07:00
Matthias Springer	0b2197b0cf	[mlir][Interfaces] Clean up `DestinationStyleOpInterface` (#67015 ) * "init" operands are specified with `MutableOperandRange` (which gives access to the underlying `OpOperand `). No more magic numbers. Remove most interface methods and make them helper functions. Only `getInitsMutable` should be implemented. * Provide separate helper functions for accessing mutable/immutable operands (`OpOperand`/`Value`, in line with #66515): `getInitsMutable` and `getInits` (same naming convention as auto-generated op accessors). `getInputOperands` was not renamed because this function cannot return a `MutableOperandRange` (because the operands are not necessarily consecutive). `OpOperandVector` is no longer needed. * The new `getDpsInits`/`getDpsInitsMutable` is more efficient than the old `getDpsInitOperands` because no `SmallVector` is created. The new functions return a range of operands. * Fix a bug in `getDpsInputOperands`: out-of-bounds operands were potentially returned.	2023-09-21 18:04:08 +02:00
Matthias Springer	d69293c1c8	[mlir][SCF] `ForOp`: Remove `getIterArgNumberForOpOperand` (#66629 ) This function was inconsistent with the remaining API because it accepted `OpOperand &` that do not belong to the op. All the other functions assert. This helper function is also not really necessary, as the iter_arg number is identical to the result number.	2023-09-19 17:33:40 +02:00
Matthias Springer	6923a31542	[mlir][IR] Change `MutableArrayRange` to enumerate `OpOperand &` (#66622 ) In line with #66515, change `MutableArrayRange::begin`/`end` to enumerate `OpOperand &` instead of `Value`. Also remove `ForOp::getIterOpOperands`/`setIterArg`, which are now redundant. Note: `MutableOperandRange` cannot be made a derived class of `indexed_accessor_range_base` (like `OperandRange`), because `MutableOperandRange::assign` can change the number of operands in the range.	2023-09-19 09:09:21 +02:00
MaheshRavishankar	170a25a793	[mlir][TilingInterface] Make the tiling set tile sizes function use `OpFoldResult`. (#66566 )	2023-09-18 17:18:51 -07:00
Matthias Springer	0f952cfe24	[mlir][IR] Change `MutableOperandRange::operator[]` to return an `OpOperand &` (#66515 ) `operator[]` returns `OpOperand &` instead of `Value`. * This allows users to get OpOperands by name instead of "magic" number. E.g., `extractSliceOp->getOpOperand(0)` can be written as `extractSliceOp.getSourceMutable()[0]`. * `OperandRange` provides a read-only API to operands: `operator[]` returns `Value`. `MutableOperandRange` now provides a mutable API: `operator[]` returns `OpOperand &`, which can be used to set operands. Note: The TableGen code generator could be changed to return `OpOperand &` (instead of `MutableOperandRange`) for non-variadic and non-optional arguments in a subsequent change. Then the `[0]` part in the above example would no longer be necessary.	2023-09-18 09:43:03 +02:00
Groverkss	2cc5f5d43c	[mlir][Linalg] Implement tileReductionUsingScf for multiple reductions This patch improves the reduction tiling for linalg to support multiple reduction dimensions. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D158005	2023-08-17 02:17:03 +05:30
Matthias Springer	6596b0dde8	[mlir][tensor] Clean up tensor::DimOp usage * Remove duplicate functions. `tensor::getMixedSize` and `tensor::getMixedSizes` should be used. * Use `tensor::getMixedSize` instead of `createOrFold<tensor::DimOp>`. This is more efficient. `createOrFold` will create an op an immediately try to fold it. In case of a static dimension size, an attribute can be used directly. Differential Revision: https://reviews.llvm.org/D153332	2023-06-22 10:56:17 +02:00
Tres Popp	5550c82189	[mlir] Move casting calls from methods to function calls The MLIR classes Type/Attribute/Operation/Op/Value support cast/dyn_cast/isa/dyn_cast_or_null functionality through llvm's doCast functionality in addition to defining methods with the same name. This change begins the migration of uses of the method to the corresponding function call as has been decided as more consistent. Note that there still exist classes that only define methods directly, such as AffineExpr, and this does not include work currently to support a functional cast/isa call. Caveats include: - This clang-tidy script probably has more problems. - This only touches C++ code, so nothing that is being generated. Context: - https://mlir.llvm.org/deprecation/ at "Use the free function variants for dyn_cast/cast/isa/…" - Original discussion at https://discourse.llvm.org/t/preferred-casting-style-going-forward/68443 Implementation: This first patch was created with the following steps. The intention is to only do automated changes at first, so I waste less time if it's reverted, and so the first mass change is more clear as an example to other teams that will need to follow similar steps. Steps are described per line, as comments are removed by git: 0. Retrieve the change from the following to build clang-tidy with an additional check: https://github.com/llvm/llvm-project/compare/main...tpopp:llvm-project:tidy-cast-check 1. Build clang-tidy 2. Run clang-tidy over your entire codebase while disabling all checks and enabling the one relevant one. Run on all header files also. 3. Delete .inc files that were also modified, so the next build rebuilds them to a pure state. 4. Some changes have been deleted for the following reasons: - Some files had a variable also named cast - Some files had not included a header file that defines the cast functions - Some files are definitions of the classes that have the casting methods, so the code still refers to the method instead of the function without adding a prefix or removing the method declaration at the same time. ``` ninja -C $BUILD_DIR clang-tidy run-clang-tidy -clang-tidy-binary=$BUILD_DIR/bin/clang-tidy -checks='-,misc-cast-functions'\ -header-filter=mlir/ mlir/ -fix rm -rf $BUILD_DIR/tools/mlir/*/.inc git restore mlir/lib/IR mlir/lib/Dialect/DLTI/DLTI.cpp\ mlir/lib/Dialect/Complex/IR/ComplexDialect.cpp\ mlir/lib/**/IR/\ mlir/lib/Dialect/SparseTensor/Transforms/SparseVectorization.cpp\ mlir/lib/Dialect/Vector/Transforms/LowerVectorMultiReduction.cpp\ mlir/test/lib/Dialect/Test/TestTypes.cpp\ mlir/test/lib/Dialect/Transform/TestTransformDialectExtension.cpp\ mlir/test/lib/Dialect/Test/TestAttributes.cpp\ mlir/unittests/TableGen/EnumsGenTest.cpp\ mlir/test/python/lib/PythonTestCAPI.cpp\ mlir/include/mlir/IR/ ``` Differential Revision: https://reviews.llvm.org/D150123	2023-05-12 11:21:25 +02:00
Matthias Springer	4c48f016ef	[mlir][Affine][NFC] Wrap dialect in "affine" namespace This cleanup aligns the affine dialect with all the other dialects. Differential Revision: https://reviews.llvm.org/D148687	2023-04-20 11:19:21 +09:00
Oleg Shyshkov	f080f1122f	[mlir][scf] Create constants for tiling in parent with isolated region. FuncOp is IsolatedFromAbove, so this change doesn't alter current behaviour, but the current code fails if the tile op is in an op with IsolatedFromAbove trait. An alternative would be to create constant in the same region where they're used a rely on CSE to figure out where to move them. Differential Revision: https://reviews.llvm.org/D147273	2023-03-31 18:27:30 +02:00
Mahesh Ravishankar	3af1c48c66	Changes to `SCFFuseProducerOfSliceResult` to also return the operations created during fusion. This is follow up to https://reviews.llvm.org/D145133 that allows propogating information about ops that are fused back to the caller. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D146254	2023-03-20 20:55:48 +00:00
Mahesh Ravishankar	809e3d8c98	[mlir][TilingInterface] Modify `TilingInterface` methods to better return the state of the transformed IR. Currently the `getTiledImplementation` and `generateResultTileValue` return just `SmallVector<Operation >` and `FailureOr<Value>`. - For `getTiledImplementation` returning empty implies tiling wasnt done. There is also an implicit assumption that the tiled operation results correspond to the tiled values of the result of the original operation. This cannot handle cases where the tiled implementation might use multiple operations to compute the tiled value for the results of the untiled operation. Sometimes, the tiled operation might not directly give the tiled values, and might require casts, etc to get a replacement. - For `generateResultTileValue`, it is assumed that the op defining the returned `Value` is the operation that represents the tiled computation. Again presence of casts, etc violate this. Instead make these methods return ``` struct TilingResult { SmallVector<Operation > tiledOps; SmallVector<Value> tiledValues; }; ``` The `tiledOps` represent the operations generated that are relevant for subsequent transformations. The `tiledValues` represent the tiled values for the results of the original operation. This better transmits the state of the transformed IR. As a consequence the following methods also return `FailureOr<TilingResult>` - `tensor::replaceExtractSliceWithTiledProducer` - `tensor::bubbleUpPadSlice` Differential Revision: https://reviews.llvm.org/D145133	2023-03-16 14:29:03 +00:00
Jakub Kuderski	a0a76804c4	[ADT] Allow `llvm::enumerate` to enumerate over multiple ranges This does not work by a mere composition of `enumerate` and `zip_equal`, because C++17 does not allow for recursive expansion of structured bindings. This implementation uses `zippy` to manage the iteratees and adds the stream of indices as the first zipped range. Because we have an upfront assertion that all input ranges are of the same length, we only need to check if the second range has ended during iteration. As a consequence of using `zippy`, `enumerate` will now follow the reference and lifetime semantics of the `zip*` family of functions. The main difference is that `enumerate` exposes each tuple of references through a new tuple-like type `enumerate_result`, with the familiar `.index()` and `.value()` member functions. Because the `enumerate_result` returned on dereference is a temporary, enumeration result can no longer be used through an lvalue ref. Reviewed By: dblaikie, zero9178 Differential Revision: https://reviews.llvm.org/D144503	2023-03-15 19:34:22 -04:00
Nicolas Vasilache	1cff4cbda3	[mlir][Transform] NFC - Various API cleanups and use RewriterBase in lieu of PatternRewriter Depends on: D145685 Differential Revision: https://reviews.llvm.org/D145977	2023-03-14 04:23:12 -07:00
Kazu Hirata	5c9013e266	Use std::optional instead of llvm::Optional (NFC)	2023-01-28 00:45:19 -08:00
Mahesh Ravishankar	dbbd907015	[mlir][TilingInterface] Fix use after free error from D141028. The `candidateSliceOp` was replaces and used in a subsequent call. Instead just replace its uses. The op is dead and will be removed with CSE. Differential Revision: https://reviews.llvm.org/D141869	2023-01-16 20:59:50 +00:00
Mahesh Ravishankar	9db7d4edd8	[mlir][TilingInterface] Add an option to tile and fuse to yield replacement for the fused producer. This patch adds an option to the method that fuses a producer with a tiled consumer, to also yield from the tiled loops a value that can be used to replace the original producer. This is only valid if it can be assertained that the slice of the producer computed within each iteration of the tiled loop nest does not compute slices of the producer redundantly. The analysis to derive this is very involved. So this is left to the caller to assertain. A test is added that mimics the `scf::tileConsumerAndFuseProducersGreedilyUsingSCFForOp`, but also yields the values of all fused producers. This can be used as a reference for how a caller could use this functionality. Differential Revision: https://reviews.llvm.org/D141028	2023-01-16 18:30:13 +00:00
Mahesh Ravishankar	ce349ff1a4	[mlir][TilingInterface] NFC: Separate out a utility method to perform one step of tile + fuse. Differential Revision: https://reviews.llvm.org/D141027	2023-01-16 05:03:41 +00:00
Mahesh Ravishankar	94f2a6ddde	[mlir][TilingInterface] NFC: Consolidate yield handling. Add a new utility method to yield the tiled value as well as preserving destination passing style. Differential Revision: https://reviews.llvm.org/D139392	2023-01-16 05:03:41 +00:00
Alex Zinenko	faac898987	[mlir] fix out-of-bounds in reduction tiling A transformation tiling a reduction dimension of a Linalg op needs a tile size for said dimension. When an insufficient number of dimensions was provided, it would segfault due to out-of-bounds access to a vector. Also fix incorrect error reporting in the structured transform op exercising this functionality. Reviewed By: springerm, ThomasRaoux Differential Revision: https://reviews.llvm.org/D141046	2023-01-05 15:20:26 +00:00
Fangrui Song	cbb0981388	[mlir] llvm::Optional::value => operator*/operator-> std::optional::value() has undesired exception checking semantics and is unavailable in older Xcode (see _LIBCPP_AVAILABILITY_BAD_OPTIONAL_ACCESS). The call sites block std::optional migration.	2022-12-17 19:07:38 +00:00
Ramkumar Ramachandra	22426110c5	mlir/tblgen: use std::optional in generation This is part of an effort to migrate from llvm::Optional to std::optional. This patch changes the way mlir-tblgen generates .inc files, and modifies tests and documentation appropriately. It is a "no compromises" patch, and doesn't leave the user with an unpleasant mix of llvm::Optional and std::optional. A non-trivial change has been made to ControlFlowInterfaces to split one constructor into two, relating to a build failure on Windows. See also: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716 Signed-off-by: Ramkumar Ramachandra <r@artagnon.com> Differential Revision: https://reviews.llvm.org/D138934	2022-12-17 11:13:26 +01:00
Hanhan Wang	b1d3afc93e	[mlir] Factor more common utils to IndexingUtils Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D139159	2022-12-02 13:27:01 -08:00
Mehdi Amini	fbfca43e6d	Apply clang-tidy fixes for llvm-qualified-auto in TileUsingInterface.cpp (NFC)	2022-11-15 18:14:01 +00:00
Hanhan Wang	52ffc72818	[mlir][tiling] Relax tiling to accept generating multiple operations. Some operations need to generate multiple operations when implementing the tiling interface. Here is a sound example in IREE, see https://github.com/iree-org/iree/pull/10905 for more details. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D137300	2022-11-04 13:59:24 -07:00
Thomas Raoux	3310fe55d9	[mlir][linalg] Add reduction tiling transformation Add a transformation to tile reduction ops into a parallel operation followed by a merge operation. This is equivalent to the existing reduction spliting transformation but using loops instead of using higher dimensions linalg. Differential Revision: https://reviews.llvm.org/D136586	2022-11-03 23:07:12 +00:00
Nicolas Vasilache	d4c4e49196	[mlir][Linalg] Drop usage of tileWithLinalgTilingOptions in the structured.tile transform This is on a path to deprecation. Context: https://discourse.llvm.org/t/psa-retire-tileandfuselinalgops-method/63850 As the interface-based transformation is more generic, some additional folding of AffineMin/MaxOp and some extra canonicalizations are needed. This can be further evolved. Differential Revision: https://reviews.llvm.org/D137195	2022-11-01 14:36:24 -07:00
Hanhan Wang	71cf48a62a	[mlir][scf] Enhance sizes computation in tileUsingSCFForOp. The boundary is always 1 if the tile size is 1. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D136884	2022-10-28 13:03:10 -07:00
Alexander Belyaev	b4db15a949	[mlir] Rename getInputs->getDpsInputs and getOutputs->getDpsInits in DPS interface. https://discourse.llvm.org/t/rfc-interface-for-destination-style-ops/64056 Differential Revision: https://reviews.llvm.org/D136943	2022-10-28 15:41:12 +02:00
Matthias Springer	b169643f3a	[mlir][interfaces] Remove getDestinationOperands from TilingInterface `getDestinationOperands` was almost a duplicate of `DestinationStyleOpInterface::getOutputOperands`. Now that the interface has been moved to mlir/Interfaces, it is no longer needed. Differential Revision: https://reviews.llvm.org/D136240	2022-10-24 09:26:19 +02:00
Mehdi Amini	6d4baa7442	Apply clang-tidy fixes for performance-unnecessary-value-param in TileUsingInterface.cpp (NFC)	2022-10-12 05:03:45 +00:00
Mehdi Amini	2a6f0fb34a	Apply clang-tidy fixes for performance-for-range-copy in TileUsingInterface.cpp (NFC)	2022-10-12 05:03:45 +00:00
Nicolas Vasilache	7915027926	[mlir][Linalg] Retire LinalgStrategyTileAndFusePass and filter-based pattern. Context: https://discourse.llvm.org/t/psa-retire-linalg-filter-based-patterns/63785 In the process, also retire `tileConsumerAndFuseProducers` that is now replaced by `tileConsumerAndFuseProducerGreedilyUsingSCFForOp`. Context: https://discourse.llvm.org/t/psa-retire-tileandfuselinalgops-method/63850 When performing this replacement, a change of behavior appeared: the older `tileConsumerAndFuseProducers` would split the parallel and non-parallel dimensions automatically and perform a first level of tile-and-fuse on parallel dimensions only and then introduce a second level of tiling-only on the reduction dimensions. The newer `tileConsumerAndFuseProducerGreedilyUsingSCFForOp` on the other hand does not perform this breakdown. As a consequence, the transform specification is evolved to produce the same output. Additionally, replace some uses of `unsigned` by `int64_t` where possible without pulling in larger interface changes (left for a future PR). Context: https://www.youtube.com/watch?v=Puio5dly9N8 Lastly, tests that were performing tile and fuse and distribute on tensors are retired: the generated IR mixing scf.for, tensors and distributed processor ids was racy at best .. Differential Revision: https://reviews.llvm.org/D135559	2022-10-10 07:04:01 -07:00
Adrian Kuegel	67bcf9825a	[mlir][SCF] Apply ClangTidyPerformance finding (NFC)	2022-09-30 12:47:32 +02:00

1 2

72 Commits