clang-p2996

Author	SHA1	Message	Date
Alex Zinenko	faac898987	[mlir] fix out-of-bounds in reduction tiling A transformation tiling a reduction dimension of a Linalg op needs a tile size for said dimension. When an insufficient number of dimensions was provided, it would segfault due to out-of-bounds access to a vector. Also fix incorrect error reporting in the structured transform op exercising this functionality. Reviewed By: springerm, ThomasRaoux Differential Revision: https://reviews.llvm.org/D141046	2023-01-05 15:20:26 +00:00
Matthias Springer	3a5811a337	[mlir][affine][NFC] Extract core functionality of `canonicalizeMinMaxOp` Move code from SCF to Affine: Add a new helper function `simplifyConstrainedMinMaxOp` to Affine/Analysis/Utils.h. `canonicalizeMinMaxOp` was originally designed for loop peeling, but it is not SCF-specific and can be used to simplify any affine.min/max ops. Various functions in SCF/Transforms are simplified by dropping unnecessary parameters. Differential Revision: https://reviews.llvm.org/D140962	2023-01-04 11:25:44 +01:00
Fangrui Song	cbb0981388	[mlir] llvm::Optional::value => operator*/operator-> std::optional::value() has undesired exception checking semantics and is unavailable in older Xcode (see _LIBCPP_AVAILABILITY_BAD_OPTIONAL_ACCESS). The call sites block std::optional migration.	2022-12-17 19:07:38 +00:00
Ramkumar Ramachandra	22426110c5	mlir/tblgen: use std::optional in generation This is part of an effort to migrate from llvm::Optional to std::optional. This patch changes the way mlir-tblgen generates .inc files, and modifies tests and documentation appropriately. It is a "no compromises" patch, and doesn't leave the user with an unpleasant mix of llvm::Optional and std::optional. A non-trivial change has been made to ControlFlowInterfaces to split one constructor into two, relating to a build failure on Windows. See also: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716 Signed-off-by: Ramkumar Ramachandra <r@artagnon.com> Differential Revision: https://reviews.llvm.org/D138934	2022-12-17 11:13:26 +01:00
Kazu Hirata	70c73d1b72	[mlir] Use std::nullopt instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 17:23:50 -08:00
Kazu Hirata	1a36588ec6	[mlir] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 18:50:27 -08:00
River Riddle	b74192b7ae	[mlir] Remove support for non-prefixed accessors This finishes off a year long pursuit to LLVMify the generated operation accessors, prefixing them with get/set. Support for any other accessor naming is fully removed after this commit. https://discourse.llvm.org/t/psa-raw-accessors-are-being-removed/65629 Differential Revision: https://reviews.llvm.org/D136727	2022-12-02 13:32:36 -08:00
Hanhan Wang	b1d3afc93e	[mlir] Factor more common utils to IndexingUtils Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D139159	2022-12-02 13:27:01 -08:00
Lei Zhang	9bb633741a	[mlir][bufferization] Support general Attribute as memory space MemRef has been accepting a general Attribute as memory space for a long time. This commits updates bufferization side to catch up, which allows downstream users to plugin customized symbolic memory space. This also eliminates quite a few `getMemorySpaceAsInt` calls, which is deprecated. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D138330	2022-11-21 09:40:50 -05:00
Mahesh Ravishankar	fc367dfa67	[mlir] Remove `Transforms/SideEffectUtils.h` and move the methods into `Interface/SideEffectInterfaces.h`. The methods in `SideEffectUtils.h` (and their implementations in `SideEffectUtils.cpp`) seem to have similar intent to methods already existing in `SideEffectInterfaces.h`. Move the decleration (and implementation) from `SideEffectUtils.h` (and `SideEffectUtils.cpp`) into `SideEffectInterfaces.h` (and `SideEffectInterface.cpp`). Also drop the `SideEffectInterface::hasNoEffect` method in favor of `mlir::isMemoryEffectFree` which actually recurses into the operation instead of just relying on the `hasRecursiveMemoryEffectTrait` exclusively. Differential Revision: https://reviews.llvm.org/D137857	2022-11-15 20:07:35 +00:00
Mehdi Amini	fbfca43e6d	Apply clang-tidy fixes for llvm-qualified-auto in TileUsingInterface.cpp (NFC)	2022-11-15 18:14:01 +00:00
Guray Ozen	6663f34704	[mlir] Introduce device mapper attribute for `thread_dim_map` and `mapped to dims` `scf.foreach_thread` defines mapping its loops to processors via an integer array, see an example below. A lowering can use this mapping. However, expressing mapping as an integer array is very confusing, especially when there are multiple levels of parallelism. In addition, the op does not verify the integer array. This change introduces device mapping attribute to make mapping descriptive and verifiable. Then it makes GPU transform dialect use it. ``` scf.foreach_thread (%i, %j) in (%c1, %c2) { scf.foreach_thread (%i2, %j2) in (%c1, %c2) {...} { thread_dim_mapping = [0, 1]} } { thread_dim_mapping = [0, 1]} ``` It first introduces a `DeviceMappingInterface` which is an attribute interface. `scf.foreach_thread` defines its mapping via this interface. A lowering must define its attributes and implement this interface as well. This way gives us a clear validation. The change also introduces two new attributes (`#gpu.thread<x/y/z>` and `#gpu.block<x,y,z>` ). After this change, the above code prints as below, as seen here, this way clarifies the loop mappings. The change also implements consuming of these two new attribute by the transform dialect. Transform dialect binds the outermost loops to the thread blocks and innermost loops to threads. ``` scf.foreach_thread (%i, %j) in (%c1, %c2) { scf.foreach_thread (%i2, %j2) in (%c1, %c2) {...} { thread_dim_mapping = [#gpu.thread<x>, #gpu.thread<y>]} } { thread_dim_mapping = [#gpu.block<x>, #gpu.block<y>]} ``` Reviewed By: ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D137413	2022-11-11 08:44:57 +01:00
Hanhan Wang	52ffc72818	[mlir][tiling] Relax tiling to accept generating multiple operations. Some operations need to generate multiple operations when implementing the tiling interface. Here is a sound example in IREE, see https://github.com/iree-org/iree/pull/10905 for more details. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D137300	2022-11-04 13:59:24 -07:00
Thomas Raoux	3310fe55d9	[mlir][linalg] Add reduction tiling transformation Add a transformation to tile reduction ops into a parallel operation followed by a merge operation. This is equivalent to the existing reduction spliting transformation but using loops instead of using higher dimensions linalg. Differential Revision: https://reviews.llvm.org/D136586	2022-11-03 23:07:12 +00:00
Peiming Liu	1ca119728e	[mlir][scf] support 1:N type conversion for scf.if/while/condition Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D137100	2022-11-02 16:53:36 +00:00
Peiming Liu	f4cd3674ea	[mlir][scf] refactor scf structuralOpConversion to better support 1:N type conversion This patch moves the 1:N type mapping into its own classes to allow better code reuse in D137100. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D137099	2022-11-02 16:45:39 +00:00
Nicolas Vasilache	d4c4e49196	[mlir][Linalg] Drop usage of tileWithLinalgTilingOptions in the structured.tile transform This is on a path to deprecation. Context: https://discourse.llvm.org/t/psa-retire-tileandfuselinalgops-method/63850 As the interface-based transformation is more generic, some additional folding of AffineMin/MaxOp and some extra canonicalizations are needed. This can be further evolved. Differential Revision: https://reviews.llvm.org/D137195	2022-11-01 14:36:24 -07:00
Hanhan Wang	71cf48a62a	[mlir][scf] Enhance sizes computation in tileUsingSCFForOp. The boundary is always 1 if the tile size is 1. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D136884	2022-10-28 13:03:10 -07:00
Alexander Belyaev	b4db15a949	[mlir] Rename getInputs->getDpsInputs and getOutputs->getDpsInits in DPS interface. https://discourse.llvm.org/t/rfc-interface-for-destination-style-ops/64056 Differential Revision: https://reviews.llvm.org/D136943	2022-10-28 15:41:12 +02:00
Matthias Springer	c9b3638126	[mlir][scf][bufferize] Fix bufferizesToMemoryRead with 0 loop iterations There was a bug in scf.for loop bufferization that could lead to a missing buffer copy (alloc was there, but not the copy). Differential Revision: https://reviews.llvm.org/D135053	2022-10-24 14:34:41 +02:00
Matthias Springer	b169643f3a	[mlir][interfaces] Remove getDestinationOperands from TilingInterface `getDestinationOperands` was almost a duplicate of `DestinationStyleOpInterface::getOutputOperands`. Now that the interface has been moved to mlir/Interfaces, it is no longer needed. Differential Revision: https://reviews.llvm.org/D136240	2022-10-24 09:26:19 +02:00
Peiming Liu	d3f5f33067	[mlir][scf] support 1:N type conversion for scf.for. scf.for used to only support 1:1 type conversion, this patch add support for 1:N type conversion. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D136314	2022-10-21 21:11:55 +00:00
Mehdi Amini	6d4baa7442	Apply clang-tidy fixes for performance-unnecessary-value-param in TileUsingInterface.cpp (NFC)	2022-10-12 05:03:45 +00:00
Mehdi Amini	2a6f0fb34a	Apply clang-tidy fixes for performance-for-range-copy in TileUsingInterface.cpp (NFC)	2022-10-12 05:03:45 +00:00
Mehdi Amini	23f989a2e3	Apply clang-tidy fixes for readability-simplify-boolean-expr in BufferizableOpInterfaceImpl.cpp (NFC)	2022-10-12 01:16:36 +00:00
Nicolas Vasilache	7915027926	[mlir][Linalg] Retire LinalgStrategyTileAndFusePass and filter-based pattern. Context: https://discourse.llvm.org/t/psa-retire-linalg-filter-based-patterns/63785 In the process, also retire `tileConsumerAndFuseProducers` that is now replaced by `tileConsumerAndFuseProducerGreedilyUsingSCFForOp`. Context: https://discourse.llvm.org/t/psa-retire-tileandfuselinalgops-method/63850 When performing this replacement, a change of behavior appeared: the older `tileConsumerAndFuseProducers` would split the parallel and non-parallel dimensions automatically and perform a first level of tile-and-fuse on parallel dimensions only and then introduce a second level of tiling-only on the reduction dimensions. The newer `tileConsumerAndFuseProducerGreedilyUsingSCFForOp` on the other hand does not perform this breakdown. As a consequence, the transform specification is evolved to produce the same output. Additionally, replace some uses of `unsigned` by `int64_t` where possible without pulling in larger interface changes (left for a future PR). Context: https://www.youtube.com/watch?v=Puio5dly9N8 Lastly, tests that were performing tile and fuse and distribute on tensors are retired: the generated IR mixing scf.for, tensors and distributed processor ids was racy at best .. Differential Revision: https://reviews.llvm.org/D135559	2022-10-10 07:04:01 -07:00
Adrian Kuegel	67bcf9825a	[mlir][SCF] Apply ClangTidyPerformance finding (NFC)	2022-09-30 12:47:32 +02:00
Jakub Kuderski	abc362a107	[mlir][arith] Change dialect name from Arithmetic to Arith Suggested by @lattner in https://discourse.llvm.org/t/rfc-define-precise-arith-semantics/65507/22. Tested with: `ninja check-mlir check-mlir-integration check-mlir-mlir-spirv-cpu-runner check-mlir-mlir-vulkan-runner check-mlir-examples` and `bazel build --config=generic_clang @llvm-project//mlir:all`. Reviewed By: lattner, Mogball, rriddle, jpienaar, mehdi_amini Differential Revision: https://reviews.llvm.org/D134762	2022-09-29 11:23:28 -04:00
Mahesh Ravishankar	97f919820b	[mlir][TilingInterface] NFC Refactor of tile and fuse using `TilingInterface`. This patch refactors the tiling and tile + fuse implementation using `TilingInterface`. Primarily, it exposes the functionality as simple utility functions instead of as a Pattern to allow calling it from a pattern as it is done in the test today or from within the transform dialect (in the future). This is a step towards deprecating similar methods in Linalg dialect. - The utility methods do not erase the root operations. - The return value provides the values to use for replacements. Differential Revision: https://reviews.llvm.org/D134144	2022-09-28 20:25:33 +00:00
Mahesh Ravishankar	7ee34550f5	[mlir][TilingInterface] Fix `iter_args` handling in tile (and fuse). The current approach for handling `iter_args` was to replace all uses of the value that is used as `init` value with the corresponding region block argument within the `scf.for`. This is not always correct. Instead a more deliberate approach needs to be taken to handle these. If the slice being fused represents a slice of the destination operand of the untiled op, then - Make the destination of the fused producer the `init` value of the loop nest - For the tiled and fused producer op created, replace the slice of the destination operand with a slice of the corresponding region iter arg of the innermost loop of the generated loop nest Differential Revision: https://reviews.llvm.org/D134411	2022-09-26 19:09:29 +00:00
Johannes Reifferscheid	eaf20c4fc2	[mlir] Fix a cast that should be a dyn_cast. This fixes a crash for certain IR, see the new test case for an example. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D134424	2022-09-22 13:13:21 +02:00
Christopher Bate	f5fe92f693	[mlir][SCF] Fix loop pipelining unable to handle ops with regions This change allows the SCF LoopPipelining transform to handle ops with nested regions within the pipelined `scf.for` body. The op and nested regions are treated as a single unit from the transform's perspective. This change also makes explicit the requirement that only ops whose parent Block is the loop body Block are allowed to be scheduled by the caller. Reviewed By: ThomasRaoux, nicolasvasilache Differential Revision: https://reviews.llvm.org/D133965	2022-09-20 21:58:53 -06:00
Johannes Reifferscheid	d1536ee48c	Fix clang-format.	2022-09-08 11:05:12 +02:00
Johannes Reifferscheid	6247988e07	One-shot-bufferize: fix for inconsistent while arg types in before/after. Currently, if the `before` and `after` regions of a while op have tensor args in different indices, this leads to a crash. This moves the pass-through check for args to the handling of the condition block, since that is where the results are produced, so it's also where copies must be made. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D133477	2022-09-08 10:24:11 +02:00
Johannes Reifferscheid	fb9fc79809	One-shot-bufferize: allow non-tensor arguments in scg.while/for. Currently, one-shot-bufferize crashes as soon as there's a mixture of tensor and non-tensor arguments. This seems to happen for no good reason. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D133419	2022-09-07 15:54:25 +02:00
Mehdi Amini	b285d708a7	Apply clang-tidy fixes for performance-for-range-copy in TileUsingInterface.cpp (NFC)	2022-09-07 09:40:59 +00:00
Mehdi Amini	8eab900170	Apply clang-tidy fixes for llvm-qualified-auto in Bufferize.cpp (NFC)	2022-09-07 09:40:59 +00:00
Matthias Springer	4cd7362083	[mlir][SCF] foreach_thread: Capture shared output tensors explicitly This change refines the semantics of scf.foreach_thread. Tensors that are inserted into in the terminator must now be passed to the region explicitly via `shared_outs`. Inside of the body of the op, those tensors are then accessed via block arguments. The body of a scf.foreach_thread is now treated as a repetitive region. I.e., op dominance can no longer be used in conflict detection when using a value that is defined outside of the body. Such uses may now be considered as conflicts (if there is at least one read and one write in the body), effectively privatizing the tensor. Shared outputs are not privatized when they are used via their corresponding block arguments. As part of this change, it was also necessary to update the "tiling to scf.foreach_thread", such that the generated tensor.extract_slice ops use the scf.foreach_thread's block arguments. This is implemented by cloning the TilingInterface op inside the scf.foreach_thread, rewriting all of its outputs with block arguments and then calling the tiling implementation. Afterwards, the cloned op is deleted again. Differential Revision: https://reviews.llvm.org/D133114	2022-09-02 14:54:04 +02:00
Matthias Springer	547942841f	[mlir][interfaces] Drop `dest`/`tileDestOperands` from TilingInterface `getTiledImplementation`/`generateResultTileValue` only computes the tiled operation, but does not insert the result into any tensor. Differential Revision: https://reviews.llvm.org/D133015	2022-09-01 08:53:53 +02:00
Michele Scuttari	67d0d7ac0a	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-31 12:28:45 +02:00
Michele Scuttari	039b969b32	Revert "[MLIR] Update pass declarations to new autogenerated files" This reverts commit `2be8af8f0e`.	2022-08-30 22:21:55 +02:00
Michele Scuttari	2be8af8f0e	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-30 21:56:31 +02:00
Matthias Springer	86974e32a4	[mlir][SCF][bufferize] Support different iter_arg/init_arg types (scf.while) This change implements the same functionality as D132860, but for scf.while. Differential Revision: https://reviews.llvm.org/D132927	2022-08-30 16:58:21 +02:00
Matthias Springer	9d6096c56f	[mlir][SCF][bufferize][NFC] Move scf.if buffer type computation to getBufferType A part of the functionality of `bufferize` is extracted into `getBufferType`. Also, bufferized scf.yields inside scf.if are now created with the correct bufferized type from the get-to. Differential Revision: https://reviews.llvm.org/D132862	2022-08-30 16:48:10 +02:00
Matthias Springer	123c4b0251	[mlir][SCF][bufferize] Support different iter_arg/init_arg types (scf.for) Even though iter_arg and init_arg of an scf.for loop may have the same tensor type, their bufferized memref types are not necessarily equal. It is sometimes necessary to insert a cast in case of differing layout maps. Differential Revision: https://reviews.llvm.org/D132860	2022-08-30 16:35:32 +02:00
Matthias Springer	111c919665	[mlir][bufferization] Generalize getBufferType This change generalizes getBufferType. This function can be used to predict the buffer type of any tensor value (not just BlockArguments) without changing any IR. It also subsumes getMemorySpace. This is useful for loop bufferization, where the precise buffer type of an iter_arg cannot be known without examining the loop body. Differential Revision: https://reviews.llvm.org/D132859	2022-08-30 16:26:44 +02:00
Jeff Niu	5b569ed2cd	[mlir] Add `Block::eraseArguments` that erases a subrange This patch adds a an `eraseArguments` function that erases a subrange of a block's arguments. This can be used inplace of the terrible pattern ``` block->eraseArguments(llvm::to_vector(llvm::seq(...))); ``` Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D132890	2022-08-29 15:34:21 -07:00
Benjamin Kramer	9fa59e7643	[mlir] Use C++17 structured bindings instead of std::tie where applicable. NFCI	2022-08-09 13:34:17 +02:00
lorenzo chelini	954de25a92	[MLIR] TilingInterface: Avoid map when tile divides iteration domain Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D131080	2022-08-04 19:43:59 +02:00
Mahesh Ravishankar	6f03a10e4f	[mlir][TilingInterface] Add a method to generate scalar implementation of the op. While The tiling interface provides a mechanism for operations to be tiled into tiled version of the op (or another op at the same level of abstraction), the `generateScalarImplementation` method added here is the "exit point" after all transformations have been done. Ops that implement this method are expected to generate IR that are directly lowerable to backend dialects like LLVM or SPIR-V dialects. Differential Revision: https://reviews.llvm.org/D130612	2022-07-28 16:37:15 +00:00

1 2 3 4 5

202 Commits