clang-p2996

Author	SHA1	Message	Date
Alex Zinenko	c214cee772	[mlir] improve error handling in Linalg op splitting In several cases, the splitting may be known to be a noop, i.e., produce no second part. Thread this information through the transform utilities to the transform dialect, and differentiate it from the error state. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D141138	2023-01-06 18:35:08 +01:00
Alex Zinenko	4b455a71b7	[mlir] adapt TransformEachOpTrait to parameter values Adapt the implementation of TransformEachOpTrait to the existence of parameter values recently introduced into the transform dialect. In particular, allow `applyToOne` hooks to return a list containing a mix of `Operation *` that will be associated with handles and `Attribute` that will be associated with parameter values by the trait implementation of the transform interface's `apply` method. Disentangle the "transposition" of the list of per-payload op partial results to decrease its overall complexity and detemplatize the code that doesn't really need templates. This removes the poorly documented special handling for single-result ops with TransformEachOpTrait that could have assigned null pointer values to handles. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D140979	2023-01-06 12:23:41 +00:00
Alex Zinenko	054ec47c91	[mlir] NFC: move DiagnosedSilenceableFailure to Utils in Transform dialect It was originally placed in TransformInterfaces for convenience, but it is really a generic utility. It may also create an include cycle between TransformTypes and TransformInterfaces if the latter needs to include the former because the former uses the failure util. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D140978	2023-01-06 12:23:37 +00:00
Alex Zinenko	faac898987	[mlir] fix out-of-bounds in reduction tiling A transformation tiling a reduction dimension of a Linalg op needs a tile size for said dimension. When an insufficient number of dimensions was provided, it would segfault due to out-of-bounds access to a vector. Also fix incorrect error reporting in the structured transform op exercising this functionality. Reviewed By: springerm, ThomasRaoux Differential Revision: https://reviews.llvm.org/D141046	2023-01-05 15:20:26 +00:00
Nicolas Vasilache	83b582d51b	[mlir][Linalg] Properly propagate transform result in ScalarizeOp	2022-12-27 06:16:55 -08:00
Murali Vijayaraghavan	1a151fdc01	[mlir][linalg] Downscale 2D pooling with unit dimensions for height to 1D pooling Differential Revision: https://reviews.llvm.org/D140187	2022-12-19 22:34:43 +00:00
Matthias Springer	411048c1ae	[mlir][transform] Add PackedOrDynamicIndexList helper This customer parser/printer is similar to DynamicIndexList, but has special syntax for the case where one handle represents the entire list. Example: ``` // Regular index list [10, 20, %val] // Packed handle (no square parentheses) %val ``` Differential Revision: https://reviews.llvm.org/D138825	2022-12-19 08:08:04 +01:00
Nicolas Vasilache	6237cd7785	[mlir][Linalg] NFC - Add C++ builder to TileOp	2022-12-18 05:51:15 -08:00
Ramkumar Ramachandra	22426110c5	mlir/tblgen: use std::optional in generation This is part of an effort to migrate from llvm::Optional to std::optional. This patch changes the way mlir-tblgen generates .inc files, and modifies tests and documentation appropriately. It is a "no compromises" patch, and doesn't leave the user with an unpleasant mix of llvm::Optional and std::optional. A non-trivial change has been made to ControlFlowInterfaces to split one constructor into two, relating to a build failure on Windows. See also: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716 Signed-off-by: Ramkumar Ramachandra <r@artagnon.com> Differential Revision: https://reviews.llvm.org/D138934	2022-12-17 11:13:26 +01:00
Lorenzo Chelini	2e5fe72172	[MLIR][Linalg] Use `DenseI64ArrayAttr` in `InterchangeOp` (NFC) Use op separator to improve code navigation. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D139917	2022-12-16 16:37:33 +01:00
Nicolas Vasilache	f27514800c	[mlir][Linalg] Better builders for transform ops Also adopt DenseI64ArrayAttr in those transform ops. Differential Revision: https://reviews.llvm.org/D140009	2022-12-14 06:22:52 -08:00
Aliia Khasanova	ded75a282a	Remove sentinel argument from dispatchIndexOpFoldResults. Post clean-up after merger of kDynamicSize and kDynamicStrideOrOffset. Differential Revision: https://reviews.llvm.org/D139929	2022-12-13 14:04:46 +01:00
Diego Caballero	72fd36448d	[mlir][Vector] Initial masking support in Linalg vectorizer This patch introduces the initial bits to support vector masking using the `vector.mask` operation. Vectorization changes should be NFC for non-masked cases. We can't test masked cases directly until we extend the Transform dialect to support masking. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D137690	2022-12-13 01:33:06 +00:00
Nicolas Vasilache	93bbcffc7e	[mlir][Transform] Make FuseIntoContainingOp support rank-reducing extract slices This fixes an issue where rank-reducing + fusion would not interop properly. Differential Revision: https://reviews.llvm.org/D139844	2022-12-12 12:55:08 -08:00
Alex Zinenko	7d5bef77e5	[mlir] make DiagnosedSilenceableError(LogicalResult) ctor private Now we have more convenient functions to construct silenceable errors while emitting diagnostics, and the constructor is ambiguous as it doesn't tell whether the logical error is silencebale or definite. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D137257	2022-12-12 12:52:06 +00:00
Nicolas Vasilache	23a057fbc4	[mlir][Transform] NFC - Return omitted loop construct in transform.tile_reduction_xxx ops	2022-12-12 02:14:00 -08:00
Andrzej Warzynski	c181f21ac7	[MLIR] Vectorize tensor.extract on n-D tensor (n >= 2) This patch implements the vectorization of tensor.extract for arbitrary tensors. It basically extends https://reviews.llvm.org/D133786 by adding support for n-D tensors (n >= 2). This is implemented by essentially flattening the indices. When benchmarking the vectorized code, we have observed that it is slower than the scalar code. That's most likely due to sub-optimal (and, in general slow) gather loads. More work is needed to identify an implementation and/or a representation that would lead to better code. In the meantime, the vectorization of n-D tensors (where n >= 2) has to be explicitly enabled. This can be done either via: * transfer dialect's `vectorize_nd_extract` attribute, * dedicated bool argument in the `vectorize` method from "Vectorization.cpp". The second option was added to control the new functionality through means other than the transfer dialect. Related discussion: https://github.com/iree-org/iree/issues/9198 Differential Revision: https://reviews.llvm.org/D137660	2022-12-12 09:32:16 +00:00
Nicolas Vasilache	06ca5c81a4	[mlir][Linalg] Apply fixes to TileReductionUsingForeachThreadOp In the process, numerous insertion point issues were found and fixed. RAII on insertion points is now used more dilligently. Differential Revision: https://reviews.llvm.org/D139714	2022-12-09 07:51:12 -08:00
Thomas Raoux	f7fda6ba4a	[mlir][linalg] Add extra parameter to tiling reduction to foreach_thread This adds a tile_size parameter, when it is used the tiles are cyclically distributed onto the threads of the scf.foreach_thread op. Differential Revision: https://reviews.llvm.org/D139474	2022-12-07 18:37:05 +00:00
Kazu Hirata	1a36588ec6	[mlir] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 18:50:27 -08:00
Ramkumar Ramachandra	57c893599d	mlir/linalg: improve debugging in LinalgTransformOps Make use of notifyMatchFailure in one place. Signed-off-by: Ramkumar Ramachandra <r@artagnon.com> Differential Revision: https://reviews.llvm.org/D139191	2022-12-03 09:55:03 +01:00
Nicolas Vasilache	a8850312c1	[mlir][Transform][NFC] Use a single rewriter instead of duplicating it everywhere Differential Revision: https://reviews.llvm.org/D139094	2022-12-01 03:54:31 -08:00
Matthias Springer	5cb68314f3	[mlir] Fix build breakage introduced by D139026	2022-12-01 09:16:49 +01:00
Matthias Springer	504a7516a1	[mlir][linalg][transform] Add structured.replace op This op is useful for debugging/experiments and allows users to replace ops (without arguments + IsolatedFromAbove) with the given op in the region of transform op. Differential Revision: https://reviews.llvm.org/D139026	2022-12-01 09:04:35 +01:00
Lorenzo Chelini	a9733b8a5e	[MLIR] Adopt `DenseI64ArrayAttr` in tensor, memref and linalg transform This commit is a first step toward removing inconsistencies between dynamic and static attributes (i64 v. index) by dropping `I64ArrayAttr` and using `DenseI64ArrayAttr` in Tensor, Memref and Linalg Transform ops. In Linalg Transform ops only `TileToScfForOp` and `TileOp` have been updated. See related discussion: https://discourse.llvm.org/t/rfc-inconsistency-between-dynamic-and-static-attributes-i64-v-index/66612/1 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D138567	2022-11-25 09:43:30 +01:00
Alexander Belyaev	65b72a78cc	[mlir] Clean-up ViewLikeOpInterface w.r.t. kDynamic change. Differential Revision: https://reviews.llvm.org/D138478	2022-11-22 10:51:53 +01:00
Aliia Khasanova	399638f98c	Merge kDynamicSize and kDynamicSentinel into one constant. resolve conflicts Differential Revision: https://reviews.llvm.org/D138282	2022-11-21 13:01:26 +00:00
Mehdi Amini	44601785ee	Apply clang-tidy fixes for bugprone-argument-comment in LinalgTransformOps.cpp (NFC)	2022-11-18 06:22:53 +00:00
Kazu Hirata	eba3fece88	[mlir] Fix warnings This patch fixes: mlir/include/mlir/ExecutionEngine/SparseTensor/Storage.h:955:20: error: unused variable 'sz' [-Werror,-Wunused-variable] mlir/lib/Dialect/Linalg/TransformOps/LinalgTransformOps.cpp:1460:2: error: extra ';' outside of a function is incompatible with C++98 [-Werror,-Wc++98-compat-extra-semi]	2022-11-15 12:01:00 -08:00
Thomas Raoux	99833cd818	[mlir][linalg] Add reduction tiling using scf.foreachthread This adds a transformation to tile reduction operations to partial reduction using scf.foreachthread. This uses PartialReductionOpInterface to create a merge operation of the partial tiles. Differential Revision: https://reviews.llvm.org/D137912	2022-11-14 18:05:40 +00:00
Nicolas Vasilache	6370f75ad7	[mlir][Transform] Add support for dynamically unpacking tile_sizes / num_threads in tile_to_foreach_thread This commit adds automatic unpacking of Value's of type pdl::OperationType to the underlying single-result OpResult. This allows mixing single-value, attribute and multi-value pdl::Operation tile sizes and num threads to TileToForeachThreadOp. Differential Revision: https://reviews.llvm.org/D137896	2022-11-14 04:39:57 -08:00
Guray Ozen	d93be483ea	[mlir][transform] Make `tile_to_foreach_thread_op` builder to use ArrayAttr D137413 clarified `scf_foreach_thread` thread mapping nicely. `tile_to_foreach_thread_op` is one of the op that generates `scf_foreach_thread`, however, its builders are still having integer array. This is bug fix of potential problem. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D137891	2022-11-12 19:27:25 +01:00
Guray Ozen	6663f34704	[mlir] Introduce device mapper attribute for `thread_dim_map` and `mapped to dims` `scf.foreach_thread` defines mapping its loops to processors via an integer array, see an example below. A lowering can use this mapping. However, expressing mapping as an integer array is very confusing, especially when there are multiple levels of parallelism. In addition, the op does not verify the integer array. This change introduces device mapping attribute to make mapping descriptive and verifiable. Then it makes GPU transform dialect use it. ``` scf.foreach_thread (%i, %j) in (%c1, %c2) { scf.foreach_thread (%i2, %j2) in (%c1, %c2) {...} { thread_dim_mapping = [0, 1]} } { thread_dim_mapping = [0, 1]} ``` It first introduces a `DeviceMappingInterface` which is an attribute interface. `scf.foreach_thread` defines its mapping via this interface. A lowering must define its attributes and implement this interface as well. This way gives us a clear validation. The change also introduces two new attributes (`#gpu.thread<x/y/z>` and `#gpu.block<x,y,z>` ). After this change, the above code prints as below, as seen here, this way clarifies the loop mappings. The change also implements consuming of these two new attribute by the transform dialect. Transform dialect binds the outermost loops to the thread blocks and innermost loops to threads. ``` scf.foreach_thread (%i, %j) in (%c1, %c2) { scf.foreach_thread (%i2, %j2) in (%c1, %c2) {...} { thread_dim_mapping = [#gpu.thread<x>, #gpu.thread<y>]} } { thread_dim_mapping = [#gpu.block<x>, #gpu.block<y>]} ``` Reviewed By: ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D137413	2022-11-11 08:44:57 +01:00
Hanhan Wang	52ffc72818	[mlir][tiling] Relax tiling to accept generating multiple operations. Some operations need to generate multiple operations when implementing the tiling interface. Here is a sound example in IREE, see https://github.com/iree-org/iree/pull/10905 for more details. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D137300	2022-11-04 13:59:24 -07:00
Nicolas Vasilache	c8fab80d64	[mlir][Transform] NFC - Add custom builders for some useful transforms. Differential Revision: https://reviews.llvm.org/D137443	2022-11-04 10:04:28 -07:00
Thomas Raoux	3310fe55d9	[mlir][linalg] Add reduction tiling transformation Add a transformation to tile reduction ops into a parallel operation followed by a merge operation. This is equivalent to the existing reduction spliting transformation but using loops instead of using higher dimensions linalg. Differential Revision: https://reviews.llvm.org/D136586	2022-11-03 23:07:12 +00:00
Nicolas Vasilache	d4c4e49196	[mlir][Linalg] Drop usage of tileWithLinalgTilingOptions in the structured.tile transform This is on a path to deprecation. Context: https://discourse.llvm.org/t/psa-retire-tileandfuselinalgops-method/63850 As the interface-based transformation is more generic, some additional folding of AffineMin/MaxOp and some extra canonicalizations are needed. This can be further evolved. Differential Revision: https://reviews.llvm.org/D137195	2022-11-01 14:36:24 -07:00
Matthias Springer	b169643f3a	[mlir][interfaces] Remove getDestinationOperands from TilingInterface `getDestinationOperands` was almost a duplicate of `DestinationStyleOpInterface::getOutputOperands`. Now that the interface has been moved to mlir/Interfaces, it is no longer needed. Differential Revision: https://reviews.llvm.org/D136240	2022-10-24 09:26:19 +02:00
Thomas Raoux	246e8c3502	[mlir][linalg] Add back split reduction tests dropped by previous commit The transition to transform dialect based tests dropped several cases of the split reduction testing. Adding them back. Differential Revision: https://reviews.llvm.org/D136287	2022-10-19 20:42:55 +00:00
Alex Zinenko	b0bf7ffffc	[mlir] add utilites for DiagnosedSilenceableFailure This class adds helper functions similar to `emitError` for the DiagnosedSilenceableFailure class in both the silenceable and definite failure cases. These helpers simplify the use of said class and make tranfsorm op application code idiomatic. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D136072	2022-10-17 15:31:28 +00:00
Nicolas Vasilache	4b17710369	[mlir][Linalg] Support multi-output fusion in FuseIntoContainingOp This revision adds the ability to fuse tileable ops with multiple results to the transform.fuse_into_containing_op. Differential Revision: https://reviews.llvm.org/D135955	2022-10-14 03:54:54 -07:00
Nicolas Vasilache	44cfea0279	[mlir][Linalg] Retire LinalgStrategyTilePass and filter-based pattern. Context: https://discourse.llvm.org/t/psa-retire-linalg-filter-based-patterns/63785 Uses of `LinalgTilingPattern::returningMatchAndRewrite` are replaced by a top-level `tileWithLinalgTilingOptions` function that is marked obsolete and serves as a temporary means to transition away from `LinalgTilingOptions`-based tiling. LinalgTilingOptions supports too many options that have been orthogonalized with the use of the transform dialect. Additionally, the revision introduces a `transform.structured.tile_to_scf_for` structured transform operation that is needed to properly tile `tensor.pad` via the TilingInterface. Uses of `transform.structured.tile` will be deprecated and replaced by this new op. This will achieve the deprecation of `linalg::tileLinalgOp`. Context: https://discourse.llvm.org/t/psa-retire-tileandfuselinalgops-method/63850 In the process of transitioning, tests that were performing tile and distribute on tensors are retired: transformations should be orthogonalized better in the future. In particular, tiling to specific loop types and tileAndDistribute behavior are not available via the transform ops. The behavior is still available as part of the `tileWithLinalgTilingOptions` method to allow downstream clients to transition without breakages but is meant to be retired soon. As more tests are ported to the transform dialect, it became necessary to introduce a test-transform-dialect-erase-schedule-pass to discard the transform specification once applied so that e2e lowering and execution is possible. Lastly, a number of redundant tests that were testing composition of patterns are retired as they are available with a better mechanism via the transform dialect. Differential Revision: https://reviews.llvm.org/D135573	2022-10-11 02:42:56 -07:00
Nicolas Vasilache	7915027926	[mlir][Linalg] Retire LinalgStrategyTileAndFusePass and filter-based pattern. Context: https://discourse.llvm.org/t/psa-retire-linalg-filter-based-patterns/63785 In the process, also retire `tileConsumerAndFuseProducers` that is now replaced by `tileConsumerAndFuseProducerGreedilyUsingSCFForOp`. Context: https://discourse.llvm.org/t/psa-retire-tileandfuselinalgops-method/63850 When performing this replacement, a change of behavior appeared: the older `tileConsumerAndFuseProducers` would split the parallel and non-parallel dimensions automatically and perform a first level of tile-and-fuse on parallel dimensions only and then introduce a second level of tiling-only on the reduction dimensions. The newer `tileConsumerAndFuseProducerGreedilyUsingSCFForOp` on the other hand does not perform this breakdown. As a consequence, the transform specification is evolved to produce the same output. Additionally, replace some uses of `unsigned` by `int64_t` where possible without pulling in larger interface changes (left for a future PR). Context: https://www.youtube.com/watch?v=Puio5dly9N8 Lastly, tests that were performing tile and fuse and distribute on tensors are retired: the generated IR mixing scf.for, tensors and distributed processor ids was racy at best .. Differential Revision: https://reviews.llvm.org/D135559	2022-10-10 07:04:01 -07:00
Nicolas Vasilache	af664e4459	[mlir][Transform] Add a transform.split_handles operation and fix general silenceable bugs. The transform.split_handles op is useful for ensuring a statically known number of operations are tracked by the source `handle` and to extract them into individual handles that can be further manipulated in isolation. In the process of making the op robust wrt to silenceable errors and the suppress mode, issues were uncovered and fixed. The main issue was that silenceable errors were short-circuited too early and the payloads were not set. This resulted in suppressed silenceable errors not propagating correctly. Fixing the issue triggered a few test failures: silenceable error returns now must properly set the results state. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D135426	2022-10-07 09:01:34 -07:00
Guray Ozen	89bb0cae46	[mlir][transform] Create GPU transform dialect This revision adds GPU transform dialect. It also introduce a prefix such as "transform.gpu" for all ops related to this dialect. MLIR already had two GPU transform op in linalg. This revision moves these ops into GPUTransformOps. The Ops are as follows: `transform.structured.map_nested_foreach_thread_to_gpu_blocks` -> `transform.gpu.map_foreach_to_blocks` This op selects the outermost (toplevel) foreach_thread and parallelize across GPU blocks. It can also generate `gpu_launch`. `transform.structured.map_nested_foreach_thread_to_gpu_threads` -> `transform.gpu.map_nested_foreach_to_threads` This op parallelizes nested foreach_thread that are inside `gpu_launch` across GPU threads. It doesn't add new functionality, but there are some minor refactoring of the code. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D134800	2022-10-04 13:09:08 +02:00
River Riddle	10c04f4641	[mlir:GPU][NFC] Update GPU API to use prefixed accessors This doesn't flip the switch for prefix generation yet, that'll be done in a followup.	2022-09-30 15:27:10 -07:00
Jakub Kuderski	abc362a107	[mlir][arith] Change dialect name from Arithmetic to Arith Suggested by @lattner in https://discourse.llvm.org/t/rfc-define-precise-arith-semantics/65507/22. Tested with: `ninja check-mlir check-mlir-integration check-mlir-mlir-spirv-cpu-runner check-mlir-mlir-vulkan-runner check-mlir-examples` and `bazel build --config=generic_clang @llvm-project//mlir:all`. Reviewed By: lattner, Mogball, rriddle, jpienaar, mehdi_amini Differential Revision: https://reviews.llvm.org/D134762	2022-09-29 11:23:28 -04:00
Murali Vijayaraghavan	146c3ea075	[mlir] Add support for parallel dim after reduction dim in split reduction Previously, splitReduction transformation added the split parallel dimension before the reduction dimension, leading to tiling for reduction. This commit creates an option to create the parallel dimension after the reduction dimension, allowing us to transform the op into vertical reduction with SIMD parallelism. Reviewed By: ThomasRaoux, dcaballe Differential Revision: https://reviews.llvm.org/D134764	2022-09-29 01:24:01 +00:00
Guray Ozen	f8ad6eaf92	[mlir] Refactor transform dialect's gpu block func This revision refactors gpu block id generator lambda that is used in the transform dialect. It removes the lambda and instead uses a static function that's name generateGpuBlockIds. It also simplifies arguments that the function takes. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D134724	2022-09-27 12:27:17 +02:00
Thomas Raoux	e99f437140	[mlir] Plumb missing paramter to gpu transform op rewriteMapNestedForeachThreadToGpuThreads was dropping the paramter to skip inserting barrier Differential Revision: https://reviews.llvm.org/D134500	2022-09-23 16:58:44 +00:00

1 2

98 Commits