clang-p2996

Author	SHA1	Message	Date
Alex Zinenko	4b455a71b7	[mlir] adapt TransformEachOpTrait to parameter values Adapt the implementation of TransformEachOpTrait to the existence of parameter values recently introduced into the transform dialect. In particular, allow `applyToOne` hooks to return a list containing a mix of `Operation *` that will be associated with handles and `Attribute` that will be associated with parameter values by the trait implementation of the transform interface's `apply` method. Disentangle the "transposition" of the list of per-payload op partial results to decrease its overall complexity and detemplatize the code that doesn't really need templates. This removes the poorly documented special handling for single-result ops with TransformEachOpTrait that could have assigned null pointer values to handles. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D140979	2023-01-06 12:23:41 +00:00
Alex Zinenko	faac898987	[mlir] fix out-of-bounds in reduction tiling A transformation tiling a reduction dimension of a Linalg op needs a tile size for said dimension. When an insufficient number of dimensions was provided, it would segfault due to out-of-bounds access to a vector. Also fix incorrect error reporting in the structured transform op exercising this functionality. Reviewed By: springerm, ThomasRaoux Differential Revision: https://reviews.llvm.org/D141046	2023-01-05 15:20:26 +00:00
Mehdi Amini	495ddf1e4f	Apply clang-tidy fixes for performance-unnecessary-value-param in AffineCanonicalizationUtils.cpp (NFC)	2023-01-05 09:59:35 +00:00
Matthias Springer	3a5811a337	[mlir][affine][NFC] Extract core functionality of `canonicalizeMinMaxOp` Move code from SCF to Affine: Add a new helper function `simplifyConstrainedMinMaxOp` to Affine/Analysis/Utils.h. `canonicalizeMinMaxOp` was originally designed for loop peeling, but it is not SCF-specific and can be used to simplify any affine.min/max ops. Various functions in SCF/Transforms are simplified by dropping unnecessary parameters. Differential Revision: https://reviews.llvm.org/D140962	2023-01-04 11:25:44 +01:00
Nicolas Vasilache	1654f489e2	[mlir] NFC - Expose scf::canonicalizeMinMaxOp	2022-12-27 05:47:07 -08:00
Fangrui Song	cbb0981388	[mlir] llvm::Optional::value => operator*/operator-> std::optional::value() has undesired exception checking semantics and is unavailable in older Xcode (see _LIBCPP_AVAILABILITY_BAD_OPTIONAL_ACCESS). The call sites block std::optional migration.	2022-12-17 19:07:38 +00:00
Ramkumar Ramachandra	22426110c5	mlir/tblgen: use std::optional in generation This is part of an effort to migrate from llvm::Optional to std::optional. This patch changes the way mlir-tblgen generates .inc files, and modifies tests and documentation appropriately. It is a "no compromises" patch, and doesn't leave the user with an unpleasant mix of llvm::Optional and std::optional. A non-trivial change has been made to ControlFlowInterfaces to split one constructor into two, relating to a build failure on Windows. See also: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716 Signed-off-by: Ramkumar Ramachandra <r@artagnon.com> Differential Revision: https://reviews.llvm.org/D138934	2022-12-17 11:13:26 +01:00
Mehdi Amini	cffd7b144b	[mlir][scf] Fixes IndexSwitchOp verifier crash Fixes #59460	2022-12-13 09:42:34 +00:00
Alex Zinenko	7d5bef77e5	[mlir] make DiagnosedSilenceableError(LogicalResult) ctor private Now we have more convenient functions to construct silenceable errors while emitting diagnostics, and the constructor is ambiguous as it doesn't tell whether the logical error is silencebale or definite. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D137257	2022-12-12 12:52:06 +00:00
Kazu Hirata	70c73d1b72	[mlir] Use std::nullopt instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 17:23:50 -08:00
Kazu Hirata	1a36588ec6	[mlir] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 18:50:27 -08:00
River Riddle	b74192b7ae	[mlir] Remove support for non-prefixed accessors This finishes off a year long pursuit to LLVMify the generated operation accessors, prefixing them with get/set. Support for any other accessor naming is fully removed after this commit. https://discourse.llvm.org/t/psa-raw-accessors-are-being-removed/65629 Differential Revision: https://reviews.llvm.org/D136727	2022-12-02 13:32:36 -08:00
Hanhan Wang	b1d3afc93e	[mlir] Factor more common utils to IndexingUtils Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D139159	2022-12-02 13:27:01 -08:00
Nicolas Vasilache	a8850312c1	[mlir][Transform][NFC] Use a single rewriter instead of duplicating it everywhere Differential Revision: https://reviews.llvm.org/D139094	2022-12-01 03:54:31 -08:00
Benjamin Kramer	652592c5ca	[MLIR][Transform] Disambiguate ternary operator for MSVC mlir/lib/Dialect/SCF/TransformOps/SCFTransformOps.cpp(42): error C2446: ':': no conversion from 'OpTy' to 'OpTy' with [ OpTy=mlir::scf::ForOp ] and [ OpTy=mlir::AffineForOp ] mlir/lib/Dialect/SCF/TransformOps/SCFTransformOps.cpp(42): note: No user-defined-conversion operator available that can perform this conversion, or the operator cannot be called	2022-12-01 08:59:58 +01:00
Christian Sigg	be065c41d8	[mlir] Change scf::LoopNest to store 'results'. This fixes the case where scf::LoopNest::loops is empty. Change LoopVector and ValueVector to SmallVector. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D136926	2022-12-01 06:51:45 +01:00
Amy Wang	e4e64eaade	[MLIR][Transform] Consolidate the transform ops of get_parent_for and loop unroll from affine and scf dialects. This patch consolidates the two transform ops from the affine dialect and the scf dialect to avoid code duplication. This is to address the review comments from https://reviews.llvm.org/D137997. The transform ops directory / file structure for the affine dialect is kept for the purpose of forth-coming transform ops for affine, but get_parent_for and unroll are removed. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D138980	2022-11-30 11:07:44 -05:00
Matthias Springer	0d9761d50e	[mlir][SCF] Add tensor.dim(scf.foreach_thread) folding Dim sizes of `scf.foreach_thread` op results match the dim sizes of their respective tied shared_outs operands. Differential Revision: https://reviews.llvm.org/D138484	2022-11-22 11:28:27 +01:00
Lei Zhang	9bb633741a	[mlir][bufferization] Support general Attribute as memory space MemRef has been accepting a general Attribute as memory space for a long time. This commits updates bufferization side to catch up, which allows downstream users to plugin customized symbolic memory space. This also eliminates quite a few `getMemorySpaceAsInt` calls, which is deprecated. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D138330	2022-11-21 09:40:50 -05:00
Che-Yu Wu	4fdf624b78	Increase the limit of SCF nested tiling loop to 10 Differential Revision: https://reviews.llvm.org/D138156	2022-11-17 08:00:21 +00:00
Mahesh Ravishankar	fc367dfa67	[mlir] Remove `Transforms/SideEffectUtils.h` and move the methods into `Interface/SideEffectInterfaces.h`. The methods in `SideEffectUtils.h` (and their implementations in `SideEffectUtils.cpp`) seem to have similar intent to methods already existing in `SideEffectInterfaces.h`. Move the decleration (and implementation) from `SideEffectUtils.h` (and `SideEffectUtils.cpp`) into `SideEffectInterfaces.h` (and `SideEffectInterface.cpp`). Also drop the `SideEffectInterface::hasNoEffect` method in favor of `mlir::isMemoryEffectFree` which actually recurses into the operation instead of just relying on the `hasRecursiveMemoryEffectTrait` exclusively. Differential Revision: https://reviews.llvm.org/D137857	2022-11-15 20:07:35 +00:00
Mehdi Amini	fbfca43e6d	Apply clang-tidy fixes for llvm-qualified-auto in TileUsingInterface.cpp (NFC)	2022-11-15 18:14:01 +00:00
Mohammed Anany	77533d79f7	[mlir][SCF] Adding custom builder to SCF::WhileOp. This is a similar builder to the one for SCF::IfOp which allows users to pass region builders to it. Refer to the builders for IfOp. Reviewed By: tpopp Differential Revision: https://reviews.llvm.org/D137709	2022-11-15 18:16:49 +01:00
Nicolas Vasilache	f0a411da77	[mlir][Transform]Significantly cleanup scf.foreach_thread and GPU transform permutation handling Previously, the need for a dense permutation leaked into the thread_dim_mapping specification. This revision allows to use a sparse specification of the thread_dim_mapping and the proper completion / sorting is applied automatically. In the process, the sematics of scf.foreach_thread is tightened to require a matching number of thread dimensions and mappings. The relevant negative test is added. Differential Revision: https://reviews.llvm.org/D137906	2022-11-14 09:19:49 -08:00
Alexander Belyaev	07665e78cb	[mlir] Fix forward the fix for incorrect Optional<ArrayAttr> usage.	2022-11-11 10:53:04 +01:00
Alexander Belyaev	b7162e136e	[mlir] Fix incorrect access to the Optional<ArrayAttr> underlying values.	2022-11-11 10:46:04 +01:00
Guray Ozen	6663f34704	[mlir] Introduce device mapper attribute for `thread_dim_map` and `mapped to dims` `scf.foreach_thread` defines mapping its loops to processors via an integer array, see an example below. A lowering can use this mapping. However, expressing mapping as an integer array is very confusing, especially when there are multiple levels of parallelism. In addition, the op does not verify the integer array. This change introduces device mapping attribute to make mapping descriptive and verifiable. Then it makes GPU transform dialect use it. ``` scf.foreach_thread (%i, %j) in (%c1, %c2) { scf.foreach_thread (%i2, %j2) in (%c1, %c2) {...} { thread_dim_mapping = [0, 1]} } { thread_dim_mapping = [0, 1]} ``` It first introduces a `DeviceMappingInterface` which is an attribute interface. `scf.foreach_thread` defines its mapping via this interface. A lowering must define its attributes and implement this interface as well. This way gives us a clear validation. The change also introduces two new attributes (`#gpu.thread<x/y/z>` and `#gpu.block<x,y,z>` ). After this change, the above code prints as below, as seen here, this way clarifies the loop mappings. The change also implements consuming of these two new attribute by the transform dialect. Transform dialect binds the outermost loops to the thread blocks and innermost loops to threads. ``` scf.foreach_thread (%i, %j) in (%c1, %c2) { scf.foreach_thread (%i2, %j2) in (%c1, %c2) {...} { thread_dim_mapping = [#gpu.thread<x>, #gpu.thread<y>]} } { thread_dim_mapping = [#gpu.block<x>, #gpu.block<y>]} ``` Reviewed By: ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D137413	2022-11-11 08:44:57 +01:00
Mehdi Amini	34233d4995	Apply clang-tidy fixes for readability-redundant-smartptr-get in SCF.cpp (NFC)	2022-11-09 00:41:30 +00:00
Mehdi Amini	f5865c8701	Apply clang-tidy fixes for llvm-qualified-auto in SCF.cpp (NFC)	2022-11-09 00:41:30 +00:00
Kazu Hirata	585e35a998	[mlir] Use llvm::is_contained (NFC)	2022-11-06 19:56:15 -08:00
Hanhan Wang	52ffc72818	[mlir][tiling] Relax tiling to accept generating multiple operations. Some operations need to generate multiple operations when implementing the tiling interface. Here is a sound example in IREE, see https://github.com/iree-org/iree/pull/10905 for more details. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D137300	2022-11-04 13:59:24 -07:00
Thomas Raoux	3310fe55d9	[mlir][linalg] Add reduction tiling transformation Add a transformation to tile reduction ops into a parallel operation followed by a merge operation. This is equivalent to the existing reduction spliting transformation but using loops instead of using higher dimensions linalg. Differential Revision: https://reviews.llvm.org/D136586	2022-11-03 23:07:12 +00:00
Peiming Liu	1ca119728e	[mlir][scf] support 1:N type conversion for scf.if/while/condition Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D137100	2022-11-02 16:53:36 +00:00
Peiming Liu	f4cd3674ea	[mlir][scf] refactor scf structuralOpConversion to better support 1:N type conversion This patch moves the 1:N type mapping into its own classes to allow better code reuse in D137100. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D137099	2022-11-02 16:45:39 +00:00
Nicolas Vasilache	d4c4e49196	[mlir][Linalg] Drop usage of tileWithLinalgTilingOptions in the structured.tile transform This is on a path to deprecation. Context: https://discourse.llvm.org/t/psa-retire-tileandfuselinalgops-method/63850 As the interface-based transformation is more generic, some additional folding of AffineMin/MaxOp and some extra canonicalizations are needed. This can be further evolved. Differential Revision: https://reviews.llvm.org/D137195	2022-11-01 14:36:24 -07:00
Sanjoy Das	788390c130	Make scf.for and affine.for conditionally speculatable for (I = Start; I < End; I += 1) always terminates so mark {scf\|affine}.for as RecursivelySpeculatable when step is known to be 1. Reviewed By: chelini Differential Revision: https://reviews.llvm.org/D136376	2022-10-30 16:08:42 -07:00
Hanhan Wang	71cf48a62a	[mlir][scf] Enhance sizes computation in tileUsingSCFForOp. The boundary is always 1 if the tile size is 1. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D136884	2022-10-28 13:03:10 -07:00
Alexander Belyaev	b4db15a949	[mlir] Rename getInputs->getDpsInputs and getOutputs->getDpsInits in DPS interface. https://discourse.llvm.org/t/rfc-interface-for-destination-style-ops/64056 Differential Revision: https://reviews.llvm.org/D136943	2022-10-28 15:41:12 +02:00
Matthias Springer	c9b3638126	[mlir][scf][bufferize] Fix bufferizesToMemoryRead with 0 loop iterations There was a bug in scf.for loop bufferization that could lead to a missing buffer copy (alloc was there, but not the copy). Differential Revision: https://reviews.llvm.org/D135053	2022-10-24 14:34:41 +02:00
Matthias Springer	b169643f3a	[mlir][interfaces] Remove getDestinationOperands from TilingInterface `getDestinationOperands` was almost a duplicate of `DestinationStyleOpInterface::getOutputOperands`. Now that the interface has been moved to mlir/Interfaces, it is no longer needed. Differential Revision: https://reviews.llvm.org/D136240	2022-10-24 09:26:19 +02:00
Peiming Liu	d3f5f33067	[mlir][scf] support 1:N type conversion for scf.for. scf.for used to only support 1:1 type conversion, this patch add support for 1:N type conversion. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D136314	2022-10-21 21:11:55 +00:00
Jeff Niu	07d8fe9391	[mlir][scf] Add an IndexSwitchOp The `scf.index_switch` is a control-flow operation that branches to one of the given regions based on the values of the argument and the cases. The argument is always of type `index`. Example: ```mlir %0 = scf.index_switch %arg0 -> i32 case 2 { %1 = arith.constant 10 : i32 scf.yield %1 : i32 } case 5 { %2 = arith.constant 20 : i32 scf.yield %2 : i32 } default { %3 = arith.constant 30 : i32 scf.yield %3 : i32 } ``` Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D136003	2022-10-21 09:21:10 -07:00
Jeff Niu	6005a1d8af	[mlir][scf] Match any constants instead of arith.constant By matching `arith.constant` specifically, SCF canonicalizers/folders are incompatible with other kinds of constants. Use the generic matchers instead. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D135517	2022-10-12 18:01:57 -07:00
Mehdi Amini	6d4baa7442	Apply clang-tidy fixes for performance-unnecessary-value-param in TileUsingInterface.cpp (NFC)	2022-10-12 05:03:45 +00:00
Mehdi Amini	2a6f0fb34a	Apply clang-tidy fixes for performance-for-range-copy in TileUsingInterface.cpp (NFC)	2022-10-12 05:03:45 +00:00
Mehdi Amini	23f989a2e3	Apply clang-tidy fixes for readability-simplify-boolean-expr in BufferizableOpInterfaceImpl.cpp (NFC)	2022-10-12 01:16:36 +00:00
Alex Zinenko	59bb8af4c3	[mlir] switch the transform loop extension to use types Add types to the Loop (SCF) extension of the transform dialect. See https://discourse.llvm.org/t/rfc-type-system-for-the-transform-dialect/65702 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D135587	2022-10-11 09:55:23 +00:00
Nicolas Vasilache	7915027926	[mlir][Linalg] Retire LinalgStrategyTileAndFusePass and filter-based pattern. Context: https://discourse.llvm.org/t/psa-retire-linalg-filter-based-patterns/63785 In the process, also retire `tileConsumerAndFuseProducers` that is now replaced by `tileConsumerAndFuseProducerGreedilyUsingSCFForOp`. Context: https://discourse.llvm.org/t/psa-retire-tileandfuselinalgops-method/63850 When performing this replacement, a change of behavior appeared: the older `tileConsumerAndFuseProducers` would split the parallel and non-parallel dimensions automatically and perform a first level of tile-and-fuse on parallel dimensions only and then introduce a second level of tiling-only on the reduction dimensions. The newer `tileConsumerAndFuseProducerGreedilyUsingSCFForOp` on the other hand does not perform this breakdown. As a consequence, the transform specification is evolved to produce the same output. Additionally, replace some uses of `unsigned` by `int64_t` where possible without pulling in larger interface changes (left for a future PR). Context: https://www.youtube.com/watch?v=Puio5dly9N8 Lastly, tests that were performing tile and fuse and distribute on tensors are retired: the generated IR mixing scf.for, tensors and distributed processor ids was racy at best .. Differential Revision: https://reviews.llvm.org/D135559	2022-10-10 07:04:01 -07:00
Nicolas Vasilache	af664e4459	[mlir][Transform] Add a transform.split_handles operation and fix general silenceable bugs. The transform.split_handles op is useful for ensuring a statically known number of operations are tracked by the source `handle` and to extract them into individual handles that can be further manipulated in isolation. In the process of making the op robust wrt to silenceable errors and the suppress mode, issues were uncovered and fixed. The main issue was that silenceable errors were short-circuited too early and the payloads were not set. This resulted in suppressed silenceable errors not propagating correctly. Fixing the issue triggered a few test failures: silenceable error returns now must properly set the results state. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D135426	2022-10-07 09:01:34 -07:00
Adrian Kuegel	67bcf9825a	[mlir][SCF] Apply ClangTidyPerformance finding (NFC)	2022-09-30 12:47:32 +02:00

1 2 3 4 5 ...

368 Commits