clang-p2996

Author	SHA1	Message	Date
Matthias Springer	5cf714bb2f	[mlir][SCF] scf.for: Consistent API around `initArgs` (#66512 ) * Always use the auto-generated `getInitArgs` function. Remove the hand-written `getInitOperands` duplicate. * Remove `hasIterOperands` and `getNumIterOperands`. The names were inconsistent because the "arg" is called `initArgs` in TableGen. Use `getInitArgs().size()` instead. * Fix verification around ops with no results.	2023-09-18 09:13:43 +02:00
Lei Zhang	d243378722	[mlir][vector] Use dyn_cast in if conditions Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D158336	2023-08-22 08:27:40 -07:00
Lei Zhang	199442ea2c	[mlir][vector] Fix uniform transfer_read distribution If the original shape and the distributed shape is the same, we don't distribute at all--every thread is handling the whole. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D158235	2023-08-17 17:38:55 -07:00
Lei Zhang	73ddc4474b	[mlir][vector] Enable distribution over multiple dimensions This commit starts enabling vector distruction over multiple dimensions. It requires delinearize the lane ID to match the expected rank. shape_cast and transfer_read now can properly handle multiple dimensions. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D157931	2023-08-16 12:08:43 -07:00
Matthias Springer	16b75cd2bb	[mlir][vector] Use DenseI64ArrayAttr for ExtractOp/InsertOp positions `DenseI64ArrayAttr` provides a better API than `I64ArrayAttr`. E.g., accessors returning `ArrayRef<int64_t>` (instead of `ArrayAttr`) are generated. Differential Revision: https://reviews.llvm.org/D156684	2023-07-31 15:25:37 +02:00
Quentin Colombet	1dd00d3903	[mlir][Vector] Fix a propagation bug with broadcast In the vector distribute patterns, we used to move `vector.broadcast`s out of `vector.warp_execute_on_lane0`s irrespectively of how they were defined. This could create broadcast operations with invalid semantic. E.g., ``` %r = warop ...[32] ... -> vector<1x2xf32> { %val = broadcast %in : vector<64xf32> to vetor<1x64xf32> vector.yield %val : vector<1x64xf32> } ``` => ``` %r = warop ...[32] ... -> vector<64xf32> { vector.yield %in : vector<64xf32> } // Broadcasting to a narrower type! broadcast %r : vector<64xf32> to vector<1x2xf32> ``` The root issue is we are trying to broadcast something that is not the same for each thread, so there is actually nothing to propagate here. The fix checks that the broadcast we want to create actually makes sense. Differential Revision: https://reviews.llvm.org/D152154	2023-06-06 16:40:15 +02:00
Quentin Colombet	018d8ac974	[mlir][Vector] Fix a propagation bug with transfer_read In the vector distribute patterns, we used to move `vector.transfer_read`s out of `vector.warp_execute_on_lane0`s irrespectively of how they were defined. This could create transfer_read operations that would read values from within the warpOp's body from outside of the body. E.g., ``` warpop { %defined_in_body %read = transfer_read %defined_in_body vector.yield %read } ``` => ``` warpop { %defined_in_body vector.yield ... } // %defined_in_body is referenced outside of its scope. %read = transfer_read %defined_in_body ``` The fix consists in checking that all the values feeding the new `transfer_read` are defined outside of warpOp's body. Note: We could do this check before creating any operation, but that would mean knowing what `affine::makeComposedAffineApply` actually do. So the current fix is a trade off of coupling the implementations of this propagation and `makeComposedAffineApply` versus compile time. Differential Revision: https://reviews.llvm.org/D152149	2023-06-05 15:52:26 +02:00
Tres Popp	5550c82189	[mlir] Move casting calls from methods to function calls The MLIR classes Type/Attribute/Operation/Op/Value support cast/dyn_cast/isa/dyn_cast_or_null functionality through llvm's doCast functionality in addition to defining methods with the same name. This change begins the migration of uses of the method to the corresponding function call as has been decided as more consistent. Note that there still exist classes that only define methods directly, such as AffineExpr, and this does not include work currently to support a functional cast/isa call. Caveats include: - This clang-tidy script probably has more problems. - This only touches C++ code, so nothing that is being generated. Context: - https://mlir.llvm.org/deprecation/ at "Use the free function variants for dyn_cast/cast/isa/…" - Original discussion at https://discourse.llvm.org/t/preferred-casting-style-going-forward/68443 Implementation: This first patch was created with the following steps. The intention is to only do automated changes at first, so I waste less time if it's reverted, and so the first mass change is more clear as an example to other teams that will need to follow similar steps. Steps are described per line, as comments are removed by git: 0. Retrieve the change from the following to build clang-tidy with an additional check: https://github.com/llvm/llvm-project/compare/main...tpopp:llvm-project:tidy-cast-check 1. Build clang-tidy 2. Run clang-tidy over your entire codebase while disabling all checks and enabling the one relevant one. Run on all header files also. 3. Delete .inc files that were also modified, so the next build rebuilds them to a pure state. 4. Some changes have been deleted for the following reasons: - Some files had a variable also named cast - Some files had not included a header file that defines the cast functions - Some files are definitions of the classes that have the casting methods, so the code still refers to the method instead of the function without adding a prefix or removing the method declaration at the same time. ``` ninja -C $BUILD_DIR clang-tidy run-clang-tidy -clang-tidy-binary=$BUILD_DIR/bin/clang-tidy -checks='-,misc-cast-functions'\ -header-filter=mlir/ mlir/ -fix rm -rf $BUILD_DIR/tools/mlir/*/.inc git restore mlir/lib/IR mlir/lib/Dialect/DLTI/DLTI.cpp\ mlir/lib/Dialect/Complex/IR/ComplexDialect.cpp\ mlir/lib/**/IR/\ mlir/lib/Dialect/SparseTensor/Transforms/SparseVectorization.cpp\ mlir/lib/Dialect/Vector/Transforms/LowerVectorMultiReduction.cpp\ mlir/test/lib/Dialect/Test/TestTypes.cpp\ mlir/test/lib/Dialect/Transform/TestTransformDialectExtension.cpp\ mlir/test/lib/Dialect/Test/TestAttributes.cpp\ mlir/unittests/TableGen/EnumsGenTest.cpp\ mlir/test/python/lib/PythonTestCAPI.cpp\ mlir/include/mlir/IR/ ``` Differential Revision: https://reviews.llvm.org/D150123	2023-05-12 11:21:25 +02:00
Rahul Kayaith	6089d612a5	[mlir] Prevent implicit downcasting to interfaces Currently conversions to interfaces may happen implicitly (e.g. `Attribute -> TypedAttr`), failing a runtime assert if the interface isn't actually implemented. This change marks the `Interface(ValueT)` constructor as explicit so that a cast is required. Where it was straightforward to I adjusted code to not require casts, otherwise I just made them explicit. Depends on D148491, D148492 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D148493	2023-04-20 16:31:54 -04:00
Matthias Springer	4c48f016ef	[mlir][Affine][NFC] Wrap dialect in "affine" namespace This cleanup aligns the affine dialect with all the other dialects. Differential Revision: https://reviews.llvm.org/D148687	2023-04-20 11:19:21 +09:00
Jakub Kuderski	8c258fda1f	[ADT][mlir][NFCI] Do not use non-const lvalue-refs with enumerate Replace references to enumerate results with either result_pairs (reference wrapper type) or structured bindings. I did not use structured bindings everywhere as it wasn't clear to me it would improve readability. This is in preparation to the switch to zip semantics which won't support non-const lvalue reference to elements: https://reviews.llvm.org/D144503. I chose to use values instead of const lvalue-refs because MLIR is biased towards avoiding `const` local variables. This won't degrade performance because currently `result_pair` is cheap to copy (size_t + iterator), and in the future, the enumerator iterator dereference will return temporaries anyway. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D146006	2023-03-15 10:43:56 -04:00
Matthias Springer	7ecc921deb	[mlir][vector] Fix incorrect API usage in RewritePatterns Incorrect API usage was detected by D144552. Differential Revision: https://reviews.llvm.org/D145153	2023-03-02 13:58:37 +01:00
Thomas Raoux	e3a88a41af	Revert "[mlir][vector] Prevent duplicating operations during vector distribute" This reverts commit `2fc3c5c34c`.	2023-02-17 03:07:16 +00:00
Lei Zhang	a1aad28d29	[mlir][vector] NFC: Improve vector type accessor methods Plain `getVectorType()` can be quite confusing and error-prone given that, well, vector ops always work on vector types, and it can commonly involve both source and result vectors. So this commit makes various such accessor methods to be explicit w.r.t. source or result vectors. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D144159	2023-02-16 04:08:33 +00:00
Thomas Raoux	2fc3c5c34c	[mlir][vector] Prevent duplicating operations during vector distribute We should distribute ops that have other uses than the yield op as this would duplicate those ops. Differential Revision: https://reviews.llvm.org/D143629	2023-02-09 08:26:35 +00:00
Frederik Gossen	1125c5c0b2	[MLIR] Remove scf.if builder with explicit result types and callbacks Instead, use the builder and infer the return type based on the inner `yield` ops. Also, fix uses that do not create the terminator as required for the callback builders. Differential Revision: https://reviews.llvm.org/D142056	2023-01-20 10:52:08 -05:00
Thomas Raoux	069d7d7e48	[mlir][vector] Fix crash in extractelement vec distribution Prevent creating a vector of size 0 that would fail verifier. Vector 1d with a single element should be treated like 0d vectors. Differential Revision: https://reviews.llvm.org/D141452	2023-01-11 02:35:12 +00:00
Kazu Hirata	51ddfd76dd	[mlir] Fix a warning This patch fixes: mlir/lib/Dialect/Vector/Transforms/VectorDistribute.cpp:947:13: error: variable 'distributedDim' set but not used [-Werror,-Wunused-but-set-variable]	2023-01-09 09:51:17 -08:00
Matthias Springer	1523b72946	[mlir][vector] Distribute vector.insert op In case the distributed dim of the dest vector is also a dim of the src vector, each lane inserts a smaller part of the source vector. Otherwise, one lane inserts the entire src vector and the other lanes do nothing. Differential Revision: https://reviews.llvm.org/D137953	2023-01-09 16:50:28 +01:00
Matthias Springer	73ce971c63	[mlir][vector] Distribute vector.insertelement op In case of a distribution, only one lane inserts the scalar value. In case of a broadcast, every lane inserts the scalar. Differential Revision: https://reviews.llvm.org/D137929	2023-01-09 16:41:08 +01:00
Matthias Springer	9085f00b4d	[mlir][vector] Support vector.extract distribution of >1D vectors Ops such as `%1 = vector.extract %0[2] : vector<5x96xf32>`. Distribute the source vector, then extract. In case of a 1d extract, rewrite to vector.extractelement. Differential Revision: https://reviews.llvm.org/D137646	2023-01-09 16:39:50 +01:00
Thomas Raoux	f41abcda5e	[mlir][vector] Relax restriction on reduction distribution Relax unnecessary restriction when distribution a vector.reduce op. All the float and integer types can be supported by user's lambda. Differential Revision: https://reviews.llvm.org/D141094	2023-01-06 16:20:17 +00:00
River Riddle	b74192b7ae	[mlir] Remove support for non-prefixed accessors This finishes off a year long pursuit to LLVMify the generated operation accessors, prefixing them with get/set. Support for any other accessor naming is fully removed after this commit. https://discourse.llvm.org/t/psa-raw-accessors-are-being-removed/65629 Differential Revision: https://reviews.llvm.org/D136727	2022-12-02 13:32:36 -08:00
Mahesh Ravishankar	fc367dfa67	[mlir] Remove `Transforms/SideEffectUtils.h` and move the methods into `Interface/SideEffectInterfaces.h`. The methods in `SideEffectUtils.h` (and their implementations in `SideEffectUtils.cpp`) seem to have similar intent to methods already existing in `SideEffectInterfaces.h`. Move the decleration (and implementation) from `SideEffectUtils.h` (and `SideEffectUtils.cpp`) into `SideEffectInterfaces.h` (and `SideEffectInterface.cpp`). Also drop the `SideEffectInterface::hasNoEffect` method in favor of `mlir::isMemoryEffectFree` which actually recurses into the operation instead of just relying on the `hasRecursiveMemoryEffectTrait` exclusively. Differential Revision: https://reviews.llvm.org/D137857	2022-11-15 20:07:35 +00:00
Matthias Springer	9d51b4e4e7	[mlir][vector] Support vector.extractelement distribution of 1D vectors Ops such as `%1 = vector.extractelement %0[%pos : index] : vector<96xf32>`. In case of an extract from a 1D vector, the source vector is distributed. The lane into which the requested position falls, extracts the element and shuffles it to all other lanes. Differential Revision: https://reviews.llvm.org/D137336	2022-11-10 15:07:56 +01:00
stanley-nod	d2061530dc	[mlir][vector] Modify constraint and interface for warp reduce on f16 and i8 Quantization method is crucial and ubiqutous in accelerating machine learning workloads. Most of these methods uses f16 and i8 types. This patch relaxes the type contraints on warp reduce distribution to allow these types. Furthermore, this patch also changed the interface and moved the initial reduction of data to a single thread into the distributedReductionFn, this gives flexibility for developers to control how they are obtaining the initial lane value, which might differ based on the input types. (i.e to shuffle 32-width type, we need to reduce f16 to 2xf16 types rather than a single element). Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D137691	2022-11-09 11:52:17 -08:00
Thomas Raoux	91f62f0e35	[mlir][vector] Fix distribution of scf.for with value coming from above When a value used in the forOp is defined outside the region but within the parent warpOp we need to return and distribute the value to pass it to new operations created within the loop. Also simplify the lambda interface. Differential Revision: https://reviews.llvm.org/D137146	2022-11-02 04:15:18 +00:00
Thomas Raoux	1757164eed	[mlir][vector] Add distribution for extract from 0d vector Differential Revision: https://reviews.llvm.org/D135994	2022-10-14 23:06:42 +00:00
Sanjoy Das	86771d0b65	Introduce a ConditionallySpeculatable op interface This patch takes the first step towards a more principled modeling of undefined behavior in MLIR as discussed in the following discourse threads: 1. https://discourse.llvm.org/t/semantics-modeling-undefined-behavior-and-side-effects/4812 2. https://discourse.llvm.org/t/rfc-mark-tensor-dim-and-memref-dim-as-side-effecting/65729 This patch in particular does the following: 1. Introduces a ConditionallySpeculatable OpInterface that dynamically determines whether an Operation can be speculated. 2. Re-defines `NoSideEffect` to allow undefined behavior, making it necessary but not sufficient for speculation. Also renames it to `NoMemoryEffect`. 3. Makes LICM respect the above semantics. 4. Changes all ops tagged with `NoSideEffect` today to additionally implement ConditionallySpeculatable and mark themselves as always speculatable. This combined trait is named `Pure`. This makes this change NFC. For out of tree dialects: 1. Replace `NoSideEffect` with `Pure` if the operation does not have any memory effects, undefined behavior or infinite loops. 2. Replace `NoSideEffect` with `NoSideEffect` otherwise. The next steps in this process are (I'm proposing to do these in upcoming patches): 1. Update operations like `tensor.dim`, `memref.dim`, `scf.for`, `affine.for` to implement a correct hook for `ConditionallySpeculatable`. I'm also happy to update ops in other dialects if the respective dialect owners would like to and can give me some pointers. 2. Update other passes that speculate operations to consult `ConditionallySpeculatable` in addition to `NoMemoryEffect`. I could not find any other than LICM on a quick skim, but I could have missed some. 3. Add some documentation / FAQs detailing the differences between side effects, undefined behavior, speculatabilty. Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D135505	2022-10-12 10:56:12 -07:00
Jakub Kuderski	abc362a107	[mlir][arith] Change dialect name from Arithmetic to Arith Suggested by @lattner in https://discourse.llvm.org/t/rfc-define-precise-arith-semantics/65507/22. Tested with: `ninja check-mlir check-mlir-integration check-mlir-mlir-spirv-cpu-runner check-mlir-mlir-vulkan-runner check-mlir-examples` and `bazel build --config=generic_clang @llvm-project//mlir:all`. Reviewed By: lattner, Mogball, rriddle, jpienaar, mehdi_amini Differential Revision: https://reviews.llvm.org/D134762	2022-09-29 11:23:28 -04:00
Thomas Raoux	4abb9e5d20	[mlir][vector] Clean up and generalize lowering of warp_execute to scf Simplify the lowering of warp_execute_on_lane0 of scf.if by making the logic more generic. Also remove the assumption that the most inner dimension is the dimension distributed. Differential Revision: https://reviews.llvm.org/D133826	2022-09-14 17:36:16 +00:00
Nicolas Vasilache	845dc178c0	[mlir][Vector] Support broadcast vector type in distribution of vector.warp_execute_on_lane_0. This revision significantly improves and tests the broadcast behavior of vector.warp_execute_on_lane_0. Previously, the implementation of the broadcast behavior of vector.warp_execute_on_lane_0 assumed that the broadcasted value was always of scalar type. This is not necessarily the case. Differential Revision: https://reviews.llvm.org/D133767	2022-09-13 08:18:47 -07:00
Nicolas Vasilache	20df17fd2d	[mlir][vector] Extend WarpExecutionOnLane0 pattern support to allow deduplicating identical yield values. Differential Revision: https://reviews.llvm.org/D133573	2022-09-09 06:53:36 -07:00
Nicolas Vasilache	27cc31b64c	[mlir][vector] NFC - Clean up vector patterns and propagate benefit through populate functions Differential Revision: https://reviews.llvm.org/D133559	2022-09-09 02:45:22 -07:00
Thomas Raoux	06413618ea	[mlir][vector] Don't duplicate transfer_read during vector distribution Only apply the pattern if the transfer_read can be distributed for all its uses. Differential Revision: https://reviews.llvm.org/D133538	2022-09-09 06:35:40 +00:00
Mehdi Amini	61f06774ff	Apply clang-tidy fixes for performance-unnecessary-value-param in VectorDistribute.cpp (NFC)	2022-09-08 00:05:22 +00:00
Nicolas Vasilache	fa8a10a1fd	[mlir][Vector] Refactor vector distribution and fix an issue related to non-homogenous transfer indices. Running: `mlir-opt -test-vector-warp-distribute=rewrite-warp-ops-to-scf-if -canonicalize -verify-each=0`. Prior to this revision, IR resembling the following would be produced: ``` %4 = "vector.load"(%3, %arg0) : (memref<1x32xf32, 3>, index) -> vector<1x1xf32> ``` This fails verification since it needs 2 indices to load but only 1 is provided. Differential Revision: https://reviews.llvm.org/D133106	2022-09-02 02:18:26 -07:00
Thomas Raoux	f48ce52c4c	[mlir][vector] Pattern to clean up vector.extract during distribution This prevents blocking propagation when converting between scalar and vector<1> Differential Revision: https://reviews.llvm.org/D129782	2022-07-14 17:07:32 +00:00
Thomas Raoux	ffa7384f10	[mlir][vector] Support distribution of vector.reduce with accumulator Right now the pattern was ignoring the optional accumulator. Differential Revision: https://reviews.llvm.org/D129719	2022-07-14 14:28:38 +00:00
Thomas Raoux	0af2680596	[mlir][vector] Add pattern to distribute splat constant Distribute splat constant out of WarpExecuteOnLane0Op region. Differential Revision: https://reviews.llvm.org/D129467	2022-07-11 15:50:26 +00:00
Thomas Raoux	d7d6443d50	[mlir][vector] Avoid creating duplicate output in warpOp Prevent creating multiple output for the same Value when distributing operations out of WarpExecuteOnLane0Op. This avoid creating combinatory explosion of outputs. Differential Revision: https://reviews.llvm.org/D129465	2022-07-11 15:37:50 +00:00
Thomas Raoux	0660f3c5a0	[mlir][vector] Relax reduction distribution pattern Support distributing reductions with vector size multiple of the warp size. Differential Revision: https://reviews.llvm.org/D129387	2022-07-09 18:36:39 +00:00
Nicolas Vasilache	6a57d8fba5	[mlir][vector] Untangle TransferWriteDistribution and avoid crashing in the 0-D case. This revision avoids a crash in the 0-D case of distributing vector.transfer ops out of vector.warp_execute_on_lane_0. Due to the code complexity and lack of documentation, it took untangling the implementation before realizing that the simple fix was to fail in the 0-D case. The rewrite is still very useful to understand this code better. Differential Revision: https://reviews.llvm.org/D128793	2022-07-01 00:15:34 -07:00
Mehdi Amini	08d651d7ba	Apply clang-tidy fixes for performance-unnecessary-value-param in VectorDistribute.cpp (NFC)	2022-06-28 19:52:46 +00:00
Thomas Raoux	d343cdd509	[mlir][vector] Fix bug when swapping scf.for and vector warp op When creating a scf.for without argument a scf.yield is automatically created. Make sure we don't create a second one. Differential Revision: https://reviews.llvm.org/D128405	2022-06-24 19:13:02 +00:00
Thomas Raoux	7eba5cdf9c	[mlir][vector] Relax transfer_write vector distribution pattern Small change to relax the pattern to support any vector containing a single element. Differential Revision: https://reviews.llvm.org/D128545	2022-06-24 19:03:14 +00:00
Nicolas Vasilache	f6c79c6ae4	[mlir][Vector]Fix bug where vector::WarpExecuteOnLane0Op are created with 2 blocks in the region Differential Revision: https://reviews.llvm.org/D128534	2022-06-24 07:33:58 -07:00
Alex Zinenko	8b68da2c7d	[mlir] move SCF headers to SCF/{IR,Transforms} respectively This aligns the SCF dialect file layout with the majority of the dialects. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D128049	2022-06-20 10:18:01 +02:00
Thomas Raoux	6834803c3d	[mlir][vector] NFC remove dependency of VectorTransform to GPU dialect Make the reduction distribution pattern more generic and remove layering problem. The new pattern to distribute reduction is now independent of GPU and takes a lamdba to decide how the distributed reduction should be generated. Differential Revision: https://reviews.llvm.org/D127867	2022-06-15 16:08:29 +00:00
Thomas Raoux	087aba4f0f	[mlir][vector] Add pattern to distribute vector reduction to GPU shuffles Add a pattern to do ad hoc lowering of vector.reduction to a sequence of warp shuffles. This allow distributing reduction on a warp for GPU targets. Also add an execution test for warp reduction. co-authored with @springerm Differential Revision: https://reviews.llvm.org/D127176	2022-06-14 05:49:16 +00:00

1 2

55 Commits