clang-p2996

Author	SHA1	Message	Date
Andrzej Warzyński	354adb44c9	[mlir][vector] Extend `CreateMaskFolder` (#75842 ) Extends `CreateMaskFolder` pattern so that the following: ```mlir %c8 = arith.constant 8 : index %c16 = arith.constant 16 : index %0 = vector.vscale %1 = arith.muli %0, %c16 : index %10 = vector.create_mask %c8, %1 : vector<8x[16]xi1> ``` is folded as: ```mlir %0 = vector.constant_mask [8, 16] : vector<8x[16]xi1> ```	2023-12-20 11:08:54 +00:00
Matthias Springer	f10302e3fa	[mlir] Require folders to produce Values of same type (#75887 ) This commit adds extra assertions to `OperationFolder` and `OpBuilder` to ensure that the types of the folded SSA values match with the result types of the op. There used to be checks that discard the folded results if the types do not match. This commit makes these checks stricter and turns them into assertions. Discarding folded results with the wrong type (without failing explicitly) can hide bugs in op folders. Two such bugs became apparent in MLIR (and some more in downstream projects) and are fixed with this change. Note: The existing type checks were introduced in https://reviews.llvm.org/D95991. Migration guide: If you see failing assertions (`folder produced value of incorrect type`; make sure to run with assertions enabled!), run with `-debug` or dump the operation right before the failing assertion. This will point you to the op that has the broken folder. A common mistake is a mismatch between static/dynamic dimensions (e.g., input has a static dimension but folded result has a dynamic dimension).	2023-12-20 14:39:22 +09:00
Jakub Kuderski	560564f51c	[mlir][vector][gpu] Align minf/maxf reduction kind names with arith (#75901 ) This is to avoid confusion when dealing with reduction/combining kinds. For example, see a recent PR comment: https://github.com/llvm/llvm-project/pull/75846#discussion_r1430722175. Previously, they were picked to mostly mirror the names of the llvm vector reduction intrinsics: https://llvm.org/docs/LangRef.html#llvm-vector-reduce-fmin-intrinsic. In isolation, it was not clear if `<maxf>` has `arith.maxnumf` or `arith.maximumf` semantics. The new reduction kind names map 1:1 to arith ops, which makes it easier to tell/look up their semantics. Because both the vector and the gpu dialect depend on the arith dialect, it's more natural to align names with those in arith than with the lowering to llvm intrinsics. Issue: https://github.com/llvm/llvm-project/issues/72354	2023-12-20 00:14:43 -05:00
Jakub Kuderski	a528cee224	[mlir][vector] Improve `makeArithReduction` expansion (#75846 ) Propagate fast math flags. Distinguish `minf`/`maxf` and `minimumf`/`maximumf`. Required for future patterns in https://github.com/llvm/llvm-project/pull/75727.	2023-12-18 17:47:46 -05:00
Rik Huijzer	51e5f677c8	[mlir][vector] Fix crash on invalid `permutation_map` (#74925 ) Without this patch, MLIR crashes with ``` Assertion failed: (getNumDims() == map.getNumResults() && "Number of results mismatch"), function compose, file AffineMap.cpp, line 537. ``` during parsing.	2023-12-11 12:07:41 +01:00
Adam Paszke	34df53739a	Revert "[mlir][Vector] Add fold transpose(shape_cast) -> shape_cast (#73951 )" (#74579 ) This reverts commit `f42b7615b8`. The fold pattern is incorrect, because it does not even look at the permutation of non-unit dims and is happy to replace a pattern such as ``` %22 = vector.shape_cast %21 : vector<1x256x256xf32> to vector<256x256xf32> %23 = vector.transpose %22, [1, 0] : vector<256x256xf32> to vector<256x256xf32> ``` with ``` %22 = vector.shape_cast %21 : vector<1x256x256xf32> to vector<256x256xf32> ``` which is obviously incorrect.	2023-12-06 11:15:47 +01:00
Benjamin Maxwell	f42b7615b8	[mlir][Vector] Add fold transpose(shape_cast) -> shape_cast (#73951 ) This folds transpose(shape_cast) into a new shape_cast, when the transpose just permutes a unit dim from the result of the shape_cast. Example: ``` %0 = vector.shape_cast %vec : vector<[4]xf32> to vector<[4]x1xf32> %1 = vector.transpose %0, [1, 0] : vector<[4]x1xf32> to vector<1x[4]xf32> ``` Folds to: ``` %0 = vector.shape_cast %vec : vector<[4]xf32> to vector<1x[4]xf32> ``` This is an (alternate) fix for lowering matmuls to ArmSME.	2023-12-01 14:24:36 +00:00
Quinn Dawkins	6e8f7d5966	[mlir][vector] Fix patterns for dropping leading unit dims from masks (#73525 ) Previously the pattern only worked when the permutation map was a minor identity. Infer the new mask type from the new transfer map after dropping leading unit dims.	2023-11-27 12:35:32 -05:00
MaheshRavishankar	1f141737c7	Revert "[mlir][vector] Move transpose with unit-dim to shape_cast pattern (#72493 )" (#72918 ) This reverts commit `95acb33b45`.	2023-11-21 06:18:51 -08:00
Matthias Springer	32c3decb77	[mlir][vector] Modernize `vector.transpose` op (#72594 ) * Declare arguments/results with `let` statements. * Rename `transp` to `permutation`. * Change type of `transp` from `I64ArrayAttr` to `DenseI64ArrayAttr` (provides direct access to `ArrayRef<int64_t>` instead of `ArrayAttr`).	2023-11-20 11:25:35 +01:00
Cullen Rhodes	95acb33b45	[mlir][vector] Move transpose with unit-dim to shape_cast pattern (#72493 ) Moved from lowering to canonicalization.	2023-11-17 14:06:03 +00:00
Matthias Springer	8a40fcaf35	[mlir][vector] Clean up VectorTransferOpInterface (#72353 ) - Better documentation. - Rename interface methods: `source` -> `getSource`, `indices` -> `getIndices`, etc. to conform with MLIR naming conventions. A default implementation is not needed. - Turn many interface methods into helper functions. Most of the previous interface methods were not meant to be overridden, and if some were overridden without others, the op would be have been broken.	2023-11-16 10:35:46 +09:00
long.chen	1609f1c2a5	[mlir][affine][nfc] cleanup deprecated T.cast style functions (#71269 ) detail see the docment: https://mlir.llvm.org/deprecation/ Not all changes are made manually, most of them are made through a clang tool I wrote https://github.com/lipracer/cpp-refactor.	2023-11-14 13:01:19 +08:00
Quinn Dawkins	28ace78d0b	[mlir][vector] NFC: Rename helper to check for all ones mask for consistency (#71810 ) Following up on https://github.com/llvm/llvm-project/pull/71676#discussion_r1387561079	2023-11-10 09:32:06 -05:00
Quinn Dawkins	f6d487207d	[mlir][vector] Add folders for full constant transfer masks (#71676 ) When the mask bounds of a `vector.constant_mask` exactly equal the shape of the vector, any transfer op consuming that mask will be unaffected by it. Drop the mask in such cases.	2023-11-08 16:35:28 -05:00
Ivan Butygin	0a22a80c1b	[mlir][vector] Fix extractelement/insertelement folder crash on poison attr (#71333 ) Types of incoming attributes weren't properly checked.	2023-11-06 16:22:56 +03:00
Nicolas Vasilache	8ba5c7a326	[mlir][Vector] Add initial support for inlining in the presence of vector ops (#70942 )	2023-11-01 16:34:04 +01:00
Mehdi Amini	237d5c6469	[mlir] Fix vector::InsertElementOp::fold on invalid input The IR is valid, but UB: there is an out-of-bound index for the position to insert inside the vector. We should just ignore this in the folder. Fixes #70884	2023-10-31 21:16:28 -07:00
Matthias Springer	e6e55e620b	[mlir][vector] Fix off-by-one error in `getTransferChunkAccessed` (#70292 ) If a dimension does not appear in the permutation map of a vector transfer op, the size of the accessed slice in that dimension is `1`. Before this fix, `getTransferChunkAccessed` used to return `0` for such dimensions, which would means that `0` elements in the underlying tensor/memref are accessed. Note: There is no test case that fails due to this bug and because this interface method is currently only used in one place, it is hard to write a regression test. This fix is in preparation of subset hoisting functionality that will be added in subsequent commits.	2023-10-27 17:39:55 +09:00
Andrzej Warzyński	f24c443e82	[mlir][SVE] Add an e2e test for vector.contract (#69845 ) Adds an end-to-end test for `vector.contract` that targets SVE (i.e. scalable vectors). Note that this requires lifting the restriction on `vector.outerproduct` (to which `vector.contract` is lowered to) that would deem the following as invalid by the Op verifier (*): ``` vector.outerproduct %27, %28, %26 {kind = #vector.kind<add>} : vector<3xf32>, vector<[2]xf32> ``` This is indeed valid as the end-to-end test demonstrates (at least when compiling for SVE).	2023-10-26 20:57:49 +01:00
Benjamin Maxwell	c7be2dee2a	[mlir][VectorOps] Add fold `ExtractOp(CreateMask) -> CreateMask` (#69456 ) This allows folding extracts from `vector.create_mask` ops that have a known value. Currently, there's no fold for this, but you get the same effect from the unrolling in LowerVectorMask (part of -convert-vector-to-llvm), then folds after that. However, for a future patch, this simplification needs to be done before lowering to LLVM, hence the need for this fold. E.g.: ``` %0 = vector.create_mask %c1, %dimA, %dimB : vector<1x[4]x[4]xi1> %1 = vector.extract %mask[0] : vector<[4]x[4]xi1> ``` -> ``` %0 = vector.create_mask %dimA, %dimB : vector<[4]x[4]xi1> ```	2023-10-23 12:40:01 +01:00
Benjamin Maxwell	03d1c99d99	[mlir][ODS] Add `OptionalTypesMatchWith` and remove a custom assemblyFormat (#68876 ) This is just a slight specialization of `TypesMatchWith` that returns success if an optional parameter is missing. There may be other places this could help e.g.: `eb21049b4b/mlir/include/mlir/Dialect/X86Vector/X86Vector.td (L58-L59)` ...but I'm leaving those to avoid some churn. This constraint will be handy for us in some later patches, it's a formalization of a short circuiting trick with the `comparator` of the `TypesMatchWith` constraint (devised for #69195). ``` TypesMatchWith< "padding type matches element type of result (if present)", "result", "padding", "::llvm::cast<VectorType>($_self).getElementType()", // This returns true if no padding is present, or it's present with a type that matches the element type of `result`. "!getPadding() \|\| std::equal_to<>()"> ``` This is a little non-obvious, so after this patch you can instead do: ``` OptionalTypesMatchWith< "padding type matches element type of result (if present)", "result", "padding", "::llvm::cast<VectorType>($_self).getElementType()"> ```	2023-10-19 17:00:02 +01:00
Lei Zhang	3049ac44e6	[mlir][vector] Enable transfer op hoisting with dynamic indices (#68500 ) Recent changes (https://github.com/llvm/llvm-project/pull/66930) disabled vector transfer ops hoisting with view-like intermediate ops. The recommended way is to fold subview ops into transfer op indices before invoking hoisting. That would mean now we see transfer op indices involving dynamic values, instead of static constant values before with subview ops. Therefore hoisting won't kick in anymore. This breaks downstream users. To fix it, this commit enables hoisting transfer ops with dynamic indices by using `ValueBoundsConstraintSet` to prove ranges are disjoint in `isDisjointTransferIndices`. Given that utility is used in many places including op folders, right now we introduce a flag to it and only set as true for "heavy" transforms in hoisting and load-store forwarding.	2023-10-15 16:37:54 -07:00
Benjamin Maxwell	469b9cbe5a	[mlir][VectorOps] Don't fold extract chains that include dynamic indices (#68333 ) This is not yet supported and previously led to a confusing crash where an extract op with a kDynamic marker, but no dynamic positions was created. The verifier has also been updated to check for this, and hint at where the problem is likely to be.	2023-10-06 14:32:37 +01:00
Cullen Rhodes	9816edc9f3	[mlir][vector] add result type to vector.extract assembly format (#66499 ) The vector.extract assembly format currently only contains the source type, for example: %1 = vector.extract %0[1] : vector<3x7x8xf32> it's not immediately obvious if this is the source or result type. This patch improves the assembly format to make this clearer, so the above becomes: %1 = vector.extract %0[1] : vector<7x8xf32> from vector<3x7x8xf32>	2023-09-28 11:11:16 +01:00
Diego Caballero	98f6289a34	[mlir][Vector] Add support for Value indices to vector.extract/insert `vector.extract/insert` ops only support constant indices. This PR is extending them so that arbitrary values can be used instead. This work is part of the RFC: https://discourse.llvm.org/t/rfc-psa-remove-vector-extractelement-and-vector-insertelement-ops-in-favor-of-vector-extract-and-vector-insert-ops Differential Revision: https://reviews.llvm.org/D155034	2023-09-22 00:39:32 +00:00
Nicolas Vasilache	1b8b556443	[mlir][Vector] Add fastmath flags to vector.reduction (#66905 ) This revision pipes the fastmath attribute support through the vector.reduction op. This seemingly simple first step already requires quite some genuflexions, file and builder reorganization. In the process, retire the boolean reassoc flag deep in the LLVM dialect builders and just use the fastmath attribute. During conversions, templated builders for predicated intrinsics are partially cleaned up. In the future, to finalize the cleanups, one should consider adding fastmath to the VPIntrinsic ops.	2023-09-20 16:57:20 +02:00
Benjamin Maxwell	2f11ce5579	[mlir][VectorOps] Extend vector.constant_mask to support 'all true' scalable dims (#66638 ) This extends `vector.constant_mask` so that mask dim sizes that correspond to a scalable dimension are treated as if they're implicitly multiplied by vscale. Currently this is limited to mask dim sizes of 0 or the size of the dim/vscale. This allows constant masks to represent all true and all false scalable masks (and some variations): ``` // All true scalable mask %mask = vector.constant_mask [8] : vector<[8]xi1> // All false scalable mask %mask = vector.constant_mask [0] : vector<[8]xi1> // First two scalable rows %mask = vector.constant_mask [2,4] : vector<4x[4]xi1> ```	2023-09-20 14:54:42 +01:00
Daniil Dudkin	4a831250b8	[mlir][vector] Rename vector reductions: `maxf` → `maximumf`, `minf` → `minimumf` This patch is part of a larger initiative aimed at fixing floating-point `max` and `min` operations in MLIR: https://discourse.llvm.org/t/rfc-fix-floating-point-max-and-min-operations-in-mlir/72671. Here, we are addressing task 2.1 from the plan, which involves renaming the vector reductions to align with the semantics of the corresponding LLVM intrinsics. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D158618	2023-09-13 22:49:07 +00:00
Daniil Dudkin	c46a04339a	[mlir][arith] Rename `AtomicRMWKind`'s `maxf` → `maximumf`, `minf` → `minimumf` (#66135 ) This patch is part of a larger initiative aimed at fixing floating-point `max` and `min` operations in MLIR: https://discourse.llvm.org/t/rfc-fix-floating-point-max-and-min-operations-in-mlir/72671. This commit renames `maxf` and `minf` enumerators of `AtomicRMWKind` to better reflect the current naming scheme and the goals of the RFC.	2023-09-14 01:09:37 +03:00
Daniil Dudkin	8a6e54c9b3	[mlir][arith] Rename operations: `maxf` → `maximumf`, `minf` → `minimumf` (#65800 ) This patch is part of a larger initiative aimed at fixing floating-point `max` and `min` operations in MLIR: https://discourse.llvm.org/t/rfc-fix-floating-point-max-and-min-operations-in-mlir/72671. This commit addresses Task 1.2 of the mentioned RFC. By renaming these operations, we align their names with LLVM intrinsics that have corresponding semantics.	2023-09-11 22:02:19 -07:00
Matthias Springer	339753de12	[mlir][vector][NFC] `isDisjointTransferIndices`: Use `getConstantIntValue` (#65931 ) Use `getConstantIntValue` instead of matching for `arith::ConstantOp`.	2023-09-11 13:30:36 +02:00
Andrzej Warzyński	718af88376	[mlir][vector] Extend mask calculation for vector.contract (#65733 ) Make sure that when calculating the expected mask for `vector.contract`, scalable sizes are correctly taken into account. Depends on: #65724	2023-09-11 11:34:47 +01:00
Andrzej Warzyński	7ec8fd4cc7	[mlir][Vector] Make `vector.contract` work with scalable vectors (#65724 ) This is just a small fix that makes sure that `vector.contract` works with scalable vectors. Rather than duplicating all the roundtrip tests for vector.contract, I'm treating scalable vectors as an edge case and just adding a couple to verify that this works.	2023-09-11 09:14:25 +01:00
Fangrui Song	7557530f42	[mlir] Fix duplicate word typos; NFC Those fixes were taken from https://reviews.llvm.org/D137338	2023-09-01 20:53:08 -07:00
Cullen Rhodes	067bd7d051	[mlir][vector] Use optional for outerproduct accumulator instead of variadic This was introduced before the Optional directive and uses Variadic, but it's really optional. Reviewed By: nicolasvasilache, benmxwl-arm, dcaballe Differential Revision: https://reviews.llvm.org/D159259	2023-09-01 05:50:01 +00:00
Andrzej Warzynski	acc572f490	[mlir][nfc] Avoid implicit casting VectorType -> Type Just a small follow-up for https://reviews.llvm.org/D159122	2023-08-30 18:01:00 +01:00
Benjamin Maxwell	296d5cb60c	[mlir][BuiltinTypes] Return VectorType from VectorType::Builder conversion operator 0-D vectors are now supported, so the special case of returning the just the element type can now be removed. A few callers that relied on the old behaviour have been updated. Reviewed By: awarzynski, nicolasvasilache Differential Revision: https://reviews.llvm.org/D159122	2023-08-30 13:47:06 +00:00
Markus Böck	4dd744ac9c	Reland "[mlir] Use a type for representing branch points in `RegionBranchOpInterface`" This reverts commit `b26bb30b46`.	2023-08-30 09:31:54 +02:00
Markus Böck	b26bb30b46	Revert "[mlir] Use a type for representing branch points in `RegionBranchOpInterface`" This reverts commit `024f562da6`. Forgot to update flang	2023-08-29 20:17:50 +02:00
Markus Böck	024f562da6	[mlir] Use a type for representing branch points in `RegionBranchOpInterface` The current implementation is not very ergonomic or descriptive: It uses `std::optional<unsigned>` where `std::nullopt` represents the parent op and `unsigned` is the region number. This doesn't give us any useful methods specific to region control flow and makes the code fragile to changes due to now taking the region number into account. This patch introduces a new type called `RegionBranchPoint`, replacing all uses of `std::optional<unsigned>` in the interface. It can be implicitly constructed from a region or a `RegionSuccessor`, can be compared with a region to check whether the branch point is branching from the parent, adds `isParent` to check whether we are coming from a parent op and adds `RegionSuccessor::parent` as a descriptive way to indicate branching from the parent. Differential Revision: https://reviews.llvm.org/D159116	2023-08-29 20:02:23 +02:00
Andrzej Warzynski	1d9dcca9e3	[mlir][vector] Add canonicalzation pattern for shape_cast(create_mask) This is primarily to avoid trailing unit dims: ``` %1 = vector.create_mask %c1, %dim_0, %c1, %c1 : vector<1x4x1x1xi1> %2 = vector.shape_cast %1 : vector<1x4x1x1xi1> to vector<1x4xi1> ``` becomes: ``` %1 = vector.create_mask %c1, %dim_0 : vector<1x4xi1> ``` Differential Revision: https://reviews.llvm.org/D158111	2023-08-24 07:14:42 +00:00
Andrzej Warzynski	f9070b2dfb	[mlir][vector] Enable CastAwayElementwiseLeadingOneDim for scalable vec This patch effectively enables the CastAwayElementwiseLeadingOneDim rewrite pattern for scalable vectors. To this end, `ExtractOp::inferReturnTypes` is updated so that scalable dimensions are correctly recognised. The change to ExtractOp will likely make also other conversion patterns valid for scalable vectors, but this patch focuses on just one case. Other conversion patterns will be enabled in the forthcoming patches. Depends on D157993 Differential Revision: https://reviews.llvm.org/D158335	2023-08-22 11:40:46 +00:00
Lei Zhang	73ddc4474b	[mlir][vector] Enable distribution over multiple dimensions This commit starts enabling vector distruction over multiple dimensions. It requires delinearize the lane ID to match the expected rank. shape_cast and transfer_read now can properly handle multiple dimensions. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D157931	2023-08-16 12:08:43 -07:00
Lei Zhang	7897a944d9	[mlir][vector] Fold extract(shape_cast) for same element count Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D157930	2023-08-15 11:28:35 -07:00
Lei Zhang	9f37c21349	[mlir][vector] Fold shape_cast of broadcast with same element count For such cases we can generate a shape cast to simplify the IR. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D157929	2023-08-15 11:26:15 -07:00
Andrzej Warzynski	51a57074bc	[mlir][vector] Add missing canonicalisation for vector.multi_reduction Make sure that when canonicalising masked `vector.multi_reduction` and creating `arith.select` to replace the mask, scalability of the mask is preserved. Differential Revision: https://reviews.llvm.org/D157732	2023-08-11 19:06:23 +00:00
Andrzej Warzynski	12b4951866	[mlir][vector] Add missing support for scalable vectors This patch adds the missing logic so that the `TransferReadPermutationLowering` can be used for scalable vectors. To this end: * TransferOp custom C++ builder is updated to support scalable vectors, * `TransferOpReduceRank` is also updated to support scalable vectors. This pattern is relevant when lowering `linalg.matmul` via `vector_multi_reduction` for scalable vectors. I've also updated relevant code in `TransferOpReduceRank` not to use `llvm::to_vector` for constructing `SmallVector` from `ArrayRef`. That hook doesn't work for `ArraryRef<bool>` (), so for consistency I switched to an explicit constructor (so that both `newShape` and `newScalableDim` are constructed in a similar fashion). () IIUC, that's due how implicit narrowing conversions between `bool` and `bool` work. Note that these narrowing conversions change when using initializer lists, see https://en.cppreference.com/w/cpp/language/list_initialization. Depends on D157092 Differential Revision: https://reviews.llvm.org/D157268	2023-08-10 09:08:30 +00:00
Markus Böck	138df29820	[mlir] Revamp `RegionBranchOpInterface` successor mechanism The `RegionBranchOpInterface` had a few fundamental issues caused by the API design of `getSuccessorRegions`. It always required passing values for the `operands` parameter. This is problematic as the operands parameter actually changes meaning depending on which predecessor `index` is referring to. If coming from a region, you'd have to find a `RegionBranchTerminatorOpInterface` in that region, get its operand count, and then create a `SmallVector` of that size. This is not only inconvenient, but also error-prone, which has lead to a bug in the implementation of a previously existing `getSuccessorRegions` overload. Additionally, this made the method dual-use, trying to serve two different use-cases: 1) Trying to determine possible control flow edges between regions and 2) Trying to determine the region being branched to based on constant operands. This patch fixes these issues by changing the interface methods and adding new ones: * The `operands` argument of `getSuccessorRegions` has been removed. The method is now only responsible for returning possible control flow edges between regions. * An optional `getEntrySuccessorRegions` method has been added. This is used to determine which regions are branched to from the parent op based on constant operands of the parent op. By default, it calls `getSuccessorRegions`. This is analogous to `getSuccessorForOperands` from `BranchOpInterface`. * Add `getSuccessorRegions` to `RegionBranchTerminatorOpInterface`. This is used to get the possible successors of the terminator based on constant operands. By default, it calls the containing `RegionBranchOpInterface`s `getSuccessorRegions` method. * `getSuccessorEntryOperands` was renamed to `getEntrySuccessorOperands` for consistency. Differential Revision: https://reviews.llvm.org/D157506	2023-08-10 10:27:27 +02:00
Andrzej Warzynski	5c581720b9	[mlir][Vector] Add support for scalable vectors in multi_reduction Support for scalable vectors in vector.multi_reduction is added by simply updating MultiDimReductionOp::verify. Also, the conversion pattern for reducing n-D vector.multi_reduction to 2D vector.multi_reduction is updated. Differential Revision: https://reviews.llvm.org/D157092	2023-08-08 17:01:59 +00:00

1 2 3 4 5

220 Commits