clang-p2996

Author	SHA1	Message	Date
Matthias Springer	1523b72946	[mlir][vector] Distribute vector.insert op In case the distributed dim of the dest vector is also a dim of the src vector, each lane inserts a smaller part of the source vector. Otherwise, one lane inserts the entire src vector and the other lanes do nothing. Differential Revision: https://reviews.llvm.org/D137953	2023-01-09 16:50:28 +01:00
Matthias Springer	73ce971c63	[mlir][vector] Distribute vector.insertelement op In case of a distribution, only one lane inserts the scalar value. In case of a broadcast, every lane inserts the scalar. Differential Revision: https://reviews.llvm.org/D137929	2023-01-09 16:41:08 +01:00
Matthias Springer	9085f00b4d	[mlir][vector] Support vector.extract distribution of >1D vectors Ops such as `%1 = vector.extract %0[2] : vector<5x96xf32>`. Distribute the source vector, then extract. In case of a 1d extract, rewrite to vector.extractelement. Differential Revision: https://reviews.llvm.org/D137646	2023-01-09 16:39:50 +01:00
Matthias Springer	e7790fbed3	[mlir] Add `test-convergence` option to Canonicalizer tests This new option is set to `false` by default. It should be set only in Canonicalizer tests to detect faulty canonicalization patterns. I.e., patterns that prevent the canonicalizer from converging. The canonicalizer should always convergence on such small unit tests that we have in `canonicalize.mlir`. Two faulty canonicalization patterns were detected and fixed with this change. Differential Revision: https://reviews.llvm.org/D140873	2023-01-04 12:02:21 +01:00
Thomas Raoux	1a0453eb44	[mlir][vector] Fix bug in extractOp folding We were missing to check for transpose when folding. Also add a new file to test folding independently of canonicalization as canonicalization was hiding the bug. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D140533	2022-12-22 14:51:02 -08:00
Matthias Springer	41e731f2a4	[mlir][vector] Add additional scalar vector transfer foldings * Rewrite vector.transfer_write of vectors with 1 element to memref.store * Rewrite vector.extract(vector.transfer_read) to memref.load Differential Revision: https://reviews.llvm.org/D140391	2022-12-22 14:43:48 +01:00
Matthias Springer	d0766c0861	[mlir][vector] Fold vector.extractelement(vector.broadcast) Differential Revision: https://reviews.llvm.org/D140394	2022-12-22 10:34:58 +01:00
Matthias Springer	2ec98ffbf1	[mlir][vector] Add scalar vector xfer to memref patterns These patterns devectorize scalar transfers such as vector<f32> or vector<1xf32>. Differential Revision: https://reviews.llvm.org/D140215	2022-12-19 10:27:49 +01:00
jacquesguan	0017dc2d0f	[mlir][Vector] Use llvm::zip to avoid assertion failed. This patch fixes the issue https://github.com/llvm/llvm-project/issues/59455. We could omit the un-changed dimensions in offsets and sizes, so llvm::zip_equal would fail in this case. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D139815	2022-12-14 10:26:54 +08:00
Benjamin Chetioui	a6c8f06f55	[mlir] Clean up typos in FileCheck directives in various tests. Reviewed By: tpopp Differential Revision: https://reviews.llvm.org/D139698	2022-12-12 09:29:14 +01:00
Nicolas Vasilache	de13eeda11	[mlir][Vector] Add a Broadcast::createBroadcastOp helper This helper handles non trivial cases of broadcast + optional transpose creation that should not leak to the outside world. Differential Revision: https://reviews.llvm.org/D139003	2022-11-30 05:32:14 -08:00
Diego Caballero	eb7e2998d1	Reland "[mlir][Vector] Re-define masking semantics in vector.transfer ops"" This relands commit `847b5f82a4`. Differential Revision: https://reviews.llvm.org/D138079	2022-11-29 03:36:54 +00:00
Hanhan Wang	0a1569a400	[mlir][NFC] Remove trailing whitespaces from `.td` and `.mlir` files. This is generated by running ``` sed --in-place 's/[[:space:]]\+$//' mlir/*/.td sed --in-place 's/[[:space:]]\+$//' mlir/*/.mlir ``` Reviewed By: rriddle, dcaballe Differential Revision: https://reviews.llvm.org/D138866	2022-11-28 15:26:30 -08:00
Jakub Kuderski	f0fe38035c	[mlir][vector] Add fold pattern to constant-fold InsertStridedSliceOp Fold InsertStridedOp(ConstantOp into ConstantOp) -> ConstantOp. This pattern comes with vector size threshold to make sure we do not introduce too many large constants. This help clean up code created by the Wide Integer Emulation pass. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D138739	2022-11-28 14:25:28 -05:00
Nicolas Vasilache	0650e1bcc0	[mlir][vector] Fix folding of vector.extract from vector.broadcast This revision fixes a bug in the vector.extract folding that was missing handling the "dim-1" broadcasting case in vector.broadcast. Differential Revision: https://reviews.llvm.org/D138804	2022-11-28 07:17:31 -08:00
Jakub Kuderski	5d05d2966f	[mlir][vector] Add fold pattern for InsertOp(Constant into Constant) This pattern comes with vector size threshold to make sure we do not introduce too many large constants. This help clean up code created by the Wide Integer Emulation pass. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D138733	2022-11-25 23:01:29 -05:00
Jakub Kuderski	afba86709f	[mlir][vector] Add fold for ExtractStridedSlice(non-splat ConstantOp) This allows us to better canonicalize/clean-up code created by the Wide Integer Emulation pass. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D138606	2022-11-25 13:42:56 -05:00
Jakub Kuderski	0e72d00d19	[mlir][vector] Constant fold sub-vector extraction This generalizes the existing fold for `ExtractOp(non-splat constant)` to work with vector results. The vector case is handled by extracting the subrange of attribute array. My main use it to clean up code generated by the Wide Integer Emulation pass. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D138690	2022-11-25 13:39:45 -05:00
Nicolas Vasilache	6ff1233304	[mlir][Vector] Add a LowerVectorsOp to VectorTransformOps This op significantly improves transfor dialect usage when using vector abstractions. It also brings us closer to writing simple end-to-end unit tests that guard against subtle regressions in how patterns combine. Differential Revision: https://reviews.llvm.org/D138723	2022-11-25 08:27:36 -08:00
Diego Caballero	847b5f82a4	Revert "[mlir][Vector] Re-define masking semantics in vector.transfer ops" This reverts commit `6c59c5cd08`.	2022-11-18 01:18:11 +00:00
Diego Caballero	6c59c5cd08	[mlir][Vector] Re-define masking semantics in vector.transfer ops Masking hasn't been widely used in vector transfer ops and the semantics of the mask operand were a bit loose. This patch states that the mask operand in a vector transfer op is applied to the read/write part of the operation and, therefore, its shape should match the shape of the elements read/written from/into the memref/tensor regardless of any permutation/broadcasting also applied by the transfer operation. Reviewers: nicolasvasilache Differential Revision: https://reviews.llvm.org/D138079	2022-11-18 01:05:42 +00:00
stanley-nod	dc26c03066	[mlir][vector] Add insertOp src shape check for BubbleUpBitCastForStridedSliceInsert Not all shape of vectors can be casted into other types, we add a check to not fold insertOp into bitcast if the shape does not support it. Examples of unsupported shape castings are f16 vectors to f32 if the shape is not multiple of 2s. or int8 to int32 if shapes are not multiple of 4. Reviewed By: antiagainst, ThomasRaoux Differential Revision: https://reviews.llvm.org/D137802	2022-11-10 16:41:59 -08:00
Matthias Springer	9d51b4e4e7	[mlir][vector] Support vector.extractelement distribution of 1D vectors Ops such as `%1 = vector.extractelement %0[%pos : index] : vector<96xf32>`. In case of an extract from a 1D vector, the source vector is distributed. The lane into which the requested position falls, extracts the element and shuffles it to all other lanes. Differential Revision: https://reviews.llvm.org/D137336	2022-11-10 15:07:56 +01:00
Lei Zhang	39c80656fe	[mlir][vector] Convert extract_strided_slice to extract & insert chain This is useful for breaking down extract_strided_slice and potentially cancel with other extract / insert ops before or after. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D137471	2022-11-09 19:42:07 -05:00
Javier Setoain	aa9647e2d0	[mlir][vector] Add vector.scalable.insert/extract ops These new operations match the semantics of llvm.experimental.vector.insert and llvm.experimental.vector.extract. `vector.scalable.insert` and `vector.scalable.extract` allow, respectively, insert vectors into scalable vectors, and extract vectors from scalable vectors. The discussion about the inclusion of these operations is here: https://discourse.llvm.org/t/rfc-interfacing-between-fixed-length-and-scalable-vectors-for-vls-vector-code-on-scalable-vector-architectures Differential Revision: https://reviews.llvm.org/D127875	2022-11-08 08:51:15 +00:00
rkayaith	13bd410962	[mlir][Pass] Include anchor op in -pass-pipeline In D134622 the printed form of a pass manager is changed to include the name of the op that the pass manager is anchored on. This updates the `-pass-pipeline` argument format to include the anchor op as well, so that the printed form of a pipeline can be directly passed to `-pass-pipeline`. In most cases this requires updating `-pass-pipeline='pipeline'` to `-pass-pipeline='builtin.module(pipeline)'`. This also fixes an outdated assert that prevented running a `PassManager` anchored on `'any'`. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D134900	2022-11-03 11:36:12 -04:00
Thomas Raoux	91f62f0e35	[mlir][vector] Fix distribution of scf.for with value coming from above When a value used in the forOp is defined outside the region but within the parent warpOp we need to return and distribute the value to pass it to new operations created within the loop. Also simplify the lambda interface. Differential Revision: https://reviews.llvm.org/D137146	2022-11-02 04:15:18 +00:00
Diego Caballero	c3e09036e8	[mlir][Vector] Introduce the `vector.mask` operation lowering This patch introduces the lowering for xfer ops masked with `vector.mask`. Vector reductions are not lowered yet because new LLVM intrinsics are needed in the LLVM dialect. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D136741	2022-10-27 22:54:20 +00:00
River Riddle	c8496d292e	[mlir] Refactor alias generation to support nested aliases We currently only support one level of aliases, which isn't great in situations where an attribute/type can have multiple duplicated components nested within it(e.g. debuginfo metadata). This commit refactors alias generation to support nested aliases, which requires changing alias grouping to take into account the depth of child aliases, to ensure that attributes/types aren't printed before the aliases they use. The only real user facing change here was that we no longer print 0 as an alias suffix, which would be unnecessarily expensive to keep in the new alias generation method (and isn't that valuable of a behavior to preserve). Differential Revision: https://reviews.llvm.org/D136541	2022-10-23 23:59:55 -07:00
Thomas Raoux	1757164eed	[mlir][vector] Add distribution for extract from 0d vector Differential Revision: https://reviews.llvm.org/D135994	2022-10-14 23:06:42 +00:00
Diego Caballero	2d10f81d46	[mlir][Vector] Introduce 'vector.mask' operation and MaskableOpInterface This patch introduces the `vector.mask` operation and the MaskableOpInterface as described in https://discourse.llvm.org/t/rfc-vector-masking-representation-in-mlir/64964. The `vector.mask` operation is used to predicate the execution of operations implementing the MaskableOpInterface. This interface will be implemented by maskable operations and provides information about its masking constraints and semantics. For now, only vector transfer and reduction ops implement the MaskableOpInterface for illustration and testing purposes. Reviewed By: nicolasvasilache, rriddle Differential Revision: https://reviews.llvm.org/D134939	2022-10-10 21:25:43 +00:00
Lei Zhang	f0c93fd4ca	[mlir][vector] Merge accumulator/result transpose into contract This commit adds a pattern to merge accumulator and result `vector.transpose` ops into `vector.contract`. This kind of pattern can be generated for NCHW convolution vectorization, where we use transposes to convert the 1-D NCW convolution into NWC during vectorization. Merging the transpose would mean we can avoid materialize vector extract/insert for transposes and it makes further vector level transformations easier. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D135496	2022-10-08 00:43:45 +00:00
Murali Vijayaraghavan	9c3d3eeb51	[mlir] vector.multi_reduction canonicalizes to vector.shape_cast (or vector.extract, if the result is a scalar) only if all reduction dimensions are of size 1. Differential Revision: https://reviews.llvm.org/D135333	2022-10-06 00:11:31 +00:00
Murali Vijayaraghavan	617ca92bf1	Revert "Added canonicalization for vector.multi_reduction" This reverts commit `c16f3260a9`. There's a bug in the commit creates a scalar result with `ShapeCastOp`. Reverting till that fix is done.	2022-10-05 21:43:51 +00:00
Murali Vijayaraghavan	c16f3260a9	Added canonicalization for vector.multi_reduction If there are reductions only along unit dimensions, then they are folded Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D134996	2022-10-05 18:43:33 +00:00
Lei Zhang	9e8e4779a2	[mlir][vector] Fix double rank reducing folding bug In https://reviews.llvm.org/D133883, we changed the `FoldExtractSliceIntoTransferRead` pattern from requiring full identity map to minor identity map. This effectively allows rank reducing `vector.transfer_read` ops. However, the logic for checking `tensor.extract_slice` rank reducing still looks at the vector rank, which now could be smaller than the `tensor.extract_slice`'s output tensor rank. It ends up we can have incorrect index cacluation after folding due to this double rank reducing behavior. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D134984	2022-09-30 16:50:48 -04:00
Nicolas Vasilache	435debea69	[mlir][test] NFC - Fix some worst offenders "capture by SSA name" tests Many tests still depend on specific names of SSA values (!!). This commit is a best effort cleanup that will set the stage for adding some pretty SSA result names.	2022-09-30 08:24:13 -07:00
Thomas Raoux	bc13437b15	[mlir][vector] Handle subview correctly in sotre to load opt on memref Make sure we consider other subviews of the same buffer when doing store to load forwarding or dead store elimination. Differential Revision: https://reviews.llvm.org/D134576	2022-09-26 18:28:17 +00:00
Christopher Bate	cb3d0260f0	[mlir][vector] Fix test for vector-warp-distribute Test does a "CHECK-NOT" against the function name when it should check to ensure that the `vector.warp_execute_on_lane_0` is removed. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D134448	2022-09-22 10:51:10 -06:00
Alex Zinenko	83df43f3a2	[mlir] use strided layouts in vector transfer on memrefs One of the vector transformation patterns has been indiscriminately converting layouts to affine maps. Leverage the strided form when possible. Reviewed By: nicolasvasilache, dcaballe Differential Revision: https://reviews.llvm.org/D134047	2022-09-17 08:11:30 +02:00
Alex Zinenko	f3fae035c7	[mlir] use strided layout in structured codegen-related tests All relevant operations have been switched to primarily use the strided layout, but still support the affine map layout. Update the relevant tests to use the strided format instead for compatibility with how ops now print by default. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D134045	2022-09-17 08:11:28 +02:00
Thomas Raoux	54db8cc7b1	[mlir][vector] Remove ExtractMap/InsertMap operations As discussed on discourse: https://discourse.llvm.org/t/vector-vector-distribution-large-vector-to-small-vector/1983/22 removing insert_map/extract_map op as vector distribution now uses warp_execute_on_lane_0 op. Differential Revision: https://reviews.llvm.org/D134000	2022-09-16 17:41:26 +00:00
Alex Zinenko	f096e72ce6	[mlir] switch bufferization to use strided layout attribute Bufferization already makes the assumption that buffers pass function boundaries in the strided form and uses the corresponding affine map layouts. Switch it to use the recently introduced strided layout instead to avoid unnecessary casts when bufferizing further operations to the memref dialect counterparts that now largely rely on the strided layout attribute. Depends On D133947 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D133951	2022-09-16 10:56:50 +02:00
Alex Zinenko	2791162b01	[mlir] make memref.subview produce strided layout Memref subview operation has been initially designed to work on memrefs with strided layouts only and has never supported anything else. Port it to use the recently added StridedLayoutAttr instead of extracting the strided from implicitly from affine maps. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D133938	2022-09-16 10:56:46 +02:00
Lei Zhang	8a5cb939e7	[mlir][vector] Check minor identity map in FoldExtractSliceIntoTransferRead vecotr.transfer_read ops with minor identity indexing map is rank reducing, with implicit leading unit dimensions. This should be a natural extension to support in addition to full identity indexing maps. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D133883	2022-09-14 17:24:02 -04:00
Thomas Raoux	4abb9e5d20	[mlir][vector] Clean up and generalize lowering of warp_execute to scf Simplify the lowering of warp_execute_on_lane0 of scf.if by making the logic more generic. Also remove the assumption that the most inner dimension is the dimension distributed. Differential Revision: https://reviews.llvm.org/D133826	2022-09-14 17:36:16 +00:00
Jakub Kuderski	8c2ea14436	[mlir][vector] Fold scalar vector.extract of non-splat n-D constants Add a new pattern to fold `vector.extract` over n-D constants that extract scalars. The previous code handled ND splat constants only. The new pattern is conservative and does handle sub-vector constants. This is to aid the `arith::EmulateWideInt` pass which emits a lot of 2-element vector constants. Reviewed By: Mogball, dcaballe Differential Revision: https://reviews.llvm.org/D133742	2022-09-13 20:30:50 -04:00
Nicolas Vasilache	845dc178c0	[mlir][Vector] Support broadcast vector type in distribution of vector.warp_execute_on_lane_0. This revision significantly improves and tests the broadcast behavior of vector.warp_execute_on_lane_0. Previously, the implementation of the broadcast behavior of vector.warp_execute_on_lane_0 assumed that the broadcasted value was always of scalar type. This is not necessarily the case. Differential Revision: https://reviews.llvm.org/D133767	2022-09-13 08:18:47 -07:00
Thomas Raoux	9e7c97d8ce	[mlir][vector] Fix bug in transfer op flattening The logic to figure out if a transfer op can be flattened wasn't considering the shape being loaded therefore it was incorrectly assuming some transfer ops were reading contigous data. Differential Revision: https://reviews.llvm.org/D133544	2022-09-09 16:02:52 +00:00
Nicolas Vasilache	20df17fd2d	[mlir][vector] Extend WarpExecutionOnLane0 pattern support to allow deduplicating identical yield values. Differential Revision: https://reviews.llvm.org/D133573	2022-09-09 06:53:36 -07:00

1 2 3 4 5 ...

332 Commits