clang-p2996

Author	SHA1	Message	Date
Jacques Pienaar	d2c0572b2e	[mlir] Flip LinAlg dialect to _Both This one required more changes than ideal due to overlapping generated name with different return types. Changed getIndexingMaps to getIndexingMapsArray to move it out of the way/highlight that it returns (more expensively) a SmallVector and uses the prefixed name for the Attribute. Differential Revision: https://reviews.llvm.org/D129919	2022-07-19 14:42:58 -07:00
Benoit Jacob	f0c3fd185e	Don't combine if there would remain no true reduction dim. Differential Revision: https://reviews.llvm.org/D130109	2022-07-19 19:58:53 +00:00
Fangrui Song	3c849d0aef	Modernize Optional::{getValueOr,hasValue}	2022-07-15 01:20:39 -07:00
Thomas Raoux	f48ce52c4c	[mlir][vector] Pattern to clean up vector.extract during distribution This prevents blocking propagation when converting between scalar and vector<1> Differential Revision: https://reviews.llvm.org/D129782	2022-07-14 17:07:32 +00:00
Thomas Raoux	ffa7384f10	[mlir][vector] Support distribution of vector.reduce with accumulator Right now the pattern was ignoring the optional accumulator. Differential Revision: https://reviews.llvm.org/D129719	2022-07-14 14:28:38 +00:00
Kazu Hirata	c27d815249	[mlir] Use value instead of getValue (NFC)	2022-07-14 00:19:59 -07:00
Benoit Jacob	6870a50f43	lowerParallel is also called on unit-size, one-sided reduction dims See: https://gist.github.com/bjacob/d8be8ec7e70ed0be4b3a5794ced2a7e8 Differential Revision: https://reviews.llvm.org/D129096	2022-07-13 16:21:12 +00:00
Kazu Hirata	491d27013d	[mlir] Use has_value instead of hasValue (NFC)	2022-07-13 00:57:02 -07:00
Thomas Raoux	051b36ba28	[mlir][vector] Add accumulator operand to MultiDimReduce op This allows vectorizing linalg reductions without changing the operation order. Therefore this produce a valid vectorization even if operations are not associative. Differential Revision: https://reviews.llvm.org/D129535	2022-07-12 14:28:30 +00:00
Thomas Raoux	0af2680596	[mlir][vector] Add pattern to distribute splat constant Distribute splat constant out of WarpExecuteOnLane0Op region. Differential Revision: https://reviews.llvm.org/D129467	2022-07-11 15:50:26 +00:00
Thomas Raoux	d7d6443d50	[mlir][vector] Avoid creating duplicate output in warpOp Prevent creating multiple output for the same Value when distributing operations out of WarpExecuteOnLane0Op. This avoid creating combinatory explosion of outputs. Differential Revision: https://reviews.llvm.org/D129465	2022-07-11 15:37:50 +00:00
Thomas Raoux	0660f3c5a0	[mlir][vector] Relax reduction distribution pattern Support distributing reductions with vector size multiple of the warp size. Differential Revision: https://reviews.llvm.org/D129387	2022-07-09 18:36:39 +00:00
Matthias Springer	a28ce1a42b	[mlir][vector][bufferize] Fix transfer_write dropping mask operand Differential Revision: https://reviews.llvm.org/D129253	2022-07-07 10:02:13 +02:00
Matthias Springer	6c3c5f8069	[mlir][memref] Improve type inference for rank-reducing subviews The result shape of a rank-reducing subview cannot be inferred in the general case. Just the result rank is not enough. The only thing that we can infer is the layout map. This change also improves the bufferization patterns of tensor.extract_slice and tensor.insert_slice to fully support rank-reducing operations. Differential Revision: https://reviews.llvm.org/D129144	2022-07-05 16:49:07 +02:00
Benoit Jacob	c3839c0b46	CombineContractBroadcast should not create dims unused in LHS+RHS Differential Revision: https://reviews.llvm.org/D129087	2022-07-04 16:52:35 +00:00
Nicolas Vasilache	6a57d8fba5	[mlir][vector] Untangle TransferWriteDistribution and avoid crashing in the 0-D case. This revision avoids a crash in the 0-D case of distributing vector.transfer ops out of vector.warp_execute_on_lane_0. Due to the code complexity and lack of documentation, it took untangling the implementation before realizing that the simple fix was to fail in the 0-D case. The rewrite is still very useful to understand this code better. Differential Revision: https://reviews.llvm.org/D128793	2022-07-01 00:15:34 -07:00
Benoit Jacob	694ad3eaef	Fix CombineContractBroadcast folding reduction iterators. Fix CombineContractBroadcast folding reduction iterators. Differential Revision: https://reviews.llvm.org/D128739	2022-06-29 18:01:56 +00:00
Mehdi Amini	08d651d7ba	Apply clang-tidy fixes for performance-unnecessary-value-param in VectorDistribute.cpp (NFC)	2022-06-28 19:52:46 +00:00
Mahesh Ravishankar	fa596c6921	[mlir][Vector] Fix reordering of floating point adds during lower of `vector.contract`. Adding the accumulator value after the `vector.contract` changes the precision of the operation. This makes sure the accumulator is carried through to `vector.reduce` (and down to LLVM). Differential Revision: https://reviews.llvm.org/D128674	2022-06-28 05:26:39 +00:00
Matthias Springer	5d50f51c97	[mlir][bufferization][NFC] Add error handling to getBuffer This is in preparation of adding memory space support. Differential Revision: https://reviews.llvm.org/D128277	2022-06-27 13:48:01 +02:00
Kazu Hirata	3b7c3a654c	Revert "Don't use Optional::hasValue (NFC)" This reverts commit `aa8feeefd3`.	2022-06-25 11:56:50 -07:00
Kazu Hirata	aa8feeefd3	Don't use Optional::hasValue (NFC)	2022-06-25 11:55:57 -07:00
Thomas Raoux	d343cdd509	[mlir][vector] Fix bug when swapping scf.for and vector warp op When creating a scf.for without argument a scf.yield is automatically created. Make sure we don't create a second one. Differential Revision: https://reviews.llvm.org/D128405	2022-06-24 19:13:02 +00:00
Thomas Raoux	7eba5cdf9c	[mlir][vector] Relax transfer_write vector distribution pattern Small change to relax the pattern to support any vector containing a single element. Differential Revision: https://reviews.llvm.org/D128545	2022-06-24 19:03:14 +00:00
Nicolas Vasilache	f6c79c6ae4	[mlir][Vector]Fix bug where vector::WarpExecuteOnLane0Op are created with 2 blocks in the region Differential Revision: https://reviews.llvm.org/D128534	2022-06-24 07:33:58 -07:00
Kazu Hirata	037f09959a	[mlir] Don't use Optional::hasValue (NFC)	2022-06-20 11:22:37 -07:00
Alex Zinenko	8b68da2c7d	[mlir] move SCF headers to SCF/{IR,Transforms} respectively This aligns the SCF dialect file layout with the majority of the dialects. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D128049	2022-06-20 10:18:01 +02:00
Matthias Springer	b55d55ecd9	[mlir][bufferize][NFC] Remove BufferizationState With the recent refactorings, this class is no longer needed. We can use BufferizationOptions in all places were BufferizationState was used. Differential Revision: https://reviews.llvm.org/D127653	2022-06-17 14:04:11 +02:00
Matthias Springer	b3ebe3beed	[mlir][bufferize] Bufferize after TensorCopyInsertion This change changes the bufferization so that it utilizes the new TensorCopyInsertion pass. One-Shot Bufferize no longer calls the One-Shot Analysis. Instead, it relies on the TensorCopyInsertion pass to make the entire IR fully inplacable. The `bufferize` implementations of all ops are simplified; they no longer have to account for out-of-place bufferization decisions. These were already materialized in the IR in the form of `bufferization.alloc_tensor` ops during the TensorCopyInsertion pass. Differential Revision: https://reviews.llvm.org/D127652	2022-06-17 13:29:52 +02:00
Thomas Raoux	f011d32c3a	[mlir][vector] Fix contraction op lowering with mixed types contraction op can have mixed type, add support for this case to the pattern lowering contraction op to outerproduct. Differential Revision: https://reviews.llvm.org/D127926	2022-06-16 16:40:56 +00:00
Thomas Raoux	6834803c3d	[mlir][vector] NFC remove dependency of VectorTransform to GPU dialect Make the reduction distribution pattern more generic and remove layering problem. The new pattern to distribute reduction is now independent of GPU and takes a lamdba to decide how the distributed reduction should be generated. Differential Revision: https://reviews.llvm.org/D127867	2022-06-15 16:08:29 +00:00
Thomas Raoux	087aba4f0f	[mlir][vector] Add pattern to distribute vector reduction to GPU shuffles Add a pattern to do ad hoc lowering of vector.reduction to a sequence of warp shuffles. This allow distributing reduction on a warp for GPU targets. Also add an execution test for warp reduction. co-authored with @springerm Differential Revision: https://reviews.llvm.org/D127176	2022-06-14 05:49:16 +00:00
Thomas Raoux	76cf33dab2	[mlir][vector] Add patterns to ppropagate vector distribution Add patterns to propagate vector distribution and remove dead arguments. This handles propagation for several vector operations. recommit after minor bug fix. Differential Revision: https://reviews.llvm.org/D127167	2022-06-14 05:26:10 +00:00
Thomas Raoux	2d32dac8bb	Revert "[mlir][vector] Add patterns to ppropagate vector distribution" This reverts commit `1c84800c42`. This was causing asan crash.	2022-06-13 17:55:31 +00:00
Thomas Raoux	1c84800c42	[mlir][vector] Add patterns to ppropagate vector distribution Add patterns to propagate vector distribution and remove dead arguments. This handles propagation for several vector operations. Differential Revision: https://reviews.llvm.org/D127167	2022-06-13 16:38:50 +00:00
Mogball	e16d13322b	[mlir] (NFC) Clean up bazel and CMake target names All dialect targets in bazel have been named Dialect and all dialect targets in CMake have been named MLIRDialect.	2022-06-13 16:24:15 +00:00
Thomas Raoux	ed0288f7c4	[mlir][vector] Add patterns for vector distribution Add pattern to hoist scalar code outside of warp distribute region as those cannot be distributed and we would want to execute them on all the lanes. Add patterns to distribute transfer_write ops. Those operations can be distributed in different ways and it is control by user. Differential Revision: https://reviews.llvm.org/D127152	2022-06-10 17:46:51 +00:00
Christopher Bate	9f1221521f	Recommit "[mlir][vector] Allow unroll of contraction in arbitrary order" Fixed issue with vector.contract default unroll permutation. Adds support for vector unroll transformations to unroll in different orders. For example, the vector.contract can be unrolled into a smaller set of contractions. There is a choice of how to unroll the decomposition based on the traversal order of (dim0, dim1, dim2). The choice of traversal order can now be specified by a callback which given by the caller of the transform. For now, only the vector.contract, vector.transfer_read/transfer_write operations support the callback. Differential Revision: https://reviews.llvm.org/D127004	2022-06-09 14:01:19 -06:00
Christopher Bate	53fe155b3f	Revert "[mlir][vector] Allow unroll of contraction in arbitrary order" Reverts commit `1469ebf838` (original commit) Reverts commit `a392a39f75` (build fix for above commit) The commit broke tests in out-of-tree projects, indicating that some logical error was made in the previous change but not covered by current tests.	2022-06-07 14:54:01 -06:00
Christopher Bate	a392a39f75	[mlir][vector] fix typo in vector unroll transform	2022-06-06 16:09:13 -06:00
Christopher Bate	1469ebf838	[mlir][vector] Allow unroll of contraction in arbitrary order Adds supprot for vector unroll transformations to unroll in different orders. For example, the `vector.contract` can be unrolled into a smaller set of contractions. There is a choice of how to unroll the decomposition based on the traversal order of (dim0, dim1, dim2). The choice of traversal order can now be specified by a callback which given by the caller of the transform. For now, only the `vector.contract`, `vector.transfer_read/transfer_write` operations support the callback. Differential Revision: https://reviews.llvm.org/D127004	2022-06-06 14:31:04 -06:00
Matthias Springer	1534177f8f	[mlir][bufferization][NFC] Move OpFilter out of BufferizationOptions Differential Revision: https://reviews.llvm.org/D126568	2022-05-28 01:47:39 +02:00
Thomas Raoux	89aaa2d033	[mlir][vector] Add new lowering mode to vector.contractionOp Add lowering for cases where the reduction dimension is fully unrolled. It is common to unroll the reduction dimension, therefore we would want to lower the contractions to an elementwise vector op in this case. Differential Revision: https://reviews.llvm.org/D126120	2022-05-24 14:19:08 +00:00
Thomas Raoux	4c1b65e7bc	[mlir][vector] Fix crash in DropInnerMostUnitDims pattern Fix number of dimensions when incrementally replacing dimensions in affine map. Differential Revision: https://reviews.llvm.org/D125984	2022-05-19 17:38:04 +00:00
Thomas Raoux	d02f10d96d	[mlir][vector] Add lowering pattern for vector.warp_execute_on_lane_0 op Add lowering of the vector.warp_execute_on_lane_0 into scf.if plus memory transfer for the operands and yield values. This also add an integration test running on GPU warp. The same tests can be later re-used with different comment lines to tests distribution transformations. This is mostly from @springerm contribution. Differential Revision: https://reviews.llvm.org/D125430	2022-05-12 13:27:43 +00:00
Alex Zinenko	4c807f2f57	[mlir][vector] insert `alloca`s outside of loops After https://reviews.llvm.org/D119743 added the `AutomaticAllocationScope` trait to loop-like constructs, the vector transfer full/partial splitting pass started inserting allocations for temporaries within the closest loop rather than the closest function (or other allocation scope such as `async.execute`). While this is correct as long as the lowered code takes care of automatic deallocation at the end of each iteration of the loop, this interferes with downstream optimizations that expect `alloca`s to be at the function level. Step over loops when looking for the closest allocation scope in vector transfer full/partial splitting pass thus restoring the original behavior. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D124366	2022-04-25 10:49:09 +02:00
River Riddle	eda6f907d2	[mlir][NFC] Shift a bunch of dialect includes from the .h to the .cpp Now that dialect constructors are generated in the .cpp file, we can drop all of the dependent dialect includes from the .h file. Differential Revision: https://reviews.llvm.org/D124298	2022-04-23 01:09:29 -07:00
Lei Zhang	4db65e279b	[mlir][vector] Reorder elementwise(transpose) Similar to the existing pattern for reodering cast(transpose), this makes transpose following transpose and increases the chance of embedding the transposition inside contraction op. Actually cast ops are just special instances of elementwise ops. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D123596	2022-04-15 09:05:35 -04:00
Lei Zhang	e54236dfb5	[mlir][vector] Cast away leading one dims for insert ops Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D123621	2022-04-14 08:57:32 -04:00
Mehdi Amini	35f48edb91	Apply clang-tidy fixes for llvm-qualified-auto in VectorTransforms.cpp (NFC)	2022-04-14 09:42:37 +00:00

1 2

90 Commits