clang-p2996

Author	SHA1	Message	Date
Matthias Springer	1523b72946	[mlir][vector] Distribute vector.insert op In case the distributed dim of the dest vector is also a dim of the src vector, each lane inserts a smaller part of the source vector. Otherwise, one lane inserts the entire src vector and the other lanes do nothing. Differential Revision: https://reviews.llvm.org/D137953	2023-01-09 16:50:28 +01:00
Matthias Springer	73ce971c63	[mlir][vector] Distribute vector.insertelement op In case of a distribution, only one lane inserts the scalar value. In case of a broadcast, every lane inserts the scalar. Differential Revision: https://reviews.llvm.org/D137929	2023-01-09 16:41:08 +01:00
Matthias Springer	9085f00b4d	[mlir][vector] Support vector.extract distribution of >1D vectors Ops such as `%1 = vector.extract %0[2] : vector<5x96xf32>`. Distribute the source vector, then extract. In case of a 1d extract, rewrite to vector.extractelement. Differential Revision: https://reviews.llvm.org/D137646	2023-01-09 16:39:50 +01:00
Alex Zinenko	984c2c8cb3	[mlir] verify against nullptr payload in transform dialect When establishing the correspondence between transform values and payload operations or parameters, check that the latter are non-null and report errors. This was previously allowed for exotic cases of partially successfull transformations with "apply each" trait, but was dangerous. The "apply each" implementation was reworked to remove the need for this functionality, so this can now be hardned to avoid null pointer dereferences. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D141142	2023-01-09 14:03:35 +01:00
Aliia Khasanova	ef545ef62a	[mlir][linalg] Reuploading: Apply shortened printing/parsing form to linalg.reduce. Differential Revision: https://reviews.llvm.org/D141259	2023-01-09 13:32:29 +01:00
Johannes de Fine Licht	758be971dc	[mlir:LLVM] Rudimentary inlining support for LLVM load store. Conservatively only allow inlining for loads and stores that don't carry any attributes that require handling while inlining. This can later be relaxed when proper handling is introduced. Reviewed By: Dinistro, gysit Differential Revision: https://reviews.llvm.org/D141115	2023-01-09 10:28:21 +01:00
Thomas Raoux	493459b6dd	[mlir][spirv] Add folder for LogicalNotEqual Add a folder for LogicalNotEqual when rhs is false. This pattern shows up after lowering to SPIRV. Differential Revision: https://reviews.llvm.org/D141163	2023-01-06 23:13:57 +00:00
Alexander Shaposhnikov	9e1a344155	[MLIR][TOSA] Switch Tosa to DenseArrayAttr This diff completes switching Tosa to DenseArrayAttr. Test plan: ninja check-mlir check-all Differential revision: https://reviews.llvm.org/D141111	2023-01-06 22:57:14 +00:00
Hanhan Wang	ead535b2f9	[mlir][tensor] Add producer fusion for tensor.unpack op. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D141151	2023-01-06 14:13:11 -08:00
Krzysztof Drewniak	e502f4fc2e	[mlir][Arith] Remove expansions of integer min and max ops As of several months ago, both ArithToLLVM and ArithToSPIRV have native support for integer min and max operations. Since these are all the targets available in MLIR core, the need to "expand" arith.minui, arith.minsi, arith,maxsi, and arith.manxui to more primitive operations is to longer present. Therefore, the expanding of integer min and max operations in Arith, while correct, is likely to lead to performance loss by way of misoptimization further down the line, and is no longer needed for anyone's correctness. This change may break downstream tests, but will not affect the semantics of MLIR programs. arith.minf and arith.maxf have a lot of underlying complexity due to the many different possible NaN and signed zero semantics available on various platforms, and so removing their expansion is left to a future commit. Reviewed By: ThomasRaoux, Mogball Differential Revision: https://reviews.llvm.org/D140856	2023-01-06 20:32:29 +00:00
Slava Zakharin	2f66c89130	[mlir] Support TBAA metadata in LLVMIR dialect. This change introduces new LLVMIR dialect operations to represent TBAA root, type descriptor and access tag metadata nodes. For the purpose of importing TBAA metadata from LLVM IR it only supports the current version of TBAA format described in https://llvm.org/docs/LangRef.html#tbaa-metadata (i.e. size-aware representation introduced in D41501 is not supported). TBAA attribute support is only added for LLVM::LoadOp and LLVM::StoreOp. Support for intrinsics operations (e.g. LLVM::MemcpyOp) may be added later. The TBAA attribute is represented as an array of access tags, though, LLVM IR supports only single access tag per memory accessing instruction. I implemented it as an array anticipating similar support in LLVM IR to combine TBAA graphs with different roots for Flang - one of the options described in https://docs.google.com/document/d/16kKZVmI585wth01VSaJAqZMZpoX68rcdBmgfj0kNAt0/edit#heading=h.jzzheaz9vqac It should be easy to restrict MLIR operation to a single access tag, if we end up using a different approach for Flang. Differential Revision: https://reviews.llvm.org/D140768	2023-01-06 11:16:31 -08:00
Alex Zinenko	c214cee772	[mlir] improve error handling in Linalg op splitting In several cases, the splitting may be known to be a noop, i.e., produce no second part. Thread this information through the transform utilities to the transform dialect, and differentiate it from the error state. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D141138	2023-01-06 18:35:08 +01:00
Matthias Springer	5eee80ce5e	[mlir][memref] Add runtime verification for memref::CastOp Verify unranked -> ranked casts and casts of dynamic sizes/offset/strides to static ones. Differential Revision: https://reviews.llvm.org/D138671	2023-01-06 14:38:56 +01:00
Alex Zinenko	4b455a71b7	[mlir] adapt TransformEachOpTrait to parameter values Adapt the implementation of TransformEachOpTrait to the existence of parameter values recently introduced into the transform dialect. In particular, allow `applyToOne` hooks to return a list containing a mix of `Operation *` that will be associated with handles and `Attribute` that will be associated with parameter values by the trait implementation of the transform interface's `apply` method. Disentangle the "transposition" of the list of per-payload op partial results to decrease its overall complexity and detemplatize the code that doesn't really need templates. This removes the poorly documented special handling for single-result ops with TransformEachOpTrait that could have assigned null pointer values to handles. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D140979	2023-01-06 12:23:41 +00:00
Alex Zinenko	97c05062af	[mlir] NFC: rename TransformTypeInterface to TransformHandleTypeInterface This makes it more consistent with the recently added TransformParamTypeInterface. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D140977	2023-01-06 12:23:33 +00:00
Alex Zinenko	ed02fa81fd	[mlir] introduce parameters into the transofrm dialect Introduce a new kind of values into the transform dialect -- parameter values. These values have a type implementing the new `TransformParamTypeInterface` and are associated with lists of attributes rather than lists of payload operations. This mechanism allows one to wrap numeric calculations, typically heuristics, into transform operations separate from those at actually applying the transformation. For example, tile size computation can be now separated from tiling itself, and not hardcoded in the transform dialect. This further improves the separation of concerns between transform choice and implementation. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D140976	2023-01-06 12:23:29 +00:00
Matthias Springer	6176d6a93e	[mlir][tensor] Support parallel_insert_slice in MergeConsecutiveInsertExtractSlicePatterns.cpp Differential Revision: https://reviews.llvm.org/D141116	2023-01-06 12:33:45 +01:00
Matthias Springer	bcfd32adc4	[mlir][linalg] Swap extract_slice(fill(x)) ops This pattern is similar to `FoldFillWithTensorReshape`, which performs the same swapping with reshapes. Fill the smaller extracted tensor slice instead of `x`. This allows for additional simplifications in case `x` is the result of another extract_slice. Differential Revision: https://reviews.llvm.org/D141117	2023-01-06 12:28:29 +01:00
Jakub Kuderski	1b82245370	[mlir][spirv] Add smul_extended expansion for WebGPU We need this because WGSL does not support extended multiplication ops. Fixes: https://github.com/llvm/llvm-project/issues/59563 Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D141096	2023-01-05 20:11:47 -05:00
Murali Vijayaraghavan	bbe2c16353	[NFC][MLIR] Adding better names to lit test for pooling vectorization Differential Revision: https://reviews.llvm.org/D141097	2023-01-05 23:55:30 +00:00
bixia1	81e3079d0f	[mlir][sparse] Replace sparse_tensor.sort with sparse_tensor.sort_coo for sorting COO tensors. Add codegen pattern for sparse_tensor.indices_buffer. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D140871	2023-01-05 15:42:57 -08:00
Jakub Kuderski	47232bea9e	[mlir][spirv] Fix extended umul expansion for WebGPU Fix an off-by-one error in extended umul extension for WebGPU. Revert to the long multiplication algorithm originally added to wide integer emulation, which was deleted in D139776. It is much easier to see why it is correct. Add runtime tests based on the mlir-vulkan-runner. These run both with and without umul extension. Issue: https://github.com/llvm/llvm-project/issues/59563 Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D141085	2023-01-05 18:41:26 -05:00
Murali Vijayaraghavan	755e776849	[mlir][linalg] Vectorize 1D convolution Differential Revision: https://reviews.llvm.org/D140188	2023-01-05 23:08:32 +00:00
bixia1	9bde3d0cc5	[mlir][sparse] Add operator sparse_tensor.indices_buffer. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D140762	2023-01-05 09:35:55 -08:00
Alex Zinenko	faac898987	[mlir] fix out-of-bounds in reduction tiling A transformation tiling a reduction dimension of a Linalg op needs a tile size for said dimension. When an insufficient number of dimensions was provided, it would segfault due to out-of-bounds access to a vector. Also fix incorrect error reporting in the structured transform op exercising this functionality. Reviewed By: springerm, ThomasRaoux Differential Revision: https://reviews.llvm.org/D141046	2023-01-05 15:20:26 +00:00
Mehdi Amini	551ec87883	Use --pass-pipeline syntax for mlir/test/Dialect/LLVMIR/canonicalize.mlir (NFC) This is just a cleanup to make the scheduling of the pass pipeline explicit.	2023-01-05 10:22:49 +00:00
bixia1	3fdd85da06	[mlir][sparse] Add AOS optimization. Use an array of structures to represent the indices for the tailing COO region of a sparse tensor. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D140870	2023-01-04 18:16:04 -08:00
Alexander Shaposhnikov	11030c7d67	[MLIR][TOSA] Switch Tosa_IntArrayAttr[N], Tosa_IntArrayAttrUpto[N] to DenseI64ArrayAttr Switch Tosa_IntArrayAttr[N], Tosa_IntArrayAttrUpto[N] to DenseI64ArrayAttr. Test plan: ninja check-mlir check-all Differential revision: https://reviews.llvm.org/D140748 https://reviews.llvm.org/D140829, https://reviews.llvm.org/D140832, https://reviews.llvm.org/D140833, https://reviews.llvm.org/D140834	2023-01-04 21:58:20 +00:00
liqinweng	5c18ae3135	[MLIR][Tensor] Canonicalize expand/collapse_shape of splat to splat Collapsing / expanding a splatted value can be replaced with a single `tensor.splat` operation. Replace these cases with a simple `tensor.splat` operation. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D140552	2023-01-04 13:07:55 -08:00
Aart Bik	1c7ffe0c38	[mlir][sparse] add test that combines sparse codegen and lowering to llvm struct Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D141006	2023-01-04 12:12:31 -08:00
Jakub Kuderski	624ed0ddaf	[mlir][spirv] Relax instruction order checks in test Fix a windows buildbot failure: https://lab.llvm.org/buildbot#builders/13/builds/30439.	2023-01-04 14:08:03 -05:00
Jakub Kuderski	c957fe0f60	[mlir][spirv] Add pattern to expand UMulExtended for WebGPU This is needed because WGSL does not yet support extended multiplication ops. Set up pattern/pass stuff and handle the first op: `UMulExtended`. `SMulExtended` handling will go to a separate patch. Issue: https://github.com/llvm/llvm-project/issues/59563 Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D140995	2023-01-04 13:29:47 -05:00
bixia1	90aa436291	[mlir][sparse] Add layout to the memref for the indices buffers to prepare for the AOS storage optimization for COO regions. Fix relevant FileCheck tests. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D140742	2023-01-04 07:36:11 -08:00
Matthias Springer	e7790fbed3	[mlir] Add `test-convergence` option to Canonicalizer tests This new option is set to `false` by default. It should be set only in Canonicalizer tests to detect faulty canonicalization patterns. I.e., patterns that prevent the canonicalizer from converging. The canonicalizer should always convergence on such small unit tests that we have in `canonicalize.mlir`. Two faulty canonicalization patterns were detected and fixed with this change. Differential Revision: https://reviews.llvm.org/D140873	2023-01-04 12:02:21 +01:00
Rob Suderman	fcbf3fafdb	[mlir][tosa] Fix tosa.transpose_conv2d decompositions for new version The decomposition was no longer correct for transpose_conv2d to conv2d after the updated TOSA specification. Specifically the behavior for padding was changed to refer to padding the tranpsose_conv2d instead of referencing the conv applied to the inverse transform. Test was validated using the TOSA conformance tests. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D140503	2023-01-03 11:36:13 -08:00
Rob Suderman	06c440f2da	[mlir][tosa] Canonicalize tosa.transpose to tosa.reshape Added tosa.transpose canonicalization for case where a tosa.transpose is equivalent to a tosa.reshape. This occurs when the permutation does not permutate non-unary dimensions. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D140356	2023-01-03 11:19:55 -08:00
Johannes Reifferscheid	998a3a3894	Add a math.cbrt instruction and lowering to libm. There's currently no way to get accurate cube roots in the math dialect. powf(x, 1/3.0) is too inaccurate in some cases. Reviewed By: akuegel Differential Revision: https://reviews.llvm.org/D140842	2023-01-03 08:44:12 +01:00
Krzysztof Drewniak	be575c5dfc	Re-land D139865 "Add known_block_size and known_grid_size to gpu.func" This should fix the MSVC warning that caused the previous revert. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D140766	2023-01-02 16:39:00 +00:00
Uday Bondhugula	5f9cd099d6	[MLIR] Fix affine LICM pass for unknown region holding ops Fix affine LICM pass for unknown region-holding ops. The logic was completely ignoring regions of unknown ops leading to generation of invalid IR on hoisting. Handle affine.parallel op among those with regions that are supported. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D140738	2022-12-31 20:26:50 +05:30
jacquesguan	6c295a932d	[mlir][Arith] Fold integer shift op with zero. This revision folds arith.shrui, arith.shrsi and arith.shli with zero rhs to lhs. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D140749	2022-12-30 17:19:23 +08:00
Andrzej Warzynski	fff0d1b836	[mlir] Simplify a test for vectorizing tensor.extract Remove unused arguments and the corresponding logic (e.g. affine maps). Differential Revision: https://reviews.llvm.org/D140755	2022-12-30 08:16:35 +00:00
bixia1	840e2ba336	[mlir][sparse] Use DLT in the mangled function names for insertion. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D140484	2022-12-28 08:21:22 -08:00
Aart Bik	431f6a543e	[sparse][mlir][vectorization] add support for shift-by-invariant Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D140596	2022-12-27 11:07:13 -08:00
Nicolas Vasilache	83b582d51b	[mlir][Linalg] Properly propagate transform result in ScalarizeOp	2022-12-27 06:16:55 -08:00
Stella Stamenova	5759d9467c	Revert "Apply shortened printing/parsing form to linalg.reduce." This reverts commit `281c2d49c9`. This broke the windows mlir buildbot: https://lab.llvm.org/buildbot/#/builders/13/builds/30167	2022-12-23 17:31:08 -08:00
Stella Stamenova	828b4762ca	Revert "[mlir][GPU] Add known_block_size and known_grid_size to gpu.func" This reverts commit `85e38d7cd6`. This broke the windows mlir buildbot: https://lab.llvm.org/buildbot/#/builders/13/builds/30180/steps/6/logs/stdio	2022-12-23 17:29:42 -08:00
Aliia Khasanova	281c2d49c9	Apply shortened printing/parsing form to linalg.reduce. Differential Revision: https://reviews.llvm.org/D140622	2022-12-23 14:40:19 +01:00
liqinweng	00fd6958fb	[MLIR][Arith] Canonicalize xor with ext Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D139307	2022-12-23 12:40:30 +08:00
Peiming Liu	988733c600	[mlir][sparse] use sparse_tensor::StorageSpecifier to store dim/memSizes Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D140130	2022-12-23 00:47:36 +00:00
Thomas Raoux	1a0453eb44	[mlir][vector] Fix bug in extractOp folding We were missing to check for transpose when folding. Also add a new file to test folding independently of canonicalization as canonicalization was hiding the bug. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D140533	2022-12-22 14:51:02 -08:00

1 2 3 4 5 ...

3369 Commits