clang-p2996

Author	SHA1	Message	Date
Peiming Liu	269c82d389	[mlir][sparse] introduce new 2:4 block sparsity level type. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D155128	2023-07-12 23:33:53 +00:00
K-Wu	e37fc3cc39	[mlir][sparse][gpu] Impl 2:4 SpMM rewrite for linalg op w/ DENSE24 attr Differential Revision: https://reviews.llvm.org/D154772	2023-07-10 22:36:57 +00:00
Peiming Liu	fc5d8fce7d	[mlir][sparse] support dual sparse convolution. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152601	2023-07-10 16:49:32 +00:00
wren romano	dcadb68a5c	[mlir][sparse] Cleaning up OOB implementation details for `VarSet` Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D154674	2023-07-07 14:35:56 -07:00
wren romano	68785c1c44	[mlir][sparse] Correcting RTTI implementation for the Var class Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D154662	2023-07-06 18:57:02 -07:00
Aart Bik	03125e6894	[mlir][sparse][gpu] fix missing dealloc This dealloc was incorrectly removed in https://reviews.llvm.org/D153173 Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D154564	2023-07-06 09:48:19 -07:00
Matthias Springer	cb7bda2ace	[mlir][NFC] Use `getConstantIntValue` instead of casting to `ConstantIndexOp` `getConstantIntValue` extracts constant values from all constant-like ops, not just `arith::ConstantIndexOp`. Differential Revision: https://reviews.llvm.org/D154356	2023-07-04 14:08:37 +02:00
Kun Wu	be2dd22b8f	[mlir][sparse][gpu] reuse CUDA environment handle throughout instance lifetime Differential Revision: https://reviews.llvm.org/D153173	2023-06-30 21:52:34 +00:00
Aart Bik	b939c015a4	[mlir][sparse] add affine parsing to new surface syntax for STEA (1) uses the previously introduce API to reuse AffineExpr parser without codedup (2) solves the look-ahead problem when parsing level spec Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D154254	2023-06-30 14:48:23 -07:00
Peiming Liu	a63d6a0014	[mlir][sparse] make UnpackOp return the actual filled length of unpacked memory This might simplify frontend implementation by avoiding recomputation for the same value. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D154244	2023-06-30 21:35:15 +00:00
Aart Bik	6b88c852b6	[mlir][sparse] Start migration to new surface syntax for STEA We are in the progress of migrating to a much improved surface syntax for the Sparse Tensor Encoding Attribute (STEA). You can see a preview of this in the StableHLO RFC at https://github.com/openxla/stablehlo/blob/main/rfcs/20230210-sparsity.md //This design is courtesy Wren Romano.// This initial revision (1) Introduces the first version of a new parser written by Wren Romano (2) Introduces a simple "migration plan" using NEW_SYNTAX on the STEA, which will allow us to test the new parser with new examples, as well as migrate existing examples over without the need to rewrite them all This first "drop" merely provides the entry points to parse the new syntax. The parser is still under active development. For example, we need to address the "lookahead" issue when parsing the lvl spec (viz. do we see l0 = d0 or a direct d0). Another larger task is to actually implement "affine" parsing (since the MLIR affine parser is not accessible in other parts of the tree). EXAMPLE: Currently, CSR looks like #CSR = #sparse_tensor.encoding<{ lvlTypes = ["dense","compressed"], dimToLvl = affine_map<(i,j) -> (i,j)> }> but you can "force" the new parser with #CSR = #sparse_tensor.encoding<{ NEW_SYNTAX = (d0, d1) -> (l0 = d0 : dense, l1 = d1 : compressed) }> Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D153997	2023-06-29 11:32:07 -07:00
Peiming Liu	df11a2b41a	[mlir][sparse] admit un-sparsifiable operations if all its operands are loaded from dense input Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D153998	2023-06-28 21:27:50 +00:00
Andrzej Warzynski	f22af204ed	[mlir][VectorType] Remove `numScalableDims` from the vector type This is a follow-up of https://reviews.llvm.org/D153372 in which `numScalableDims` (single integer) was effectively replaced with `isScalableDim` bitmask. This change is a part of a larger effort to enable scalable vectorisation in Linalg. See this RFC for more context: * https://discourse.llvm.org/t/rfc-scalable-vectorisation-in-linalg/ Differential Revision: https://reviews.llvm.org/D153412	2023-06-28 13:53:45 +01:00
Andrzej Warzynski	79c83e12c8	[mlir][VectorType] Allow arbitrary dimensions to be scalable At the moment, only the trailing dimensions in the vector type can be scalable, i.e. this is supported: vector<2x[4]xf32> and this is not allowed: vector<[2]x4xf32> This patch extends the vector type so that arbitrary dimensions can be scalable. To this end, an array of bool values is added to every vector type to denote whether the corresponding dimensions are scalable or not. For example, for this vector: vector<[2]x[3]x4xf32> the following array would be created: {true, true, false}. Additionally, the current syntax: vector<[2x3]x4xf32> is replaced with: vector<[2]x[3]x4xf32> This is primarily to simplify parsing (this way, the parser can easily process one dimension at a time rather than e.g. tracking whether "scalable block" has been entered/left). NOTE: The `isScalableDim` parameter of `VectorType` (introduced in this patch) makes `numScalableDims` redundant. For the time being, `numScalableDims` is preserved to facilitate the transition between the two parameters. `numScalableDims` will be removed in one of the subsequent patches. This change is a part of a larger effort to enable scalable vectorisation in Linalg. See this RFC for more context: * https://discourse.llvm.org/t/rfc-scalable-vectorisation-in-linalg/ Differential Revision: https://reviews.llvm.org/D153372	2023-06-27 19:21:59 +01:00
Aart Bik	11a4f5bdfb	[mlir][sparse] minor code changes Submitting for Wren Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D153804	2023-06-26 12:57:52 -07:00
Aart Bik	f14c8eb595	[mlir][sparse][gpu] refine SDDMM pattern for cuSPARSE Old pattern was missing some cases (e.g. swapping the arguments) but it also allowed too many cases (e.g. non-empty "absent" or different arguments for add/mul). This fixes the issues. Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D153487	2023-06-21 18:31:55 -07:00
Peiming Liu	e7df82816b	[mlir][sparse] rewrite arith::SelectOp to semiring operations to sparsify it. Reviewed By: aartbik, K-Wu Differential Revision: https://reviews.llvm.org/D153397	2023-06-21 21:22:18 +00:00
Kun Wu	9167dd46ba	[mlir][sparse][gpu] recognizing sddmm pattern in GPU libgen path Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D151582	2023-06-15 23:48:11 +00:00
Peiming Liu	4e9526b9ea	[mlir][sparse] using stable_sort to make sure the compiled code are consistent between different builds configuration Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D153074	2023-06-15 21:09:35 +00:00
Aart Bik	65bfd5cb25	[mlir][sparse] proper in-place SDDMM with spy function This specific operation matches the cuSPARSE SDDMM semantics exactly. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D152969	2023-06-15 13:59:38 -07:00
Peiming Liu	faf7cd97d0	[mlir][sparse] merger extension to support sparsifying arith::CmpI/CmpF operation Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152761	2023-06-15 17:26:50 +00:00
Kazu Hirata	a39adc00db	[mlir] Fix warnings in release builds This patch fixes: mlir/lib/Dialect/SparseTensor/Transforms/LoopEmitter.cpp:846:16: error: unused variable 'lvlTp' [-Werror,-Wunused-variable] mlir/lib/Dialect/SparseTensor/Transforms/LoopEmitter.cpp:1059:13: error: unused variable '[t, l]' [-Werror,-Wunused-variable]	2023-06-14 14:22:17 -07:00
Peiming Liu	83b7f018fd	[mlir][sparse] fix crashes when the tensor that defines the loop bound can not be found Reviewed By: aartbik, K-Wu Differential Revision: https://reviews.llvm.org/D152877	2023-06-14 20:27:50 +00:00
Peiming Liu	fd68d36109	[mlir][sparse] unifying enterLoopOverTensorAtLvl and enterCoIterationOverTensorsAtLvls The tensor levels are now explicitly categorized into different `LoopCondKind` to instruct LoopEmitter generate different code for different kinds of condition (e.g., `SparseCond`, `SparseSliceCond`, `SparseAffineIdxCond`, etc) The process of generating a while loop is now dissembled into three steps and they are dispatched to different LoopCondKind handler. 1. Generate LoopCondition (e.g., `pos <= posHi` for `SparseCond`, `slice.isNonEmpty` for `SparseAffineIdxCond`) 2. Generate LoopBody (e.g., compute the coordinates) 3. Generate ExtraChecks (e.g., `if (onSlice(crd))` for `SparseSliceCond`) Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152464	2023-06-14 20:03:10 +00:00
Aart Bik	debdf7e0ff	[mlir][sparse] refine single condition set up for semi-ring ops Reviewed By: Peiming, K-Wu Differential Revision: https://reviews.llvm.org/D152874	2023-06-14 09:23:09 -07:00
Aart Bik	1ea903e164	[mlir][sparse][gpu] guard matvec COO AoS Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D152738	2023-06-12 16:49:58 -07:00
Aart Bik	80fe3168b5	[mlir][sparse] add support for direct prod/and/min/max reductions We recently fixed a bug in "sparsifying" such reductions, since it incorrectly changed this into reductions over stored elements only , which only works for add/sub/or/xor. However, we still want to be able to "sparsify" the reductions even in the general case, and this is a first step by rewriting them into a custom reduction that feeds in the implicit zeros. NOTE HOWEVER, that in the long run we want to do this better and feed in any implicit zero only ONCE for efficiency. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D152580	2023-06-12 09:27:47 -07:00
Peiming Liu	0258a53521	Brings back "[mlir][sparse] moving inbound check for slice driven loop into before block of the WhileOp" This reverts commit `07b927902d`. Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D152566	2023-06-09 17:45:46 +00:00
Peiming Liu	07b927902d	Revert "[mlir][sparse] moving inbound check for slice driven loop into before block of the WhileOp" This reverts commit `853d704fd0`. Differential Revision: https://reviews.llvm.org/D152562	2023-06-09 17:21:40 +00:00
Kun Wu	97f4c22b3a	[mlir][sparse][gpu] unify dnmat and dnvec handle and ops Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152465	2023-06-09 17:16:48 +00:00
Peiming Liu	853d704fd0	[mlir][sparse] moving inbound check for slice driven loop into before block of the WhileOp This patch changes the while loop generated for iterating over a fully reduced sparse level with affine index expression. Before: ``` cont = true while (cont) { if (inBound()) { .... cont = true; } else { cont = false; } } ``` After: ``` while(inBound()) { .... } ``` Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D152463	2023-06-09 17:03:15 +00:00
Kun Wu	8ed59c53de	[mlir][sparse][gpu] add sm8.0+ tensor core 2:4 sparsity support Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D151775	2023-06-06 23:13:21 +00:00
Aart Bik	9fc02a7a08	[mlir][sparse][gpu] add AoS COO support to cuSPARSE Even though this feature was deprecated in release 11.2, any library before this version still supports the feature, which is why we are making it available under a macro. Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D152290	2023-06-06 12:32:46 -07:00
Aart Bik	e2167d89db	[mlir][sparse] refine absent branch feeding into custom op Document better that unary/binary may only feed to the output or the input of a custom reduction (not even a regular reduction since it may have "no value"!). Also fixes a bug when present branch is empty and feeds into custom reduction. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D152224	2023-06-06 09:57:15 -07:00
Peiming Liu	23dc96bbe4	[mlir][sparse] fix crashes when using custom reduce with unary operation. The tests case is directly copied from https://reviews.llvm.org/D152179 authored by @aartbik Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152204	2023-06-05 23:41:26 +00:00
Peiming Liu	7d9677a9bd	[mlir][sparse] Make getNumTensors() consistent between LoopEmitter and Merger. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152178	2023-06-05 17:56:08 +00:00
Peiming Liu	e7b4c93f5e	[mlir][sparse] fix crash when using sparse_tensor::UnaryOp and ReduceOp. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152048	2023-06-03 01:19:05 +00:00
Aart Bik	6a38c772d4	[mlir][sparse] fixed bug with unary op, dense output Note that by sparse compiler convention, dense output is zerod out when not set, so complement results in zeros where elements were present. Reviewed By: wrengr Differential Revision: https://reviews.llvm.org/D152046	2023-06-02 18:15:33 -07:00
Kun Wu	fa98bdbd95	[mlir][sparse][gpu] make computeType mandatory Differential Revision: https://reviews.llvm.org/D152018	2023-06-02 21:47:44 +00:00
Peiming Liu	ce6f8c5afe	[mlir][sparse] fix various bug to support sparse pooling Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D151776	2023-06-02 17:34:47 +00:00
Aart Bik	378f1885e3	[mlir][sparse] enhance sparse reduction support Formerly, we accepted and/prod reductions as a standard reduction but these change the semantics after sparsification by not looking at implicit zeros. Therefore, we only accept standard reductions that are insensitive to implicit vs. explicit zeros, and leave the more complex reductions to the sparse_tensor.reduce custom reduction implementation. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D151929	2023-06-01 16:30:21 -07:00
Peiming Liu	54ac02dd16	[mlir][sparse] fix crashes when generation conv_2d_nchw_fchw with Compressed Dense Compressed Dense sparse encoding. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D151773	2023-05-31 18:06:01 +00:00
wren romano	7a1077baa0	[mlir][sparse] Improving SparseTensorDimSliceAttr methods This patch makes the following changes to `SparseTensorDimSliceAttr` methods: * Mark `isDynamic` constexpr. * Add new helpers `getStatic` and `getStaticString` to avoid repetition. * Moved the definitions for `getStatic{Offset,Stride,Size}` and `isCompletelyDynamic` out of the class declaration; because there's no benefit to inlining them. * Changed `parse` to use `kDynamic` rather than literals. * Changed `verify` to use the `isDynamic` helper. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D150919	2023-05-30 17:30:55 -07:00
wren romano	f58e67dee9	[mlir][sparse] Removing unused helper function Depends On D151505 Reviewed By: aartbik, Peiming Differential Revision: https://reviews.llvm.org/D151522	2023-05-30 15:59:26 -07:00
wren romano	af2bec7c4a	[mlir][sparse] Adding new STEA::{with,without}DimSlices factories (These factories are used in downstream code, despite not being used within the MLIR codebase.) Depends On D151513 Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D151518	2023-05-30 15:53:30 -07:00
wren romano	540d5e0ce6	[mlir][sparse] Updating STEA parser/printer to use the name "dimSlices" Depends On D151505 Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D151513	2023-05-30 15:50:07 -07:00
wren romano	76647fce13	[mlir][sparse] Combining `dimOrdering`+`higherOrdering` fields into `dimToLvl` This is a major step along the way towards the new STEA design. While a great deal of this patch is simple renaming, there are several significant changes as well. I've done my best to ensure that this patch retains the previous behavior and error-conditions, even though those are at odds with the eventual intended semantics of the `dimToLvl` mapping. Since the majority of the compiler does not yet support non-permutations, I've also added explicit assertions in places that previously had implicitly assumed it was dealing with permutations. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D151505	2023-05-30 15:19:50 -07:00
Peiming Liu	db7f639b90	[mlir][sparse] fix a crash when generating sparse convolution with nchw input Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D151744	2023-05-30 20:16:54 +00:00
Kun Wu	235fbe792b	[mlir] [sparse] [gpu] adding transpose support to spmm spmv Reviewed By: aartbik, wrengr Differential Revision: https://reviews.llvm.org/D151259	2023-05-26 17:07:09 +00:00
Tres Popp	68f58812e3	[mlir] Move casting calls from methods to function calls The MLIR classes Type/Attribute/Operation/Op/Value support cast/dyn_cast/isa/dyn_cast_or_null functionality through llvm's doCast functionality in addition to defining methods with the same name. This change begins the migration of uses of the method to the corresponding function call as has been decided as more consistent. Note that there still exist classes that only define methods directly, such as AffineExpr, and this does not include work currently to support a functional cast/isa call. Context: - https://mlir.llvm.org/deprecation/ at "Use the free function variants for dyn_cast/cast/isa/…" - Original discussion at https://discourse.llvm.org/t/preferred-casting-style-going-forward/68443 Implementation: This patch updates all remaining uses of the deprecated functionality in mlir/. This was done with clang-tidy as described below and further modifications to GPUBase.td and OpenMPOpsInterfaces.td. Steps are described per line, as comments are removed by git: 0. Retrieve the change from the following to build clang-tidy with an additional check: main...tpopp:llvm-project:tidy-cast-check 1. Build clang-tidy 2. Run clang-tidy over your entire codebase while disabling all checks and enabling the one relevant one. Run on all header files also. 3. Delete .inc files that were also modified, so the next build rebuilds them to a pure state. ``` ninja -C $BUILD_DIR clang-tidy run-clang-tidy -clang-tidy-binary=$BUILD_DIR/bin/clang-tidy -checks='-,misc-cast-functions'\ -header-filter=mlir/ mlir/ -fix rm -rf $BUILD_DIR/tools/mlir/*/.inc ``` Differential Revision: https://reviews.llvm.org/D151542	2023-05-26 10:29:55 +02:00

1 2 3 4 5 ...

694 Commits