clang-p2996

Author	SHA1	Message	Date
Aart Bik	6a45339bac	[mlir][sparse] refine sparse fusion with empty tensors materialization (#66563 ) This is a minor step towards deprecating bufferization.alloc_tensor(). It replaces the examples with tensor.empty() and adjusts the underlying rewriting logic to prepare for this upcoming change.	2023-09-18 09:01:11 -07:00
Martin Erhart	6bf043e743	[mlir][bufferization] Remove allow-return-allocs and create-deallocs pass options, remove bufferization.escape attribute (#66619 ) This commit removes the deallocation capabilities of one-shot-bufferization. One-shot-bufferization should never deallocate any memrefs as this should be entirely handled by the ownership-based-buffer-deallocation pass going forward. This means the `allow-return-allocs` pass option will default to true now, `create-deallocs` defaults to false and they, as well as the escape attribute indicating whether a memref escapes the current region, will be removed. A new `allow-return-allocs-from-loops` option is added as a temporary workaround for some bufferization limitations.	2023-09-18 16:44:48 +02:00
Yinying Li	2a07f0fd40	[mlir][sparse] Migrate more tests to use new syntax (#66443 ) Dense `lvlTypes = [ "dense", "dense" ]` to `map = (d0, d1) -> (d0 : dense, d1 : dense)` `lvlTypes = [ "dense", "dense" ], dimToLvl = affine_map<(i,j) -> (j,i)>` to `map = (d0, d1) -> (d1 : dense, d0 : dense)` DCSR `lvlTypes = [ "compressed", "compressed" ]` to `map = (d0, d1) -> (d0 : compressed, d1 : compressed)` DCSC `lvlTypes = [ "compressed", "compressed" ], dimToLvl = affine_map<(i,j) -> (j,i)>` to `map = (d0, d1) -> (d1 : compressed, d0 : compressed)` Block Row `lvlTypes = [ "compressed", "dense" ]` to `map = (d0, d1) -> (d0 : compressed, d1 : dense)` Block Column `lvlTypes = [ "compressed", "dense" ], dimToLvl = affine_map<(i,j) -> (j,i)>` to `map = (d0, d1) -> (d1 : compressed, d0 : dense)` This is an ongoing effort: #66146, #66309	2023-09-14 23:19:57 +00:00
Peiming Liu	098f46dce3	[sparse] allow unpack op to return 0-ranked tensor type. (#66269 ) Many frontends canonicalize scalar into 0-ranked tensor, it change will hopefully make the operation easier to use for those cases.	2023-09-13 11:33:01 -07:00
Martin Erhart	c199f7dc62	Revert "[mlir][bufferization] Remove allow-return-allocs and create-deallocs pass options, remove bufferization.escape attribute" This reverts commit `6a91dfedeb`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:48 +00:00
Martin Erhart	6a91dfedeb	[mlir][bufferization] Remove allow-return-allocs and create-deallocs pass options, remove bufferization.escape attribute This is the first commit in a series with the goal to rework the BufferDeallocation pass. Currently, this pass heavily relies on copies to perform correct deallocations, which leads to very slow code and potentially high memory usage. Additionally, there are unsupported cases such as returning memrefs which this series of commits aims to add support for as well. This first commit removes the deallocation capabilities of one-shot-bufferization.One-shot-bufferization should never deallocate any memrefs as this should be entirely handled by the buffer-deallocation pass going forward. This means the allow-return-allocs pass option will default to true now, create-deallocs defaults to false and they, as well as the escape attribute indicating whether a memref escapes the current region, will be removed. The documentation should w.r.t. these pass option changes should also be updated in this commit. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D156662	2023-09-13 09:30:22 +00:00
Peiming Liu	64df1c08d0	[sparse] allow unpack op to return any integer type. (#66161 )	2023-09-12 17:27:51 -07:00
Daniil Dudkin	8a6e54c9b3	[mlir][arith] Rename operations: `maxf` → `maximumf`, `minf` → `minimumf` (#65800 ) This patch is part of a larger initiative aimed at fixing floating-point `max` and `min` operations in MLIR: https://discourse.llvm.org/t/rfc-fix-floating-point-max-and-min-operations-in-mlir/72671. This commit addresses Task 1.2 of the mentioned RFC. By renaming these operations, we align their names with LLVM intrinsics that have corresponding semantics.	2023-09-11 22:02:19 -07:00
Kazu Hirata	c86bb3d91a	[mlir] Use a range-based for loop (NFC)	2023-09-05 00:30:40 -07:00
Aart Bik	b86d3cbc12	[mlir][sparse] complete various FIXMEs in sparse support lib Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D159245	2023-08-30 21:30:25 -07:00
Peiming Liu	22e8d5b428	[mlir][sparse] Support strided convolution on dense level. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D159020	2023-08-30 20:00:50 +00:00
Peiming Liu	07bd5f20bc	[mlir][sparse] Support strided convolution on compressed level. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158912	2023-08-30 19:37:50 +00:00
Peiming Liu	e015d385c9	[mlir][sparse] Pass down constant coefficients of affine index expressions to LoopEmitter. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158914	2023-08-30 18:44:50 +00:00
Peiming Liu	96e1914aa2	[mlir][sparse] fix crash when generating convolution kernel with sparse input in DCCD format. Reviewed By: aartbik, anlunx Differential Revision: https://reviews.llvm.org/D159170	2023-08-30 17:49:36 +00:00
Matthias Springer	79ff70fda2	[mlir][sparse] Better error handling when bufferizing sparse_tensor ops sparse_tensor ops cannot be bufferized with One-Shot Bufferize. (They can only be analyzed.) The sparse compiler does the actual lowering to memref. Produce a proper error message instead of crashing. This fixes #61311. Differential Revision: https://reviews.llvm.org/D158728	2023-08-25 08:34:05 +02:00
Yinying Li	be556ee157	[mlir][sparse] Fix typos in comments Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158667	2023-08-24 18:13:36 +00:00
Peiming Liu	fa9c93e30e	[mlir][sparse] Remove view-based sparse tensor collapse_shape implementation. We will migrate to a cleaner and more complete implementation. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158658	2023-08-23 20:29:28 +00:00
Yinying Li	51ebecf309	[mlir][sparse] Changed sparsity properties to use _ instead of - Example: compressed-no -> compressed_no Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158567	2023-08-23 17:00:27 +00:00
Eymen Ünay	57035192d0	[mlir] Fix typo in comments Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D157215	2023-08-22 17:51:57 -07:00
Aart Bik	d8ae3a9b25	[mlir][sparse] Fix strict weak ordering in sorting These checks have been fired when comparing same elements like comp(a, a). Let's fix it. I don't have commit rights. Danila Kutenin kutdanila@yandex.ru Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152152	2023-08-22 17:15:10 -07:00
Aart Bik	8154494e28	[mlir][sparse] refactor sparsification and bufferization pass into proper TD pass Registering the SparsificationAndBufferization into a proper TD pass has the advantage that it can be invoked and tested in isolation. This change also moves some bufferization specific set up from the pipeline file into the pass file, keeping the logic more locally. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D158219	2023-08-17 15:45:03 -07:00
Peiming Liu	fa6726e27b	[mlir][sparse] supports sparse_tensor.pack on libgen path Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158012	2023-08-15 20:20:54 +00:00
Matthias Springer	a02ad6c177	[mlir][bufferization] Generalize getAliasingOpResults to getAliasingValues This revision is needed to support bufferization of `cf.br`/`cf.cond_br`. It will also be useful for better analysis of loop ops. This revision generalizes `getAliasingOpResults` to `getAliasingValues`. An OpOperand can now not only alias with OpResults but also with BlockArguments. In the case of `cf.br` (will be added in a later revision): a `cf.br` operand will alias with the corresponding argument of the destination block. If an op does not implement the `BufferizableOpInterface`, the analysis in conservative. It previously assumed that an OpOperand may alias with each OpResult. It now assumes that an OpOperand may alias with each OpResult and each BlockArgument of the entry block. Differential Revision: https://reviews.llvm.org/D157957	2023-08-15 15:02:47 +02:00
Aart Bik	289f7231f9	[mlir][sparse][gpu] minor code cleanup for sparse gpu ops Consistent order of ops and related methods. Also, renamed SpGEMMGetSizeOp to SpMatGetSizeOp since this is a general utility for sparse matrices, not specific to GEMM ops only. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D157922	2023-08-14 15:08:57 -07:00
Aart Bik	76a80a0808	[mlir][sparse][gpu] sparsifier GPU libgen for SpGEMM in cuSparse With working integration end-to-end test Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D157652	2023-08-10 14:52:16 -07:00
Peiming Liu	372d88b051	[mlir][sparse] code cleanup. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D156941	2023-08-02 23:19:55 +00:00
K-Wu	cfa82f7783	[mlir][sparse][gpu] introduce flag that controls host to device copy strategies (regular dma default) Differential Revision: https://reviews.llvm.org/D155352	2023-08-01 22:30:40 +00:00
Kun Wu	1e491c425b	[mlir][sparse][gpu] add 2:4 spmm prune_and_check flag Differential Revision: https://reviews.llvm.org/D155909	2023-08-01 18:24:18 +00:00
Alex Zinenko	4a6b31b8d8	[mlir] NFC: untangle SCF Patterns.h and Transforms.h These two headers both contained a strange mix of definitions related to both patterns and non-pattern transforms. Put patterns and "populate" functions into Patterns.h and standalone transforms into Transforms.h. Depends On: D155223 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D155454	2023-07-18 11:27:36 +00:00
K-Wu	e37fc3cc39	[mlir][sparse][gpu] Impl 2:4 SpMM rewrite for linalg op w/ DENSE24 attr Differential Revision: https://reviews.llvm.org/D154772	2023-07-10 22:36:57 +00:00
Peiming Liu	fc5d8fce7d	[mlir][sparse] support dual sparse convolution. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152601	2023-07-10 16:49:32 +00:00
Aart Bik	03125e6894	[mlir][sparse][gpu] fix missing dealloc This dealloc was incorrectly removed in https://reviews.llvm.org/D153173 Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D154564	2023-07-06 09:48:19 -07:00
Kun Wu	be2dd22b8f	[mlir][sparse][gpu] reuse CUDA environment handle throughout instance lifetime Differential Revision: https://reviews.llvm.org/D153173	2023-06-30 21:52:34 +00:00
Peiming Liu	a63d6a0014	[mlir][sparse] make UnpackOp return the actual filled length of unpacked memory This might simplify frontend implementation by avoiding recomputation for the same value. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D154244	2023-06-30 21:35:15 +00:00
Andrzej Warzynski	f22af204ed	[mlir][VectorType] Remove `numScalableDims` from the vector type This is a follow-up of https://reviews.llvm.org/D153372 in which `numScalableDims` (single integer) was effectively replaced with `isScalableDim` bitmask. This change is a part of a larger effort to enable scalable vectorisation in Linalg. See this RFC for more context: * https://discourse.llvm.org/t/rfc-scalable-vectorisation-in-linalg/ Differential Revision: https://reviews.llvm.org/D153412	2023-06-28 13:53:45 +01:00
Andrzej Warzynski	79c83e12c8	[mlir][VectorType] Allow arbitrary dimensions to be scalable At the moment, only the trailing dimensions in the vector type can be scalable, i.e. this is supported: vector<2x[4]xf32> and this is not allowed: vector<[2]x4xf32> This patch extends the vector type so that arbitrary dimensions can be scalable. To this end, an array of bool values is added to every vector type to denote whether the corresponding dimensions are scalable or not. For example, for this vector: vector<[2]x[3]x4xf32> the following array would be created: {true, true, false}. Additionally, the current syntax: vector<[2x3]x4xf32> is replaced with: vector<[2]x[3]x4xf32> This is primarily to simplify parsing (this way, the parser can easily process one dimension at a time rather than e.g. tracking whether "scalable block" has been entered/left). NOTE: The `isScalableDim` parameter of `VectorType` (introduced in this patch) makes `numScalableDims` redundant. For the time being, `numScalableDims` is preserved to facilitate the transition between the two parameters. `numScalableDims` will be removed in one of the subsequent patches. This change is a part of a larger effort to enable scalable vectorisation in Linalg. See this RFC for more context: * https://discourse.llvm.org/t/rfc-scalable-vectorisation-in-linalg/ Differential Revision: https://reviews.llvm.org/D153372	2023-06-27 19:21:59 +01:00
Aart Bik	f14c8eb595	[mlir][sparse][gpu] refine SDDMM pattern for cuSPARSE Old pattern was missing some cases (e.g. swapping the arguments) but it also allowed too many cases (e.g. non-empty "absent" or different arguments for add/mul). This fixes the issues. Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D153487	2023-06-21 18:31:55 -07:00
Peiming Liu	e7df82816b	[mlir][sparse] rewrite arith::SelectOp to semiring operations to sparsify it. Reviewed By: aartbik, K-Wu Differential Revision: https://reviews.llvm.org/D153397	2023-06-21 21:22:18 +00:00
Kun Wu	9167dd46ba	[mlir][sparse][gpu] recognizing sddmm pattern in GPU libgen path Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D151582	2023-06-15 23:48:11 +00:00
Peiming Liu	4e9526b9ea	[mlir][sparse] using stable_sort to make sure the compiled code are consistent between different builds configuration Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D153074	2023-06-15 21:09:35 +00:00
Aart Bik	65bfd5cb25	[mlir][sparse] proper in-place SDDMM with spy function This specific operation matches the cuSPARSE SDDMM semantics exactly. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D152969	2023-06-15 13:59:38 -07:00
Peiming Liu	faf7cd97d0	[mlir][sparse] merger extension to support sparsifying arith::CmpI/CmpF operation Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152761	2023-06-15 17:26:50 +00:00
Kazu Hirata	a39adc00db	[mlir] Fix warnings in release builds This patch fixes: mlir/lib/Dialect/SparseTensor/Transforms/LoopEmitter.cpp:846:16: error: unused variable 'lvlTp' [-Werror,-Wunused-variable] mlir/lib/Dialect/SparseTensor/Transforms/LoopEmitter.cpp:1059:13: error: unused variable '[t, l]' [-Werror,-Wunused-variable]	2023-06-14 14:22:17 -07:00
Peiming Liu	83b7f018fd	[mlir][sparse] fix crashes when the tensor that defines the loop bound can not be found Reviewed By: aartbik, K-Wu Differential Revision: https://reviews.llvm.org/D152877	2023-06-14 20:27:50 +00:00
Peiming Liu	fd68d36109	[mlir][sparse] unifying enterLoopOverTensorAtLvl and enterCoIterationOverTensorsAtLvls The tensor levels are now explicitly categorized into different `LoopCondKind` to instruct LoopEmitter generate different code for different kinds of condition (e.g., `SparseCond`, `SparseSliceCond`, `SparseAffineIdxCond`, etc) The process of generating a while loop is now dissembled into three steps and they are dispatched to different LoopCondKind handler. 1. Generate LoopCondition (e.g., `pos <= posHi` for `SparseCond`, `slice.isNonEmpty` for `SparseAffineIdxCond`) 2. Generate LoopBody (e.g., compute the coordinates) 3. Generate ExtraChecks (e.g., `if (onSlice(crd))` for `SparseSliceCond`) Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152464	2023-06-14 20:03:10 +00:00
Aart Bik	1ea903e164	[mlir][sparse][gpu] guard matvec COO AoS Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D152738	2023-06-12 16:49:58 -07:00
Aart Bik	80fe3168b5	[mlir][sparse] add support for direct prod/and/min/max reductions We recently fixed a bug in "sparsifying" such reductions, since it incorrectly changed this into reductions over stored elements only , which only works for add/sub/or/xor. However, we still want to be able to "sparsify" the reductions even in the general case, and this is a first step by rewriting them into a custom reduction that feeds in the implicit zeros. NOTE HOWEVER, that in the long run we want to do this better and feed in any implicit zero only ONCE for efficiency. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D152580	2023-06-12 09:27:47 -07:00
Peiming Liu	0258a53521	Brings back "[mlir][sparse] moving inbound check for slice driven loop into before block of the WhileOp" This reverts commit `07b927902d`. Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D152566	2023-06-09 17:45:46 +00:00
Peiming Liu	07b927902d	Revert "[mlir][sparse] moving inbound check for slice driven loop into before block of the WhileOp" This reverts commit `853d704fd0`. Differential Revision: https://reviews.llvm.org/D152562	2023-06-09 17:21:40 +00:00
Kun Wu	97f4c22b3a	[mlir][sparse][gpu] unify dnmat and dnvec handle and ops Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152465	2023-06-09 17:16:48 +00:00

1 2 3 4 5 ...

600 Commits