clang-p2996

Author	SHA1	Message	Date
Jacques Pienaar	d2c0572b2e	[mlir] Flip LinAlg dialect to _Both This one required more changes than ideal due to overlapping generated name with different return types. Changed getIndexingMaps to getIndexingMapsArray to move it out of the way/highlight that it returns (more expensively) a SmallVector and uses the prefixed name for the Attribute. Differential Revision: https://reviews.llvm.org/D129919	2022-07-19 14:42:58 -07:00
Matthias Springer	27a431f5e9	[mlir][bufferization][NFC] Move sparse_tensor.release to bufferization dialect This op used to belong to the sparse dialect, but there are use cases for dense bufferization as well. (E.g., when a tensor alloc is returned from a function and should be deallocated at the call site.) This change moves the op to the bufferization dialect, which now has an `alloc_tensor` and a `dealloc_tensor` op. Differential Revision: https://reviews.llvm.org/D129985	2022-07-19 09:18:19 +02:00
Aart Bik	28ebb0b61d	[mlir][sparse] migrate sparse rewriting to sparse transformations pass The rules in the linalg file were very specific to sparse tensors so will find a better home under sparse tensor dialect than linalg dialect. Also moved some rewriting from sparsification into this new "pre-rewriting" file. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D129910	2022-07-18 09:29:22 -07:00
Kazu Hirata	10bcfeebfa	[mlir] Remove unused using (NFC) Identified with misc-unused-using-decls.	2022-07-17 18:08:48 -07:00
Matthias Springer	c66303c287	[mlir][sparse] Switch to One-Shot Bufferize This change removes the partial bufferization passes from the sparse compilation pipeline and replaces them with One-Shot Bufferize. One-Shot Analysis (and TensorCopyInsertion) is used to resolve all out-of-place bufferizations, dense and sparse. Dense ops are then bufferized with BufferizableOpInterface. Sparse ops are still bufferized in the Sparsification pass. Details: * Dense allocations are automatically deallocated, unless they are yielded from a block. (In that case the alloc would leak.) All test cases are modified accordingly. E.g., some funcs now have an "out" tensor argument that is returned from the function. (That way, the allocation happens at the call site.) * Sparse allocations are not automatically deallocated. They must be "released" manually. (No change, this will be addressed in a future change.) * Sparse tensor copies are not supported yet. (Future change) * Sparsification no longer has to consider inplacability. If necessary, allocations and/or copies are inserted during TensorCopyInsertion. All tensors are inplaceable by the time Sparsification is running. Instead of marking a tensor as "not inplaceable", it can be marked as "not writable", which will trigger an allocation and/or copy during TensorCopyInsertion. Differential Revision: https://reviews.llvm.org/D129356	2022-07-14 09:52:48 +02:00
Kazu Hirata	c27d815249	[mlir] Use value instead of getValue (NFC)	2022-07-14 00:19:59 -07:00
Kazu Hirata	491d27013d	[mlir] Use has_value instead of hasValue (NFC)	2022-07-13 00:57:02 -07:00
Aart Bik	faa00c1313	[mlir][sparse] implement sparse2sparse reshaping (expand/collapse) A previous revision implemented expand/collapse reshaping between dense and sparse tensors for sparse2dense and dense2sparse since those could use the "cheap" view reshape on the already materialized dense tensor (at either the input or output side), and do some reshuffling from or to sparse. The dense2dense case, as always, is handled with a "cheap" view change. This revision implements the sparse2sparse cases. Lacking any "view" support on sparse tensors this operation necessarily has to perform data reshuffling on both ends. Tracker for improving this: https://github.com/llvm/llvm-project/issues/56477 Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D129416	2022-07-11 14:49:06 -07:00
Jacques Pienaar	136d746ec7	[mlir] Flip accessors to prefixed form (NFC) Another mechanical sweep to keep diff small for flip to _Prefixed.	2022-07-10 21:19:11 -07:00
Aart Bik	6d8e2f1e51	[mlir][sparse] implement simple reshaping (expand/collapse) The revision makes a start with implementing expand/collapse reshaping for sparse tensors. When either source or destination is sparse, but other is dense, the "cheap" dense reshape can be used prior to converting from or to a sparse tensor. Note1 sparse to sparse reshaping is still TBD. Note2 in the long run, we may want to implement a "view" into a sparse tensor so that the operation remains cheap and does not require data shuffling Reviewed By: wrengr Differential Revision: https://reviews.llvm.org/D129031	2022-07-06 14:34:30 -07:00
wren romano	875ee0ed1c	[mlir][sparse] Reducing computational complexity This is a followup to D128847. The `AffineMap::getPermutedPosition` method performs a linear scan of the map, thus the previous implementation had asymptotic complexity of `O(\|topSort\| * \|m\|)`. This change reduces that to `O(\|topSort\| + \|m\|)`. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D129011	2022-07-01 12:55:09 -07:00
Aart Bik	e057f25dee	[mlir][sparse] auto-insertion of conversion to resolve cycles When the iteration graph is cyclic (even after several attempts using less and less constraints), the current sparse compiler bails out, and no rewriting hapens. However, this revision adds some new logic where the sparse compiler tries to find a single input sparse tensor that breaks the cycle, and then adds a proper sparse conversion operation. This way, more incoming kernels can be handled! Note, the resulting code is not optimal (although it keeps more or less proper "sparse" complexity), and more improvements should be added (especially when the kernel directly yields without computation, such as the transpose example). However, handling is better than not handling ;-) Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D128847	2022-06-29 18:28:18 -07:00
Aart Bik	eca6f9160f	[mlir][sparse][bufferization] refine bufferization assumption enforcement Enforce the assumption made on tensor buffers explicitly. When in-place, reuse the buffer, but fill with all zeroes for the non-update case, since the kernel assumes all elements are written to. When not in-place, zero out the new buffer when materializing or when no-updates occur. Copy the original tensor value when updates occur. This prepares migrating to the new bufferization strategy, where these assumptions must be made explicit. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D128691	2022-06-28 09:43:30 -07:00
Kazu Hirata	3b7c3a654c	Revert "Don't use Optional::hasValue (NFC)" This reverts commit `aa8feeefd3`.	2022-06-25 11:56:50 -07:00
Kazu Hirata	aa8feeefd3	Don't use Optional::hasValue (NFC)	2022-06-25 11:55:57 -07:00
Matthias Springer	3798678bd1	[mlir][sparse][bufferize] Implement BufferizableOpInterface Only the analysis part of the interface is implemented. The bufferization itself is performed by the SparseTensorConversion pass. Differential Revision: https://reviews.llvm.org/D128138	2022-06-24 13:47:01 +02:00
Aart Bik	fde04aee33	[mlir][sparse] refine bufferization allocation lowering Marking bufferization allocation operation as invalid during sparse lowering is too strict, since dense and sparse allocation can co-exist. This revision refines the lowering with a dynamic type check. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D128305	2022-06-21 15:17:25 -07:00
Kazu Hirata	6d5fc1e3d5	[mlir] Don't use Optional::getValue (NFC)	2022-06-20 23:20:25 -07:00
Kazu Hirata	0916d96d12	Don't use Optional::hasValue (NFC)	2022-06-20 20:17:57 -07:00
Kazu Hirata	037f09959a	[mlir] Don't use Optional::hasValue (NFC)	2022-06-20 11:22:37 -07:00
Alex Zinenko	8b68da2c7d	[mlir] move SCF headers to SCF/{IR,Transforms} respectively This aligns the SCF dialect file layout with the majority of the dialects. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D128049	2022-06-20 10:18:01 +02:00
Jacques Pienaar	8df54a6a03	[mlir] Update accessors to prefixed form (NFC) Follow up from flipping dialects to both, flip accessor used to prefixed variant ahead to flipping from _Both to _Prefixed. This just flips to the accessors introduced in the preceding change which are just prefixed forms of the existing accessor changed from. Mechanical change using helper script https://github.com/jpienaar/llvm-project/blob/main/clang-tools-extra/clang-tidy/misc/AddGetterCheck.cpp and clang-format.	2022-06-18 17:53:22 -07:00
Aart Bik	aef20f59a5	[mlir][sparse] move from by-value to by-reference for data types This fixes all sorts of ABI issues due to passing by-value (using by-reference with memref's exclusively). Reviewed By: bkramer Differential Revision: https://reviews.llvm.org/D128018	2022-06-17 08:39:25 -07:00
Alex Zinenko	610139d2d9	[mlir] replace 'emit_c_wrappers' func->llvm conversion option with a pass The 'emit_c_wrappers' option in the FuncToLLVM conversion requests C interface wrappers to be emitted for every builtin function in the module. While this has been useful to bootstrap the interface, it is problematic in the longer term as it may unintentionally affect the functions that should retain their existing interface, e.g., libm functions obtained by lowering math operations (see D126964 for an example). Since D77314, we have a finer-grain control over interface generation via an attribute that avoids the problem entirely. Remove the 'emit_c_wrappers' option. Introduce the '-llvm-request-c-wrappers' pass that can be run in any pipeline that needs blanket emission of functions to annotate all builtin functions with the attribute before performing the usual lowering that accounts for the attribute. Reviewed By: chelini Differential Revision: https://reviews.llvm.org/D127952	2022-06-17 11:10:31 +02:00
Aart Bik	2a2886160d	[mlir][sparse] improved testing and codegen for semi-ring operations The semi-ring blocks were simply "inlined" by the sparse compiler but without any filtering or patching. This revision improves the analysis (rejecting blocks that use non-invariant computations from outside their blocks, except for linalg.index) and also improves the codegen by properly patching up index computations (previous version crashed). With a regression test. Also updated the documentation now that the example code is properly working. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D128000	2022-06-16 16:13:42 -07:00
Mogball	e16d13322b	[mlir] (NFC) Clean up bazel and CMake target names All dialect targets in bazel have been named Dialect and all dialect targets in CMake have been named MLIRDialect.	2022-06-13 16:24:15 +00:00
bixia1	ea8ed5cbcf	[mlir][sparse] Add F16 and BF16. This is the first PR to add `F16` and `BF16` support to the sparse codegen. There are still problems in supporting these two data types, such as `BF16` is not quite working yet. Add tests cases. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D127010	2022-06-08 09:51:05 -07:00
Matthias Springer	6232a8f3d6	[mlir][sparse][NFC] Switch InitOp to bufferization::AllocTensorOp Now that we have an AllocTensorOp (previously InitTensorOp) in the bufferization dialect, the InitOp in the sparse dialect is no longer needed. Differential Revision: https://reviews.llvm.org/D126180	2022-06-02 00:03:52 +02:00
wren romano	b364c76683	[mlir][sparse] Using non-empty function name suffix for OverheadType::kIndex The trick of using an empty token in the `FOREVERY_O` x-macro relies on preprocessor behavior which is only standard since C99 6.10.3/4 and C++11 N3290 16.3/4 (whereas it was undefined behavior up through C++03 16.3/10). Since the `ExecutionEngine/SparseTensorUtils.cpp` file is required to be compile-able under C++98 compatibility mode (unlike the C++11 used elsewhere in MLIR), we shouldn't rely on that behavior. Also, using a non-empty suffix helps improve uniformity of the API, since all other primary/overhead suffixes are also non-empty. I'm using the suffix `0` since that's the value used by the `SparseTensorEncoding` attribute for indicating the index overhead-type. Depends On D126720 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D126724	2022-06-01 14:18:42 -07:00
wren romano	98e142cd4f	[mlir][sparse] Using x-macros in the function-suffix functions By defining the `{primary,overhead}TypeFunctionSuffix` functions via the same x-macros used to generate the runtime library's functions themselves, this helps avoid bugs from typos or things getting out of sync. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D126720	2022-05-31 17:36:43 -07:00
Aart Bik	5799f843a2	[mlir][sparse] add new complex ops to reduction recognition Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D126318	2022-05-24 15:00:56 -07:00
Aart Bik	28b6d412af	[mlir][sparse] add support for complex zero/one building Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D126039	2022-05-20 08:53:30 -07:00
wren romano	8cb332406c	[mlir][sparse] Enhancing sparse=>sparse conversion. Fixes: https://github.com/llvm/llvm-project/issues/51652 Depends On D122060 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D122061	2022-05-16 15:42:19 -07:00
Aart Bik	736c1b66ef	[mlir][sparse] introduce complex type to sparse tensor support This is the first implementation of complex (f64 and f32) support in the sparse compiler, with complex add/mul as first operations. Note that various features are still TBD, such as other ops, and reading in complex values from file. Also, note that the std::complex<float> had a bit of an ABI issue when passed as single argument. It is still TBD if better solutions are possible. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D125596	2022-05-16 13:17:36 -07:00
Matthias Springer	e9fa559097	[mlir][sparse][NFC] Use RewriterBase/OpBuilder when possible Most functions do not need a PatternRewriter or ConversionPatternRewriter. Differential Revision: https://reviews.llvm.org/D125466	2022-05-13 11:37:26 +02:00
Aart Bik	2617f2f708	[mlir][sparse] fix build issue with unused local under opt builds Reviewed By: rdzhabarov Differential Revision: https://reviews.llvm.org/D124883	2022-05-03 14:55:32 -07:00
Jim Kitchen	2c33266084	[mlir][sparse] Add lowering for unary and binary ops Adding lowering for Unary and Binary required several changes due to their unique nature of containing custom code for different "regions" of the sparse structure being operated on. Along with a Kind, a pointer to the Operation is passed along to be merged once the lattice structure is figured out. The original operation is maintained, as it is required for subsequent lattice decisions. However, sparse_tensor.binary has some branches are considered as fully handled and therefore are marked with as kBinaryBranch to distinguish them. A unique aspect of the custom code is that sometimes the desired result is no result at all -- i.e. a user wants overlapping sparse entries to become empty in the output. The solution to this is to return an uninitialized Value(), which is checked and handled elsewhere in the code and results in nothing being written to the output tensor for that case. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D123057	2022-05-03 15:50:26 -05:00
Javier Setoain	6301574206	[mlir][SparseTensor] Enable VLA ops in index value generation Current index value generation uses fixed-length vector ops, this patch adds an alterantive codegen path compatible with scalable vectors by using `LLVM::StepVectorOp`. Differential Revision: https://reviews.llvm.org/D124454	2022-04-28 09:39:07 +01:00
Nick Kreeger	4620032ee3	Revert "[mlir][sparse] Expose SpareTensor passes as enums instead of opaque numbers for vectorization and parallelization options." This reverts commit `d59cf901cb`. Build fails on NVIDIA Sparse tests: https://lab.llvm.org/buildbot/#/builders/61/builds/25447	2022-04-23 20:14:48 -05:00
Nick Kreeger	d59cf901cb	[mlir][sparse] Expose SpareTensor passes as enums instead of opaque numbers for vectorization and parallelization options. The SparseTensor passes currently use opaque numbers for the CLI, despite using an enum internally. This patch exposes the enums instead of numbered items that are matched back to the enum. Fixes GitHub issue #53389 Reviewed by: aartbik, mehdi_amini Differential Revision: https://reviews.llvm.org/D123876	2022-04-23 19:16:57 -05:00
River Riddle	eda6f907d2	[mlir][NFC] Shift a bunch of dialect includes from the .h to the .cpp Now that dialect constructors are generated in the .cpp file, we can drop all of the dependent dialect includes from the .h file. Differential Revision: https://reviews.llvm.org/D124298	2022-04-23 01:09:29 -07:00
River Riddle	58ceae9561	[mlir:NFC] Remove the forward declaration of FuncOp in the mlir namespace FuncOp has been moved to the `func` namespace for a little over a month, the using directive can be dropped now.	2022-04-18 12:01:55 -07:00
Aart Bik	0b55f94d2b	[mlir][sparse] replace stack-based access pattern with dyn-alloc Rationale: Allocating the temporary buffers for access pattern expansion on the stack (using alloca) is a bit too agressive, since it easily runs out of stack space for large enveloping tensor dimensions. This revision changes the dynamic allocation of these buffers with explicit alloc/dealloc pairs. Reviewed By: bixia, wrengr Differential Revision: https://reviews.llvm.org/D123253	2022-04-06 17:10:43 -07:00
wren romano	63bdcaf92a	[mlir][sparse] Moving `delete coo` into codegen instead of runtime library Prior to this change there were a number of places where the allocation and deallocation of SparseTensorCOO objects were not cleanly paired, leading to inconsistencies regarding whether each function released its tensor/coo arguments or not, as well as making it easy to run afoul of memory leaks, use-after-free, or double-free errors. This change cleans up the codegen vs runtime boundary to resolve those issues. Now, the only time the runtime library frees an object is either (a) because it's a function explicitly designed to do so, or (b) because the allocated object is entirely local to the function and would be a memory leak if not released. Thus, now the codegen takes complete responsibility for releasing any objects it caused to be allocated. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D122435	2022-04-01 11:08:52 -07:00
Javier Setoain	7783a178f5	[mlir][Sparse] Add option for VLA sparsification Use "enable-vla-vectorization=vla" to generate a vector length agnostic loops during vectorization. This option works for vectorization strategy 2. Differential Revision: https://reviews.llvm.org/D118379	2022-03-25 10:54:49 +00:00
wren romano	ebc8466481	[mlir][sparse] Adding {pointer,index}OverheadTypeEncoding Work towards: https://github.com/llvm/llvm-project/issues/51652 The new functions fill the gap between `overheadTypeEncoding` and `get{Pointer,Index}OverheadType`. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D122056	2022-03-23 12:04:47 -07:00
wren romano	c7e24db412	[mlir][sparse] Introducing options for the SparseTensorConversion pass This is work towards: https://github.com/llvm/llvm-project/issues/51652 This differential sets up the options and threads them through everywhere, but doesn't actually use them yet. The differential that finally makes use of them is D122061, which is the final differential in the chain that fixes bug 51652. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D122054	2022-03-22 13:11:09 -07:00
Aart Bik	69a7759b40	[mlir][sparse] implement loop index value vectorization with CHECK and integration test Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D122040	2022-03-21 10:40:38 -07:00
Benjamin Kramer	89d8035e36	Use llvm::append_range where applicable It knows the size, so no need to call reserve beforehand. NFCI.	2022-03-18 20:05:48 +01:00
River Riddle	4a3460a791	[mlir:FunctionOpInterface] Rename the "type" attribute to "function_type" This removes any potential confusion with the `getType` accessors which correspond to SSA results of an operation, and makes it clear what the intent is (i.e. to represent the type of the function). Differential Revision: https://reviews.llvm.org/D121762	2022-03-16 17:07:04 -07:00

1 2 3 4

164 Commits