clang-p2996

Author	SHA1	Message	Date
Martin Erhart	8037deb7af	[mlir][memref] Add pass to expand realloc operations, simplify lowering to LLVM There are two motivations for this change: 1. It considerably simplifies adding support for the realloc operation to the new buffer deallocation pass by lowering the realloc such that no deallocation operation is inserted and the deallocation pass itself can insert that dealloc 2. The lowering is expressed on a higher level and thus easier to understand, and the lowerings of the memref operations it is composed of don't have to be duplicated in the MemRefToLLVM lowering (also see discussion in https://reviews.llvm.org/D133424) Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D159430	2023-09-05 08:58:40 +00:00
Kazu Hirata	c86bb3d91a	[mlir] Use a range-based for loop (NFC)	2023-09-05 00:30:40 -07:00
Yinying Li	898bf539a7	[mlir][sparse] Surface syntax change in parsing Example: compressed(nonunique, nonordered) or compressed(nonordered, nonunique) instead of compressed_nu_no. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D159366	2023-09-01 19:25:00 +00:00
Aart Bik	b86d3cbc12	[mlir][sparse] complete various FIXMEs in sparse support lib Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D159245	2023-08-30 21:30:25 -07:00
Peiming Liu	22e8d5b428	[mlir][sparse] Support strided convolution on dense level. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D159020	2023-08-30 20:00:50 +00:00
Peiming Liu	07bd5f20bc	[mlir][sparse] Support strided convolution on compressed level. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158912	2023-08-30 19:37:50 +00:00
Peiming Liu	e015d385c9	[mlir][sparse] Pass down constant coefficients of affine index expressions to LoopEmitter. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158914	2023-08-30 18:44:50 +00:00
Peiming Liu	96e1914aa2	[mlir][sparse] fix crash when generating convolution kernel with sparse input in DCCD format. Reviewed By: aartbik, anlunx Differential Revision: https://reviews.llvm.org/D159170	2023-08-30 17:49:36 +00:00
Matthias Springer	79ff70fda2	[mlir][sparse] Better error handling when bufferizing sparse_tensor ops sparse_tensor ops cannot be bufferized with One-Shot Bufferize. (They can only be analyzed.) The sparse compiler does the actual lowering to memref. Produce a proper error message instead of crashing. This fixes #61311. Differential Revision: https://reviews.llvm.org/D158728	2023-08-25 08:34:05 +02:00
Yinying Li	be556ee157	[mlir][sparse] Fix typos in comments Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158667	2023-08-24 18:13:36 +00:00
Peiming Liu	fa9c93e30e	[mlir][sparse] Remove view-based sparse tensor collapse_shape implementation. We will migrate to a cleaner and more complete implementation. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158658	2023-08-23 20:29:28 +00:00
Yinying Li	51ebecf309	[mlir][sparse] Changed sparsity properties to use _ instead of - Example: compressed-no -> compressed_no Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158567	2023-08-23 17:00:27 +00:00
wren romano	f5b974b783	[mlir][sparse] Adding `{Var,DimLvlExpr,DimSpec,LvlSpec,DimLvlMap}::str` methods These methods are needed for use with `Diagnostic::operator<<` etc. The definitions follow the pattern of `Diagnostic::str` by simply wrapping the underlying `print(raw_ostream)` method. Although there is some overhead for constructing the `std::string`, this seems like the overall most-efficient option: since this overhead only occurs on the error path (under the current intended usage). An alternative approach would be to have one method construct a `Twine` directly, and then have the print method pass the twine to the stream; however, that would mean introducing the overhead of twine construction on the common/happy path of simply printing things out. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D157643	2023-08-22 20:31:15 -07:00
wren romano	78921a64f7	[mlir][sparse] Add more helper methods for converting DimLvlExpr to Var These new methods help clean up some code for doing LvlExpr-analysis during DimExpr-inference. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D157647	2023-08-22 20:00:59 -07:00
Eymen Ünay	57035192d0	[mlir] Fix typo in comments Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D157215	2023-08-22 17:51:57 -07:00
Aart Bik	d8ae3a9b25	[mlir][sparse] Fix strict weak ordering in sorting These checks have been fired when comparing same elements like comp(a, a). Let's fix it. I don't have commit rights. Danila Kutenin kutdanila@yandex.ru Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152152	2023-08-22 17:15:10 -07:00
Aart Bik	bb44a6b7bb	[mlir][sparse] migrate more to new surface syntax Replaced the "NEW_SYNTAX" with the more readable "map" (which we may, or may not keep). Minor improvement in keyword parsing, migrated a few more examples over. Reviewed By: Peiming, yinying-lisa-li Differential Revision: https://reviews.llvm.org/D158325	2023-08-21 12:49:21 -07:00
Aart Bik	8154494e28	[mlir][sparse] refactor sparsification and bufferization pass into proper TD pass Registering the SparsificationAndBufferization into a proper TD pass has the advantage that it can be invoked and tested in isolation. This change also moves some bufferization specific set up from the pipeline file into the pass file, keeping the logic more locally. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D158219	2023-08-17 15:45:03 -07:00
Aart Bik	ada62568a8	[mlir][sparse] cleanup of CMake files Alphabetical order, less whitespace removed some verbose comments for readability (which are preserved in the history if ever needed) Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D158114	2023-08-16 13:46:02 -07:00
Peiming Liu	fa6726e27b	[mlir][sparse] supports sparse_tensor.pack on libgen path Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D158012	2023-08-15 20:20:54 +00:00
Matthias Springer	a02ad6c177	[mlir][bufferization] Generalize getAliasingOpResults to getAliasingValues This revision is needed to support bufferization of `cf.br`/`cf.cond_br`. It will also be useful for better analysis of loop ops. This revision generalizes `getAliasingOpResults` to `getAliasingValues`. An OpOperand can now not only alias with OpResults but also with BlockArguments. In the case of `cf.br` (will be added in a later revision): a `cf.br` operand will alias with the corresponding argument of the destination block. If an op does not implement the `BufferizableOpInterface`, the analysis in conservative. It previously assumed that an OpOperand may alias with each OpResult. It now assumes that an OpOperand may alias with each OpResult and each BlockArgument of the entry block. Differential Revision: https://reviews.llvm.org/D157957	2023-08-15 15:02:47 +02:00
Aart Bik	289f7231f9	[mlir][sparse][gpu] minor code cleanup for sparse gpu ops Consistent order of ops and related methods. Also, renamed SpGEMMGetSizeOp to SpMatGetSizeOp since this is a general utility for sparse matrices, not specific to GEMM ops only. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D157922	2023-08-14 15:08:57 -07:00
Aart Bik	76a80a0808	[mlir][sparse][gpu] sparsifier GPU libgen for SpGEMM in cuSparse With working integration end-to-end test Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D157652	2023-08-10 14:52:16 -07:00
wren romano	f9d505316b	[mlir][sparse] updating methods to get MLIRContext Renames the old `DimLvlExpr::getContext` to `tryGetContext` to make it clear that the result could be a nullptr. Also adds new delegation methods `DimSpec::tryGetContext` and `LvlSpec::getContext`. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D157163	2023-08-04 17:56:09 -07:00
wren romano	cad4646733	[mlir][sparse] Improve handling of NEW_SYNTAX Improves the conversion from `DimLvlMap` to STEA, in order to correct rank-mismatch issues in the roundtrip tests. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D157162	2023-08-04 17:53:34 -07:00
Peiming Liu	2b1bdc9f0b	fix build error. To fix build error introduced in https://reviews.llvm.org/D157025 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D157134	2023-08-04 20:37:49 +00:00
Peiming Liu	849187c95d	[mlir][transform][sparse] introduce sparse tensor transform extension. Reviewed By: aartbik, ftynse Differential Revision: https://reviews.llvm.org/D157025	2023-08-04 18:40:07 +00:00
Peiming Liu	372d88b051	[mlir][sparse] code cleanup. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D156941	2023-08-02 23:19:55 +00:00
K-Wu	cfa82f7783	[mlir][sparse][gpu] introduce flag that controls host to device copy strategies (regular dma default) Differential Revision: https://reviews.llvm.org/D155352	2023-08-01 22:30:40 +00:00
wren romano	fdbe9312b1	[mlir][sparse] Adding getters/setters to `DimLvlMap` Depends On D156768 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D156770	2023-08-01 12:55:45 -07:00
wren romano	0495912388	[mlir][sparse] Adding `LvlVar` forward-declarations to `DimLvlMap::print` This commit makes `DimLvlMap::print` consistent with `DimLvlMapParser` with respect to both the forward-decls per se, as well as the all-or-none constraint on LvlVar-bindings in LvlSpecs. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D156768	2023-08-01 12:52:17 -07:00
Kun Wu	1e491c425b	[mlir][sparse][gpu] add 2:4 spmm prune_and_check flag Differential Revision: https://reviews.llvm.org/D155909	2023-08-01 18:24:18 +00:00
wren romano	5df63ad826	[mlir][sparse] Adding `Ranks::operator==` and `VarSet::getRank` etc Depends On D156001 Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D156010	2023-07-31 17:09:00 -07:00
wren romano	2a68213858	[mlir][sparse] Adding nodiscard annotations to `isWF`/`isValid` predicates Depends On D156001 Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D156007	2023-07-31 14:40:05 -07:00
wren romano	3b00f44811	[mlir][sparse] Marking off todos and updating commentary Depends On D155999 Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D156001	2023-07-31 14:35:39 -07:00
wren romano	497050c961	[mlir][sparse] Fixes bug in VarSet ctor Previously, the commented out code in the `DimLvlMap` ctor would result in `VarSet::add` raising an OOB error; which should be impossible because the ctor asserted `DimLvlMap::isWF` which ensures that all variables occuring in the map are within bounds for the ranks. The root cause of that bug was the `VarSet` ctor using `SmallBitVector::reserve` which does not actually change the size of the bitvectors (hence the subsequent OOB). This is corrected by using any of `SmallBitVector::resize`, the move-ctor, or the copy-ctor. Since the default-initialized bitvectors being modified/overwritten have size zero, there shouldn't be any significant performance difference between these three implementations. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D155999	2023-07-31 14:27:59 -07:00
Fangrui Song	6a684dbc44	[Support] Remove llvm::is_trivially_{copy/move}_constructible This restores D132311, which was reverted in `29c841ce93` (Sep 2022) due to certain files not buildable with GCC 7.3.0. The previous attempt was reverted by `6cd9608fb3` (Dec 2020). This time, GCC 7.3.0 has existing build errors for a long time due to structured bindings for many files, e.g. ``` llvm/lib/Transforms/Vectorize/LoopVectorize.cpp:9098:13: error: cannot decompose class type ‘std::pair<llvm::Value, const llvm::SCEV>’: both it and it s base class ‘std::pair<llvm::Value, const llvm::SCEV>’ have non-static data members for (auto [_, Stride] : Legal->getLAI()->getSymbolicStrides()) { ^~~~~~~~~~~ ``` ... and also some `error: duplicate initialization of` instances due to llvm/Transforms/IPO/Attributor.h. --- GCC 7.5.0 has a bug that, without this change, certain `SmallVector` with a `std::pair` element type like `SmallVector<std::pair<Instruction * const, Info>, 0> X;` lead to spurious ``` /tmp/opt/gcc-7.5.0/include/c++/7.5.0/type_traits:878:48: error: constructor required before non-static data member for ‘...’ has been parsed ``` Switching to std::is_trivially_{copy/move}_constructible fixes the error.	2023-07-25 17:21:16 -07:00
Mehdi Amini	5e8a1164f2	Revert "[mlir][gpu] Fallback to JIT compilation" "[mlir][gpu] Increase default SM version from 35 to 50" and "[mlir][gpu] Improving Cubin Serialization with ptxas Compiler" This reverts commit `2e0e00ed84` and reverts commit `a6eb40692c` and reverts commit `585cbe3f63`. 15 tests are broken on the mlir-nvidia buildbot: 'cuModuleLoadData(&module, data)' failed with 'CUDA_ERROR_INVALID_SOURCE' 'cuModuleGetFunction(&function, module, name)' failed with 'CUDA_ERROR_INVALID_HANDLE' 'cuLaunchKernel(function, gridX, gridY, gridZ, blockX, blockY, blockZ, smem, stream, params, extra)' failed with 'CUDA_ERROR_INVALID_HANDLE' 'cuModuleUnload(module)' failed with 'CUDA_ERROR_INVALID_HANDLE'	2023-07-24 10:23:15 -07:00
Guray Ozen	585cbe3f63	[mlir][gpu] Improving Cubin Serialization with ptxas Compiler This work improves how we compile the generated PTX code using the `ptxas` compiler. Currently, we rely on the driver's jit API to compile the PTX code. However, this approach has some limitations. It doesn't always produce the same binary output as the ptxas compiler, leading to potential inconsistencies in the generated Cubin files. This work introduces a significant improvement by directly utilizing the ptxas compiler for PTX compilation. By doing so, we can achieve more consistent and reliable results in generating cubin files. Key Benefits: - Using the Ptxas compiler directly ensures that the cubin files generated during the build process remain consistent with CUDA compilation using `nvcc` or `clang`. - Another advantage of this work is that it allows developers to experiment with different ptxas compilers without the need to change the compiler. Performance among ptxas compiler versions are vary, therefore, one can easily try different ptxas compilers. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D155563	2023-07-24 12:29:53 +02:00
wren romano	889f4bf264	[mlir][sparse] Improve `DimLvlMapParser`'s handling of variable bindings This commit comprises a number of related changes: (1) Reintroduces the semantic distinction between `parseVarUsage` vs `parseVarBinding`, adds documentation explaining the distinction, and adds commentary to the one place that violates the desired/intended semantics. (2) Improves documentation/commentary about the forward-declaration of level-vars, and about the meaning of the `bool` parameter to `parseLvlSpec`. (2) Removes the `VarEnv::addVars` method, and instead has `DimLvlMapParser` handle the conversion issues directly. In particular, the parser now stores and maintains the `{dims,lvls}AndSymbols` arrays, thereby avoiding the O(n^2) behavior of scanning through the entire `VarEnv` for each `parse{Dim,Lvl}Spec` call. Unfortunately there still remains another source of O(n^2) behavior, namely: the `AsmParser::parseAffineExpr` method will copy the `DimLvlMapParser::{dims,lvls}AndSymbols` arrays into `AffineParser::dimsAndSymbols` on each `parse{Dim,Lvl}Spec` call; but fixing that would require extensive changes to `AffineParser` itself. Depends On D155532 Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D155533	2023-07-20 15:56:03 -07:00
wren romano	ad7a6b672c	[mlir][sparse] Renaming `CreationPolicy` to `Policy` This change is mostly for brevity's sake; but it also paves the way for the `Policy` enum to be reuseable for other situations that require the same three-way semantics. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D155532	2023-07-19 12:13:02 -07:00
Alex Zinenko	4a6b31b8d8	[mlir] NFC: untangle SCF Patterns.h and Transforms.h These two headers both contained a strange mix of definitions related to both patterns and non-pattern transforms. Put patterns and "populate" functions into Patterns.h and standalone transforms into Transforms.h. Depends On: D155223 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D155454	2023-07-18 11:27:36 +00:00
Amir Bishara	1ad009ad3c	[mlir][cmake] Comment out redundant static assert regarding VarInfo struct Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D154940	2023-07-18 00:14:24 -07:00
wren romano	47cf7a4ba5	[llvm] Allow SMLoc to be used in constexpr context Since `StringRef::empty` can be used in constexpr context, it seems reasonable that `SMLoc::isValid` should be too. The default-ctor and `operator==` are made constexpr for consistency. In particular, the `constexpr` keyword is needed for silencing warnings on Windows (whereas Linux allows constexpr usage without the keyword). Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D154741	2023-07-17 11:45:22 -07:00
Peiming Liu	269c82d389	[mlir][sparse] introduce new 2:4 block sparsity level type. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D155128	2023-07-12 23:33:53 +00:00
K-Wu	e37fc3cc39	[mlir][sparse][gpu] Impl 2:4 SpMM rewrite for linalg op w/ DENSE24 attr Differential Revision: https://reviews.llvm.org/D154772	2023-07-10 22:36:57 +00:00
Peiming Liu	fc5d8fce7d	[mlir][sparse] support dual sparse convolution. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D152601	2023-07-10 16:49:32 +00:00
wren romano	dcadb68a5c	[mlir][sparse] Cleaning up OOB implementation details for `VarSet` Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D154674	2023-07-07 14:35:56 -07:00
wren romano	68785c1c44	[mlir][sparse] Correcting RTTI implementation for the Var class Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D154662	2023-07-06 18:57:02 -07:00
Aart Bik	03125e6894	[mlir][sparse][gpu] fix missing dealloc This dealloc was incorrectly removed in https://reviews.llvm.org/D153173 Reviewed By: K-Wu Differential Revision: https://reviews.llvm.org/D154564	2023-07-06 09:48:19 -07:00

1 2 3 4 5 ...

738 Commits