clang-p2996

Author	SHA1	Message	Date
Peiming Liu	260e45cff0	[mlir][sparse] fix stack UAF (#79353 )	2024-01-24 12:12:55 -08:00
Peiming Liu	298412b578	[mlir][sparse] setup `SparseIterator` to help generating code to traverse a sparse tensor level. (#78345 )	2024-01-24 11:33:06 -08:00
Matthias Springer	5fcf907b34	[mlir][IR] Rename "update root" to "modify op" in rewriter API (#78260 ) This commit renames 4 pattern rewriter API functions: * `updateRootInPlace` -> `modifyOpInPlace` * `startRootUpdate` -> `startOpModification` * `finalizeRootUpdate` -> `finalizeOpModification` * `cancelRootUpdate` -> `cancelOpModification` The term "root" is a misnomer. The root is the op that a rewrite pattern matches against (https://mlir.llvm.org/docs/PatternRewriter/#root-operation-name-optional). A rewriter must be notified of all in-place op modifications, not just in-place modifications of the root (https://mlir.llvm.org/docs/PatternRewriter/#pattern-rewriter). The old function names were confusing and have contributed to various broken rewrite patterns. Note: The new function names use the term "modify" instead of "update" for consistency with the `RewriterBase::Listener` terminology (`notifyOperationModified`).	2024-01-17 11:08:59 +01:00
Matthias Springer	0a8e3dd432	[mlir][Interfaces] `DestinationStyleOpInterface`: Rename `hasTensor/BufferSemantics` (#77574 ) Rename interface functions as follows: * `hasTensorSemantics` -> `hasPureTensorSemantics` * `hasBufferSemantics` -> `hasPureBufferSemantics` These two functions return "true" if the op has tensor/buffer operands but not buffer/tensor operands. Also drop the "ranked" part from the interface, i.e., do not distinguish between ranked/unranked types. The new function names describe the functions more accurately. They also align their semantics with the notion of "tensor semantics" with the bufferization framework. (An op is supposed to be bufferized if it has tensor operands, and we don't care if it also has memref operands.) This change is in preparation of #75273, which adds `BufferizableOpInterface::hasTensorSemantics`. By renaming the functions in the `DestinationStyleOpInterface`, we can avoid name clashes between the two interfaces.	2024-01-12 10:02:54 +01:00
Aart Bik	aec73eade7	[mlir][sparse] allow unknown ops in one-shot bufferization in mini-pipeline (#77688 ) Rationale: Since this mini-pipeline may be used in alternative pipelines (viz. different from the default "sparsifier" pipeline) where unknown ops are handled by alternative bufferization methods that are downstream of this mini-pipeline, we allow unknown ops by default (failure to bufferize is eventually apparent by failing to convert to LLVM IR). This is part of enabling e2e testing for TORCH-MLIR tests using a sparsifier backend	2024-01-10 13:36:20 -08:00
Peiming Liu	d933b88b71	[mlir][sparse] use a common util function to query the tensor level s… (#76764 ) …et in a lattice point.	2024-01-02 15:56:42 -08:00
Aart Bik	41a07e668c	[mlir][sparse] recognize NVidia 2:4 type for matmul (#76758 ) This removes the temporary DENSE24 attribute and replaces it with proper recognition of dense to 24 conversion. The compressionh will be performed on the device prior to performing the matrix mult. Note that we no longer need to start with the linalg version, we can lift this to the proper named linalg op. Also renames some files into more consistent names.	2024-01-02 14:44:24 -08:00
Peiming Liu	cf4dd91165	[mlir][sparse] initialize slice-driven loop-related fields in one place (#76099 )	2023-12-20 14:20:57 -08:00
Matthias Springer	10056c821a	[mlir][SCF] `scf.parallel`: Make reductions part of the terminator (#75314 ) This commit makes reductions part of the terminator. Instead of `scf.yield`, `scf.reduce` now terminates the body of `scf.parallel` ops. `scf.reduce` may contain an arbitrary number of reductions, with one region per reduction. Example: ```mlir %init = arith.constant 0.0 : f32 %r:2 = scf.parallel (%iv) = (%lb) to (%ub) step (%step) init (%init, %init) -> f32, f32 { %elem_to_reduce1 = load %buffer1[%iv] : memref<100xf32> %elem_to_reduce2 = load %buffer2[%iv] : memref<100xf32> scf.reduce(%elem_to_reduce1, %elem_to_reduce2 : f32, f32) { ^bb0(%lhs : f32, %rhs: f32): %res = arith.addf %lhs, %rhs : f32 scf.reduce.return %res : f32 }, { ^bb0(%lhs : f32, %rhs: f32): %res = arith.mulf %lhs, %rhs : f32 scf.reduce.return %res : f32 } } ``` `scf.reduce` operations can no longer be interleaved with other ops in the body of `scf.parallel`. This simplifies the op and makes it possible to assign the `RecursiveMemoryEffects` trait to `scf.reduce`. (This was not possible before because the op was not a terminator, causing the op to be DCE'd.)	2023-12-20 11:06:27 +09:00
Matthias Springer	ea979b24b0	[mlir][SparseTensor][NFC] Remove `isNestedIn` helper function (#75729 ) Use `Region::findAncestorBlockInRegion` instead of a custom IR traversal.	2023-12-17 13:19:27 +09:00
Peiming Liu	6c06bde7c4	[mlir][sparse] support loop range query using SparseTensorLevel. (#75670 )	2023-12-15 16:33:31 -08:00
Peiming Liu	21edad7d07	[mlir][sparse] set up the skeleton for SparseTensorLevel abstraction. (#75645 ) Note that at the current moment, the newly-introduced `SparseTensorLevel` classes are far from complete, we plan to migrate code generation related to accessing sparse tensor levels to these classes in the near future to simplify `LoopEmitter`.	2023-12-15 13:34:34 -08:00
Peiming Liu	4a72a4ef12	[NFC][mlir][sparse] remove redundant parameter. (#75551 )	2023-12-15 09:29:22 -08:00
Aart Bik	15c06bc4af	[mlir][sparse] comment cleanup in iteration graph sorter (#75508 )	2023-12-14 10:56:28 -08:00
Aart Bik	e52c941921	[mlir][sparse] minor cleanup of transform/utils (#75396 ) Consistent include macro naming Modified and added comments	2023-12-13 15:18:35 -08:00
Aart Bik	365777ecbe	[mlir][sparse] refactor utilities into transform/utils dir (#75250 ) Separates actual transformation files from supporting utility files in the transforms directory. Includes a bazel overlay fix for the build (as well as a bit of cleanup of that file to be less verbose and more flexible).	2023-12-12 15:34:31 -08:00
Aart Bik	047399c213	[mlir][sparse] cleanup of CodegenEnv reduction API (#75243 )	2023-12-12 12:44:46 -08:00
Aart Bik	d96f46dd20	[mlir][sparse] fix bug in custom reduction scalarization code (#74898 ) Bug found with BSR of "spy" SDDMM method	2023-12-11 10:22:17 -08:00
Peiming Liu	baa192ea65	[mlir][sparse] optimize memory loads to SSA values when generating sp… (#74787 ) …arse conv.	2023-12-08 09:22:19 -08:00
Peiming Liu	097d2f1417	[mlir][sparse] optimize memory load to SSA value when generating spar… (#74750 ) …se conv kernel.	2023-12-07 12:00:25 -08:00
Matthias Springer	986287e7f3	[mlir][SparseTensor] Fix invalid API usage in patterns (#74690 ) Rewrite patterns must return `success` if the IR was modified. This commit fixes sparse tensor tests such as `SparseTensor/sparse_fusion.mlir`, `SparseTensor/CPU/sparse_reduce_custom.mlir`, `SparseTensor/CPU/sparse_semiring_select.mlir` when running with `MLIR_ENABLE_EXPENSIVE_PATTERN_API_CHECKS`.	2023-12-07 12:05:20 +09:00
Peiming Liu	78e2b74f96	[mlir][sparse] fix bugs when generate sparse conv_3d kernels. (#74561 )	2023-12-06 15:59:10 -08:00
Matthias Springer	861600f175	[mlir][SparseTensor] Fix invalid IR in `ForallRewriter` pattern (#74547 ) The `ForallRewriter` pattern used to generate invalid IR: ``` mlir/test/Dialect/SparseTensor/GPU/gpu_combi.mlir:0:0: error: 'scf.for' op expects region #0 to have 0 or 1 blocks mlir/test/Dialect/SparseTensor/GPU/gpu_combi.mlir:0:0: note: see current operation: "scf.for"(%8, %2, %9) ({ ^bb0(%arg5: index): // ... "scf.yield"() : () -> () ^bb1(%10: index): // no predecessors "scf.yield"() : () -> () }) : (index, index, index) -> () ``` This commit fixes tests such as `mlir/test/Dialect/SparseTensor/GPU/gpu_combi.mlir` when verifying the IR after each pattern application (#74270).	2023-12-07 08:47:20 +09:00
Matthias Springer	851f85fffb	[mlir][SparseTensor] Fix insertion point in `createQuickSort` (#74549 ) `createQuickSort` used to generate invalid IR: ``` "func.func"() <{function_type = (index, index, memref<?xindex>, memref<?xf32>, memref<?xi32>) -> (), sym_name = "_sparse_qsort_0_1_index_coo_1_f32_i32", sym_visibility = "private"}> ({ ^bb0(%arg0: index, %arg1: index, %arg2: memref<?xindex>, %arg3: memref<?xf32>, %arg4: memref<?xi32>): %0:2 = "scf.while"(%arg0, %arg1) ({ ^bb0(%arg5: index, %arg6: index): // ... "scf.condition"(%3, %arg5, %arg6) : (i1, index, index) -> () }, { ^bb0(%arg5: index, %arg6: index): // ... %7:2 = "scf.if"(%6) ({ %8 = "arith.cmpi"(%2, %3) <{predicate = 7 : i64}> : (index, index) -> i1 // ... "scf.yield"(%9#0, %9#1) : (index, index) -> () %10 = "arith.constant"() <{value = 0 : index}> : () -> index }, { "scf.yield"(%arg5, %arg5) : (index, index) -> () }) : (i1) -> (index, index) "scf.yield"(%7#0, %7#1) : (index, index) -> () }) : (index, index) -> (index, index) "func.return"() : () -> () }) : () -> () within split at mlir/test/Dialect/SparseTensor/buffer_rewriting.mlir:76 offset :11:1: error: 'scf.yield' op must be the last operation in the parent block ``` This commit fixes tests such as `mlir/test/Dialect/SparseTensor/buffer_rewriting.mlir` when verifying the IR after each pattern application (#74270).	2023-12-07 08:47:05 +09:00
Aart Bik	c5a1732cf3	[mlir][sparse] use "current" and "curr" consistently (#74656 ) Removes at in favor of curr; also makes method delegates consistent	2023-12-06 14:12:46 -08:00
Aart Bik	98ce2debc6	[mlir][sparse] cleanup ldx/idx/depth/at usage (#74654 ) This adds a consistent usage with `at` for everything that refers to the current loop nesting. This cleans up some redundant legacy code from when we were still using topSort inside sparsifier code.	2023-12-06 13:23:50 -08:00
Aart Bik	5b0db27ace	[mlir][sparse] remove LoopOrd type (#74540 ) Rationale: We no longer deal with topsort during sparsification, so that LoopId == LoopOrd for all methods. This first revision removes the types. A follow up revision will simplify some other remaining constructs that deal with loop order (e.g. at and ldx).	2023-12-06 09:35:30 -08:00
Aart Bik	067bebb50f	[mlir][sparse] minor refactoring of sparsification file (#74403 ) Removed obsoleted TODOs and NOTEs, formatting, removed unused parameter	2023-12-05 09:31:17 -08:00
Peiming Liu	8206b75a1e	[mlir][sparse] fix crash when generate rotated convolution kernels. (#74146 )	2023-12-01 14:13:57 -08:00
Peiming Liu	b6cad75e07	[mlir][sparse] refactoring: using util functions to query the index to load from position array for slice-driven loop. (#73986 )	2023-11-30 16:40:11 -08:00
Aart Bik	5b72950394	[mlir][sparse] move all COO related methods into SparseTensorType (#73881 ) This centralizes all COO methods, and provides a cleaner API. Note that the "enc" only constructor is a temporary workaround the need for COO methods inside the "enc" only storage specifier.	2023-11-30 09:40:39 -08:00
Aart Bik	45288085b5	[mlir][sparse] move toCOOType into SparseTensorType class (#73708 ) Migrates dangling convenience method into proper SparseTensorType class. Also cleans up some details (picking right dim2lvl/lvl2dim). Removes more dead code.	2023-11-28 16:04:01 -08:00
Peiming Liu	1ece4d3a0d	[mlir][sparse] code simplification: always use synthetical tensor for… (#73597 ) … loop bound.	2023-11-27 17:41:45 -08:00
Peiming Liu	4e2f1521ec	[mlir][sparse] code cleanup, remove FIXMEs (#73575 )	2023-11-27 14:57:08 -08:00
Aart Bik	1944c4f76b	[mlir][sparse] rename DimLevelType to LevelType (#73561 ) The "Dim" prefix is a legacy left-over that no longer makes sense, since we have a very strict "Dimension" vs. "Level" definition for sparse tensor types and their storage.	2023-11-27 14:27:52 -08:00
Aart Bik	1dd387e106	[mlir][sparse] change dim level type -> level type (#73058 ) The "dimension" before "level" does not really make sense Note that renaming the actual type DimLevelType to LevelType is still TBD, since this is an externally visible change (e.g. visible to Python API).	2023-11-22 09:06:22 -08:00
Aart Bik	d213220a9a	[mlir][sparse] fixed naming consistency (#73053 ) All DLT related methods have DLT at end, removed stale TODO	2023-11-21 16:26:09 -08:00
Peiming Liu	b52eb7c2fe	[mlir][sparse] add a csr x bsr matmul test case (#73012 )	2023-11-21 09:14:45 -08:00
Peiming Liu	2cc4b3d07c	[mlir][sparse] code cleanup using the assumption that dim2lvl maps ar… (#72894 ) …e simplified.	2023-11-20 10:25:42 -08:00
Aart Bik	83cf0dc982	[mlir][sparse] implement direct IR alloc/empty/new for non-permutations (#72585 ) This change implements the correct level sizes set up for the direct IR codegen fields in the sparse storage scheme. This brings libgen and codegen together again. This is step 3 out of 3 to make sparse_tensor.new work for BSR	2023-11-16 17:17:41 -08:00
Peiming Liu	ccd923e3cb	[mlir][sparse] code cleanup (remove dead code related to filter loop). (#72573 )	2023-11-16 14:26:09 -08:00
Peiming Liu	ff8815e597	[mlir][sparse] code cleanup (remove topSort in CodegenEnv). (#72550 )	2023-11-16 13:21:49 -08:00
Aart Bik	2323f48e0d	[mlir][sparse] refactor dim2lvl/lvl2dim lvlsizes setup (#72474 ) This change provides access to the individual components of dim sizes and lvl sizes after each codegenutil call. This is step 2 out of 3 to make sparse_tensor.new work for BSR	2023-11-15 21:41:43 -08:00
Aart Bik	e8fc282ff2	[mlir][sparse] avoid non-perm on sparse tensor convert for new (#72459 ) This avoids seeing non-perm on the convert from COO to non-COO for higher dimensional new operators (viz. reading in BSR). This is step 1 out of 3 to make sparse_tensor.new work for BSR	2023-11-15 20:47:37 -08:00
Peiming Liu	06a65ce500	[mlir][sparse] schedule sparse kernels in a separate pass from sparsification. (#72423 )	2023-11-15 12:16:05 -08:00
Maksim Levental	e35b606280	[mlir][sparsifier] fix `isAdmissibleBSR` (#72195 ) Fixes https://github.com/llvm/llvm-project/issues/72194.	2023-11-14 16:56:34 -06:00
Aart Bik	5f32bcfbae	[mlir][sparse][gpu] re-enable all GPU libgen tests (#72185 ) Previous change no longer properly used the GPU libgen pass (even though most tests still passed falling back to CPU). This revision puts the proper pass order into place. Also bit of a cleanup of CPU codegen vs. libgen setup.	2023-11-14 09:06:15 -08:00
long.chen	1609f1c2a5	[mlir][affine][nfc] cleanup deprecated T.cast style functions (#71269 ) detail see the docment: https://mlir.llvm.org/deprecation/ Not all changes are made manually, most of them are made through a clang tool I wrote https://github.com/lipracer/cpp-refactor.	2023-11-14 13:01:19 +08:00
Peiming Liu	269685545e	[mlir][sparse] remove filter-loop based algorithm support to handle a… (#71840 ) …ffine subscript expressions.	2023-11-13 11:36:49 -08:00
Aart Bik	af8428c0d9	[mlir][sparse] unify support of (dis)assemble between direct IR/lib path (#71880 ) Note that the (dis)assemble operations still make some simplfying assumptions (e.g. trailing 2-D COO in AoS format) but now at least both the direct IR and support library path behave exactly the same. Generalizing the ops is still TBD.	2023-11-13 10:05:00 -08:00

1 2 3 4 5 ...

717 Commits