clang-p2996

Author	SHA1	Message	Date
Matthias Springer	325b58d59f	[mlir][cf] Print message in cf.assert to LLVM lowering The assert message was previously ignored. The lowered IR now calls `puts` it in case of a failed assertion. Differential Revision: https://reviews.llvm.org/D138647	2022-12-15 17:45:34 +01:00
Matthias Kramm	4e98d611ef	[mlir] Implement backward dataflow. This enables interprocedural lifeness analysis, very busy expression analysis, etc. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D138935	2022-12-13 18:35:27 +01:00
Hanhan Wang	0f297cad4d	[mlir][tensor][linalg] Introduce DataLayoutPropagation pass. It introduces a pattern that swaps `linalg.generic + tensor.pack` to `tensor.pack + linalg.generic`. It requires all the iteration types being parallel; the indexing map of output operand is identiy. They can all be relaxed in the future. The user can decide whether the propagation should be applied or not by passing a control function. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D138882	2022-12-06 15:00:07 -08:00
Matthias Springer	c1fef4e88a	[mlir][bufferization] Make `TensorCopyInsertionPass` a test pass TensorCopyInsertion should not have been exposed as a pass. This was a flaw in the original design. It is a preparation step for bufferization and certain transforms (that would otherwise be legal) are illegal between TensorCopyInsertion and actual rewrite to MemRef ops. Therefore, even if broken down as two separate steps internally, they should be exposed as a single pass. This change affects the sparse compiler, which uses `TensorCopyInsertionPass`. A new `SparsificationAndBufferizationPass` is added to replace all passes in the sparse tensor pipeline from `TensorCopyInsertionPass` until the actual bufferization (rewrite to memref/non-tensor). It is generally unsafe to run arbitrary passes in-between, in particular passes that hoist tensor ops out of loops or change SSA use-def chains along tensor ops. Differential Revision: https://reviews.llvm.org/D138915	2022-12-02 15:38:02 +01:00
Nicolas Vasilache	6e92d3fead	[mlir][Test] Add a test pass to act as a sink towards LLVM conversion This allows writing simple e2e tests where we can check for the proper materialization of specific LLVM IR (e.g. `llvm.intr.fmuladd`). Differential Revision: https://reviews.llvm.org/D138776	2022-11-28 00:59:55 -08:00
River Riddle	8c66344ee9	[mlir:PDL] Add support for DialectConversion with pattern configurations Up until now PDL(L) has not supported dialect conversion because we had no way of remapping values or integrating with type conversions. This commit rectifies that by adding a new "pattern configuration" concept to PDL. This essentially allows for attaching external configurations to patterns, which can hook into pattern events (for now just the scope of a rewrite, but we could also pass configs to native rewrites as well). This allows for injecting the type converter into the conversion pattern rewriter. Differential Revision: https://reviews.llvm.org/D133142	2022-11-08 01:57:57 -08:00
Nicolas Vasilache	44cfea0279	[mlir][Linalg] Retire LinalgStrategyTilePass and filter-based pattern. Context: https://discourse.llvm.org/t/psa-retire-linalg-filter-based-patterns/63785 Uses of `LinalgTilingPattern::returningMatchAndRewrite` are replaced by a top-level `tileWithLinalgTilingOptions` function that is marked obsolete and serves as a temporary means to transition away from `LinalgTilingOptions`-based tiling. LinalgTilingOptions supports too many options that have been orthogonalized with the use of the transform dialect. Additionally, the revision introduces a `transform.structured.tile_to_scf_for` structured transform operation that is needed to properly tile `tensor.pad` via the TilingInterface. Uses of `transform.structured.tile` will be deprecated and replaced by this new op. This will achieve the deprecation of `linalg::tileLinalgOp`. Context: https://discourse.llvm.org/t/psa-retire-tileandfuselinalgops-method/63850 In the process of transitioning, tests that were performing tile and distribute on tensors are retired: transformations should be orthogonalized better in the future. In particular, tiling to specific loop types and tileAndDistribute behavior are not available via the transform ops. The behavior is still available as part of the `tileWithLinalgTilingOptions` method to allow downstream clients to transition without breakages but is meant to be retired soon. As more tests are ported to the transform dialect, it became necessary to introduce a test-transform-dialect-erase-schedule-pass to discard the transform specification once applied so that e2e lowering and execution is possible. Lastly, a number of redundant tests that were testing composition of patterns are retired as they are available with a better mechanism via the transform dialect. Differential Revision: https://reviews.llvm.org/D135573	2022-10-11 02:42:56 -07:00
Yuanqiang Liu	9f77909a5e	[mlir][shape] add outline-shape-computation pass Add outline-shape-computation pass. This pass his pass outlines the shape computation part in high level IR by adding shape.func and populate corresponding mapping information into ShapeMappingAnalysis. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D131810	2022-10-02 20:24:49 -07:00
Jakub Kuderski	abc362a107	[mlir][arith] Change dialect name from Arithmetic to Arith Suggested by @lattner in https://discourse.llvm.org/t/rfc-define-precise-arith-semantics/65507/22. Tested with: `ninja check-mlir check-mlir-integration check-mlir-mlir-spirv-cpu-runner check-mlir-mlir-vulkan-runner check-mlir-examples` and `bazel build --config=generic_clang @llvm-project//mlir:all`. Reviewed By: lattner, Mogball, rriddle, jpienaar, mehdi_amini Differential Revision: https://reviews.llvm.org/D134762	2022-09-29 11:23:28 -04:00
Jakub Kuderski	242d558658	[mlir][arith] Add test pass for wide integer emulation The new test pass allows for running wide integer emulation conversion within specified functions only. I intend to use it in integration tests in a way that allows me print both original and emulated results in the same format, or even compare both results at runtime and print on mismatch only. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D134120	2022-09-20 11:22:28 -04:00
Mathieu Fehr	ba8424a251	[mlir] Add Dynamic Dialects Dynamic dialects are dialects that can be defined at runtime. Dynamic dialects are extensible by new operations, types, and attributes at runtime. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125201	2022-09-19 09:58:18 -07:00
Matthias Springer	31fbdab376	[mlir][transforms] Add topological sort analysis This change add a helper function for computing a topological sorting of a list of ops. E.g. this can be useful in transforms where a subset of ops should be cloned without dominance errors. The analysis reuses the existing implementation in TopologicalSortUtils.cpp. Differential Revision: https://reviews.llvm.org/D131669	2022-08-15 21:09:18 +02:00
Manish Gupta	14d79afeae	[mlir][NVGPU] nvgpu.mmasync on F32 through TF32 Adds optional attribute to support tensor cores on F32 datatype by lowering to `mma.sync` with TF32 operands. Since, TF32 is not a native datatype in LLVM we are adding `tf32Enabled` as an attribute to allow the IR to be aware of `MmaSyncOp` datatype. Additionally, this patch adds placeholders for nvgpu-to-nvgpu transformation targeting higher precision tf32x3. For mma.sync on f32 input using tensor cores there are two possibilites: (a) tf32 (1 `mma.sync` per warp-level matrix-multiply-accumulate) (b) tf32x3 (3 `mma.sync` per warp-level matrix-multiply-accumulate) Typically, tf32 tensor core acceleration comes at a cost of accuracy from missing precision bits. While f32 has 23 precision bits, tf32 has only 10 precision bits. tf32x3 aims to recover the precision bits by splitting each operand into two tf32 values and issue three `mma.sync` tensor core operations. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D130294	2022-08-01 23:23:27 +00:00
srishti-cb	b508c5649f	[MLIR] Add a utility to sort the operands of commutative ops Added a commutativity utility pattern and a function to populate it. The pattern sorts the operands of an op in ascending order of the "key" associated with each operand iff the op is commutative. This sorting is stable. The function is intended to be used inside passes to simplify the matching of commutative operations. After the application of the above-mentioned pattern, since the commutative operands now have a deterministic order in which they occur in an op, the matching of large DAGs becomes much simpler, i.e., requires much less number of checks to be written by a user in her/his pattern matching function. The "key" associated with an operand is the list of the "AncestorKeys" associated with the ancestors of this operand, in a breadth-first order. The operand of any op is produced by a set of ops and block arguments. Each of these ops and block arguments is called an "ancestor" of this operand. Now, the "AncestorKey" associated with: 1. A block argument is `{type: BLOCK_ARGUMENT, opName: ""}`. 2. A non-constant-like op, for example, `arith.addi`, is `{type: NON_CONSTANT_OP, opName: "arith.addi"}`. 3. A constant-like op, for example, `arith.constant`, is `{type: CONSTANT_OP, opName: "arith.constant"}`. So, if an operand, say `A`, was produced as follows: ``` `<block argument>` `<block argument>` \ / \ / `arith.subi` `arith.constant` \ / `arith.addi` \| returns `A` ``` Then, the block arguments and operations present in the backward slice of `A`, in the breadth-first order are: `arith.addi`, `arith.subi`, `arith.constant`, `<block argument>`, and `<block argument>`. Thus, the "key" associated with operand `A` is: ``` { {type: NON_CONSTANT_OP, opName: "arith.addi"}, {type: NON_CONSTANT_OP, opName: "arith.subi"}, {type: CONSTANT_OP, opName: "arith.constant"}, {type: BLOCK_ARGUMENT, opName: ""}, {type: BLOCK_ARGUMENT, opName: ""} } ``` Now, if "keyA" is the key associated with operand `A` and "keyB" is the key associated with operand `B`, then: "keyA" < "keyB" iff: 1. In the first unequal pair of corresponding AncestorKeys, the AncestorKey in operand `A` is smaller, or, 2. Both the AncestorKeys in every pair are the same and the size of operand `A`'s "key" is smaller. AncestorKeys of type `BLOCK_ARGUMENT` are considered the smallest, those of type `CONSTANT_OP`, the largest, and `NON_CONSTANT_OP` types come in between. Within the types `NON_CONSTANT_OP` and `CONSTANT_OP`, the smaller ones are the ones with smaller op names (lexicographically). --- Some examples of such a sorting: Assume that the sorting is being applied to `foo.commutative`, which is a commutative op. Example 1: > %1 = foo.const 0 > %2 = foo.mul <block argument>, <block argument> > %3 = foo.commutative %1, %2 Here, 1. The key associated with %1 is: ``` { {CONSTANT_OP, "foo.const"} } ``` 2. The key associated with %2 is: ``` { {NON_CONSTANT_OP, "foo.mul"}, {BLOCK_ARGUMENT, ""}, {BLOCK_ARGUMENT, ""} } ``` The key of %2 < the key of %1 Thus, the sorted `foo.commutative` is: > %3 = foo.commutative %2, %1 Example 2: > %1 = foo.const 0 > %2 = foo.mul <block argument>, <block argument> > %3 = foo.mul %2, %1 > %4 = foo.add %2, %1 > %5 = foo.commutative %1, %2, %3, %4 Here, 1. The key associated with %1 is: ``` { {CONSTANT_OP, "foo.const"} } ``` 2. The key associated with %2 is: ``` { {NON_CONSTANT_OP, "foo.mul"}, {BLOCK_ARGUMENT, ""} } ``` 3. The key associated with %3 is: ``` { {NON_CONSTANT_OP, "foo.mul"}, {NON_CONSTANT_OP, "foo.mul"}, {CONSTANT_OP, "foo.const"}, {BLOCK_ARGUMENT, ""}, {BLOCK_ARGUMENT, ""} } ``` 4. The key associated with %4 is: ``` { {NON_CONSTANT_OP, "foo.add"}, {NON_CONSTANT_OP, "foo.mul"}, {CONSTANT_OP, "foo.const"}, {BLOCK_ARGUMENT, ""}, {BLOCK_ARGUMENT, ""} } ``` Thus, the sorted `foo.commutative` is: > %5 = foo.commutative %4, %3, %2, %1 Signed-off-by: Srishti Srivastava <srishti.srivastava@polymagelabs.com> Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D124750	2022-07-30 19:25:18 -04:00
Mahesh Ravishankar	485190df95	[mlir][Linalg] Deprecate `tileAndFuseLinalgOps` method and associated patterns. The `tileAndFuseLinalgOps` is a legacy approach for tiling + fusion of Linalg operations. Since it was also intended to work on operations with buffer operands, this method had fairly complex logic to make sure tile and fuse was correct even with side-effecting linalg ops. While complex, it still wasnt robust enough. This patch deprecates this method and thereby deprecating the tiling + fusion method for ops with buffer semantics. Note that the core transformation to do fusion of a producer with a tiled consumer still exists. The deprecation here only removes methods that auto-magically tried to tile and fuse correctly in presence of side-effects. The `tileAndFuseLinalgOps` also works with operations with tensor semantics. There are at least two other ways the same functionality exists. 1) The `tileConsumerAndFuseProducers` method. This does a similar transformation, but using a slightly different logic to automatically figure out the legal tile + fuse code. Note that this is also to be deprecated soon. 2) The prefered way uses the `TilingInterface` for tile + fuse, and relies on the caller to set the tiling options correctly to ensure that the generated code is correct. As proof that (2) is equivalent to the functionality provided by `tileAndFuseLinalgOps`, relevant tests have been moved to use the interface, where the test driver sets the tile sizes appropriately to generate the expected code. Differential Revision: https://reviews.llvm.org/D129901	2022-07-21 05:05:06 +00:00
Mahesh Ravishankar	3139cc766c	[mlir][Linalg] Add a pattern to decompose `linalg.generic` ops. This patch adds a pattern to decompose a `linalg.generic` operations that - has only parallel iterator types - has more than 2 statements (including the yield) into multiple `linalg.generic` operation such that each operation has a single statement and a yield. The pattern added here just splits the matching `linalg.generic` into two `linalg.generic`s, one containing the first statement, and the other containing the remaining. The same pattern can be applied repeatedly on the second op to ultimately fully decompose the generic op. Differential Revision: https://reviews.llvm.org/D129704	2022-07-15 23:01:18 +00:00
Nicolas Vasilache	cd6e02eebc	[mlir][Linalg] Retire TestLinalgCodegenStrategy pass. This pass tests patterns that are already tested elsewhere by applying them in a semi-targeted fashion using anchor function and op names. From now on, targeted tests should use the transform dialect interpreter. Differential Revision: https://reviews.llvm.org/D129627	2022-07-13 04:20:42 -07:00
Mogball	c20a581a8d	[mlir] Delete ForwardDataFlowAnalysis With SCCP and integer range analysis ported to the new framework, this old framework is redundant. Delete it. Depends on D128866 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D128867	2022-07-07 21:08:27 -07:00
Mogball	d80c271c8a	[mlir] An implementation of dense data-flow analysis This patch introduces an implementation of dense data-flow analysis. Dense data-flow analysis attaches a lattice before and after the execution of every operation. The lattice state is propagated across operations by a user-defined transfer function. The state is joined across control-flow and callgraph edges. Thge patch provides an example pass that uses both a dense and a sparse analysis together. Depends on D127139 Reviewed By: rriddle, phisiart Differential Revision: https://reviews.llvm.org/D127173	2022-07-07 15:12:46 -07:00
Mogball	c095afcba6	[mlir] Add Dead Code Analysis This patch implements the analysis state classes needed for sparse data-flow analysis and implements a dead-code analysis using those states to determine liveness of blocks, control-flow edges, region predecessors, and function callsites. Depends on D126751 Reviewed By: rriddle, phisiart Differential Revision: https://reviews.llvm.org/D127064	2022-06-30 13:51:25 -07:00
Mogball	ead75d9434	(Reland)[mlir] Add a generic data-flow analysis framework Removes one element of the pointer union to make it work on 32-bit systems. This patch introduces a generic data-flow analysis framework to MLIR. The framework implements a fixed-point iteration algorithm and a dependency graph between lattice states and analysis. Lattice states and points are fully extensible to support highly-customizable analyses. Reviewed By: phisiart, rriddle Differential Revision: https://reviews.llvm.org/D126751	2022-06-14 21:33:05 +00:00
Frederik Gossen	a6fa12ab3b	Revert "[mlir] Add a generic data-flow analysis framework" This reverts commit `9dea117283`. The PointerUnion assumes 3 available bits, which is not the case on 32-bit machines.	2022-06-14 17:14:27 -04:00
Mogball	9dea117283	[mlir] Add a generic data-flow analysis framework This patch introduces a generic data-flow analysis framework to MLIR. The framework implements a fixed-point iteration algorithm and a dependency graph between lattice states and analysis. Lattice states and points are fully extensible to support highly-customizable analyses. Reviewed By: phisiart, rriddle Differential Revision: https://reviews.llvm.org/D126751	2022-06-14 16:54:15 +00:00
Mahesh Ravishankar	cf6a7c1947	[mlir][TilingInterface] Add pattern to tile using TilingInterface and implement TilingInterface for Linalg ops. This patch adds support for tiling operations that implement the TilingInterface. - It separates the loop constructs that are used to iterate over tile from the implementation of the tiling itself. For example, the use of destructive updates is more related to use of scf.for for iterating over tiles that are tensors. - To test the transformation, TilingInterface is implemented for LinalgOps. The separation of the looping constructs used from the implementation of tile code generation greatly simplifies the latter. - The implementation of TilingInterface for LinalgOp is kept as an external model for now till this approach can be fully flushed out to replace the existing tiling + fusion approaches in Linalg. Differential Revision: https://reviews.llvm.org/D127133	2022-06-13 20:37:44 +00:00
Krzysztof Drewniak	95aff23e29	Re-land "[mlir] Add integer range inference analysis"" This reverts commit `4e5ce2056e`. This relands commit `1350c9887d`. Reinstates the range analysis with the build issue fixed. Differential Revision: https://reviews.llvm.org/D126926	2022-06-03 17:13:48 +00:00
Mehdi Amini	4e5ce2056e	Revert "[mlir] Add integer range inference analysis" This reverts commit `1350c9887d`. Shared library build is broken with undefined references.	2022-06-02 21:24:06 +00:00
Krzysztof Drewniak	1350c9887d	[mlir] Add integer range inference analysis This commit defines a dataflow analysis for integer ranges, which uses a newly-added InferIntRangeInterface to compute the lower and upper bounds on the results of an operation from the bounds on the arguments. The range inference is a flow-insensitive dataflow analysis that can be used to simplify code, such as by statically identifying bounds checks that cannot fail in order to eliminate them. The InferIntRangeInterface has one method, inferResultRanges(), which takes a vector of inferred ranges for each argument to an op implementing the interface and a callback allowing the implementation to define the ranges for each result. These ranges are stored as ConstantIntRanges, which hold the lower and upper bounds for a value. Bounds are tracked separately for the signed and unsigned interpretations of a value, which ensures that the impact of arithmetic overflows is correctly tracked during the analysis. The commit also adds a -test-int-range-inference pass to test the analysis until it is integrated into SCCP or otherwise exposed. Finally, this commit fixes some bugs relating to the handling of region iteration arguments and terminators in the data flow analysis framework. Depends on D124020 Depends on D124021 Reviewed By: rriddle, Mogball Differential Revision: https://reviews.llvm.org/D124023	2022-06-02 20:24:11 +00:00
Rob Suderman	f3bdb56d61	[mlir][math] Add math.ctlz expansion to control flow + arith operations Ctlz is an intrinsic in LLVM but does not have equivalent operations in SPIR-V. Including a decomposition gives an alternative path for these platforms. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D126261	2022-06-01 11:45:04 -07:00
Christian Sigg	bcf3d52486	[MLIR][GPU] Expose GpuParallelLoopMapping as non-test pass. Reviewed By: bondhugula, herhut Differential Revision: https://reviews.llvm.org/D126199	2022-05-30 09:20:48 +02:00
River Riddle	597fde54a8	[mlir][PDLL] Add support for generating PDL patterns from PDLL at build time This essentially sets up mlir-pdll to function in a similar manner to mlir-tblgen. Aside from the boilerplate of configuring CMake and setting up a basic initial test, two new options are added to mlir-pdll to mirror options provided by tblgen: * -d This option generates a dependency file (i.e. a set of build time dependencies) while processing the input file. * --write-if-changed This option only writes to the output file if the data would have changed, which for the build system prevents unnecesarry rebuilds if the file was touched but not actually changed. Differential Revision: https://reviews.llvm.org/D124075	2022-04-26 18:33:16 -07:00
Krzysztof Drewniak	d35f7f254f	[mlir] Allow data flow analysis of non-control flow branch arguments This commit adds the visitNonControlFlowArguments method to DataFlowAnalysis, allowing analyses to provide lattice values for the arguments to a RegionSuccessor block that aren't directly tied to an op's inputs. For example, integer range interface can use this method to infer bounds for the step values in loops. This method has a default implementation that keeps the old behavior of assigning a pessimistic fixedpoint state to all such arguments. Reviewed By: Mogball, rriddle Differential Revision: https://reviews.llvm.org/D124021	2022-04-25 20:19:34 +00:00
Markus Böck	850b2c6b3c	[mlir] Fix `Region`s `takeBody` method if the region is not empty The current implementation of takeBody first clears the Region, before then taking ownership of the blocks of the other regions. The issue here however, is that when clearing the region, it does not take into account references of operations to each other. In particular, blocks are deleted from front to back, and operations within a block are very likely to be deleted despite still having uses, causing an assertion to trigger [0]. This patch fixes that issue by simply calling dropAllReferences()before clearing the blocks. [0] `9a8bb4bc63/mlir/lib/IR/Operation.cpp (L154)` Differential Revision: https://reviews.llvm.org/D123913	2022-04-21 15:32:59 +02:00
William S. Moses	ed499ddcda	[MLIR] Fix operation clone Operation clone is currently faulty. Suppose you have a block like as follows: ``` (%x0 : i32) { %x1 = f(%x0) return %x1 } ``` The test case we have is that we want to "unroll" this, in which we want to change this to compute `f(f(x0))` instead of just `f(x0)`. We do so by making a copy of the body at the end of the block and set the uses of the argument in the copy operations with the value returned from the original block. This is implemented as follows: 1) map to the block arguments to the returned value (`map[x0] = x1`). 2) clone the body Now for this small example, this works as intended and we get the following. ``` (%x0 : i32) { %x1 = f(%x0) %x2 = f(%x1) return %x2 } ``` This is because the current logic to clone `x1 = f(x0)` first looks up the arguments in the map (which finds `x0` maps to `x1` from the initialization), and then sets the map of the result to the cloned result (`map[x1] = x2`). However, this fails if `x0` is not an argument to the op, but instead used inside the region, like below. ``` (%x0 : i32) { %x1 = f() { yield %x0 } return %x1 } ``` This is because cloning an op currently first looks up the args (none), sets the map of the result (`map[%x1] = %x2`), and then clones the regions. This results in the following, which is clearly illegal: ``` (%x0 : i32) { %x1 = f() { yield %x0 } %x2 = f() { yield %x2 } return %x2 } ``` Diving deeper, this is partially due to the ordering (how this PR fixes it), as well as how region cloning works. Namely it will first clone with the mapping, and then it will remap all operands. Since the ordering above now has a map of `x0 -> x1` and `x1 -> x2`, we end up with the incorrect behavior here. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D122531	2022-04-15 13:09:13 -04:00
Alex Zinenko	d064c4801c	[mlir] Introduce Transform dialect This dialect provides operations that can be used to control transformation of the IR using a different portion of the IR. It refers to the IR being transformed as payload IR, and to the IR guiding the transformation as transform IR. The main use case for this dialect is orchestrating fine-grain transformations on individual operations or sets thereof. For example, it may involve finding loop-like operations with specific properties (e.g., large size) in the payload IR, applying loop tiling to those and only those operations, and then applying loop unrolling to the inner loops produced by the previous transformations. As such, it is not intended as a replacement for the pass infrastructure, nor for the pattern rewriting infrastructure. In the most common case, the transform IR will be processed and applied to payload IR by a pass. Transformations expressed by the transform dialect may be implemented using the pattern infrastructure or any other relevant MLIR component. This dialect is designed to be extensible, that is, clients of this dialect are allowed to inject additional operations into this dialect using the newly introduced in this patch `TransformDialectExtension` mechanism. This allows the dialect to avoid a dependency on the implementation of the transformation as well as to avoid introducing dialect-specific transform dialects. See https://discourse.llvm.org/t/rfc-interfaces-and-dialects-for-precise-ir-transformation-control/60927. Reviewed By: nicolasvasilache, Mogball, rriddle Differential Revision: https://reviews.llvm.org/D123135	2022-04-14 13:48:45 +02:00
Mogball	b73f1d2c5d	[mlir][cf-sink] Accept a callback for sinking operations (This was a TODO from the initial patch). The control-flow sink utility accepts a callback that is used to sink an operation into a region. The `moveIntoRegion` is called on the same operation and region that return true for `shouldMoveIntoRegion`. The callback must preserve the dominance of the operation within the region. In the default control-flow sink implementation, this is moving the operation to the start of the entry block. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D122445	2022-03-28 19:31:23 +00:00
Lei Zhang	86fe16b67d	[mlir][spirv] NFC: Move GLSL canonicalization pass to Transforms/ This is a pass that can be used by downstream consumers directly to avoid the boilerplate to wrap around the `populate*Patterns`. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D121222	2022-03-08 13:49:14 -05:00
Sergei Grechanik	27df7158fe	[mlir] Fix dumping invalid ops This patch fixes the crash when printing some ops (like affine.for and scf.for) when they are dumped in invalid state, e.g. during pattern application. Now the AsmState constructor verifies the operation first and switches to generic operation printing when the verification fails. Also operations are now printed in generic form when emitting diagnostics and the severity level is Error. Reviewed By: rriddle, mehdi_amini Differential Revision: https://reviews.llvm.org/D117834	2022-03-07 08:32:31 -08:00
River Riddle	6b7d211a1b	[mlir][NFC] Move MlirOptMain to the Tools/ directory MlirOptMain is currently awkwardly shoved into mlir/Support. This commit moves it to the Tools/ directory, which is intended for libraries used to implement tools. Differential Revision: https://reviews.llvm.org/D121025	2022-03-07 01:05:38 -08:00
Alexander Belyaev	1a829d2d06	[mlir] Purge linalg.tiled_loop. Differential Revision: https://reviews.llvm.org/D119415	2022-02-28 09:05:18 +01:00
Thomas Raoux	b1357fe618	[mlir][memref] Add transformation to do loop multi-buffering This transformation is useful to break dependency between consecutive loop iterations by increasing the size of a temporary buffer. This is usually combined with heavy software pipelining. Differential Revision: https://reviews.llvm.org/D119406	2022-02-24 09:41:21 -08:00
Matthias Springer	d2dacde5d8	[mlir][bufferize][NFC] Rename `comprehensive-function-bufferize` to `one-shot-bufferize` The related functionality is moved over to the bufferization dialect. Test cases are cleaned up a bit. Differential Revision: https://reviews.llvm.org/D120191	2022-02-22 17:19:20 +09:00
Lei Zhang	e027c00821	[mlir][tensor] Add a pattern to split tensor.pad ops This commit adds a pattern to wrap a tensor.pad op with an scf.if op to separate the cases where we don't need padding (all pad sizes are actually zeros) and where we indeed need padding. This pattern is meant to handle padding inside tiled loops. Under such cases the padding sizes typically depend on the loop induction variables. Splitting them would allow treating perfect tiles and edge tiles separately. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D117018	2022-02-16 13:43:57 -05:00
Mahesh Ravishankar	2abd7f13bc	[mlir][Linalg] NFC: Combine elementwise fusion test passes. There are a few different test passes that check elementwise fusion in Linalg. Consolidate them to a single pass controlled by different pass options (in keeping with how `TestLinalgTransforms` exists).	2022-02-08 18:08:37 +00:00
Mahesh Ravishankar	7568f7101f	Revert "[mlir][Linalg] NFC: Combine elementwise fusion test passes." This reverts commit `d730336411`.	2022-02-07 22:51:29 +00:00
Mahesh Ravishankar	d730336411	[mlir][Linalg] NFC: Combine elementwise fusion test passes. There are a few different test passes that check elementwise fusion in Linalg. Consolidate them to a single pass controlled by different pass options (in keeping with how `TestLinalgTransforms` exists).	2022-02-07 22:46:57 +00:00
Nicolas Vasilache	cc0d208805	[mlir][Linalg] Drop deprecated convolution vectorization patterns Differential revision: https://reviews.llvm.org/D117326	2022-01-18 09:26:50 +00:00
Nicolas Vasilache	f40a579bea	Revert "[mlir][Linalg] NFC - Drop vectorization reliance on ConvolutionOpInterface" This reverts commit `c8f5735301`. The integration tests are broken.	2022-01-17 19:38:07 +00:00
Nicolas Vasilache	c8f5735301	[mlir][Linalg] NFC - Drop vectorization reliance on ConvolutionOpInterface Differential Revision: https://reviews.llvm.org/D117323	2022-01-17 17:01:36 +00:00
Eugene Zhulenev	69bc334be5	[mlir] Remove getNumberOfExecutions from RegionBranchOpInterface `getNumRegionInvocations` was originally added for the async reference counting, but turned out to be not useful, and currently is not used anywhere (couldn't find any uses in public github repos). Removing dead code. Reviewed By: Mogball, mehdi_amini Differential Revision: https://reviews.llvm.org/D117347	2022-01-14 13:15:27 -08:00
Rahul Joshi	8067ced144	[MLIR] Introduce generic visitors. - Generic visitors invoke operation callbacks before/in-between/after visiting the regions attached to an operation and use a `WalkStage` to indicate which regions have been visited. - This can be useful for cases where we need to visit the operation in between visiting regions attached to the operation. Differential Revision: https://reviews.llvm.org/D116230	2022-01-14 09:15:27 -08:00

1 2 3 4 5

241 Commits