clang-p2996

Author	SHA1	Message	Date
Matthias Springer	5e4a44380e	[mlir][Interfaces][NFC] `ValueBoundsConstraintSet`: Pass stop condition in the constructor (#86099 ) This commit changes the API of `ValueBoundsConstraintSet`: the stop condition is now passed to the constructor instead of `processWorklist`. That makes it easier to add items to the worklist multiple times and process them in a consistent manner. The current `ValueBoundsConstraintSet` is passed as a reference to the stop function, so that the stop function can be defined before the the `ValueBoundsConstraintSet` is constructed. This change is in preparation of adding support for branches.	2024-04-04 17:05:47 +09:00
Fehr Mathieu	e03f16f9fd	[mlir] [arith] Remove buggy illegal operation in --arith-unsigned-when-equivalent (#87298 ) `CeilDivUIOp` seemed to have been added by mistake to the list of dynamically illegal operations in `arith-unsigned-when-equivalent`. The only illegal operations should be the signed operations that can be converted to their unsigned counterpart.	2024-04-02 14:47:27 +01:00
Victor Perez	8827ff92b9	[MLIR][Arith] Add rounding mode attribute to `truncf` (#86152 ) Add rounding mode attribute to `arith`. This attribute can be used in different FP `arith` operations to control rounding mode. Rounding modes correspond to IEEE 754-specified rounding modes. Use in `arith.truncf` folding. As this is not supported in dialects other than LLVM, conversion should fail for now in case this attribute is present. --------- Signed-off-by: Victor Perez <victor.perez@codeplay.com>	2024-04-01 11:57:14 +02:00
Matthias Springer	a45e58af1b	[mlir][bufferization] Add `BufferViewFlowOpInterface` (#78718 ) This commit adds the `BufferViewFlowOpInterface` to the bufferization dialect. This interface can be implemented by ops that operate on buffers to indicate that a buffer op result and/or region entry block argument may be the same buffer as a buffer operand (or a view thereof). This interface is queried by the `BufferViewFlowAnalysis`. The new interface has two interface methods: * `populateDependencies`: Implementations use the provided callback to declare dependencies between operands and op results/region entry block arguments. E.g., for `%r = arith.select %c, %m1, %m2 : memref<5xf32>`, the interface implementation should declare two dependencies: %m1 -> %r and %m2 -> %r. * `mayBeTerminalBuffer`: An SSA value is a terminal buffer if the buffer view flow analysis stops at the specified value. E.g., because the value is a newly allocated buffer or because no further information is available about the origin of the buffer. Ops that implement the `RegionBranchOpInterface` or `BranchOpInterface` do not have to implement the `BufferViewFlowOpInterface`. The buffer dependencies can be inferred from those two interfaces. This commit makes the `BufferViewFlowAnalysis` more accurate. For unknown ops, it conservatively used to declare all combinations of operands and op results/region entry block arguments as dependencies (false positives). This is no longer the case. While the analysis is still a "maybe" analysis with false positives (e.g., when analyzing ops such as `arith.select` or `scf.if` where the taken branch is not known at compile time), results and region entry block arguments of unknown ops are now marked as terminal buffers. This commit addresses a TODO in `BufferViewFlowAnalysis.cpp`: ``` // TODO: We should have an op interface instead of a hard-coded list of // interfaces/ops. ``` It is no longer needed to hard-code ops.	2024-03-24 12:48:19 +09:00
long.chen	631e54aa1a	[mlir][arith] fix wrong floordivsi fold (#83248 ) Fixs https://github.com/llvm/llvm-project/issues/83079	2024-03-22 23:52:47 +08:00
Benoit Jacob	9c7cde64e6	Fix the lowering of `arith.truncf : f32 to bf16`. (#83180 ) This lowering was not correctly handling the case where saturation of the mantissa results in an increase of the exponent value. The new code borrows, with credit, the idea from `e1502c0cdb/c10/util/BFloat16.h (L60-L79)` and adds comments to explain the magic trick going on here and why it's correct. Hat tip to its original author, whom I believe to be @Maratyszcza. A testcase was also requiring a tie to be broken upwards in a case where "to nearest-even" required going downward. The fact that it used to pass suggests that there was another bug in the old code.	2024-02-28 13:56:18 -05:00
ian Bearman	067d2779fc	[MLIR] Setting MemorySpace During Bufferization (#78484 ) Collection of changes with the goal of being able to convert `encoding` to `memorySpace` during bufferization - new API for encoder to allow implementation to select destination memory space - update existing bufferization implementations to support the new interface	2024-02-08 16:59:37 +01:00
Krzysztof Drewniak	750e90e440	[mlir][ArithToAMDGPU] Add option for saturating truncation to fp8 (#74153 ) Many machine-learning applications (and most software written at AMD) expect the operation that truncates floats to 8-bit floats to be saturatinng. That is, they expect `truncf 256.0 : f32 to f8E4M3FNUZ` to yield `240.0`, not `NaN`, and similarly for negative numbers. However, the underlying hardware instruction that can be used for this truncation implements overflow-to-NaN semantics. To enable handling this usecase, we add the saturate-fp8-truncf option to ArithToAMDGPU (off by default), which causes the requisite clamping code to be emitted. Said clamping code ensures that Inf and NaN are passed through exactly (and thus trancate to NaN). Per review feedback, this commit efactors createScalarOrSplatConstant() to the Arith dialect utilities and uses it in this code. It also fixes naming of existing patterns and switches from vector.extractelement/insertelement to vector.extract/insert.	2024-01-23 16:52:21 -06:00
Mehdi Amini	e5e08955af	Apply clang-tidy fixes for performance-move-const-arg in IntRangeOptimizations.cpp (NFC)	2024-01-15 20:59:12 -08:00
Han-Chung Wang	b33a131c82	[mlir][arith] Add support for expanding arith.maxnumf/minnumf ops. (#75989 ) The maxnum/minnum semantics can be found at https://llvm.org/docs/LangRef.html#llvm-minnum-intrinsic. The revision also updates function names in lit tests to match op name. Take arith.maxnumf as example: ``` func.func @maxnumf(%lhs: f32, %rhs: f32) -> f32 { %result = arith.maxnumf %lhs, %rhs : f32 return %result : f32 } ``` will be expanded to ``` func.func @maxnumf(%lhs: f32, %rhs: f32) -> f32 { %0 = arith.cmpf ugt, %lhs, %rhs : f32 %1 = arith.select %0, %lhs, %rhs : f32 %2 = arith.cmpf uno, %lhs, %lhs : f32 %3 = arith.select %2, %rhs, %1 : f32 return %3 : f32 } ``` Case 1: Both LHS and RHS are not NaN; LHS > RHS In this case, `%1` is LHS. `%3` and `%1` have the same value, so `%3` is LHS. Case 2: LHS is NaN and RHS is not NaN In this case, `%2` is true, so `%3` is always RHS. Case 3: LHS is not NaN and RHS is NaN In this case, `%0` is true and `%1` is LHS. `%2` is false, so `%3` and `%1` have the same value, which is LHS. Case 4: Both LHS and RHS are NaN: `%1` and RHS are all NaN, so the result is still NaN.	2023-12-20 10:35:12 -08:00
Matthias Springer	32c3decb77	[mlir][vector] Modernize `vector.transpose` op (#72594 ) * Declare arguments/results with `let` statements. * Rename `transp` to `permutation`. * Change type of `transp` from `I64ArrayAttr` to `DenseI64ArrayAttr` (provides direct access to `ArrayRef<int64_t>` instead of `ArrayAttr`).	2023-11-20 11:25:35 +01:00
long.chen	1609f1c2a5	[mlir][affine][nfc] cleanup deprecated T.cast style functions (#71269 ) detail see the docment: https://mlir.llvm.org/deprecation/ Not all changes are made manually, most of them are made through a clang tool I wrote https://github.com/lipracer/cpp-refactor.	2023-11-14 13:01:19 +08:00
Mehdi Amini	b97aaa72d9	Remove `let construct =` from ArithExpandOpsPass definition (NFC) Note that the `Pass` suffix is added in tablegen, and as a side effect the options are renamed from `ArithExpandOpsOptions` to `ArithExpandOpsPassOptions`.	2023-10-02 15:54:22 -07:00
Mehdi Amini	b1c10dfd72	Fixup on ArithBufferizePass: add the Pass suffix in TableGen to ensure consitency of the generated code	2023-10-02 15:50:41 -07:00
Mehdi Amini	c1c56ae49e	Remove `let constructor =` from ArithBufferizePass and rely on TableGen to generate the glue (NFC)	2023-10-02 15:41:16 -07:00
Martin Erhart	6a651c7f44	Revert "[mlir][bufferization] Don't clone on unknown ownership and verify function boundary ABI (#66626 )" This reverts commit `aa9eb47da2`. It introduced a double free in a test case. Reverting to have some time for fixing this and relanding later.	2023-09-28 09:14:46 +00:00
Martin Erhart	aa9eb47da2	[mlir][bufferization] Don't clone on unknown ownership and verify function boundary ABI (#66626 ) Inserting clones requires a lot of assumptions to hold on the input IR, e.g., all writes to a buffer need to dominate all reads. This is not guaranteed by one-shot bufferization and isn't easy to verify, thus it could quickly lead to incorrect results that are hard to debug. This commit changes the mechanism of how an ownership indicator is materialized when there is not already a unique ownership present. Additionally, we don't create copies of returned memrefs anymore when we don't have ownership. Instead, we insert assert operations to make sure we have ownership at runtime, or otherwise report to the user that correctness could not be guaranteed.	2023-09-28 10:45:35 +02:00
Diego Caballero	98f6289a34	[mlir][Vector] Add support for Value indices to vector.extract/insert `vector.extract/insert` ops only support constant indices. This PR is extending them so that arbitrary values can be used instead. This work is part of the RFC: https://discourse.llvm.org/t/rfc-psa-remove-vector-extractelement-and-vector-insertelement-ops-in-favor-of-vector-extract-and-vector-insert-ops Differential Revision: https://reviews.llvm.org/D155034	2023-09-22 00:39:32 +00:00
Martin Erhart	942ce31985	[mlir][bufferization] BufferDeallocationOpInterface: support custom ownership update logic (#66350 ) Add a method to the BufferDeallocationOpInterface that allows operations to implement the interface and provide custom logic to compute the ownership indicators of values it defines. As a demonstrating example, this new method is implemented by the `arith.select` operation.	2023-09-14 14:34:04 +02:00
Martin Erhart	9782232ec7	Revert "[mlir][bufferization] BufferDeallocationOpInterface: support custom ownership update logic" This reverts commit `89117f1807`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:47 +00:00
Martin Erhart	99cd1e7004	Revert "[mlir][bufferization] Remove unused function" This reverts commit `cb5fe6ce75`. This caused problems in downstream projects. We are reverting to give them more time for integration.	2023-09-13 13:53:47 +00:00
Martin Erhart	cb5fe6ce75	[mlir][bufferization] Remove unused function	2023-09-13 09:30:24 +00:00
Martin Erhart	89117f1807	[mlir][bufferization] BufferDeallocationOpInterface: support custom ownership update logic Add a method to the BufferDeallocationOpInterface that allows operations to implement the interface and provide custom logic to compute the ownership indicators of values it defines. As a demonstrating example, this new method is implemented by the `arith.select` operation. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158828	2023-09-13 09:30:23 +00:00
Daniil Dudkin	8a6e54c9b3	[mlir][arith] Rename operations: `maxf` → `maximumf`, `minf` → `minimumf` (#65800 ) This patch is part of a larger initiative aimed at fixing floating-point `max` and `min` operations in MLIR: https://discourse.llvm.org/t/rfc-fix-floating-point-max-and-min-operations-in-mlir/72671. This commit addresses Task 1.2 of the mentioned RFC. By renaming these operations, we align their names with LLVM intrinsics that have corresponding semantics.	2023-09-11 22:02:19 -07:00
Mikhail Goncharov	0a0aff2d24	fix unused variable warnings in conditionals warning was updated in `92023b1509`	2023-08-30 19:09:27 +02:00
Matthias Springer	2257ebd33a	[mlir][arith][bufferize] Fix crash when bufferizing elementwise arith.select Elementwise arith.select are currently not supported. Emit an error message instead of crashing. This fixes #61707. Differential Revision: https://reviews.llvm.org/D158617	2023-08-23 17:40:53 +02:00
Krzysztof Drewniak	51b65d0895	[mlir][AMDGPU] Improve BF16 handling through AMDGPU compilation Many previous sets of AMDGPU dialect code have been incorrect in the presence of the bf16 type (when lowered to LLVM's bfloat) as they were developed in a setting that run a custom bf16-to-i16 pass before LLVM lowering. An overall effect of this patch is that you should run --arith-emulate-unsupported-floats="source-types=bf16 target-type=f32" on your GPU module before calling --convert-gpu-to-rocdl if your code performs bf16 arithmetic. While LLVM now supports software bfloat, initial experiments showed that using this support on AMDGPU inserted a large number of conversions around loads and stores which had substantial performance imparts. Furthermore, all of the native AMDGPU operations on bf16 types (like the WMMA operations) operate on 16-bit integers instead of the bfloat type. First, we make the following changes to preserve compatibility once the LLVM bfloat type is reenabled. 1. The matrix multiplication operations (MFMA and WMMA) will bitcast bfloat vectors to i16 vectors. 2. Buffer loads and stores will operate on the relevant integer datatype and then cast to bfloat if needed. Second, we add type conversions to convert bf16 and vectors of it to equivalent i16 types. Third, we add the bfloat <-> f32 expansion patterns to the set of operations run before the main LLVM conversion so that MLIR's implementation of these conversion routines is used. Finally, we extend the "floats treated as integers" support in the LLVM exporter to handle types other than fp8. We also fix a bug in the unsupported floats emulation where it tried to operate on `arith.bitcast` due to an oversight. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D156361	2023-08-17 18:31:28 +00:00
Matthias Springer	878950b82c	[mlir][bufferization] Simplify `getBufferType` `getBufferType` computes the bufferized type of an SSA value without bufferizing any IR. This is useful for predicting the bufferized type of iter_args of a loop. To avoid endless recursion (e.g., in the case of "scf.for", the type of the iter_arg depends on the type of init_arg and the type of the yielded value; the type of the yielded value depends on the type of the iter_arg again), `fixedTypes` was used to fall back to "fixed" type. A simpler way is to maintain an "invocation stack". `getBufferType` implementations can then inspect the invocation stack to detect repetitive computations (typically when computing the bufferized type of a block argument). Also improve error messages in case of inconsistent memory spaces inside of a loop. Differential Revision: https://reviews.llvm.org/D158060	2023-08-16 15:02:07 +02:00
Matthias Springer	a02ad6c177	[mlir][bufferization] Generalize getAliasingOpResults to getAliasingValues This revision is needed to support bufferization of `cf.br`/`cf.cond_br`. It will also be useful for better analysis of loop ops. This revision generalizes `getAliasingOpResults` to `getAliasingValues`. An OpOperand can now not only alias with OpResults but also with BlockArguments. In the case of `cf.br` (will be added in a later revision): a `cf.br` operand will alias with the corresponding argument of the destination block. If an op does not implement the `BufferizableOpInterface`, the analysis in conservative. It previously assumed that an OpOperand may alias with each OpResult. It now assumes that an OpOperand may alias with each OpResult and each BlockArgument of the entry block. Differential Revision: https://reviews.llvm.org/D157957	2023-08-15 15:02:47 +02:00
Matthias Springer	ce254598b7	[mlir][Conversion] Store const type converter in ConversionPattern ConversionPatterns do not (and should not) modify the type converter that they are using. * Make `ConversionPattern::typeConverter` const. * Make member functions of the `LLVMTypeConverter` const. * Conversion patterns take a const type converter. * Various helper functions (that are called from patterns) now also take a const type converter. Differential Revision: https://reviews.llvm.org/D157601	2023-08-14 09:03:11 +02:00
Krzysztof Drewniak	3fe8ec7906	Revert "[mlir][Arith] Add arith.is_nan and arith.is_inf predicates" This reverts commit `7c349c3698`. Per discussion at https://reviews.llvm.org/rG7c349c369847dc2f1736efb9c90d03521cd44a90 and elsewhere, the lowering to LLVM defined here isn't what it should be and the fastmath flag usage isn't correct, so `arith.is_nan` and `arith.is_inf` cannot exist in their current form. It's unclear if those operations should be introduced in the future, since they make the dialect more complex and don't add any expressive power. Further discussion may be moved to an RFC (or I'll drop this patch). Differential Revision: https://reviews.llvm.org/D157543	2023-08-10 14:13:25 +00:00
Krzysztof Drewniak	7c349c3698	[mlir][Arith] Add arith.is_nan and arith.is_inf predicates Both LLVM and SPIR-V have some form of "is this float a NaN/Inf" operation (though LLVM's uses the rather opaque "is.fpclass" intrinsic), which is not exposed in MLIR. This has lead to awkward workarounds in -arith-expands-ops where a NaN test was performed by comparing an operation to itself. This commit resolves that issue. Reviewed By: dcaballe, kuhar Differential Revision: https://reviews.llvm.org/D156169	2023-08-02 14:42:25 +00:00
Jakub Kuderski	4a3899577a	[mlir][arith] Validate pass options in arith-int-narrowing Exist gracefully instead of triggering an assertions. Fixes: https://github.com/llvm/llvm-project/issues/64257 Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D156694	2023-07-31 11:32:01 -04:00
Matthias Springer	ba745eea40	[mlir][bufferization] Remove cleanup pipeline from bufferization pass To keep the pass simple, users should apply cleanup passes manually when necessary. In particular, `-cse -canonicalize` are often desireable to fold away self-copies that are created by the bufferization. This addresses a comment in D120191. Differential Revision: https://reviews.llvm.org/D155923	2023-07-21 12:11:25 +02:00
Krzysztof Drewniak	636f772871	[mlir][Arith] Make previous load-bearing assert into a real error When I landed the EmulateUnsupportedFloats, I'd negligently included an assert that needed to run for the pass to be correct. Previous emergency fix commits removed the assert. This commit re-adds the "can't happen" testing as an emitOpError() and aborting the rewrite, thus allowing it to function in no-assertions builds. Reviewed By: kuhar Differential Revision: https://reviews.llvm.org/D155088	2023-07-13 14:49:29 +00:00
Sterling Augustine	5f4d96ebef	Don't to real work inside an assertion. asserts get compiled to empty when built in opt mode, so that makes certain tests fail, such as emulate-unsupported-floats.mlir.test. This removes the assert altogether, which is also suboptimal, but I have reported to the original author.	2023-07-11 17:44:58 -07:00
Sterling Augustine	5671f02304	Fix unused variable warning.	2023-07-11 14:54:29 -07:00
Krzysztof Drewniak	10b56e0210	[mlir][Arith] Add pass for emulating unsupported float ops (#1079 ) To complement the bf16 expansion and truncation patterns added to ExpandOps, define a pass that replaces, for any arithmetic operation op, %y = arith.op %v0, %v1, ... : T with %e0 = arith.expf %v0 : T to U %e1 = arith.expf %v1 : T to U ... %y.exp = arith.op %e0, %e1, ... : U %y = arith.truncf %y.exp : U to T This allows for "emulating" floating-point operations not supported on a given target (such as bfloat operations or most arithmetic on 8-bit floats) by extending those types to supported ones, performing the arithmetic operation, and then truncating back to the original type (which ensures appropriate rounding behavior). The lowering of the extf and truncf ops introduced by this transformation should be handled by subsequent passes. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D154539	2023-07-11 20:32:35 +00:00
yzhang93	5a1cdcbd86	[mlir] Narrow bitwidth emulation for MemRef load This patch adds support for narrow bitwidth storage emulation. The goal is to support sub-byte type codegen for LLVM CPU. Specifically, a type converter is added to convert memref of narrow bitwidth (e.g., i4) into supported wider bitwidth (e.g., i8). Another focus of this patch is to populate the pattern for int4 memref.load. memref.store pattern should be added in a seperate patch. Reviewed By: hanchung, mravishankar Differential Revision: https://reviews.llvm.org/D151519	2023-06-26 14:18:30 -07:00
Tres Popp	68f58812e3	[mlir] Move casting calls from methods to function calls The MLIR classes Type/Attribute/Operation/Op/Value support cast/dyn_cast/isa/dyn_cast_or_null functionality through llvm's doCast functionality in addition to defining methods with the same name. This change begins the migration of uses of the method to the corresponding function call as has been decided as more consistent. Note that there still exist classes that only define methods directly, such as AffineExpr, and this does not include work currently to support a functional cast/isa call. Context: - https://mlir.llvm.org/deprecation/ at "Use the free function variants for dyn_cast/cast/isa/…" - Original discussion at https://discourse.llvm.org/t/preferred-casting-style-going-forward/68443 Implementation: This patch updates all remaining uses of the deprecated functionality in mlir/. This was done with clang-tidy as described below and further modifications to GPUBase.td and OpenMPOpsInterfaces.td. Steps are described per line, as comments are removed by git: 0. Retrieve the change from the following to build clang-tidy with an additional check: main...tpopp:llvm-project:tidy-cast-check 1. Build clang-tidy 2. Run clang-tidy over your entire codebase while disabling all checks and enabling the one relevant one. Run on all header files also. 3. Delete .inc files that were also modified, so the next build rebuilds them to a pure state. ``` ninja -C $BUILD_DIR clang-tidy run-clang-tidy -clang-tidy-binary=$BUILD_DIR/bin/clang-tidy -checks='-,misc-cast-functions'\ -header-filter=mlir/ mlir/ -fix rm -rf $BUILD_DIR/tools/mlir/*/.inc ``` Differential Revision: https://reviews.llvm.org/D151542	2023-05-26 10:29:55 +02:00
Tres Popp	5550c82189	[mlir] Move casting calls from methods to function calls The MLIR classes Type/Attribute/Operation/Op/Value support cast/dyn_cast/isa/dyn_cast_or_null functionality through llvm's doCast functionality in addition to defining methods with the same name. This change begins the migration of uses of the method to the corresponding function call as has been decided as more consistent. Note that there still exist classes that only define methods directly, such as AffineExpr, and this does not include work currently to support a functional cast/isa call. Caveats include: - This clang-tidy script probably has more problems. - This only touches C++ code, so nothing that is being generated. Context: - https://mlir.llvm.org/deprecation/ at "Use the free function variants for dyn_cast/cast/isa/…" - Original discussion at https://discourse.llvm.org/t/preferred-casting-style-going-forward/68443 Implementation: This first patch was created with the following steps. The intention is to only do automated changes at first, so I waste less time if it's reverted, and so the first mass change is more clear as an example to other teams that will need to follow similar steps. Steps are described per line, as comments are removed by git: 0. Retrieve the change from the following to build clang-tidy with an additional check: https://github.com/llvm/llvm-project/compare/main...tpopp:llvm-project:tidy-cast-check 1. Build clang-tidy 2. Run clang-tidy over your entire codebase while disabling all checks and enabling the one relevant one. Run on all header files also. 3. Delete .inc files that were also modified, so the next build rebuilds them to a pure state. 4. Some changes have been deleted for the following reasons: - Some files had a variable also named cast - Some files had not included a header file that defines the cast functions - Some files are definitions of the classes that have the casting methods, so the code still refers to the method instead of the function without adding a prefix or removing the method declaration at the same time. ``` ninja -C $BUILD_DIR clang-tidy run-clang-tidy -clang-tidy-binary=$BUILD_DIR/bin/clang-tidy -checks='-,misc-cast-functions'\ -header-filter=mlir/ mlir/ -fix rm -rf $BUILD_DIR/tools/mlir/*/.inc git restore mlir/lib/IR mlir/lib/Dialect/DLTI/DLTI.cpp\ mlir/lib/Dialect/Complex/IR/ComplexDialect.cpp\ mlir/lib/**/IR/\ mlir/lib/Dialect/SparseTensor/Transforms/SparseVectorization.cpp\ mlir/lib/Dialect/Vector/Transforms/LowerVectorMultiReduction.cpp\ mlir/test/lib/Dialect/Test/TestTypes.cpp\ mlir/test/lib/Dialect/Transform/TestTransformDialectExtension.cpp\ mlir/test/lib/Dialect/Test/TestAttributes.cpp\ mlir/unittests/TableGen/EnumsGenTest.cpp\ mlir/test/python/lib/PythonTestCAPI.cpp\ mlir/include/mlir/IR/ ``` Differential Revision: https://reviews.llvm.org/D150123	2023-05-12 11:21:25 +02:00
Jakub Kuderski	fc19204918	[mlir][arith] Add narrowing patterns for index casts These rely on the `ValueBounds` interace and its utility function to compute constant bounds. This allows us to optimize `linalg.index` values cast to integer types. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D149538	2023-05-03 13:55:02 -04:00
Jakub Kuderski	9701c5abd6	[mlir][arith] Add narrowing patterns for `maxi` and `mini` Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D149583	2023-05-02 10:50:07 -04:00
Jakub Kuderski	46740dd02b	[mlir][arith] Add narrowing patterns for subi, divsi, and divui Each of these ops is compatible with only one extension kind and produces an extra result bit. I checked these transformation in Alive2: 1. subi + extsi: https://alive2.llvm.org/ce/z/ipmZZA 2. divsi + extsi: https://alive2.llvm.org/ce/z/fAcqUv 3. divui + extui: https://alive2.llvm.org/ce/z/QZJpFp Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D149531	2023-05-02 10:44:29 -04:00
Jakub Kuderski	e2f7563d7c	[mlir][arith] Add narrowing patterns for `addi` and `muli` These two ops are handled in a very similar way -- the only difference in the number result bits produced. I checked these transformation with Alive2: 1. addi + sext: https://alive2.llvm.org/ce/z/3NSs9T 2. addi + zext: https://alive2.llvm.org/ce/z/t7XHOT 3. muli + sext: https://alive2.llvm.org/ce/z/-7sfW9 4. muli + zext: https://alive2.llvm.org/ce/z/h4yntF Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D149530	2023-05-02 10:10:10 -04:00
Jakub Kuderski	7f3b0e5845	[mlir][arith] Add narrowing patterns to commute more vector ops This commutes the extension (`arith.extsi`, `arith.extui`) over the following vector ops: `vector.broadcast`, `vector.shape_cast`, `vector.transpose`, `vector.flat_transpose`. I focused on these as I saw them getting created by vector unroll patterns. Maybe except `vector.flat_transpose`. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D149534	2023-05-01 14:32:57 -04:00
Jakub Kuderski	3ff870881f	[mlir][arith] Add narrowing patterns for other insertion ops Allow to commute extension ops over `vector.insertelement` and `vector.insert_strided_slice`. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D149509	2023-05-01 14:29:02 -04:00
Jakub Kuderski	49df068836	[mlir][arith][NFC] Simplify narrowing patterns with a wrapper type Add a new wraper type that represents either of `ExtSIOp` or `ExtUIOp`. This is to simplify the code by using a single type, so that we do not have to use templates or branching to handle both extension kinds. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D149485	2023-05-01 13:31:31 -04:00
Jakub Kuderski	33017e5a3f	[mlir][arith] Add narrowing pattern to commute extension over insertion This enabled more optimization opportunities by moving zero/sign-extension closer to the use. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D149282	2023-04-28 16:17:44 -04:00
Jakub Kuderski	09043a26c8	[mlir][arith] Add patterns to commute extension over vector extraction This moves zero/sign-extension ops closer to their use and exposes more narrowing optimization opportunities. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D149233	2023-04-28 13:48:50 -04:00

1 2

88 Commits