clang-p2996

Author	SHA1	Message	Date
Hyunsung Lee	3a3377579f	[mlir][math]Update `convertPowfOp` `ExpandPatterns.cpp` (#124402 ) The current implementation of `convertPowfOp` requires a calculation of `a * a` but, max\<fp16\> ~= 65,504, and if `a` is about 16, it will overflow so get INF in fp8 or fp16 easily. Remove support when `a < 0`. Overhead of handling negative value of `a` is large and easy to overflow; - related issue in iree: https://github.com/iree-org/iree/issues/15936	2025-01-28 22:56:43 -05:00
donald chen	45d83ae7df	[mlir] [math] Fix the precision issue of expand math (#120865 ) The convertFloorOp pattern incurs precision loss when floating-point numbers exceed the representable range of int64. This pattern should be removed. Fixes https://github.com/llvm/llvm-project/issues/119836	2025-01-24 14:46:41 +08:00
Jacques Pienaar	09dfc5713d	[mlir] Enable decoupling two kinds of greedy behavior. (#104649 ) The greedy rewriter is used in many different flows and it has a lot of convenience (work list management, debugging actions, tracing, etc). But it combines two kinds of greedy behavior 1) how ops are matched, 2) folding wherever it can. These are independent forms of greedy and leads to inefficiency. E.g., cases where one need to create different phases in lowering and is required to applying patterns in specific order split across different passes. Using the driver one ends up needlessly retrying folding/having multiple rounds of folding attempts, where one final run would have sufficed. Of course folks can locally avoid this behavior by just building their own, but this is also a common requested feature that folks keep on working around locally in suboptimal ways. For downstream users, there should be no behavioral change. Updating from the deprecated should just be a find and replace (e.g., `find ./ -type f -exec sed -i 's\|applyPatternsAndFoldGreedily\|applyPatternsGreedily\|g' {} \;` variety) as the API arguments hasn't changed between the two.	2024-12-20 08:15:48 -08:00
Christopher Bate	a92e3df300	[mlir][math] Fix `math.powf` expansion case for `pow(x, 0)` (#119015 ) Lowering `math.powf` to `llvm.intr.powf` will result in `pow(x, 0) = 1`, even for `x=0`. When using the Math dialect expansion patterns, `pow(0, 0)` will result in `-nan`, however, This change adds two additional instructions to the lowering to ensure the `pow(x, 0)` case lowers to to `1` regardless of the value of `x`. Resolves https://github.com/llvm/llvm-project/issues/118945.	2024-12-09 14:52:55 -07:00
Kunwar Grover	72f362170f	[mlir][Math] Fix 0-rank support for PolynomialApproximation (#114826 ) This patch disambiguates 0-rank vectors and scalars in PolynomialApproximation. This fixes a bug in PolynomialApproximation where 0-rank vectors would be treated as scalars and arguments would not be broadcasted properly.	2024-11-05 09:40:52 +00:00
Matthias Springer	206fad0e21	[mlir][NFC] Mark type converter in `populate...` functions as `const` (#111250 ) This commit marks the type converter in `populate...` functions as `const`. This is useful for debugging. Patterns already take a `const` type converter. However, some `populate...` functions do not only add new patterns, but also add additional type conversion rules. That makes it difficult to find the place where a type conversion was added in the code base. With this change, all `populate...` functions that only populate pattern now have a `const` type converter. Programmers can then conclude from the function signature that these functions do not register any new type conversion rules. Also some minor cleanups around the 1:N dialect conversion infrastructure, which did not always pass the type converter as a `const` object internally.	2024-10-05 21:32:40 +02:00
Daniel Hernandez-Juarez	1fd1f65569	[mlir] Refactor LegalizeToF32 to specify extra supported float types and target type as arguments (#108815 ) Instead of hardcoding all fp smaller than 32 bits are unsupported we provide a way to pass supported floating point types as well as the target type. fp64 and fp32 are implicitly supported. CC: @krzysz00 @manupak	2024-09-27 10:02:16 -05:00
Ivy Zhang	b96f18b20c	[MLIR][MathDialect] fix fp32 promotion crash when encounters scf.if (#104451 ) 1. Expand legal op list in `legalizeToF32` 2. add legalization support for `math::rsqrtOp` in `mathToLibm`.	2024-08-21 10:08:56 +08:00
Kazu Hirata	5262865aac	[mlir] Construct SmallVector with ArrayRef (NFC) (#101896 )	2024-08-04 11:43:05 -07:00
Rob Suderman	a3fb301435	[mlir][math] Fix polynomial `math.asin` approximation (#101247 ) The polynomial approximation for asin is only good between [-9/16, 9/16]. Values beyond that range must be remapped to achieve good numeric results. This is done by the equation below: `arcsin(x) = PI/2 - arcsin(sqrt(1.0 - x*x))`	2024-07-31 10:06:20 -07:00
Ivy Zhang	7042fcc638	[MLIR][Arith][Resubmit] add fastMathAttr on arith::extf and arith::truncf (#95346 ) Add an `fastMathAttr` on `arith::extf` and `arith::truncf`. If these two ops are inserted by some promotion passes (like legalize-to-f32 / emulate-unsupported-floats), they will be labeled as `FastMathFlags::contract`, denoting that they can be then `eliminated by canonicalizer`. The `elimination` can help improve performance, while may introduce some numerical differences.	2024-06-15 07:42:29 +08:00
Ivy Zhang	f941908d77	Revert "[MLIR][Arith] add fastMathAttr on arith::extf and arith::truncf" (#95344 ) Reverts llvm/llvm-project#93443	2024-06-13 11:23:20 +08:00
Ivy Zhang	6784bf7642	[MLIR][Arith] add fastMathAttr on arith::extf and arith::truncf (#93443 ) Add an `fastMathAttr` on `arith::extf` and `arith::truncf`. If these two ops are inserted by some promotion passes (like legalize-to-f32 / emulate-unsupported-floats), they will be labeled as `FastMathFlags::contract`, denoting that they can be then `eliminated by canonicalizer`. The `elimination` can help improve performance, while may introduce some numerical differences.	2024-06-13 09:27:44 +08:00
Corentin Ferry	279a659e97	[mlir][math] lower rsqrt to sqrt + fdiv (#91344 ) This commit creates an expansion pattern to lower math.rsqrt(x) into fdiv(1, sqrt(x)).	2024-05-13 10:15:39 +02:00
Prashant Kumar	72085698a2	[mlir][math] Add Polynomial Approximation for acos, asin op (#90962 ) Adds the Polynomial Approximation for math.acos and math.asin op. Also, it adds integration tests. The Approximation has been borrowed from https://stackoverflow.com/a/42683455 I added this script: https://gist.github.com/pashu123/cd3e682b21a64ac306f650fb842a422b to test 50 values between -1 and 1. The results are https://gist.github.com/pashu123/8acb233bd045bacabfa8c992d4040465. It's well within the bounds.	2024-05-07 22:19:28 +05:30
jinchen	a62a702416	[mlir][math] Add expand patterns for acosh, asinh, atanh (#90718 )	2024-05-07 13:10:37 +05:30
Prashant Kumar	5b702be1e8	[mlir][math] Convert math.fpowi to math.powf in case of non constant (#87472 ) Convert math.fpowi to math.powf by converting dtype of power operand to floating point.	2024-04-03 22:19:26 +05:30
Prashant Kumar	10a57f3aff	[mlir][math] Expand powfI operation for constant power operand. (#87081 ) -- Convert `math.fpowi` to a series of `arith.mulf` operations. -- If the power is negative, we divide the result by 1.	2024-04-01 13:18:27 +05:30
Justin Fargnoli	35d55f2894	[NFC][mlir] Reorder `declarePromisedInterface()` operands (#86628 ) Reorder the template operands of `declarePromisedInterface()` to match `declarePromisedInterfaces()`.	2024-03-27 10:30:17 -07:00
srcarroll	d39ac3a8e0	[mlir][math] Reland `58ef9bec07` (#85436 ) The previous implementation decomposes tanh(x) into `(exp(2x) - 1)/(exp(2x)+1), x < 0` `(1 - exp(-2x))/(1 + exp(-2x)), x >= 0` This is fine as it avoids overflow with the exponential, but the whole decomposition is computed for both cases unconditionally, then the result is chosen based off the sign of the input. This results in doing two expensive exp computations. The proposed change avoids doing the whole computation twice by exploiting the reflection symmetry `tanh(-x) = -tanh(x)`. We can "normalize" the input to be positive by setting `y = sign(x) * x`, where the sign of `x` is computed as `sign(x) = (float)(x > 0) * (-2) + 1`. Then compute `z = tanh(y) `with the decomposition above for `x >=0` and "denormalize" the result `z * sign(x)` to retain the sign. The reason it is done this way is that it is very amenable to vectorization. This method trades the duplicate decomposition computations (which takes 5 instructions including an extra expensive exp and div) for 4 cheap instructions to compute the signs value `arith.cmpf `(which is a pre-existing instruction in the previous impl) `arith.sitofp` `arith.mulf` `arith.addf` and 1 more instruction to get the right sign in the result 5. `arith.mulf`. Moreover, numerically, this implementation will yield the exact same results as the previous implementation. As part of the relanding, a casting issue from the original commit has been fixed, i.e. casting bool to float with `uitofp`. Additionally a correctness test with `mlir-cpu-runner` has been added.	2024-03-17 11:23:30 -05:00
Benjamin Maxwell	e74bcecd36	[mlir][math] Propagate scalability in polynomial approximation (#84949 ) This simply updates the rewrites to propagate the scalable flags (which as they do not alter the vector shape, is pretty simple). The added tests are simply scalable versions of the existing vector tests.	2024-03-15 20:08:52 +00:00
srcarroll	f75d164eea	Revert "[mlir][math] Implement alternative decomposition for tanh (#8… (#85429 ) …5025)" This reverts commit `58ef9bec07`. There is a bool to float casting issue that needs to be sorted out to make sure this is target independent	2024-03-15 14:39:57 -05:00
srcarroll	58ef9bec07	[mlir][math] Implement alternative decomposition for tanh (#85025 ) The previous implementation decomposes `tanh(x)` into `(exp(2x) - 1)/(exp(2x)+1), x < 0` `(1 - exp(-2x))/(1 + exp(-2x)), x >= 0` This is fine as it avoids overflow with the exponential, but the whole decomposition is computed for both cases unconditionally, then the result is chosen based off the sign of the input. This results in doing two expensive `exp` computations. The proposed change avoids doing the whole computation twice by exploiting the reflection symmetry `tanh(-x) = -tanh(x)`. We can "normalize" the input to be positive by setting `y = sign(x) * x`, where the sign of `x` is computed as `sign(x) = (float)(x > 0) * (-2) + 1`. Then compute `z = tanh(y)` with the decomposition above for `x >=0` and "denormalize" the result `z * sign(x)` to retain the sign. The reason it is done this way is that it is very amenable to vectorization. This method trades the duplicate decomposition computations (which takes 5 instructions including an extra expensive `exp` and `div`) for 4 cheap instructions to compute the signs value 1. `arith.cmpf` (which is a pre-existing instruction in the previous impl) 2. `arith.sitofp` 3. `arith.mulf` 4. `arith.addf` and 1 more instruction to get the right sign in the result 5. `arith.mulf`. Moreover, numerically, this implementation will yield the exact same results as the previous implementation.	2024-03-14 19:18:56 -05:00
Johannes Reifferscheid	bcf9826a53	[MLIR] Expose approximation patterns for tanh/erf. (#82750 ) These patterns can already be used via populateMathPolynomialApproximationPatterns, but that includes a number of other patterns that may not be needed. There are already similar functions for expansion. For now only adding tanh and erf since I have a concrete use case for these two.	2024-02-23 13:15:08 +01:00
Ivan Butygin	35ef3994bf	[mlir][vector] ND vectors linearization pass (#81159 ) Common backends (LLVM, SPIR-V) only supports 1D vectors, LLVM conversion handles ND vectors (N >= 2) as `array<array<... vector>>` and SPIR-V conversion doesn't handle them at all at the moment. Sometimes it's preferable to treat multidim vectors as linearized 1D. Add pass to do this. Only constants and simple elementwise ops are supported for now. @krzysz00 I've extracted yours result type conversion code from LegalizeToF32 and moved it to common place. Also, add ConversionPattern class operating on traits.	2024-02-13 15:30:58 +03:00
Mehdi Amini	d4933b3241	Apply clang-tidy fixes for readability-identifier-naming in PolynomialApproximation.cpp (NFC)	2024-01-22 17:34:56 -08:00
Krzysztof Drewniak	05e85e4fc5	[mlir][Math] Add pass to legalize math functions to f32-or-higher (#78361 ) Since most of the operations in the `math` dialect don't have low-precision implementations, add the -math-legalize-to-f32 pass that goes through and brackets low-precision math funcitons (like `math.sin %0 : f16`) with `arith.extf` and `arith.truncf`. This preserves the original semantics of the math operation but allows lowering to proceed. Versions of this lowering are already implicitly present in some passes, like ConvertGPUToROCDL. However, because those are implicit rewrites, they hide the floating-point extension and truncation, preventing anyone from writing passes that operate on those implitic extf/truncf pairs. Exposing this legalization explicitly is needed to allow lowening 8-bit floats on AMD GPUs, as the implementation of extf and truncf on that platform requires the complex logic found in ArithToAMDGPU, which runs before the GPU to ROCDL lowering.	2024-01-18 09:37:43 -06:00
Vivek Khandelwal	b8dca4fa72	[mlir][math] Add math.acosh\|asin\|asinh\|atanh op (#77463 ) Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>	2024-01-10 13:39:32 +01:00
Justin Fargnoli	b43c50490c	[mlir] Declare promised interfaces for the ConvertToLLVM extension (#76341 ) This PR adds promised interface declarations for `ConvertToLLVMPatternInterface` in all the dialects that support the `ConvertToLLVM` dialect extension. Promised interfaces allow a dialect to declare that it will have an implementation of a particular interface, crashing the program if one isn't provided when the interface is used.	2024-01-08 20:19:18 -08:00
Rob Suderman	aa165edca8	[mlir][math] Added `math.sinh` with expansions to `math.exp` (#75517 ) Includes end-to-end tests for the cpu running, folders using `libm` and lowerings to the corresponding `libm` operations.	2023-12-15 11:35:40 -08:00
Sungsoon Cho	762964e97f	Add cosh op to the math dialect. (#75153 )	2023-12-13 12:25:37 +01:00
Frederik Harwath	f7250179e2	Implement acos operator in MLIR Math Dialect (#74584 ) Required for torch-mlir. Cf. llvm/torch-mlir#2604 "Implement torch.aten.acos".	2023-12-08 09:08:43 -08:00
Ivan Butygin	5dce74817b	[mlir][ub] Add poison support to CommonFolders.h Return poison from foldBinary/unary if argument(s) is poison. Add ub dialect as dependency to affected dialects (arith, math, spirv, shape). Add poison materialization to dialects. Add tests for some ops from each dialect. Not all affected ops are covered as it will involve a huge copypaste. Differential Revision: https://reviews.llvm.org/D159013	2023-09-07 12:30:29 +02:00
Balaji V. Iyer	f66e4bd67a	[mlir][math] Modify math.powf to handle negative bases. Powf expansion currently returns NaN when the base is negative. This is because taking natural log of a negative number gives NaN. This patch will square the base and half the exponent, thereby getting around the negative base problem. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D158797	2023-08-25 15:35:05 -07:00
Alexander Shaposhnikov	fe355a44e7	[MLIR][Math] Add support for f64 in the expansion of math.roundeven Add support for f64 in the expansion of math.roundeven. Associated GitHub issue: https://github.com/openxla/iree/issues/13522 This is based on the offline discussion and essentially recommits https://reviews.llvm.org/D158234. Test plan: ninja check-mlir check-all	2023-08-24 21:41:26 +00:00
Alexander Shaposhnikov	d22883e384	Revert "[MLIR][Math] Add support for f16 in the expansion of math.roundeven" This reverts commit `40bf36319e`. The build bot ppc64le-mlir-rhel-test got broken by these changes, see https://lab.llvm.org/buildbot#builders/88/builds/61048 .	2023-08-18 18:20:52 +00:00
Alexander Shaposhnikov	40bf36319e	[MLIR][Math] Add support for f16 in the expansion of math.roundeven Add support for f16 in the expansion of math.roundeven. Associated GitHub issue: https://github.com/openxla/iree/issues/13522 This version addresses the build issues on Windows reported on https://reviews.llvm.org/D157204 Test plan: ninja check-mlir check-all Differential revision: https://reviews.llvm.org/D158234	2023-08-18 17:48:34 +00:00
Alexander Shaposhnikov	f745c91f61	Revert "[MLIR][Math] Add support for f16 in the expansion of math.roundeven" This reverts commit `b96f6cf629` since it has broken some Windows build bots (see https://reviews.llvm.org/D157204). Will recommit a fixed version later.	2023-08-17 23:22:05 +00:00
Alexander Shaposhnikov	b96f6cf629	[MLIR][Math] Add support for f16 in the expansion of math.roundeven Add support for f16 in the expansion of math.roundeven. Associated GitHub issue: https://github.com/openxla/iree/issues/13522 Test plan: ninja check-mlir check-all Differential revision: https://reviews.llvm.org/D157204	2023-08-17 18:40:26 +00:00
Jacques Pienaar	75bd41f61d	[mlir][polyapprox] Use llvm::numbers for constants. Fixes windows build.	2023-06-24 17:03:39 -07:00
Robert Suderman	0bedb667af	[mlir][math] Improved math.atan approximation Used the cephes numerical approximation for `math.atan`. This is a significant accuracy improvement over the previous taylor series approximation. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D153656	2023-06-23 17:25:34 -07:00
Robert Suderman	710dc7282a	[mlir][math] Modified the 'math.exp' lowering for higher precision The existing lowering has lower precision for certain use cases, e.g. tanh. Improved version should demonstrate an overall higher level of precision. Reviewed By: cota, jpienaar Differential Revision: https://reviews.llvm.org/D153592	2023-06-23 12:25:18 -07:00
Ivan Butygin	ee8b8d6b58	[mlir][math] Uplift from arith to math.fma Add pass to uplift from arith mulf + addf ops to math.fma if fastmath flags allow it. Differential Revision: https://reviews.llvm.org/D152633	2023-06-18 17:11:21 +02:00
Tres Popp	5550c82189	[mlir] Move casting calls from methods to function calls The MLIR classes Type/Attribute/Operation/Op/Value support cast/dyn_cast/isa/dyn_cast_or_null functionality through llvm's doCast functionality in addition to defining methods with the same name. This change begins the migration of uses of the method to the corresponding function call as has been decided as more consistent. Note that there still exist classes that only define methods directly, such as AffineExpr, and this does not include work currently to support a functional cast/isa call. Caveats include: - This clang-tidy script probably has more problems. - This only touches C++ code, so nothing that is being generated. Context: - https://mlir.llvm.org/deprecation/ at "Use the free function variants for dyn_cast/cast/isa/…" - Original discussion at https://discourse.llvm.org/t/preferred-casting-style-going-forward/68443 Implementation: This first patch was created with the following steps. The intention is to only do automated changes at first, so I waste less time if it's reverted, and so the first mass change is more clear as an example to other teams that will need to follow similar steps. Steps are described per line, as comments are removed by git: 0. Retrieve the change from the following to build clang-tidy with an additional check: https://github.com/llvm/llvm-project/compare/main...tpopp:llvm-project:tidy-cast-check 1. Build clang-tidy 2. Run clang-tidy over your entire codebase while disabling all checks and enabling the one relevant one. Run on all header files also. 3. Delete .inc files that were also modified, so the next build rebuilds them to a pure state. 4. Some changes have been deleted for the following reasons: - Some files had a variable also named cast - Some files had not included a header file that defines the cast functions - Some files are definitions of the classes that have the casting methods, so the code still refers to the method instead of the function without adding a prefix or removing the method declaration at the same time. ``` ninja -C $BUILD_DIR clang-tidy run-clang-tidy -clang-tidy-binary=$BUILD_DIR/bin/clang-tidy -checks='-,misc-cast-functions'\ -header-filter=mlir/ mlir/ -fix rm -rf $BUILD_DIR/tools/mlir/*/.inc git restore mlir/lib/IR mlir/lib/Dialect/DLTI/DLTI.cpp\ mlir/lib/Dialect/Complex/IR/ComplexDialect.cpp\ mlir/lib/**/IR/\ mlir/lib/Dialect/SparseTensor/Transforms/SparseVectorization.cpp\ mlir/lib/Dialect/Vector/Transforms/LowerVectorMultiReduction.cpp\ mlir/test/lib/Dialect/Test/TestTypes.cpp\ mlir/test/lib/Dialect/Transform/TestTransformDialectExtension.cpp\ mlir/test/lib/Dialect/Test/TestAttributes.cpp\ mlir/unittests/TableGen/EnumsGenTest.cpp\ mlir/test/python/lib/PythonTestCAPI.cpp\ mlir/include/mlir/IR/ ``` Differential Revision: https://reviews.llvm.org/D150123	2023-05-12 11:21:25 +02:00
Uday Bondhugula	20b12fa898	Fix MathTransforms library dependencies The dependencies were set up improperly likely due to past code locations. MathTransforms shouldn't depend on VectorUtils which add a whole bunch of additional dependencies; it instead depends on the SCF dialect. Differential Revision: https://reviews.llvm.org/D149797	2023-05-04 05:41:48 +05:30
Ramiro Leal-Cavazos	44baa65589	Revert "Revert "Fix handling of special and large vals in expand pattern for `round`" and "Add pattern that expands `math.roundeven` into `math.round` + arith"" This reverts commit `87cef78fa1`. The issue in the original revert is that a lit test expecting a `-nan` as an output was failing on M2. Since the IEEE 754-2008 standard does not require the sign to be printed when displaying a `nan`, this commit changes the `CHECK` for `-nan` to one that checks the result value bitcasted to an `i32` to ensure that input is being left unchanged. This check should now be independent of platform being used to run test. Reviewed By: jpienaar, mehdi_amini Differential Revision: https://reviews.llvm.org/D148941	2023-04-22 07:15:40 -07:00
Mehdi Amini	87cef78fa1	Revert "Fix handling of special and large vals in expand pattern for `round`" and "Add pattern that expands `math.roundeven` into `math.round` + arith" This reverts commit `8d2bae9abd` and commit `ab2fc9521e`. Tests are broken on Mac M2	2023-04-21 00:16:32 -06:00
Rahul Kayaith	6089d612a5	[mlir] Prevent implicit downcasting to interfaces Currently conversions to interfaces may happen implicitly (e.g. `Attribute -> TypedAttr`), failing a runtime assert if the interface isn't actually implemented. This change marks the `Interface(ValueT)` constructor as explicit so that a cast is required. Where it was straightforward to I adjusted code to not require casts, otherwise I just made them explicit. Depends on D148491, D148492 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D148493	2023-04-20 16:31:54 -04:00
Rahul Kayaith	00e3566d6c	[mlir][arith] Add arith.constant materialization helper This adds `arith::ConstantOp::materialize`, which builds a constant from an attribute and type only if it would result in a valid op. This is useful for dialect `materializeConstant` hooks, and allows for removing the previous `Attribute, Type` builder which was only used during materialization. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D148491	2023-04-20 16:31:52 -04:00
Ramiro Leal-Cavazos	8d2bae9abd	Add pattern that expands `math.roundeven` into `math.round` + arith This commit adds a pattern that expands `math.roundeven` into `math.round` + some ops from `arith`. This is needed to be able to run `math.roundeven` in a vectorized manner. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D148285	2023-04-20 12:48:12 -07:00

1 2 3 4

163 Commits