clang-p2996

Author	SHA1	Message	Date
Christian Sigg	bd9fdce69b	[flang] Use `isa/dyn_cast/cast/...` free functions. (#90432 ) The corresponding member functions are deprecated.	2024-04-29 09:16:22 +02:00
Christian Sigg	fac349a169	Reapply "[mlir] Mark `isa/dyn_cast/cast/...` member functions depreca… (#90406 ) …ted. (#89998)" (#90250) This partially reverts commit `7aedd7dc75`. This change removes calls to the deprecated member functions. It does not mark the functions deprecated yet and does not disable the deprecation warning in TypeSwitch. This seems to cause problems with MSVC.	2024-04-28 22:01:42 +02:00
dyung	7aedd7dc75	Revert "[mlir] Mark `isa/dyn_cast/cast/...` member functions deprecated. (#89998 )" (#90250 ) This reverts commit `950b7ce0b8`. This change is causing build failures on a bot https://lab.llvm.org/buildbot/#/builders/216/builds/38157	2024-04-26 12:09:13 -07:00
Christian Sigg	950b7ce0b8	[mlir] Mark `isa/dyn_cast/cast/...` member functions deprecated. (#89998 ) See https://mlir.llvm.org/deprecation and https://discourse.llvm.org/t/preferred-casting-style-going-forward.	2024-04-26 16:28:30 +02:00
Tom Eccles	81442f8d97	[flang][NFC] Use tablegen to create SimplifyIntrinsics constructor (#89963 ) This pass runs on ModuleOp, internally walking all func::CallOps so it shouldn't need anything special to work on other top level operations.	2024-04-25 10:26:05 +01:00
jeanPerier	a4798bb0b6	[flang][NFC] use mlir::SymbolTable in lowering (#86673 ) Whenever lowering is checking if a function or global already exists in the mlir::Module, it was doing module->lookup. On big programs (~5000 globals and functions), this causes important slowdowns because these lookups are linear. Use mlir::SymbolTable to speed-up these lookups. The SymbolTable has to be created from the ModuleOp and maintained in sync. It is therefore placed in the converter, and FirOPBuilders can take a pointer to it to speed-up the lookups. This patch does not bring mlir::SymbolTable to FIR/HLFIR passes, but some passes creating a lot of runtime calls could benefit from it too. More analysis will be needed. As an example of the speed-ups, this patch speeds-up compilation of Whizard compare_amplitude_UFO.F90 from 5 mins to 2 mins on my machine (there is still room for speed-ups).	2024-04-02 14:29:29 +02:00
David Green	2a95fe481d	[Flang] Allow Intrinsic simpification with min/maxloc dim and scalar result (#81619 ) This makes an adjustment to the existing fir minloc/maxloc generation code to handle functions with a dim=1 that produce a scalar result. This should allow us to get the same benefits as the existing generated minmax reductions. This is a recommit of #76194 with an extra alteration to the end of genRuntimeMinMaxlocBody to make sure we convert the output array to the correct type (a `box<heap<i32>>`, not `box<heap<array<1xi32>>>`) to prevent writing the wrong type of box into it. This still allocates the data as a `array<1xi32>`, converting it into a i32 assuming that is safe. An alternative would be to allocate the data as a i32 and change more of the accesses to it throughout genRuntimeMinMaxlocBody.	2024-03-02 14:39:59 +00:00
David Green	7242896233	[Flang] Attempt to fix Nan handling in Minloc/Maxloc intrinsic simplification (#82313 ) In certain case "extreme" values like Nan, Inf and 0xffffffff could lead to generating different code via the inline-generated intrinsics vs the versions in the runtimes (and other compilers like gfortran). There are some examples I was using for testing in https://godbolt.org/z/x4EfqEss5. This changes the generation for the intrinsics to be more like the runtimes, using a condition that is similar to: isFirst \|\| (prev != prev && elem == elem) \|\| elem < prev The middle part is only used for floating point operations, and checks if the values are Nan. This should then hopefully make the logic closer to - return the first element with the lowest value, with Nans ignored unless there are only Nans. The initial limit value for floats are also changed from the largest float to Inf, to make sure it is handled correctly. The integer reductions are also changed to use a similar scheme to make sure they work with masked values. This means that the preamble after the loop can be removed.	2024-02-21 09:31:29 +00:00
David Green	815a846552	[Flang] Move genMinMaxlocReductionLoop to Transforms/Utils.cpp (#81380 ) This is one option for attempting to move genMinMaxlocReductionLoop to a better location. It moves it into Transforms and makes HLFIRTranforms depend upon FIRTransforms. It passes a build locally, both with and without -DBUILD_SHARED_LIBS, and does OK on the windows CI.	2024-02-13 08:31:07 +00:00
David Green	202917f86e	[Flang] Move genMinMaxlocReductionLoop to a common location. The shared library build doesn't like references of genMinMaxlocReductionLoop, in Optimizer/Transforms, from HLFIR/Optimizer/Transforms. For the moment I've moved the code to the header file where it can be shared, like other methods in Utils.h	2024-01-25 13:31:18 +00:00
David Green	223d3dabc8	[Flang] Minloc elemental intrinsic lowering (#74828 ) Currently the lowering of a minloc intrinsic with a mask will look something like: %e = hlfir.elemental %shape ({ ... }) %m = hlfir.minloc %array mask %e hlfir.assign %m to %result hlfir.destroy %m The elemental will be expanded into a temporary+loop, the minloc into a FortranAMinloc call (which hopefully gets simplified to a specialized call that can be inlined at the call site), and the assign might get expanded to a FortranAAssign. It would be better to generate the entire construct as single loop if we can - one that performs the minloc calculation with the mask elemental computed inline. This patch attempt to do that, adding a hlfir version of the expansion code from SimplifyIntrinsics that turns an minloc+elemental into a single combined loop nest. It attempts to reuse the methods in genMinlocReductionLoop for constructing the loop with a modified loop body. The declaration for the function is currently in Optimizer/Support/Utils.h, but there might be a better place for it. It is added as part of the OptimizedBufferizationPass, like the similar count/any/all that have been added recently.	2024-01-25 12:17:12 +00:00
Pete Steinfeld	4f59a38821	Revert #76194 (#76987 ) [Flang] Revert "Allow Intrinsic simpification with min/maxloc dim and…scalar result (#76194)" This reverts commit `9b7cf5bfb0`. See merge request #76194. This change was causing several failures in our internal tests. I'm reverting now and will work on creating a test that David Green can use to reproduce the problem.	2024-01-04 10:19:50 -08:00
David Green	9b7cf5bfb0	[Flang] Allow Intrinsic simpification with min/maxloc dim and scalar result (#76194 ) This makes an adjustment to the existing fir minloc/maxloc generation code to handle functions with a dim=1 that produce a scalar result. This should allow us to get the same benefits as the existing generated minmax reductions. This is a recommit of #75820 with the typename added to the generated function.	2024-01-02 11:09:18 +00:00
Pete Steinfeld	0cf3af0c51	Revert "[Flang] Allow Intrinsic simpification with min/maxloc dim and… (#76184 ) … scalar result. (#75820)" This reverts commit `701f647905`. The commit breaks some uses of the 'maxloc' intrinsic. See PR #75820	2023-12-21 13:14:05 -08:00
David Green	701f647905	[Flang] Allow Intrinsic simpification with min/maxloc dim and scalar result. (#75820 ) This makes an adjustment to the existing fir minloc/maxloc generation code to handle functions with a dim=1 that produce a scalar result. This should allow us to get the same benefits as the existing generated minmax reductions.	2023-12-20 12:12:12 +00:00
David Green	9bb47f7f8b	[Flang] Add Maxloc to fir simplify intrinsics pass (#75463 ) This takes the code from D144103 and extends it to maxloc, to allow the simplifyMinMaxlocReduction method to work with both min and max intrinsics by switching condition and limit/initial value.	2023-12-18 07:59:51 +00:00
Kazu Hirata	11efccea8f	[flang] Use StringRef::{starts,ends}_with (NFC) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-13 23:48:53 -08:00
Slava Zakharin	89b98c13e0	[flang] Fixed simplification for FP maxval. On x86, a simplified F128 maxval ends up calling fmaxl that does not work properly for F128 arguments. It is probably an LLVM issue, but we also should not use arith.maxf if NaN or -0.0 operands are possible. The change is to use cmpf and select. Unfortunately, these arith ops do not support FastMathFlags currently, so I will have to fix this sooner or later (depending on how this affects performance). Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D158200	2023-08-21 19:33:56 -07:00
David Truby	f52c64b115	[flang] Add fastmath flags to localBuilder in IntrinsicCall Currently the local builder used in IntrinsicCall doesn't have the fastmath flags passed to it. This results in the fastmath attribute not being added to certain runtime calls. This patch simply forwards the fastmath flags from the parent builder. Differential Revision: https://reviews.llvm.org/D154611	2023-07-11 18:53:31 +01:00
Slava Zakharin	7a607e253d	[flang] Removed unnecessary llvm/CodeGen/SelectionDAGNodes.h include. Required after D148767 for flang+debug+slibs build. Reviewed By: chapuni, clementval Differential Revision: https://reviews.llvm.org/D149764	2023-05-03 15:10:09 -07:00
Renaud-K	b07ef9e7cd	Break circular dependency between FIR dialect and utilities	2023-03-09 15:24:51 -08:00
Sacha Ballantyne	242bb0b652	[flang] Fix a bug with simplified minloc that treated logicals with even values > 1 as 0 Previously the mask would be loaded as the appropriate integer type and cast to I1 to pass to fir.if, however this truncates the integer and so would cast 6 to 0. By loading values as logicals and casting to I1 this problem is avoided. Reviewed By: Leporacanthicus Differential Revision: https://reviews.llvm.org/D144974	2023-02-28 17:15:36 +00:00
Sacha Ballantyne	79dccded69	[flang] Change COUNT intrinsic to support different kind logicals Previously COUNT would cast the mask input to logical<4> before passing it to the runtime function, this has been changed to allow different types of logical. Reviewed By: tblah Differential Revision: https://reviews.llvm.org/D144867	2023-02-28 12:26:33 +00:00
Sacha Ballantyne	614cd721e1	[Flang] Add Minloc to simplify intrinsics pass This patch adds minloc to the simplify intrinsics pass, supporting calls with KIND or MASK arguments while calls which have BACK, DIM or have a CHARACTER input array are rejected. This patch is targeting exchange2, and in benchmarks provides a ~11% improvement in performance. Also included are some minor style changes / cleanup in simplifyIntrinsics.cpp. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D144103	2023-02-27 11:36:55 +00:00
Sacha Ballantyne	98ecc3ac77	[Flang] Fix for Any/All simplification to properly propogate the inital value When rank > 1, the inital value would be lost on inner loops, leading to the wrong value to be returned, e.g. This would return T. This patch fixes this to use the correct inital value for all cases. ``` Integer :: m(0,10) Any(m .eq 0) ``` Reviewed By: vdonaldson Differential Revision: https://reviews.llvm.org/D143899	2023-02-14 10:28:56 +00:00
Sacha Ballantyne	20fba03f96	[Flang] Add Any and All intrinsics to simplify intrinsics pass This patch provides a simplified version of the Any intrinsic as well as the All intrinsic that can be used for inlining or simpiler use cases. These changes are targeting exchange2, and provide a ~9% performance increase. Reviewed By: Leporacanthicus, vzakhari Differential Revision: https://reviews.llvm.org/D142977	2023-02-09 19:52:15 +00:00
Sacha Ballantyne	bb94d33aac	[flang] Fix simplify intrinsic for count not checking for rank = 0 properly Simple fix to check for rank in the same way as other intrinsics to allow runtime count to take over when dealing with unknown dimension arrays. Fixes #60356 Reviewed By: Leporacanthicus Differential Revision: https://reviews.llvm.org/D142877	2023-01-30 12:23:37 +00:00
Sacha Ballantyne	7d2e198729	[flang] Add Count to simplified intrinsics This patch adds a simplfiied version of count for the simplify intrinsics pass, allowing the function to be inlined. This was done specifically to help improve performance for exchange2, and provides a ~12% performance increase. Reviewed By: vzakhari, Leporacanthicus Differential Revision: https://reviews.llvm.org/D142209	2023-01-27 16:30:11 +00:00
Kazu Hirata	0db88db5d9	flang] Remove remaining uses of llvm::Optional (NFC) This patch removes the unused "using" declaration and removes #include "llvm/ADT/Optional.h". This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-07 22:32:38 -08:00
Kazu Hirata	c09215860f	[flang] Use std::optional instead of llvm::Optional (NFC) This patch replaces (llvm::\|)Optional< with std::optional<. I'll post a separate patch to remove #include "llvm/ADT/Optional.h". This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-07 22:26:48 -08:00
Kazu Hirata	4d4d4785e0	[flang] Add #include <optional> (NFC) This patch adds #include <optional> to those files containing llvm::Optional<...> or Optional<...>. I'll post a separate patch to actually replace llvm::Optional with std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-07 20:55:47 -08:00
Kazu Hirata	c15a925ada	[flang] Use std::optional instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-10 17:00:21 -08:00
Kazu Hirata	9a41739565	[flang] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 12:14:21 -08:00
Slava Zakharin	ffe1661fab	[flang] Propagate fastmath flags during intrinsics simplification. In general, the meaning of fastmath flags on a call during inlining is that the call's operation flags must be ignored. For user functions that means that the fastmath flags used for the function definition override any call site's fastmath flags. For intrinsic functions we can use the call site's fastmath flags, but we have to make sure that the call sites with different flags produce/use different simplified versions of the same intrinsic function. Differential Revision: https://reviews.llvm.org/D138048	2022-11-17 10:16:47 -08:00
David Truby	d983f5f39e	[flang] Add cpowi function to runtime and use instead of pgmath This patch adds a cpowi function to the flang runtime, and switches to using that function instead of pgmath for complex number to integer power operations. Differential Revision: https://reviews.llvm.org/D134889	2022-10-11 12:34:58 +00:00
Slava Zakharin	8bd76ac151	[flang] Support multidimensional reductions in SimplifyIntrinsicsPass. Create simplified functions for each rank with "x<rank>" suffix that implement multidimensional reductions. To enable this I had to fix an issue with taking incorrect box shape in cases of sliced embox/rebox. Differential Revision: https://reviews.llvm.org/D133820	2022-09-19 12:16:23 -07:00
Slava Zakharin	2b138567e0	[flang] Support more data types for reduction in SimplifyIntrinsicsPass. All floating point (not complex) and integer types should be supported now. Differential Revision: https://reviews.llvm.org/D133818	2022-09-19 12:16:22 -07:00
Mats Petersson	aa94eb3877	[FLANG][NFC]Use RTNAME instead of hard-coding for simplify intrinsics Use the RTNMAE macro (via stringify macros) to generate the name strings for runtime functions, instead of using strings. The sequence of macros generate exactly the same string as the ones used previously, but this will support future changes in runtime function names. No functional change. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D132652	2022-09-05 13:06:44 +01:00
Mats Petersson	43159b5808	[FLANG][NFCI]De-duplicate code in SimplifyIntrinsics This removes a bunch of duplicated code, by adding an intermediate function simplifyReduction that takes a std::function argument for the actual replacement of the code. No functional change intended. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D132588	2022-09-02 10:49:25 +01:00
Michele Scuttari	67d0d7ac0a	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-31 12:28:45 +02:00
Michele Scuttari	039b969b32	Revert "[MLIR] Update pass declarations to new autogenerated files" This reverts commit `2be8af8f0e`.	2022-08-30 22:21:55 +02:00
Michele Scuttari	2be8af8f0e	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-30 21:56:31 +02:00
Mats Petersson	5653884e34	[FLANG]Remove experimental flag from SUM simplification The SUM function does appear to be safe to use, so remove the experimental flag for the SUM operation. Reviewed By: vzakhari, awarzynski Differential Revision: https://reviews.llvm.org/D132567	2022-08-25 14:11:41 +01:00
Mats Petersson	afa520ab34	[FLANG]Add maxval simplification support Add simplifcation pass for MAXVAL intrinsic function This refactors some of the code to allow variation on the initialization value and operation performed within the loop, reusing the majority of code for both SUM and MAXVAL. Adding tests for the test-cases that produce different output than the SUM function. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D132234	2022-08-24 14:08:19 +01:00
Mats Petersson	72e599197c	[Flang]Fix another way to crash SimplifyIntrinsics Under some conditions, the defining op may be NULL, so accept that rahter than try to use it and crash! Adds test to prevent regression Fixes github issue #57201 Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D132238	2022-08-19 19:00:30 +01:00
Slava Zakharin	11db65bab8	[flang] Control SUM simplification with a pass option. The current code may not always work correctly, e.g.: https://github.com/llvm/llvm-project/issues/57201 I added 'enable-experimental' pass option so that SUM simplification may be enabled in LIT tests, but it is not enabled when the pass is added to the passes pipeline. Differential Revision: https://reviews.llvm.org/D131640	2022-08-17 13:37:44 -07:00
Mats Petersson	726786083f	[flang]Avoid asking for operands when there are none Fix one encountered (issue #57072) and two potential scenarios where the code would ask for an operand that isn't there. Add test for the encountered case. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D131671	2022-08-17 12:31:42 +01:00
Slava Zakharin	56eda98f0c	[flang] Handle mixed types in DOT_PRODUCT simplification. Fortran runtime supports mixed types by casting the loaded values to the result type, so DOT_PRODUCT simplification has to do the same. Differential Revision: https://reviews.llvm.org/D131726	2022-08-15 09:03:38 -07:00
Slava Zakharin	1d5e7a498f	[flang] Support DOT_PRODUCT in late inlining. This change inlines DOT_PRODUCT calls for real and integer types. Differential Revision: https://reviews.llvm.org/D131538	2022-08-10 16:30:35 -07:00
Slava Zakharin	80dcc907a8	[NFC] Restructured SimplifyIntrinsicsPass::getOrCreateFunction. I would like to add DOT_PRODUCT support in this pass, so this restructuring is the first step to allow some code reuse inside getOrCreateFunction(). Differential Revision: https://reviews.llvm.org/D131530	2022-08-10 09:40:57 -07:00

1 2

51 Commits