clang-p2996

Author	SHA1	Message	Date
Andrew Gozillon	062fce6f4d	[Flang][OpenMP][MLIR] An mlir transformation pass for marking FuncOp's implicitly called from TargetOp's and declare target marked FuncOp's as implicitly declare target This pass will mark functions called from TargetOp's and declare target functions as implicitly declare target by adding the MLIR declare target attribute directly to the function. This pass executes after the initial lowering of Fortran's PFT to MLIR (FIR/OMP+Arith etc.) and is one of a series of passes that aim to clean up the MLIR for offloading (seperate passes in different patches, one for early outlining, another for declare target function filtering). Reviewers: jsjodin, skatrak, kiaranchandramohan Differential Revision: https://reviews.llvm.org/D154247	2023-07-17 08:32:26 -05:00
Sergio Afonso	debdfc0ae2	[Flang][OpenMP][MLIR] Filter emitted code depending on declare target and device This patch adds support for selecting which functions are lowered to LLVM IR from MLIR depending on declare target information and whether host or device code is being generated. The approach proposed by this patch is to perform the filtering in two stages: - An MLIR transformation pass, which is added to the Flang translation flow after the `OMPEarlyOutliningPass`. The functions that are kept are those that match the OpenMP processor (host or device) the compiler invocation is targeting, according to the presence of the `-fopenmp-is-target-device` compiler option and declare target information. All functions contaning an `omp.target` are also kept, regardless of the declare target information of the function, due to the need for keeping target regions visible for both host and device compilation. - A filtering step during translation to LLVM IR, which is peformed for those functions that were kept because of the presence of a target region inside. If the targeted OpenMP processor does not match the declare target information of the function, then it is removed from the LLVM IR after its contents have been processed and translated. Since they should only contain an omp.target operation which, in turn, should have been outlined into another LLVM IR function, the wrapper can be deleted at that point. Depends on D150328 and D150329. Differential Revision: https://reviews.llvm.org/D147641	2023-07-17 09:07:54 +01:00
Jan Sjodin	22a167779a	[flang] Fix OMPEarlyOutlining erasing declare target functions The early outlining pass was erasing target functions that need to be kept. It should only erase functions that contain target ops.	2023-07-13 13:00:23 -04:00
Mark Danial	d85b94bf00	[Flang] -funderscoring bug fix There was a bug with the -funderscoring / -fno-underscoring options from (https://reviews.llvm.org/D140795) that prevented the driver option from controlling the underscoring behaviour and instead the behaviour could only be controlled by the pass option instead of the driver option. The driver test case did not catch the bug and also needed to be updated. Reviewed By: awarzynski Differential Revision: https://reviews.llvm.org/D155042	2023-07-13 11:30:35 -04:00
Jan Sjodin	45a9604417	[Flang][OpenMP][MLIR] Add early outlining pass for omp.target operations to flang This patch implements an early outlining transform of omp.target operations in flang. The pass is needed because optimizations may cross target op region boundaries, but with the outlining the resulting functions only contain a single omp.target op plus a func.return, so there should not be any opportunity to optimize across region boundaries. The patch also adds an interface to be able to store and retrieve the parent function name of the original target operation. This is needed to be able to create correct kernel function names when lowering to LLVM-IR. Reviewed By: kiranchandramohan, domada Differential Revision: https://reviews.llvm.org/D154879	2023-07-13 09:14:42 -04:00
David Truby	f52c64b115	[flang] Add fastmath flags to localBuilder in IntrinsicCall Currently the local builder used in IntrinsicCall doesn't have the fastmath flags passed to it. This results in the fastmath attribute not being added to certain runtime calls. This patch simply forwards the fastmath flags from the parent builder. Differential Revision: https://reviews.llvm.org/D154611	2023-07-11 18:53:31 +01:00
Tom Eccles	76c3c5bca0	[flang] [stack-arrays] fix unused variable warning	2023-06-05 15:36:02 +00:00
Tom Eccles	53cc33b00b	[flang] Store KindMapping by value in FirOpBuilder Previously only a constant reference was stored in the FirOpBuilder. However, a lot of code was merged using FirOpBuilder builder{rewriter, getKindMapping(mod)}; This is incorrect because the KindMapping returned will go out of scope as soon as FirOpBuilder's constructor had run. This led to an infinite loop running some tests using HLFIR (because the stack space containing the kind mapping was re-used and corrupted). One solution would have just been to fix the incorrect call sites, however, as a large number of these had already made it past review, I decided to instead change FirOpBuilder to store its own copy of the KindMapping. This is not costly because nearly every time we construct a KindMapping is exclusively to construct a FirOpBuilder. To make this common pattern simpler, I added a new constructor to FirOpBuilder which calls getKindMapping(). Differential Revision: https://reviews.llvm.org/D151881	2023-06-05 09:57:57 +00:00
Tom Eccles	775de6754a	[flang] convert stack arrays allocation to match old type The old fir.allocmem operation returned a !fir.heap<.> type. The new fir.alloca operation returns a !fir.ref<.> type. This patch inserts a fir.convert so that the old type is preserved. This prevents verifier failures when types returned from fir.if statements don't match the expected type. Differential Revision: https://reviews.llvm.org/D151921	2023-06-05 09:57:57 +00:00
Mats Petersson	b812932b35	[FLANG] Change loop versioning to use shift instead of divide Despite me being convinced that the use of divide didn't produce any divide instructions, it does in fact add more instructions than using a plain shift operation. This patch simply changes the divide to a shift right, with an assert to check that the "divisor" is a power of two. Reviewed By: kiranchandramohan, tblah Differential Revision: https://reviews.llvm.org/D151880	2023-06-01 19:29:57 +01:00
Tom Eccles	408f4196ba	[flang] use greedy mlir driver for stack arrays pass In upstream mlir, the dialect conversion infrastructure is used for lowering from one dialect to another: the passes are of the form XToYPass. Whereas, transformations within the same dialect tend to use applyPatternsAndFoldGreedily. In this case, the full complexity of applyPatternsAndFoldGreedily isn't needed so we can get away with the simpler applyOpPatternsAndFold. This change was suggested by @jeanPerier The old differential revision for this patch was https://reviews.llvm.org/D150853 Re-applying here fixing the issue which led to the patch being reverted. The issue was from erasing uses of the allocation operation while still iterating over those uses (leading to a use-after-free). I have added a regression test which catches this bug for -fsanitize=address builds, but it is hard to reliably cause a crash from the use-after-free in normal builds. Differential Revision: https://reviews.llvm.org/D151728	2023-05-31 14:06:57 +00:00
Mats Petersson	b75f9ce3fe	[FLANG] Support all arrays for LoopVersioning This patch makes more than 2D arrays work, with a fix for the way that loop index is calculated. Removing the restriction of number of dimensions. This also changes the way that the actual index is calculated, such that the stride is used rather than the extent of the previous dimension. Some tests failed without fixing this - this was likely a latent bug in the 2D version too, but found in a test using 3D arrays, so wouldn't have been found with 2D only. This introduces a division on the index calculation - however it should be a nice and constant value allowing a shift to be used to actually divide - or otherwise removed by using other methods to calculate the result. In analysing code generated with optimisation at -O3, there are no divides produced. Some minor refactoring to avoid repeatedly asking for the "rank" of the array being worked on. This improves some of the SPEC-2017 ROMS code, in the same way as the limited 2D array improvements - less overhead spent calculating array indices in the inner-most loop and better use of vector-instructions. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D151140	2023-05-30 18:54:40 +01:00
Tom Eccles	2dfaec7781	Revert "[flang] use greedy mlir driver for stack arrays pass" This reverts commit `74c2ec50f3`. This caused a regression building spec2017 with -Ofast.	2023-05-24 16:15:52 +00:00
Tom Eccles	74c2ec50f3	[flang] use greedy mlir driver for stack arrays pass In upstream mlir, the dialect conversion infrastructure is used for lowering from one dialect to another: the passes are of the form XToYPass. Whereas, transformations within the same dialect tend to use applyPatternsAndFoldGreedily. In this case, the full complexity of applyPatternsAndFoldGreedily isn't needed so we can get away with the simpler applyOpPatternsAndFold. This change was suggested by @jeanPerier Differential Revision: https://reviews.llvm.org/D150853	2023-05-23 14:51:42 +00:00
Valentin Clement	677f7cc55a	[mlir][flang][openacc] Remove obsolete operand legalization passes The information needed for translation is now encoded in the dialect operations and does not require a dedicated pass to be extracted. Remove the obsolete passes that were performing operand legalization. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D150248	2023-05-11 10:33:00 -07:00
Valentin Clement	5e983942d5	[mlir][openacc] Cleanup acc.parallel from old data clause operands Remove old clause operands from acc.parallel operation since the new dataOperands is now in place. private, firstprivate and reductions will receive some redesign but are not part of the new dataOperands. Reviewed By: razvanlupusoru Differential Revision: https://reviews.llvm.org/D150207	2023-05-09 14:57:50 -07:00
Valentin Clement	46e1b095c9	[mlir][openacc] Cleanup acc.data from old data clause operands Since the new data operand operations have been added in D148389 and adopted on acc.data in D149673, the old clause operands are no longer needed. The LegalizeDataOpForLLVMTranslation will become obsolete when all operations will be cleaned. For the time being only the appropriate part are being removed. processOperands will also receive some updates once all the operands will be coming from an acc data operand operation. Reviewed By: razvanlupusoru Differential Revision: https://reviews.llvm.org/D150155	2023-05-09 13:21:37 -07:00
Valentin Clement	15a480c05e	[mlir][openacc] Cleanup acc.exit_data from old data clause operands Since the new data operand operations have been added in D148389 and adopted on acc.exit_data in D149601, the old clause operands are no longer needed. The LegalizeDataOpForLLVMTranslation will become obsolete when all operations will be cleaned. For the time being only the appropriate part are being removed. processOperands will also receive some updates once all the operands will be coming from an acc data operand operation. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D150145	2023-05-09 11:36:48 -07:00
Valentin Clement	9dec07f44a	[mlir][openacc] Cleanup acc.enter_data from old data clause operands Since the new data operand operations have been added in D148389 and adopted on acc.enter_data in D148721, the old clause operands are no longer needed. The LegalizeDataOpForLLVMTranslation will become obsolete when all operations will be cleaned. For the time being only the appropriate part are being removed. processOperands will also receive some updates once all the operands will be coming from an acc data operand operation. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D150132	2023-05-09 09:01:30 -07:00
Valentin Clement	689afa88ae	[mlir][openacc] Cleanup acc.update from old data clause operands Since the new data operand operations have been added in D148389 and adopted on acc.update in D149909, the old clause operands are no longer needed. This is a first patch to start cleaning the OpenACC operations with data clause operands. The `LegalizeDataOpForLLVMTranslation` will become obsolete when all operations will be cleaned. For the time being only the appropriate part are being removed. `processOperands` will also receive some updates once all the operands will be coming from an acc data operand operation. Reviewed By: razvanlupusoru, jeanPerier Differential Revision: https://reviews.llvm.org/D150053	2023-05-08 10:03:28 -07:00
Slava Zakharin	7a607e253d	[flang] Removed unnecessary llvm/CodeGen/SelectionDAGNodes.h include. Required after D148767 for flang+debug+slibs build. Reviewed By: chapuni, clementval Differential Revision: https://reviews.llvm.org/D149764	2023-05-03 15:10:09 -07:00
Slava Zakharin	7e584357ac	[flang] Fixed branch-to-default generation for select_type. When the default case requires block arguments, they have to be passed through the cf.br - this piece was missing. Reviewed By: clementval Differential Revision: https://reviews.llvm.org/D149484	2023-04-28 15:45:09 -07:00
Jean Perier	c203850ad5	[flang][hlfir] Support fir.declare in AbstractResult pass The AbstractResult pass replaces allocation of function result on the callee side per an extra argument so that the allocation of the result can be done on the caller stack. It looks for the result allocation from the fir.return op, so it needs to handle (in a transparent way) a fir.declare in the chain between the allocation and the fir.return. Reviewed By: vzakhari, clementval Differential Revision: https://reviews.llvm.org/D149057	2023-04-25 09:04:38 +02:00
Matthias Springer	4c48f016ef	[mlir][Affine][NFC] Wrap dialect in "affine" namespace This cleanup aligns the affine dialect with all the other dialects. Differential Revision: https://reviews.llvm.org/D148687	2023-04-20 11:19:21 +09:00
Mats Petersson	a716ace13d	Add loop-versioning pass to improve unit-stride Introduce conditional code to identify stride of "one element", and simplify the array accesses for that case. This allows better loop performance in various benchmarks. Reviewed By: tblah, kiranchandramohan Differential Revision: https://reviews.llvm.org/D141306	2023-04-18 09:53:07 +01:00
V Donaldson	bddd7a6436	[flang] REAL(KIND=3) and COMPLEX(KIND=3) descriptors Update descriptor generation to correctly set the `type` field for REAL(3) and COMPLEX(3) objects.	2023-04-17 09:10:47 -07:00
V Donaldson	4add0e3db9	Revert "[flang] REAL(KIND=3) and COMPLEX(KIND=3) descriptors" This reverts commit `17a4fcecf4`.	2023-04-13 18:34:18 -07:00
V Donaldson	17a4fcecf4	[flang] REAL(KIND=3) and COMPLEX(KIND=3) descriptors Update descriptor generation to correctly set the `type` field for REAL(3) and COMPLEX(3) objects.	2023-04-13 18:02:13 -07:00
Valentin Clement	49a813b4a6	[flang][openacc] Keep region when applying data operand conversion Similar to D148039 but for the FIR to LLVM IR conversion pass. The inner part of the acc.loop has been removed since the rest of the pipeline is not ready and would raise an error here. This was passing until now because the acc.loop was discarded completely. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D148057	2023-04-12 08:20:34 -07:00
Valentin Clement	30408f5ccf	[flang][NFC] Move TypeConverter.h header file to include dir After the extraction of the TypeConverter, move the header files to the include dir so the shared library build is fine. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D147979	2023-04-10 17:01:50 -07:00
Valentin Clement	cd9cdc6837	[flang][openacc] Add missing piece to translate to LLVM IR dialect Add missing pieces to translate handle OpenACC dialect in the translation. Depends on D147825 Reviewed By: PeteSteinfeld, razvanlupusoru Differential Revision: https://reviews.llvm.org/D147828	2023-04-10 14:30:25 -07:00
Valentin Clement	42598ec745	[flang][openacc] Add data operands conversion from FIR This patch revive an old PR attempt [1] to perform the data operands conversion needed for translation to LLVMIR. This is currently not supporting box/class type since they will normally not reach this pass when the proposed change in this RFC [2] are implemented. [1] https://github.com/flang-compiler/f18-llvm-project/pull/915 [2] https://discourse.llvm.org/t/rfc-openacc-dialect-data-operation-improvements/69825/2 Depends on D147824 Reviewed By: PeteSteinfeld, razvanlupusoru Differential Revision: https://reviews.llvm.org/D147825	2023-04-10 13:34:57 -07:00
Valentin Clement	1c624633a6	Revert "[flang][openacc] Add data operands conversion from FIR" This reverts commit `68bcd647c9`.	2023-04-10 13:05:37 -07:00
Valentin Clement	cc0a0044bf	Revert "[flang][openacc] Add missing piece to translate to LLVM IR dialect" This reverts commit `03289dc7af`.	2023-04-10 13:05:23 -07:00
Valentin Clement	03289dc7af	[flang][openacc] Add missing piece to translate to LLVM IR dialect Add missing pieces to translate handle OpenACC dialect in the translation. Depends on D147825 Reviewed By: PeteSteinfeld, razvanlupusoru Differential Revision: https://reviews.llvm.org/D147828	2023-04-10 12:18:59 -07:00
Valentin Clement	68bcd647c9	[flang][openacc] Add data operands conversion from FIR This patch revive an old PR attempt [1] to perform the data operands conversion needed for translation to LLVMIR. This is currently not supporting box/class type since they will normally not reach this pass when the proposed change in this RFC [2] are implemented. [1] https://github.com/flang-compiler/f18-llvm-project/pull/915 [2] https://discourse.llvm.org/t/rfc-openacc-dialect-data-operation-improvements/69825/2 Depends on D147824 Reviewed By: PeteSteinfeld, razvanlupusoru Differential Revision: https://reviews.llvm.org/D147825	2023-04-10 12:18:05 -07:00
Renaud-K	4c5dee7773	[flang] Lowering fir.dispatch in the polymorphic op pass Differential revision: https://reviews.llvm.org/D146594	2023-03-23 09:40:47 -07:00
Renaud-K	b07ef9e7cd	Break circular dependency between FIR dialect and utilities	2023-03-09 15:24:51 -08:00
Renaud-K	ff761f2ce4	[flang] Move fir.select_type into the PolymorphicOpConversion pass https://reviews.llvm.org/D144921	2023-03-01 11:33:31 -08:00
Sacha Ballantyne	242bb0b652	[flang] Fix a bug with simplified minloc that treated logicals with even values > 1 as 0 Previously the mask would be loaded as the appropriate integer type and cast to I1 to pass to fir.if, however this truncates the integer and so would cast 6 to 0. By loading values as logicals and casting to I1 this problem is avoided. Reviewed By: Leporacanthicus Differential Revision: https://reviews.llvm.org/D144974	2023-02-28 17:15:36 +00:00
Sacha Ballantyne	79dccded69	[flang] Change COUNT intrinsic to support different kind logicals Previously COUNT would cast the mask input to logical<4> before passing it to the runtime function, this has been changed to allow different types of logical. Reviewed By: tblah Differential Revision: https://reviews.llvm.org/D144867	2023-02-28 12:26:33 +00:00
Sacha Ballantyne	614cd721e1	[Flang] Add Minloc to simplify intrinsics pass This patch adds minloc to the simplify intrinsics pass, supporting calls with KIND or MASK arguments while calls which have BACK, DIM or have a CHARACTER input array are rejected. This patch is targeting exchange2, and in benchmarks provides a ~11% improvement in performance. Also included are some minor style changes / cleanup in simplifyIntrinsics.cpp. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D144103	2023-02-27 11:36:55 +00:00
Mark Danial	1360bfb05b	[Flang] Add user option -funderscoring/-fnounderscoring to control trailing underscore added to external names This patch adds user option -funderscoring/-fnounderscoring to control the trailing underscore being appended to external names (e.g. procedure names, common block names). The option in gfortran is documented in https://gcc.gnu.org/onlinedocs/gfortran/Code-Gen-Options.html. Reviewed By: clementval Differential Revision: https://reviews.llvm.org/D140795	2023-02-21 16:34:26 -05:00
Tom Eccles	7a49d50f22	[flang] support fir.unreachable in stack arrays pass Some functions (e.g. the main function) end with a call to the STOP statement instead of a func.return. This is lowered as a call to the stop runtime function followed by a fir.unreachable. fir.unreachable is a terminator and so this can cause functions to have no func.return. The stack arrays pass looks to see which heap allocations have always been freed by the time a function returns. Without any returns, the pass does not detect any freed allocations. This patch changes this behaviour so that fir.unreachable is checked as well as func.return. This allows 15 heap allocations for array temporaries in spec2017 exchange2's main function to be moved to the stack. Differential Revision: https://reviews.llvm.org/D143918	2023-02-14 13:44:59 +00:00
Sacha Ballantyne	98ecc3ac77	[Flang] Fix for Any/All simplification to properly propogate the inital value When rank > 1, the inital value would be lost on inner loops, leading to the wrong value to be returned, e.g. This would return T. This patch fixes this to use the correct inital value for all cases. ``` Integer :: m(0,10) Any(m .eq 0) ``` Reviewed By: vdonaldson Differential Revision: https://reviews.llvm.org/D143899	2023-02-14 10:28:56 +00:00
Tom Eccles	d5ea1b22cb	[flang] use mlir::LoopLikeOpInterface::blockIsInLoop The inlined version of this function can now go away because https://reviews.llvm.org/D141401 has been merged. Differential Revision: https://reviews.llvm.org/D143659	2023-02-13 10:29:36 +00:00
Slava Zakharin	8c85550549	[flang] Fixed build after D142977. Added missing link to HLFIRDialect. Differential Revision: https://reviews.llvm.org/D142977	2023-02-09 14:12:03 -08:00
Sacha Ballantyne	20fba03f96	[Flang] Add Any and All intrinsics to simplify intrinsics pass This patch provides a simplified version of the Any intrinsic as well as the All intrinsic that can be used for inlining or simpiler use cases. These changes are targeting exchange2, and provide a ~9% performance increase. Reviewed By: Leporacanthicus, vzakhari Differential Revision: https://reviews.llvm.org/D142977	2023-02-09 19:52:15 +00:00
Tom Eccles	cc14bf22bd	[flang] add a pass to move array temporaries to the stack This pass implements the `-fstack-arrays` flag. See the RFC in `flang/docs/fstack-arrays.md` for more information. Differential revision: https://reviews.llvm.org/D140415	2023-02-07 10:27:52 +00:00
Andrew Gozillon	f86209fc80	[FLANG][MLIR] Update all module symbol references after changing FuncOp symbol during external name mangling This fixes an issue where the symbols for operations that were not directly handled by the rewriting in ExternalNameConversion.cpp were not updated accurately when a FuncOp symbol was modified. Resulting in a name mismatch between the FuncOp and the operation holding a symbol to the FuncOp. This fix works by updating all of the symbols relating to a FuncOp in a module, this did not show up as an issue previously as fir::CallOps were getting specific handling and only fir::CallOps were being tested. So as the more larger case is now being handled the specific handling for fir::CallOps has been removed (but is still handled by the fix). Reviewers: clementval Differential Revision: https://reviews.llvm.org/D142918	2023-02-02 04:59:32 -06:00

1 2 3 4

167 Commits