clang-p2996

Author	SHA1	Message	Date
Slava Zakharin	7beb65ae2d	[flang] Fixed LoopVersioning for array slices. (#65703 ) The first test case added in the LIT test demonstrates the problem. Even though we did not consider the inner loop as a candidate for the transformation due to the array_coor with a slice, we decided to version the outer loop for the same function argument. During the cloning of the outer loop we dropped the slicing completely producing invalid code. I restructured the code so that we record all arg uses that cannot be transformed (regardless of the reason), and then fixup the usage information across the loop nests. I also noticed that we may generate redundant contiguity checks for the inner loops, so I fixed it since it was easy with the new way of keeping the usage data.	2023-09-08 09:01:10 -07:00
jeanPerier	6ffea74f7c	[flang] Use BIND name, if any, when consolidating common blocks (#65613 ) This patch changes how common blocks are aggregated and named in lowering in order to: * fix one obvious issue where BIND(C) and non BIND(C) with the same Fortran name were "merged" * go further and deal with a derivative where the BIND(C) C name matches the assembly name of a Fortran common block. This is a bit unspecified IMHO, but gfortran, ifort, and nvfortran "merge" the common block without complaints as a linker would have done. This required getting rid of all the common block mangling early in FIR (\_QC) instead of leaving that to the phase that emits LLVM from FIR because BIND(C) common blocks did not have mangled names. Care has to be taken to deal with the underscoring option of flang-new. See added flang/test/Lower/HLFIR/common-block-bindc-conflicts.f90 for an illustration.	2023-09-08 10:43:55 +02:00
Tom Eccles	ad9af7de90	[flang][LoopVersioning] support fir.array_coor This is the last piece required for the loop versioning patch to work on code lowered via HLFIR. With this patch, HLFIR performance on spec2017 roms is now similar to the FIR lowering. Adding support for fir.array_coor means that many more loops will be versioned, even in the FIR lowering. So far as I have seen, these do not seem to have an impact on performance for the benchmarks I tried, but I expect it would speed up some programs, if the loop being versioned happened to be the hot code. The main difference between fir.array_coor and fir.coordinate_of is that fir.coordinate_of uses zero-based indices, whereas fir.array_coor uses the indices as specified in the Fortran program (starting from 1 by default, but also supporting non default lower bounds). I opted to transform fir.array_coor operations into fir.coordinate_of operations because this allows both to share the same offset calculation logic. The tricky bit of this patch is getting the correct lower bounds for the array operand to subtract from the fir.array_coor indices to get a zero-based indices. So far as I can tell, the FIR lowering will always provide lower bounds (shift) information in the shape operand to the fir.array_coor when non-default lower bounds are used. If none is given, I originally tried falling back to reading lower bounds from the box, but this led to misscompilation in SPEC2017 cam4. Therefore the pass instead assumes that if it can't already find an SSA value for the shift information, the default lower bound (1) should be used. A suspect the incorrect lower bounds in the box for the FIR lowering was already a known issue (see https://reviews.llvm.org/D158119). Differential Revision: https://reviews.llvm.org/D158597	2023-09-04 10:40:40 +00:00
Slava Zakharin	cccf4d6e4a	[flang] Skip OPTIONAL arguments in LoopVersioning. This patch fixes multiple tests failing with segfault due to accessing absent argument box before the loop versioning check. The absent arguments might be treated as contiguous for the purpose of loop versioning, but this is not done in this patch. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D158800	2023-08-25 08:33:49 -07:00
Tom Eccles	8d24b7322e	[flang][LoopVersioning] support reboxed operands Since https://reviews.llvm.org/D158119, many boxes lowered via HLFIR are reboxed with better lower bounds information after they are declared. For the loop versioning pass to support FIR lowered via HLFIR, it needs to dereference fir.rebox operations to figure out that the variable was a function argument. I decided to modify the existing dereferencing of fir.declare so that the declared/reboxed value is used in the versioned loop instead of the function argument. This makes it easier for the improved lower bounds information to be accessed. In doing this, I changed ArgInfo to store ArgInfo::arg by value instead of by pointer because mlir::Value has value-type semantics. Differential Revision: https://reviews.llvm.org/D158408	2023-08-23 09:53:05 +00:00
Slava Zakharin	668f261bfa	[flang] Make ISO_Fortran_binding.h a standalone header again. This implements the proposal from https://discourse.llvm.org/t/adding-flang-specific-header-files-to-clang/72442/6 Since ISO_Fortran_binding.h is supposed to be included from users' C/C++ codes, it would better have no dependencies on other header files. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D158549	2023-08-22 18:56:27 -07:00
Slava Zakharin	89b98c13e0	[flang] Fixed simplification for FP maxval. On x86, a simplified F128 maxval ends up calling fmaxl that does not work properly for F128 arguments. It is probably an LLVM issue, but we also should not use arith.maxf if NaN or -0.0 operands are possible. The change is to use cmpf and select. Unfortunately, these arith ops do not support FastMathFlags currently, so I will have to fix this sooner or later (depending on how this affects performance). Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D158200	2023-08-21 19:33:56 -07:00
Mark Danial	bfe390cf9a	[Flang] funderscoring intermittent failure fix There is an intermittent failure in the tests for the funderscoring driver option reported in (https://lab.llvm.org/buildbot/#/builders/21/builds/78228) that is caused by an uninitialized member variable. Reviewed By: kkwli0 Differential Revision: https://reviews.llvm.org/D158187	2023-08-21 14:42:33 -04:00
Tom Eccles	05011024fd	[flang][LoopVersioning] support fir.declare When FIR comes from HLFIR, there will be a fir.declare operation between the source and the usage of each source variable (and some temporary allocations). This pass needs to be able to follow these so that it can still transform loops when HLFIR is used, otherwise it mistakenly assumes these values are not function arguments. More work is needed after this patch to fully support HLFIR, because the generated code tends to use fir.array_coor instead of fir.coordinate_of. Differential Revision: https://reviews.llvm.org/D157964	2023-08-18 09:51:22 +00:00
Sergio Afonso	f20b67a81c	[Flang][MLIR][OpenMP] Improve device-only function filtering This patch improves the implementation of a recent function filtering workaround to address problems uncovered by D154247. In particular, the problem was related to the removal of functions called from within target regions. Since target regions have to remain until LLVM IR is generated, removing these functions from MLIR results in undefined references any time there are calls to them in a target region. This patch modifies the MLIR function filtering pass to make these functions "external" rather than removing them. This way, the processing and lowering of MLIR functions that will eventually be discarded is still prevented, but no calls to undefined functions remain either. Additionally, the approach of just filtering host-only functions during device compilation, and not filtering device-only functions during host compilation, is maintained. This is because code generation for device-only functions is required for host fallback to work. Depends on D156988 Differential Revision: https://reviews.llvm.org/D155827	2023-08-10 11:29:45 +01:00
Valentin Clement	103907bc5f	[flang] Add missing dependency on tablegen files This issue was raised on https://github.com/llvm/llvm-project/issues/64268. `flang/lib/Optimizer/Transforms/SimplifyIntrinsics.cpp` includes `flang/Optimizer/HLFIR/HLFIRDialect.h` and might fails if the HLFIR related tablegen files have not been generated. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D156751	2023-08-01 09:48:07 -07:00
Alex Zinenko	b2b7efb96d	[mlir] NFC: rename XDataFlowAnalysis to XForwardDataFlowAnalysis This makes naming consisnt with XBackwardDataFlowAnalysis. Reviewed By: Mogball, phisiart Differential Revision: https://reviews.llvm.org/D155930	2023-07-27 11:11:40 +00:00
Andrew Gozillon	062fce6f4d	[Flang][OpenMP][MLIR] An mlir transformation pass for marking FuncOp's implicitly called from TargetOp's and declare target marked FuncOp's as implicitly declare target This pass will mark functions called from TargetOp's and declare target functions as implicitly declare target by adding the MLIR declare target attribute directly to the function. This pass executes after the initial lowering of Fortran's PFT to MLIR (FIR/OMP+Arith etc.) and is one of a series of passes that aim to clean up the MLIR for offloading (seperate passes in different patches, one for early outlining, another for declare target function filtering). Reviewers: jsjodin, skatrak, kiaranchandramohan Differential Revision: https://reviews.llvm.org/D154247	2023-07-17 08:32:26 -05:00
Sergio Afonso	debdfc0ae2	[Flang][OpenMP][MLIR] Filter emitted code depending on declare target and device This patch adds support for selecting which functions are lowered to LLVM IR from MLIR depending on declare target information and whether host or device code is being generated. The approach proposed by this patch is to perform the filtering in two stages: - An MLIR transformation pass, which is added to the Flang translation flow after the `OMPEarlyOutliningPass`. The functions that are kept are those that match the OpenMP processor (host or device) the compiler invocation is targeting, according to the presence of the `-fopenmp-is-target-device` compiler option and declare target information. All functions contaning an `omp.target` are also kept, regardless of the declare target information of the function, due to the need for keeping target regions visible for both host and device compilation. - A filtering step during translation to LLVM IR, which is peformed for those functions that were kept because of the presence of a target region inside. If the targeted OpenMP processor does not match the declare target information of the function, then it is removed from the LLVM IR after its contents have been processed and translated. Since they should only contain an omp.target operation which, in turn, should have been outlined into another LLVM IR function, the wrapper can be deleted at that point. Depends on D150328 and D150329. Differential Revision: https://reviews.llvm.org/D147641	2023-07-17 09:07:54 +01:00
Jan Sjodin	22a167779a	[flang] Fix OMPEarlyOutlining erasing declare target functions The early outlining pass was erasing target functions that need to be kept. It should only erase functions that contain target ops.	2023-07-13 13:00:23 -04:00
Mark Danial	d85b94bf00	[Flang] -funderscoring bug fix There was a bug with the -funderscoring / -fno-underscoring options from (https://reviews.llvm.org/D140795) that prevented the driver option from controlling the underscoring behaviour and instead the behaviour could only be controlled by the pass option instead of the driver option. The driver test case did not catch the bug and also needed to be updated. Reviewed By: awarzynski Differential Revision: https://reviews.llvm.org/D155042	2023-07-13 11:30:35 -04:00
Jan Sjodin	45a9604417	[Flang][OpenMP][MLIR] Add early outlining pass for omp.target operations to flang This patch implements an early outlining transform of omp.target operations in flang. The pass is needed because optimizations may cross target op region boundaries, but with the outlining the resulting functions only contain a single omp.target op plus a func.return, so there should not be any opportunity to optimize across region boundaries. The patch also adds an interface to be able to store and retrieve the parent function name of the original target operation. This is needed to be able to create correct kernel function names when lowering to LLVM-IR. Reviewed By: kiranchandramohan, domada Differential Revision: https://reviews.llvm.org/D154879	2023-07-13 09:14:42 -04:00
David Truby	f52c64b115	[flang] Add fastmath flags to localBuilder in IntrinsicCall Currently the local builder used in IntrinsicCall doesn't have the fastmath flags passed to it. This results in the fastmath attribute not being added to certain runtime calls. This patch simply forwards the fastmath flags from the parent builder. Differential Revision: https://reviews.llvm.org/D154611	2023-07-11 18:53:31 +01:00
Tom Eccles	76c3c5bca0	[flang] [stack-arrays] fix unused variable warning	2023-06-05 15:36:02 +00:00
Tom Eccles	53cc33b00b	[flang] Store KindMapping by value in FirOpBuilder Previously only a constant reference was stored in the FirOpBuilder. However, a lot of code was merged using FirOpBuilder builder{rewriter, getKindMapping(mod)}; This is incorrect because the KindMapping returned will go out of scope as soon as FirOpBuilder's constructor had run. This led to an infinite loop running some tests using HLFIR (because the stack space containing the kind mapping was re-used and corrupted). One solution would have just been to fix the incorrect call sites, however, as a large number of these had already made it past review, I decided to instead change FirOpBuilder to store its own copy of the KindMapping. This is not costly because nearly every time we construct a KindMapping is exclusively to construct a FirOpBuilder. To make this common pattern simpler, I added a new constructor to FirOpBuilder which calls getKindMapping(). Differential Revision: https://reviews.llvm.org/D151881	2023-06-05 09:57:57 +00:00
Tom Eccles	775de6754a	[flang] convert stack arrays allocation to match old type The old fir.allocmem operation returned a !fir.heap<.> type. The new fir.alloca operation returns a !fir.ref<.> type. This patch inserts a fir.convert so that the old type is preserved. This prevents verifier failures when types returned from fir.if statements don't match the expected type. Differential Revision: https://reviews.llvm.org/D151921	2023-06-05 09:57:57 +00:00
Mats Petersson	b812932b35	[FLANG] Change loop versioning to use shift instead of divide Despite me being convinced that the use of divide didn't produce any divide instructions, it does in fact add more instructions than using a plain shift operation. This patch simply changes the divide to a shift right, with an assert to check that the "divisor" is a power of two. Reviewed By: kiranchandramohan, tblah Differential Revision: https://reviews.llvm.org/D151880	2023-06-01 19:29:57 +01:00
Tom Eccles	408f4196ba	[flang] use greedy mlir driver for stack arrays pass In upstream mlir, the dialect conversion infrastructure is used for lowering from one dialect to another: the passes are of the form XToYPass. Whereas, transformations within the same dialect tend to use applyPatternsAndFoldGreedily. In this case, the full complexity of applyPatternsAndFoldGreedily isn't needed so we can get away with the simpler applyOpPatternsAndFold. This change was suggested by @jeanPerier The old differential revision for this patch was https://reviews.llvm.org/D150853 Re-applying here fixing the issue which led to the patch being reverted. The issue was from erasing uses of the allocation operation while still iterating over those uses (leading to a use-after-free). I have added a regression test which catches this bug for -fsanitize=address builds, but it is hard to reliably cause a crash from the use-after-free in normal builds. Differential Revision: https://reviews.llvm.org/D151728	2023-05-31 14:06:57 +00:00
Mats Petersson	b75f9ce3fe	[FLANG] Support all arrays for LoopVersioning This patch makes more than 2D arrays work, with a fix for the way that loop index is calculated. Removing the restriction of number of dimensions. This also changes the way that the actual index is calculated, such that the stride is used rather than the extent of the previous dimension. Some tests failed without fixing this - this was likely a latent bug in the 2D version too, but found in a test using 3D arrays, so wouldn't have been found with 2D only. This introduces a division on the index calculation - however it should be a nice and constant value allowing a shift to be used to actually divide - or otherwise removed by using other methods to calculate the result. In analysing code generated with optimisation at -O3, there are no divides produced. Some minor refactoring to avoid repeatedly asking for the "rank" of the array being worked on. This improves some of the SPEC-2017 ROMS code, in the same way as the limited 2D array improvements - less overhead spent calculating array indices in the inner-most loop and better use of vector-instructions. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D151140	2023-05-30 18:54:40 +01:00
Tom Eccles	2dfaec7781	Revert "[flang] use greedy mlir driver for stack arrays pass" This reverts commit `74c2ec50f3`. This caused a regression building spec2017 with -Ofast.	2023-05-24 16:15:52 +00:00
Tom Eccles	74c2ec50f3	[flang] use greedy mlir driver for stack arrays pass In upstream mlir, the dialect conversion infrastructure is used for lowering from one dialect to another: the passes are of the form XToYPass. Whereas, transformations within the same dialect tend to use applyPatternsAndFoldGreedily. In this case, the full complexity of applyPatternsAndFoldGreedily isn't needed so we can get away with the simpler applyOpPatternsAndFold. This change was suggested by @jeanPerier Differential Revision: https://reviews.llvm.org/D150853	2023-05-23 14:51:42 +00:00
Valentin Clement	677f7cc55a	[mlir][flang][openacc] Remove obsolete operand legalization passes The information needed for translation is now encoded in the dialect operations and does not require a dedicated pass to be extracted. Remove the obsolete passes that were performing operand legalization. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D150248	2023-05-11 10:33:00 -07:00
Valentin Clement	5e983942d5	[mlir][openacc] Cleanup acc.parallel from old data clause operands Remove old clause operands from acc.parallel operation since the new dataOperands is now in place. private, firstprivate and reductions will receive some redesign but are not part of the new dataOperands. Reviewed By: razvanlupusoru Differential Revision: https://reviews.llvm.org/D150207	2023-05-09 14:57:50 -07:00
Valentin Clement	46e1b095c9	[mlir][openacc] Cleanup acc.data from old data clause operands Since the new data operand operations have been added in D148389 and adopted on acc.data in D149673, the old clause operands are no longer needed. The LegalizeDataOpForLLVMTranslation will become obsolete when all operations will be cleaned. For the time being only the appropriate part are being removed. processOperands will also receive some updates once all the operands will be coming from an acc data operand operation. Reviewed By: razvanlupusoru Differential Revision: https://reviews.llvm.org/D150155	2023-05-09 13:21:37 -07:00
Valentin Clement	15a480c05e	[mlir][openacc] Cleanup acc.exit_data from old data clause operands Since the new data operand operations have been added in D148389 and adopted on acc.exit_data in D149601, the old clause operands are no longer needed. The LegalizeDataOpForLLVMTranslation will become obsolete when all operations will be cleaned. For the time being only the appropriate part are being removed. processOperands will also receive some updates once all the operands will be coming from an acc data operand operation. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D150145	2023-05-09 11:36:48 -07:00
Valentin Clement	9dec07f44a	[mlir][openacc] Cleanup acc.enter_data from old data clause operands Since the new data operand operations have been added in D148389 and adopted on acc.enter_data in D148721, the old clause operands are no longer needed. The LegalizeDataOpForLLVMTranslation will become obsolete when all operations will be cleaned. For the time being only the appropriate part are being removed. processOperands will also receive some updates once all the operands will be coming from an acc data operand operation. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D150132	2023-05-09 09:01:30 -07:00
Valentin Clement	689afa88ae	[mlir][openacc] Cleanup acc.update from old data clause operands Since the new data operand operations have been added in D148389 and adopted on acc.update in D149909, the old clause operands are no longer needed. This is a first patch to start cleaning the OpenACC operations with data clause operands. The `LegalizeDataOpForLLVMTranslation` will become obsolete when all operations will be cleaned. For the time being only the appropriate part are being removed. `processOperands` will also receive some updates once all the operands will be coming from an acc data operand operation. Reviewed By: razvanlupusoru, jeanPerier Differential Revision: https://reviews.llvm.org/D150053	2023-05-08 10:03:28 -07:00
Slava Zakharin	7a607e253d	[flang] Removed unnecessary llvm/CodeGen/SelectionDAGNodes.h include. Required after D148767 for flang+debug+slibs build. Reviewed By: chapuni, clementval Differential Revision: https://reviews.llvm.org/D149764	2023-05-03 15:10:09 -07:00
Slava Zakharin	7e584357ac	[flang] Fixed branch-to-default generation for select_type. When the default case requires block arguments, they have to be passed through the cf.br - this piece was missing. Reviewed By: clementval Differential Revision: https://reviews.llvm.org/D149484	2023-04-28 15:45:09 -07:00
Jean Perier	c203850ad5	[flang][hlfir] Support fir.declare in AbstractResult pass The AbstractResult pass replaces allocation of function result on the callee side per an extra argument so that the allocation of the result can be done on the caller stack. It looks for the result allocation from the fir.return op, so it needs to handle (in a transparent way) a fir.declare in the chain between the allocation and the fir.return. Reviewed By: vzakhari, clementval Differential Revision: https://reviews.llvm.org/D149057	2023-04-25 09:04:38 +02:00
Matthias Springer	4c48f016ef	[mlir][Affine][NFC] Wrap dialect in "affine" namespace This cleanup aligns the affine dialect with all the other dialects. Differential Revision: https://reviews.llvm.org/D148687	2023-04-20 11:19:21 +09:00
Mats Petersson	a716ace13d	Add loop-versioning pass to improve unit-stride Introduce conditional code to identify stride of "one element", and simplify the array accesses for that case. This allows better loop performance in various benchmarks. Reviewed By: tblah, kiranchandramohan Differential Revision: https://reviews.llvm.org/D141306	2023-04-18 09:53:07 +01:00
V Donaldson	bddd7a6436	[flang] REAL(KIND=3) and COMPLEX(KIND=3) descriptors Update descriptor generation to correctly set the `type` field for REAL(3) and COMPLEX(3) objects.	2023-04-17 09:10:47 -07:00
V Donaldson	4add0e3db9	Revert "[flang] REAL(KIND=3) and COMPLEX(KIND=3) descriptors" This reverts commit `17a4fcecf4`.	2023-04-13 18:34:18 -07:00
V Donaldson	17a4fcecf4	[flang] REAL(KIND=3) and COMPLEX(KIND=3) descriptors Update descriptor generation to correctly set the `type` field for REAL(3) and COMPLEX(3) objects.	2023-04-13 18:02:13 -07:00
Valentin Clement	49a813b4a6	[flang][openacc] Keep region when applying data operand conversion Similar to D148039 but for the FIR to LLVM IR conversion pass. The inner part of the acc.loop has been removed since the rest of the pipeline is not ready and would raise an error here. This was passing until now because the acc.loop was discarded completely. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D148057	2023-04-12 08:20:34 -07:00
Valentin Clement	30408f5ccf	[flang][NFC] Move TypeConverter.h header file to include dir After the extraction of the TypeConverter, move the header files to the include dir so the shared library build is fine. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D147979	2023-04-10 17:01:50 -07:00
Valentin Clement	cd9cdc6837	[flang][openacc] Add missing piece to translate to LLVM IR dialect Add missing pieces to translate handle OpenACC dialect in the translation. Depends on D147825 Reviewed By: PeteSteinfeld, razvanlupusoru Differential Revision: https://reviews.llvm.org/D147828	2023-04-10 14:30:25 -07:00
Valentin Clement	42598ec745	[flang][openacc] Add data operands conversion from FIR This patch revive an old PR attempt [1] to perform the data operands conversion needed for translation to LLVMIR. This is currently not supporting box/class type since they will normally not reach this pass when the proposed change in this RFC [2] are implemented. [1] https://github.com/flang-compiler/f18-llvm-project/pull/915 [2] https://discourse.llvm.org/t/rfc-openacc-dialect-data-operation-improvements/69825/2 Depends on D147824 Reviewed By: PeteSteinfeld, razvanlupusoru Differential Revision: https://reviews.llvm.org/D147825	2023-04-10 13:34:57 -07:00
Valentin Clement	1c624633a6	Revert "[flang][openacc] Add data operands conversion from FIR" This reverts commit `68bcd647c9`.	2023-04-10 13:05:37 -07:00
Valentin Clement	cc0a0044bf	Revert "[flang][openacc] Add missing piece to translate to LLVM IR dialect" This reverts commit `03289dc7af`.	2023-04-10 13:05:23 -07:00
Valentin Clement	03289dc7af	[flang][openacc] Add missing piece to translate to LLVM IR dialect Add missing pieces to translate handle OpenACC dialect in the translation. Depends on D147825 Reviewed By: PeteSteinfeld, razvanlupusoru Differential Revision: https://reviews.llvm.org/D147828	2023-04-10 12:18:59 -07:00
Valentin Clement	68bcd647c9	[flang][openacc] Add data operands conversion from FIR This patch revive an old PR attempt [1] to perform the data operands conversion needed for translation to LLVMIR. This is currently not supporting box/class type since they will normally not reach this pass when the proposed change in this RFC [2] are implemented. [1] https://github.com/flang-compiler/f18-llvm-project/pull/915 [2] https://discourse.llvm.org/t/rfc-openacc-dialect-data-operation-improvements/69825/2 Depends on D147824 Reviewed By: PeteSteinfeld, razvanlupusoru Differential Revision: https://reviews.llvm.org/D147825	2023-04-10 12:18:05 -07:00
Renaud-K	4c5dee7773	[flang] Lowering fir.dispatch in the polymorphic op pass Differential revision: https://reviews.llvm.org/D146594	2023-03-23 09:40:47 -07:00
Renaud-K	b07ef9e7cd	Break circular dependency between FIR dialect and utilities	2023-03-09 15:24:51 -08:00

1 2 3 4

179 Commits