clang-p2996

Author	SHA1	Message	Date
Daniel Chen	8136ac1c42	[Flang] Define c_int_fast16_t and c_int_fast32_t for PowerPC. (#88292 ) On Linux, PowerPC defines `int_fast16_t` and `int_fast32_t` as `long`. Need to update the corresponding type, `c_int_fast16_t` and `c_int_fast32_t` in `iso_c_binding` module so they are interoparable.	2024-04-10 19:22:38 -04:00
Krzysztof Parzyszek	7d60232b38	[flang][Frontend] Implement printing defined macros via -dM (#87627 ) This should work the same way as in clang.	2024-04-10 10:41:20 -05:00
jeanPerier	d0829fbded	[flang] Enable polymorphic lowering by default (#83285 ) Polymorphic entity lowering status is good. The main remaining TODO is to allow lowering of vector subscripted polymorphic entity, but this does not deserve blocking all application using polymorphism. Remove experimental option and enable lowering of polymorphic entity by default.	2024-03-19 11:45:31 +01:00
Valentin Clement (バレンタインクレメン)	8a8ef1cacf	[flang][cuda] Enable cuda with -x cuda option (#84944 ) Flang driver was already able to enable the CUDA language feature base on the file extension but there was no command line option. This PR adds one.	2024-03-13 09:14:40 -07:00
Matthias Springer	a622b21f46	[mlir][Transforms] Make `ConversionPatternRewriter` constructor private (#82244 ) `ConversionPatternRewriter` objects should not be constructed outside of dialect conversions. Some IR modifications performed through a `ConversionPatternRewriter` are reflected in the IR in a delayed fashion (e.g., only when the dialect conversion is guaranteed to succeed). Using a `ConversionPatternRewriter` outside of the dialect conversion is incorrect API usage and can bring the IR in an inconsistent state. Migration guide: Use `IRRewriter` instead of `ConversionPatternRewriter`.	2024-02-23 10:31:55 +01:00
Julien Schueller	da04f306a8	[flang] Fix missing CGPasses.h.inc (#71691 ) I could reproduce using unix makefiles with -j1 option, with version 16.0.6 Closes #61543 Closes #57680	2024-02-19 10:19:12 +00:00
Daniel Chen	987258f5c7	[Flang] Add __powerpc__ macro to set c_intmax_t to c_int64_t rather than c_int128_t as PowerPC only supports up to c_int64_t. (#81222 ) PowerPC only supports up to `c_int64_t`. Add macro `__powerpc__` and preprocess it for setting `c_intmax_t` in `iso_c_binding` intrinsic module.	2024-02-13 11:03:54 -05:00
Vijay Kandiah	369b822184	[flang] Introducing a method to dynamically and conditionally register dialect interfaces. (#80881 ) This change introduces the `addFIRExtensions` method to dynamically and conditionally register dialect interfaces. As a use case of `addFIRExtensions`, this change moves the static registration of `FIRInlinerInterface` out of the constructor of `FIROpsDialect` to be dynamically registered while loading the necessary MLIR dialects required by Flang. This registration of `FIRInlinerInterface` is also guarded by a boolean `addFIRInlinerInterface` which defaults to true. --------- Co-authored-by: Vijay Kandiah <vkandiah@nvidia.com>	2024-02-07 12:39:44 -08:00
Alex Bradbury	22544e2a54	[flang] Set fast math related function attributes for -Ofast/-ffast-math (#79301 ) The implemented logic matches the logic used for Clang in emitting these attributes. Although it's hoped that function attributes won't be needed in the future (vs using fast math flags in individual IR instructions), there are codegen differences currently with/without these attributes, as can be seen in issues like #79257 or by hacking Clang to avoid producing these attributes and observing codegen changes.	2024-02-05 19:39:12 +00:00
Pierre van Houtryve	500846d2f5	[AMDGPU] Introduce Code Object V6 (#76954 ) Introduce Code Object V6 in Clang, LLD, Flang and LLVM. This is the same as V5 except a new "generic version" flag can be present in EFLAGS. This is related to new generic targets that'll be added in a follow-up patch. It's also likely V6 will have new changes (possibly new metadata entries) added later. Docs change are part of the follow-up patch #76955	2024-02-05 08:19:53 +01:00
Sergio Afonso	92bbf615f5	[Flang][MLIR][OpenMP] Use function-attached target attributes for OpenMP lowering (#78291 ) This patch removes the omp.target module attribute, since the information it held on the target CPU and features is available through the fir.target_cpu and fir.target_features module attributes. Target outlining during the MLIR to LLVM IR translation stage is updated, so that these attributes, at that point available as llvm.func attributes, are passed along to the newly created function.	2024-02-02 13:16:36 +00:00
Sergio Afonso	837bff11cb	[Flang][Lower] Attach target_cpu and target_features attributes to MLIR functions (#78289 ) This patch forwards the target CPU and features information from the Flang frontend to MLIR func.func operation attributes, which are later used to populate the target_cpu and target_features llvm.func attributes. This is achieved in two stages: 1. Introduce the `fir.target_cpu` and `fir.target_features` module attributes with information from the target machine immediately after the initial creation of the MLIR module in the lowering bridge. 2. Update the target rewrite flang pass to get this information from the module and pass it along to all func.func MLIR operations, respectively as attributes named `target_cpu` and `target_features`. These attributes will be automatically picked up during Func to LLVM dialect lowering and used to initialize the corresponding llvm.func named attributes. The target rewrite and FIR to LLVM lowering passes are updated with the ability to override these module attributes, and the `CodeGenSpecifics` optimizer class is augmented to make this information available to target-specific MLIR transformations. This completes a full flow by which target CPU and features make it all the way from compiler options to LLVM IR function attributes.	2024-01-30 13:45:56 +00:00
Kazu Hirata	f0346a5862	[Frontend] Use SmallString::operator std::string (NFC)	2024-01-25 18:17:18 -08:00
Kareem Ergawy	10317da203	[flang][re-apply] Fix seg fault CodeGenAction::executeAction() (#78672 ) If `generateLLVMIR()` fails, we still continue using the module we failed to generate which causes a seg fault if LLVM code-gen failed for some reason or another. This commit fixes this issue. Re-applies PR #78269 and adds LLVM and MLIR dependencies that were missed in the PR. The missing libs were: `LLVMCore` & `MLIRIR`. This reverts commit `4fc7506274`.	2024-01-19 10:40:25 +01:00
Kareem Ergawy	4fc7506274	Revert "[flang] Fix seg fault `CodeGenAction::executeAction()` (#78269 )" (#78667 ) This reverts commit `99cae9a44f`. Temporarily until I reproduce and fix a linker issue: ``` FAILED: tools/flang/unittests/Frontend/FlangFrontendTests ... /usr/bin/ld: tools/flang/unittests/Frontend/CMakeFiles/FlangFrontendTests.dir/CodeGenActionTest.cpp.o: undefined reference to symbol '_ZN4llvm11LLVMContextC1Ev' /usr/bin/ld: /work1/omp-nightly/build/git/trunk18.0/build/llvm-project/lib/libLLVMCore.so.18git: error adding symbols: DSO missing from command line ```	2024-01-19 05:30:29 +01:00
Kareem Ergawy	99cae9a44f	[flang] Fix seg fault `CodeGenAction::executeAction()` (#78269 )	2024-01-18 20:36:27 +01:00
Luke Lau	219c14a260	[Flang] Remove dead -mvscale-{min,max} logic from getVScaleRange. NFCI (#78133 ) After #77905, setting -mvscale-min or -mvscale-max on targets other than AArch64 and RISC-V should be an error now, so we no longer need this target-agnostic code in getVScaleRange.	2024-01-16 00:20:35 +07:00
Luke Lau	c4b591a10f	[Flang][RISCV] Set vscale_range based off zvlb (#77277 ) This patch implements the logic (for now, copied from RISCVTargetInfo::getVScaleRange) so that we can compute the vscale_range based off of the zvlb extension, e.g. using an arch with zvl256b now implies vscale_range(2,1024). It's worth noting that we don't have to exactly copy the behaviour of clang with regards to how it interacts with the -mvscale-min/-mvscale-max flags, but changing it can be left to a future patch. This also adds a guard for +sve so that we only check for it on aarch64, which was the behaviour prior to `898db1136e`	2024-01-15 15:52:20 +07:00
Andrzej Warzyński	8cac995ead	[flang][driver] Limit the usage of -mvscale-max and -mvscale-min (#77905 ) Make sure that `-mvscale-max` and `-mvscale-min` are only available for targets that are known to support vscale and scalable vectors. Also fix capitalization of function variables.	2024-01-15 08:49:58 +00:00
Dominik Adamski	f443fbc49b	[Flang][OpenMP][MLIR] Add support for -nogpulib option (#71045 ) If -nogpulib option is passed by the user, then the OpenMP device runtime is not used and we should not emit globals to configure debugging at compile-time for the device runtime. Link to -nogpulib flag implementation for Clang: https://reviews.llvm.org/D125314	2024-01-10 09:38:58 +01:00
Luke Lau	c8c525678e	[Flang] Remove unused triple variable. NFC (#77275 ) I'm not sure why we don't get an unused variable warning, but triple doesn't seem to be used after `898db1136e`.	2024-01-08 18:48:35 +07:00
Radu Salavat	0487377382	[flang] Pass to add frame pointer attribute (#74598 ) Pass to add frame pointer attribute in Flang	2023-12-28 15:41:27 +00:00
Andrzej Warzyński	48e96e97d1	[flang][driver][nfc] Rename one variable (res -> invoc) (#75535 ) The new name better reflects what the variable represents.	2023-12-15 15:04:54 +00:00
Kazu Hirata	11efccea8f	[flang] Use StringRef::{starts,ends}_with (NFC) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-13 23:48:53 -08:00
Philip Reames	898db1136e	[flang] Support -mvscale-min and mvscale-max on all targets (#74633 ) We have other targets with scalable vectors (e.g.RISC-V), and there doesn't seem to be any particular reason these options can't be used on those targets.	2023-12-11 08:34:02 -08:00
Dominik Adamski	276a024b49	[NFC][AMDGPU] Unify AMDGPU address space enum (#73944 ) Types of AMDGPU address space were defined not only in Clang-specific class but also in LLVM header. If we unify the AMD GPU address space enumeration, then we can reuse it in Clang, Flang and LLVM.	2023-12-11 10:45:21 +01:00
jeanPerier	e59e848805	[flang] Updating drivers to create data layout before semantics (#73301 ) Preliminary patch to change lowering/code generation to use llvm::DataLayout information instead of generating "sizeof" GEP (see https://github.com/llvm/llvm-project/issues/71507). Fortran Semantic analysis needs to know about the target type size and alignment to deal with common blocks, and intrinsics like C_SIZEOF/TRANSFER. This information should be obtained from the llvm::DataLayout so that it is consistent during the whole compilation flow. This change is changing flang-new and bbc drivers to: 1. Create the llvm::TargetMachine so that the data layout of the target can be obtained before semantics. 2. Sharing bbc/flang-new set-up of the SemanticConstext.targetCharateristics from the llvm::TargetMachine. For now, the actual part that set-up the Fortran type size and alignment from the llvm::DataLayout is left TODO so that this change is mostly an NFC impacting the drivers. 3. Let the lowering bridge set-up the mlir::Module datalayout attributes since it is doing it for the target attribute, and that allows the llvm data layout information to be available during lowering. For flang-new, the changes are code shuffling: the `llvm::TargetMachine` instance is moved to `CompilerInvocation` class so that it can be used to set-up the semantic contexts. `setMLIRDataLayout` is moved to `flang/Optimizer/Support/DataLayout.h` (it will need to be used from codegen pass for fir-opt target independent testing.)), and the code setting-up semantics targetCharacteristics is moved to `Tools/TargetSetup.h` so that it can be shared with bbc. As a consequence, LLVM targets must be registered when running semantics, and it is not possible to run semantics for a target that is not registered with the -triple option (hence the power pc specific modules can only be built if the PowerPC target is available.	2023-12-06 14:20:06 +01:00
Tom Eccles	6b8d659062	[flang] remove -f[no-]alias-analysis (#74343 ) Now that tbaa tags pass is enabled by default, I would like to remove these flags. `-fno-alias-analysis` was originally intended to be useful for debugging, but as it also disables tbaa tag generation in codegen, it turned out to be too noisy. @banach-space expressed that these flags felt too non-standard. The tbaa tags pass can be toggled using `-mllvm -disable-fir-alias-tags=0`	2023-12-05 10:03:57 +00:00
madanial0	8dc474c6b7	[flang] Pass Argv0 to getIntriniscDir and getOpenMPHeadersDir (#73254 ) The `llvm::sys::fs::getMainExecutable(nullptr, nullptr)` is not able to obtain the correct executable path on AIX without Argv0 due to the lack of a current process on AIX's `proc` filesystem. This causes a build failure on AIX as intrinsic module directory is missing. --------- Co-authored-by: Mark Danial <mak.danial@ibm.com>	2023-12-04 16:34:08 -05:00
Tom Eccles	374e8288e0	[flang] (Re-)Enable alias tags pass by default (#74250 ) Enable by default for optimization levels higher than 0 (same behavior as clang). For simplicity, only forward the flag to the frontend driver when it contradicts what is implied by the optimization level. This was first landed in https://github.com/llvm/llvm-project/pull/73111 but was later reverted due to a performance regression. That regression was fixed by https://github.com/llvm/llvm-project/pull/74065.	2023-12-04 15:28:15 +00:00
Andrzej Warzyński	ae4d7ac9c8	[flang][driver][nfc] Move the definition of SemanticsContext (#73669 ) Moves the defintion of `SemanticsContext` within the Flang driver. Rather than in `CompilerInvocation`, semantic context fits better within `CompilerInstance` that encapsulates the objects that are required to run the frontend. `CompilerInvocation` is better suited for objects encapsulating compiler configuration (e.g. set-up resulting from user input or host set-up).	2023-11-29 21:20:43 +00:00
Tom Eccles	5ce5ea3786	Revert "[flang] Enable alias tags pass by default (#73111 )" (#73821 ) This reverts commit `caba0314cf`. Serious performance regressions were reported by @vzakhari https://github.com/llvm/llvm-project/issues/58303#issuecomment-1830754173 Fixing this doesn't look quick so I will revert for now.	2023-11-29 17:10:18 +00:00
Dominik Adamski	95943d2fab	[Flang] Add code-object-version option (#72638 ) Information about code object version can be configured by the user for AMD GPU target and it needs to be placed in LLVM IR generated by Flang. Information about code object version in MLIR generated by the parser can be reused by other tools. There is no need to specify extra flags if we want to invoke MLIR tools (like fir-opt) separately. Changes in comparison to a8ac93: * added information about required targets for test flang/test/Driver/driver-help.f90	2023-11-29 03:01:01 -06:00
Dominik Adamski	f00ffcdb58	Revert "[Flang] Add code-object-version option (#72638 )" This commit causes test errors on buildbots. This reverts commit `a8ac930b99`.	2023-11-28 13:18:46 -06:00
Dominik Adamski	a8ac930b99	[Flang] Add code-object-version option (#72638 ) Information about code object version can be configured by the user for AMD GPU target and it needs to be placed in LLVM IR generated by Flang. Information about code object version in MLIR generated by the parser can be reused by other tools. There is no need to specify extra flags if we want to invoke MLIR tools (like fir-opt) separately.	2023-11-28 19:57:36 +01:00
Tom Eccles	caba0314cf	[flang] Enable alias tags pass by default (#73111 ) Enable by default for optimization levels higher than 0 (same behavior as clang). For simplicity, only forward the flag to the frontend driver when it contradicts what is implied by the optimization level. Since https://github.com/llvm/llvm-project/pull/72903 there are now no known performance regressions.	2023-11-27 15:10:21 +00:00
David Truby	5e36c64cb6	[flang] Remove extra space added with --dependent-lib option This patch fixes a bug with the --dependent-lib option where an extra space is added to the directive causing linking to fail.	2023-11-20 16:51:02 +00:00
Fabian Mora	be9fa9dee5	[flang][NVPTX] Add initial support to the NVPTX target (#71992 ) This patch adds initial support to the NVPTX target, enabling `flang` to produce OpenMP offload code for NVPTX targets.	2023-11-16 11:34:28 -05:00
David Truby	77ecb9a49b	[flang] Add dependent-lib option to flang -fc1 on Windows (#72121 ) This patch adds a --dependent-lib option to flang -fc1 on Windows to embed library link options into the object file. This is needed to properly select the Windows CRT to link against.	2023-11-15 15:26:43 +00:00
Peter Klausler	1c91d9bdea	[flang] Ensure that portability warnings are conditional (#71857 ) Before emitting a warning message, code should check that the usage in question should be diagnosed by calling ShouldWarn(). A fair number of sites in the code do not, and can emit portability warnings unconditionally, which can confuse a user that hasn't asked for them (-pedantic) and isn't terribly concerned about portability to other compilers. Add calls to ShouldWarn() or IsEnabled() around messages that need them, and add -pedantic to tests that now require it to test their portability messages, and add more expected message lines to those tests when -pedantic causes other diagnostics to fire.	2023-11-13 16:13:50 -08:00
Tom Eccles	a207e6307a	[flang] add fveclib flag (#71734 ) -fveclib= allows users to choose a vectorized libm so that loops containing math functions are vectorized. This is implemented as much as possible in the same way as in clang. The driver test in veclib.f90 is copied from the clang test.	2023-11-13 10:04:50 +00:00
jeanPerier	9f265c3871	[flang][driver] add -flang-deprecated-no-hlfir hidden option (#71820 ) Patch 1/3 of the transition to use the HLFIR step by default in lowering as described in https://discourse.llvm.org/t/rfc-enabling-the-hlfir-lowering-by-default/72778/7 This option will allow to lower code without the HLFIR step during a grace period as described in the RFC. It is not meant to be a long term switch for flang.	2023-11-10 11:54:10 +01:00
Tom Eccles	ac0015fe21	[flang][driver] add command line arguments for alias tags pass The ultimate intention is to have this pass enabled by default whenever we are optimizing for speed. But for now, just add the arguments so this can be more easily tested. PR: https://github.com/llvm/llvm-project/pull/68595	2023-10-12 09:37:58 +00:00
Mats Petersson	6180964a01	[flang]Pass to add vscale range attribute (#68103 ) Add vscale range attirbute for the Scalable Vector Extension (SVE) if provided on the command-line (options in a previous commit) If no command-line option is provided, if the target-feature of SVE is specified and the architecture is AArch64, it defualts to 128-2048. in other words a vscale-min of 1, vscale-max of 16. A pass is used to add the atribute to all functions. The vectorizer will use this attribute to generate the SVE instruction to match the range specified. The attribute is harmless if there is no vectorizable operations in the function.	2023-10-05 11:06:00 +01:00
Mats Petersson	7006b90a06	[flang][NFCI]Use config structure for MLIR to LLVM pass creation (#67792 ) The CreateMLIRToLLVMPassPipeline function has quite a few arguments, all of which has default values. Create a struct, with a constructor for the default values, and pass that struct instead. Re-arrange a few include files to make everything available. No functional change intended.	2023-10-03 14:01:50 +01:00
Mats Petersson	11e68c7e7f	[flang]Add vscale argument parsing (#67676 ) Support for vector scale range arguments, for AArch64 scalable vector extension (SVE) support. Adds -msve-vector-bits to the flang frontend, and for flang fc1 the options are -mvscale-min and -mvscale-max (optional). These match the clang and clang cc1 options for the same purposes. A further patch will actually USE these arguments.	2023-10-02 12:01:12 +01:00
Sergio Afonso	fb4bdf361f	[Flang][OpenMP] Run Flang-specific OpenMP MLIR passes in bbc This patch moves the group of OpenMP MLIR passes using after lowering of Fortran to MLIR into a pipeline to be shared by `flang-new` and `bbc`. Currently, the `bbc` tool does not produce the expected FIR for offloading- enabled OpenMP codes due to not running these passes. Unit tests exercising these passes are updated to check `bbc` output as well.	2023-09-18 14:10:04 +01:00
Arthur Eubanks	0a1aa6cda2	[NFC][CodeGen] Change CodeGenOpt::Level/CodeGenFileType into enum classes (#66295 ) This will make it easy for callers to see issues with and fix up calls to createTargetMachine after a future change to the params of TargetMachine. This matches other nearby enums. For downstream users, this should be a fairly straightforward replacement, e.g. s/CodeGenOpt::Aggressive/CodeGenOptLevel::Aggressive or s/CGFT_/CodeGenFileType::	2023-09-14 14:10:14 -07:00
jeanPerier	6ffea74f7c	[flang] Use BIND name, if any, when consolidating common blocks (#65613 ) This patch changes how common blocks are aggregated and named in lowering in order to: * fix one obvious issue where BIND(C) and non BIND(C) with the same Fortran name were "merged" * go further and deal with a derivative where the BIND(C) C name matches the assembly name of a Fortran common block. This is a bit unspecified IMHO, but gfortran, ifort, and nvfortran "merge" the common block without complaints as a linker would have done. This required getting rid of all the common block mangling early in FIR (\_QC) instead of leaving that to the phase that emits LLVM from FIR because BIND(C) common blocks did not have mangled names. Care has to be taken to deal with the underscoring option of flang-new. See added flang/test/Lower/HLFIR/common-block-bindc-conflicts.f90 for an illustration.	2023-09-08 10:43:55 +02:00
Victor Kingi	12da8ef0e3	[Flang][Driver] Add location and remark option printing to R_Group Diagnostics For each R_Group diagnostic produced, this patch gives more information about it by printing the absolute file path, the line and column number the pass was applied to and finally the remark option that was used. Clang does the same with the exception of printing the relative path rather than absolute path. Depends on D159260. That patch adds support for backend passes while this patch adds remark options to the backend test cases. Reviewed By: awarzynski Differential Revision: https://reviews.llvm.org/D159258	2023-08-31 15:41:16 +00:00

1 2 3 4 5 ...

269 Commits