clang-p2996

Author	SHA1	Message	Date
jeanPerier	36a073a5f4	[flang] Add option to skip struct argument rewrite in target-rewrite (#75939 ) Be consistent with complex and character rewrite so that the pass can be run selectively.	2023-12-20 10:15:09 +01:00
jeanPerier	c373f58134	[flang] Lower procedure pointer components (#75453 ) Lower procedure pointer components, except in the context of structure constructor (left TODO). Procedure pointer components lowering share most of the lowering logic of procedure poionters with the following particularities: - They are components, so an hlfir.designate must be generated to retrieve the procedure pointer address from its derived type base. - They may have a PASS argument. While there is no dispatching as with type bound procedure, special care must be taken to retrieve the derived type component base in this case since semantics placed it in the argument list and not in the evaluate::ProcedureDesignator. These components also bring a new level of recursive MLIR types since a fir.type may now contain a component with an MLIR function type where one of the argument is the fir.type itself. This required moving the "derived type in construction" stackto the converter so that the object and function type lowering utilities share the same state (currently the function type utilty would end-up creating a new stack when lowering its arguments, leading to infinite loops). The BoxedProcedurePass also needed an update to deal with this recursive aspect.	2023-12-19 17:17:09 +01:00
jeanPerier	1d57b9a5b1	[flang] Pass one element struct by register on X86-64 (#75802 ) Implement the C struct passing ABI on X86-64 for the trivial case where the structs have one element. This is required to cover some cases of BIND(C) derived type pass with the VALUE attribute.	2023-12-19 09:50:58 +01:00
jeanPerier	27d9a479c0	[flang] Add struct passing target rewrite hooks and partial X86-64 impl (#74829 ) In the context of C/Fortran interoperability (BIND(C)), it is possible to give the VALUE attribute to a BIND(C) derived type dummy, which according to Fortran 2018 18.3.6 - 2. (4) implies that it must be passed like the equivalent C structure value. The way C structure value are passed is ABI dependent. LLVM does not implement the C struct ABI passing for LLVM aggregate type arguments. It is up to the front-end, like clang is doing, to split the struct into registers or pass the struct on the stack (llvm "byval") as required by the target ABI. So the logic for C struct passing sits in clang. Using it from flang requires setting up a lot of clang context and to bridge FIR/MLIR representation to clang AST representation for function signatures (in both directions). It is a non trivial task. See https://stackoverflow.com/questions/39438033/passing-structs-by-value-in-llvm-ir/75002581#75002581. Since BIND(C) struct are rather limited as opposed to generic C struct (e.g. no bit fields). It is easier to provide a limited implementation of it for the case that matter to Fortran. This patch: - Updates the generic target rewrite pass to keep track of both the new argument type and attributes. The motivation for this is to be able to tell if a previously marshalled argument is passed in memory (it is a C pointer), or if it is being passed on the stack (has the byval llvm attributes). - Adds an entry point in the target specific codegen to marshal struct arguments, and use it in the generic target rewrite pass. - Implements limited support for the X86-64 case. So far, the support allows telling if a struct must be passed in register or on the stack, and to deal with the stack case. The register case is left TODO in this patch. The X86-64 ABI implemented is the System V ABI for AMD64 version 1.0	2023-12-12 11:52:39 +01:00
Tom Eccles	bdacd56fd1	[flang][CodeGen] add nsw to address calculations (#74709 ) `nsw` is a flag for LLVM arithmetic operations meaning "no signed wrap". If this keyword is present, the result of the operation is a poison value if overflow occurs. Adding this keyword permits LLVM to re-order integer arithmetic more aggressively. In https://discourse.llvm.org/t/rfc-changes-to-fircg-xarray-coor-codegen-to-allow-better-hoisting/75257/16 @vzakhari observed that adding nsw is useful to enable hoisting of address calculations after some loops (or is at least a step in that direction). Classic flang also adds nsw to address calculations.	2023-12-08 10:51:20 +00:00
Tom Eccles	fcd06d774d	[mlir][flang] add fast math attribute to fcmp (#74315 ) `llvm.fcmp` does support fast math attributes therefore so should `arith.cmpf`. The heavy churn in flang tests are because flang sets `fastmath<contract>` by default on all operations that support the fast math interface. Downstream users of MLIR should not be so effected. This was requested in https://github.com/llvm/llvm-project/issues/74263	2023-12-06 10:19:48 +00:00
jeanPerier	f65e3af73d	[flang] Implement COMPLEX(10) passing and return ABI for X86-64 linux (#74094 ) COMPLEX(10) passing by value and returning follows C complex passing/returning ABI. Cover the COMPLEX(10) case (X87 / __Complex long double on X86-64). Implements System V ABI for AMD64 version 1.0. The LLVM signatures match the one generated by clang for the __Complex long double case. Note that a FIXME is added for the COMPLEX(8) case that is incorrect in a corner case. This will be fixed when dealing with passing derived type by value in BIND(C) context.	2023-12-04 09:47:12 +01:00
Peter Klausler	33b54f01fe	[flang] Move internal Fortran::ISO namespace out of user-facing ISO_F… (#72909 ) …ortran_binding.h ... and into the ISO_Fortran_binding_wrapper.h header, through which the compiler and runtime access its contents. This change ensures that user code that #includes ISO_Fortran_binding.h within 'extern "C" {' doesn't encounter mysterious namespace errors.	2023-11-30 12:59:06 -08:00
Pete Steinfeld	04b185302b	[flang] Cleanup of NYI messages (#73740 ) This update makes the user visible messages relating to features that are not yet implemented be more consistent. I also cleaned up some of the code. For NYI messages that refer to intrinsics, I made sure the the message begins with "not yet implemented: intrinsic:" to make them easier to recognize. I created some utility functions for NYI reporting that I put into .../include/Optimizer/Support/Utils.h. These mainly convert MLIR types to their Fortran equivalents. I converted the NYI code to use the newly created utility functions.	2023-11-29 09:20:46 -08:00
jeanPerier	740f14edb4	[flang] fix codegen warning from #73641 (#73808 )	2023-11-29 18:00:40 +01:00
jeanPerier	91e1b4a64f	[flang] add fir.box_offset operation (#73641 ) This operation allows computing the address of descriptor fields. It is needed to help attaching descriptors in OpenMP/OpenACC target region. The pointers inside the descriptor structure must be mapped too, but the fir.box is abstract, so these fields cannot be computed with fir.coordinate_of. To preserve the abstraction of the descriptor layout in FIR, introduce an operation specifically to !fir.ref<fir.box<>> address fields based on field names (base_addr or derived_type).	2023-11-29 10:27:27 +01:00
Fabian Mora	be9fa9dee5	[flang][NVPTX] Add initial support to the NVPTX target (#71992 ) This patch adds initial support to the NVPTX target, enabling `flang` to produce OpenMP offload code for NVPTX targets.	2023-11-16 11:34:28 -05:00
Tom Eccles	6be0e97989	[flang] Add fastmath attributes to complex arithmetic (#70690 ) Propagate fast math flags through complex number lowering (when lowering fir.*c directly to llvm floating point operations). The lowering path through the MLIR complex dialect is unchanged. This leads to a small improvement in spec2017 fotonik3d_r.	2023-10-31 16:15:13 +00:00
jeanPerier	8a1ce2d6c2	[flang][codegen] Update FIR codegen to use mlir.llvm opaque pointers (#69692 ) !llvm.ptr<T> typed pointers are depreciated in MLIR LLVM dialects. Flang codegen still generated them and relied on mlir.llvm codegen to LLVM to turn them into opaque pointers. This patch update FIR codegen to directly emit and work with LLVM opaque pointers. Addresses https://github.com/llvm/llvm-project/issues/69303 - All places generating GEPs need to add an extra type argument with the base type (the T that was previously in the llvm.ptr<T> of the base). - llvm.alloca must also be provided the object type. In the process, I doscovered that we were shamelessly copying all the attribute from fir.alloca to the llvm.alloca, which makes no sense for the operand segments. The updated code that cannot take an attribute dictionnary in the llvm.alloca builder with opaque pointers only propagate the "pinned" and "bindc_name" attributes to help debugging the generated IR. - Updating all the places that rely on getting the llvm object type from lowered llvm.ptr<T> arguments to get it from a type conversion of the original fir types. - Updating all the places that were generating llvm.ptr<T> types to generate the opaque llvm.ptr type. - Updating all the codegen tests checking generated MLIR llvm dialect. Many tests are testing directly LLVM IR, and this change is a no-op for those (which is expected).	2023-10-25 09:42:28 +02:00
Tom Eccles	ac0015fe21	[flang][driver] add command line arguments for alias tags pass The ultimate intention is to have this pass enabled by default whenever we are optimizing for speed. But for now, just add the arguments so this can be more easily tested. PR: https://github.com/llvm/llvm-project/pull/68595	2023-10-12 09:37:58 +00:00
Tom Eccles	ad547cecf4	[flang] Add missing dependency FIRCodeGen -> FIRAnalysis After https://github.com/llvm/llvm-project/pull/68437	2023-10-11 16:38:24 +00:00
Tom Eccles	6042c2eb9e	[flang] use TBAAForest in TBAABuilder This is important to ensure that tags end up in the same trees that were created in the FIR TBAA pass. If they are in different trees then everything in one tree will be assumed to MayAlias with everything in the other tree. This leads to poor performance. @vzakhari requested that the old (not-per-function) trees are maintained so I left the old test intact. PR: https://github.com/llvm/llvm-project/pull/68437	2023-10-11 16:16:53 +00:00
Tom Eccles	8301e48500	[flang][FIR] add FirAliasAnalysisOpInterface (#68317 ) This interface allows (HL)FIR passes to add TBAA information to fir.load and fir.store. If present, these TBAA tags take precedence over those added during CodeGen. We can't reuse mlir::LLVMIR::AliasAnalysisOpInterface because that uses the mlir::LLVMIR namespace so it tries to define methods for fir operations in the wrong namespace. But I did re-use the tbaa tag type to minimise boilerplate code. The new builders are to preserve the old interface without the tbaa tag.	2023-10-11 15:06:50 +01:00
jeanPerier	4ccd57ddb1	[flang][nfc] replace fir.dispatch_table with more generic fir.type_info (#68309 ) The goal is to progressively propagate all the derived type info that is currently in the runtime type info globals into a FIR operation that can be easily queried and used by FIR/HLFIR passes. When this will be complete, the last step will be to stop generating the runtime info global in lowering, but to do that later in or just before codegen to keep the FIR files readable (on the added type-info.f90 tests, the lowered runtime info globals takes a whooping 2.6 millions characters on 1600 lines of the FIR textual output. The fir.type_info that contains all the info required to generate those globals for such "trivial" types takes 1721 characters on 9 lines). So far this patch simply starts by replacing the fir.dispatch_table operation by the fir.type_info operation and to add the noinit/ nofinal/nodestroy flags to it. These flags will soon be used in HLFIR to better rewrite hlfir.assign with derived types.	2023-10-06 09:29:57 +02:00
Slava Zakharin	cfe8ae3805	[flang] TBAA for memory accesses of derived type values. (#68047 ) Since HLFIR bufferization can introduce shallow copies of derived type values we have to be careful not to treat these load/store operations as data-only-accesses. If a derived type has descriptor members, we attach any-access tag now.	2023-10-03 13:10:06 -07:00
jeanPerier	bb38f268e1	[flang] zero initialized all saved values without initial values (#67693 ) This is not standard but is vastly expected by existing code. This was implemented by https://reviews.llvm.org/D149877 for simple scalars, but MLIR lacked a generic way to deal with aggregate types (arrays and derived type). Support was recently added in https://github.com/llvm/llvm-project/pull/65508. Leverage it to zero initialize all types.	2023-09-29 08:51:30 +02:00
Tobias Gysi	85175edd4e	[mlir][llvm] Replace NullOp by ZeroOp (#67183 ) This revision replaces the LLVM dialect NullOp by the recently introduced ZeroOp. The ZeroOp is more generic in the sense that it represents zero values of any LLVM type rather than null pointers only. This is a follow to https://github.com/llvm/llvm-project/pull/65508	2023-09-25 11:11:52 +02:00
David Truby	5f476b80e3	[flang] Add comdats to functions with linkonce linkage (#66516 ) This fixes a bug where functions generated by the MLIR Math dialect, for example ipowi, would fail to link with link.exe on Windows due to having linkonce linkage but no associated comdat. Adding the comdat on ELF also allows linkers to perform better garbage collection in the binary. Simply adding comdats to all functions with this linkage type should also cover future cases where linkonce or linkonce_odr functions might be necessary.	2023-09-19 15:00:04 +01:00
Hao Jin	f3fdc967a8	[flang] Fix the incorrect insertion point for alloca (#65999 ) While creating a temporary alloca for a box in OpenMp region, the insertion point should be the OpenMP region block instead of the function entry block.	2023-09-13 00:11:50 -04:00
Fangrui Song	fc04472aa2	[flang] Fix duplicate word typos; NFC Those fixes were taken from https://reviews.llvm.org/D137338	2023-09-01 18:41:05 -07:00
Jie Fu	910b9372d1	[flang] Function 'attributeTypeIsCompatible' should be debug only (NFC) /data/home/jiefu/llvm-project/flang/lib/Optimizer/CodeGen/CodeGen.cpp:2905:20: error: unused function 'attributeTypeIsCompatible' [-Werror,-Wunused-function] static inline bool attributeTypeIsCompatible(mlir::MLIRContext *ctx, ^ 1 error generated.	2023-08-30 22:28:35 +08:00
Leandro Lupori	c8517f1752	[flang] Add support for dense complex constants Add support for representing complex array constants with MLIR dense attribute. This improves compile time and greatly reduces memory usage of programs with large complex array constants. Fixes https://github.com/llvm/llvm-project/issues/63610 Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D155951	2023-08-30 10:51:02 -03:00
Mehdi Amini	dc3dc97410	Remove the `conversionCallStack` from the MLIR TypeConverter This vector keeps tracks of recursive types through the recursive invocations of `convertType()`. However this is something only useful for some specific cases, in which the dedicated conversion callbacks can handle this stack privately. This allows removing a mutable member of the type converter. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D158351	2023-08-27 16:14:31 -07:00
Slava Zakharin	668f261bfa	[flang] Make ISO_Fortran_binding.h a standalone header again. This implements the proposal from https://discourse.llvm.org/t/adding-flang-specific-header-files-to-clang/72442/6 Since ISO_Fortran_binding.h is supposed to be included from users' C/C++ codes, it would better have no dependencies on other header files. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D158549	2023-08-22 18:56:27 -07:00
Matthias Springer	ce254598b7	[mlir][Conversion] Store const type converter in ConversionPattern ConversionPatterns do not (and should not) modify the type converter that they are using. * Make `ConversionPattern::typeConverter` const. * Make member functions of the `LLVMTypeConverter` const. * Conversion patterns take a const type converter. * Various helper functions (that are called from patterns) now also take a const type converter. Differential Revision: https://reviews.llvm.org/D157601	2023-08-14 09:03:11 +02:00
Slava Zakharin	315939fd61	[flang] Fixed slice offset computation in XEmbox codegen. For character type with unknown length we end up generating a GEP with the base type `llvm.ptr<i[width]>`. The GEP produces the address of the first element of the slice, and it should be using the offset computed in the number of characters, while we were providing the offset in bytes. Simple reproducer fails with and w/o HLFIR: ``` program test integer,parameter :: ck = 4 character(:,ck),allocatable :: res(:,:) allocate(character(3,ck) :: res(2,2)) res(1,1) = ck_'111' res(1,2) = ck_'222' res(2,1) = ck_'333' res(2,2) = ck_'444' call check(res) contains subroutine check(res) character(:,ck),allocatable :: res(:,:) print *, res(2,:) end subroutine check end program test ``` Reviewed By: clementval Differential Revision: https://reviews.llvm.org/D156849	2023-08-02 10:45:00 -07:00
Markus Böck	1dda134f85	[mlir][flang] Convert TBAA metadata to an attribute representation The current representation of TBAA is the very last in-tree user of the `llvm.metadata` operation. Using ops to model metadata has a few disadvantages: * Building a graph has to be done through some weakly typed indirection mechanism such as `SymbolRefAttr` * Creating the metadata has to be done through a builder within a metadata op. * It is not multithreading safe as operation insertion into the same block is not thread-safe This patch therefore converts TBAA metadata into an attribute representation, in a similar manner as it has been done for alias groups and access groups in previous patches. This additionally has the large benefit of giving us more "correctness by construction" as it makes things like cycles in a TBAA graph, or references to an incorrectly typed metadata node impossible. Differential Revision: https://reviews.llvm.org/D155444	2023-07-19 16:42:50 +02:00
Leandro Lupori	783222efde	[flang] Fix codegen of subcomponents' indexing Identify multidimensional array indices in subcomponents and convert them from column-major to row-major ordering. This fixes codegen for fircg.ext_array_coor, fircg.ext_embox and, possibly, fircg.ext_rebox. Fixes https://github.com/llvm/llvm-project/issues/62038 Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D154214	2023-07-03 08:59:53 -03:00
Jean Perier	b881fc2737	[flang] Fix array substring emboxing code generation The code generation of the fir.embox op creating descriptors for array substring with a non constant length base was using the substring length to compute the first dimension result stride. Fix it to use the input length instead. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D154086	2023-06-29 18:40:01 +02:00
Jean Perier	51a3468150	[flang] Support CHARACTER(4) pointer targets fir.rebox is emitting an llvm.sdiv to compute the character length given the byte size from the input descriptor. Inside a fir.global, this is not needed given the target length must be accessible via the type, and it caused MLIR to fail LLVM IR code generation (and crash). Use the input type length when available instead. Reviewed By: PeteSteinfeld, vzakhari Differential Revision: https://reviews.llvm.org/D154072	2023-06-29 18:36:44 +02:00
David Truby	8cb0c3bb21	[flang] Add COMDAT to global variables where needed On platforms which support COMDAT sections we should use them when linkonce or linkonce_odr linkage is requested. This is required on Windows (PE/COFF) and provides better behaviour than weak symbols on ELF-based platforms. This patch also reverts string literals to use linkonce instead of internal linkage now that comdats are supported. Differential Revision: https://reviews.llvm.org/D153768	2023-06-28 13:49:30 +01:00
Kelvin Li	a9e1d2e760	[flang] Add PowerPC vec_add, vec_and, vec_mul, vec_sub and vec_xor intrinsics Differential Revision: https://reviews.llvm.org/D151857	2023-06-13 16:05:21 -04:00
Tom Eccles	53cc33b00b	[flang] Store KindMapping by value in FirOpBuilder Previously only a constant reference was stored in the FirOpBuilder. However, a lot of code was merged using FirOpBuilder builder{rewriter, getKindMapping(mod)}; This is incorrect because the KindMapping returned will go out of scope as soon as FirOpBuilder's constructor had run. This led to an infinite loop running some tests using HLFIR (because the stack space containing the kind mapping was re-used and corrupted). One solution would have just been to fix the incorrect call sites, however, as a large number of these had already made it past review, I decided to instead change FirOpBuilder to store its own copy of the KindMapping. This is not costly because nearly every time we construct a KindMapping is exclusively to construct a FirOpBuilder. To make this common pattern simpler, I added a new constructor to FirOpBuilder which calls getKindMapping(). Differential Revision: https://reviews.llvm.org/D151881	2023-06-05 09:57:57 +00:00
Valentin Clement	677f7cc55a	[mlir][flang][openacc] Remove obsolete operand legalization passes The information needed for translation is now encoded in the dialect operations and does not require a dedicated pass to be extracted. Remove the obsolete passes that were performing operand legalization. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D150248	2023-05-11 10:33:00 -07:00
Mats Petersson	43cf32a1c0	[flang]Zero Initialize simple types Instead of filling uninitialized global variables with "undef", initialize them with 0. Only for Integer, Float or Logical type variables. Complex, user defined data structures, arrays, etc are not supported at this point. This patch fixes the main problem of https://github.com/llvm/llvm-project/issues/62432 Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D149877	2023-05-05 17:37:41 +01:00
Slava Zakharin	2384e84bbb	[flang] Restore stack after allocas created by TargetRewrite. This resolves issues with running out of stack on examples like https://fortran-lang.discourse.group/t/modern-fortran-sample-code/2019/18 reported by @clementval. When target rewrite creates alloca(s) around a call, we need to insert stacksave/stackrestore to free the allocated stack. Better performant code may be achieved by placing the alloca(s) outside of loops, but the placement has to behave correctly with regards to OpenMP/OpenACC/etc. dialect operations that have special representation for "private" objects. This is a concervative fix for correctness issue. Differential Revision: https://reviews.llvm.org/D149222	2023-04-26 10:33:00 -07:00
Andrew Gozillon	6b44274d83	[Flang][MLIR] Alter Fir.GlobalOp to print and lower external attributes Fir.GlobalOp's currently do not respect attributes that are applied to them, this change will do two things: - Allow lowering of arbitrary attributes applied to Fir.GlobalOp's to LLVMGlobalOp's during CodeGen - Allow printing and parsing of arbitrarily applied attributes This allows applying other dialects attributes (or other fir attributes) to fir.GlobalOps on the fly and have them exist in the resulting LLVM dialect IR or FIR IR. Reviewer: jeanPerier Differential Revision: https://reviews.llvm.org/D148352	2023-04-20 07:06:05 -05:00
Slava Zakharin	a45ca5d999	[flang] Fixed substr access in embox/rebox CodeGen. The code was using the original operand of the operation, while it should have been using the remapped operands via the adaptor. Differential Revision: https://reviews.llvm.org/D148587	2023-04-18 08:39:35 -07:00
Kiran Chandramohan	96e1d2b5b2	Revert "[Flang] Change fir.divc to perform library call rather than generate inline operations." This reverts commit `a7bb8e273f`. Revertin since this runs into an ABI issue.	2023-04-18 11:08:16 +00:00
Markus Mützel	774703ec08	[flang] Complex numbers in function arguments on Windows Function arguments or return values that are complex floating point values aren't correctly lowered for Windows x86 32-bit and 64-bit targets. See: https://github.com/llvm/llvm-project/issues/61976 Add targets that are specific for these platforms and OS. With thanks to @mstorsjo for pointing out the fix. Reviewed By: vzakhari Differential Revision: https://reviews.llvm.org/D147768	2023-04-17 11:02:26 -07:00
V Donaldson	bddd7a6436	[flang] REAL(KIND=3) and COMPLEX(KIND=3) descriptors Update descriptor generation to correctly set the `type` field for REAL(3) and COMPLEX(3) objects.	2023-04-17 09:10:47 -07:00
Jean Perier	3ce7e4b28d	[flang] fix fir.array_coor of fir.box with component references When dealing with "derived_array(j)%component" where derived_array is not a contiguous array, but for which we know the extent, lowering generates a fir.array_coor op on a !fir.box<!fir.array<cst x T>> with a fir.slice containing "j" in the component path. Codegen first computes "derived_array(j)" address using the byte strides inside the descriptor, and then computes the offset of "j" from that address with a second GEP. The type of the address in that second GEP matters since "j" is passed in the GEP via an index indicating its component position in the type. The code was using the LLVM type of "derived_array" instead of "derived_array(j)". In general, with fir.box, the extent ("cst" above) is unknown and those types match. But if the extent of "derived_array" is a compile time constant, its LLVM type will be [cst x T] instead of T*, and the produced GEP will compute the address of the nth T instead of the nth component inside T leading to undefined behaviors. Fix this by computing the element type for the second GEP. Differential Revision: https://reviews.llvm.org/D148226	2023-04-14 08:47:50 +02:00
V Donaldson	4add0e3db9	Revert "[flang] REAL(KIND=3) and COMPLEX(KIND=3) descriptors" This reverts commit `17a4fcecf4`.	2023-04-13 18:34:18 -07:00
V Donaldson	17a4fcecf4	[flang] REAL(KIND=3) and COMPLEX(KIND=3) descriptors Update descriptor generation to correctly set the `type` field for REAL(3) and COMPLEX(3) objects.	2023-04-13 18:02:13 -07:00
Valentin Clement	30408f5ccf	[flang][NFC] Move TypeConverter.h header file to include dir After the extraction of the TypeConverter, move the header files to the include dir so the shared library build is fine. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D147979	2023-04-10 17:01:50 -07:00

1 2 3 4 5 ...

318 Commits