clang-p2996

Author	SHA1	Message	Date
Kazu Hirata	938cdb30f1	[flang] Migrate away from std::nullopt (NFC) (#145928 ) ArrayRef has a constructor that accepts std::nullopt. This constructor dates back to the days when we still had llvm::Optional. Since the use of std::nullopt outside the context of std::optional is kind of abuse and not intuitive to new comers, I would like to move away from the constructor and eventually remove it. This patch replaces std::nullopt with {}. There are a couple of places where std::nullopt is replaced with TypeRange() to accommodate perfect forwarding.	2025-06-26 12:41:49 -07:00
Jacques Pienaar	d0469d1d3c	[mlir] Move WalkResult to Support (#145649 ) This also enables moving StateStack, both are relatively generic helper structs not tied to IR.	2025-06-25 01:58:44 -07:00
Lance Wang	77af8bff97	[mlir]Moves the StateStack to IR folder from Support folder. (#145598 ) [MLIR] Fix circular dependency introduced in In https://github.com/llvm/llvm-project/pull/144897. This PR is to break the dependency. by moving StateStack to IR folder This commit resolves a circular dependency issue between mlir/Support and mlir/IR: - Move StateStack.h and StateStack.cpp from Support to IR folder - Update CMakeLists.txt files to reflect the new locations - Update Bazel BUILD file to maintain correct dependencies - Update includes in affected files (flang, Target/LLVMIR) The circular dependency was caused by StateStack.h depending on IR/Visitors.h while other IR files depended on Support. Moving StateStack to IR eliminates this cycle while maintaining proper separation of concerns.	2025-06-25 00:00:13 -04:00
Tom Eccles	8f7f48a97e	[flang][OpenMP][NFC] remove globals with mlir::StateStack (#144898 ) Idea suggested by @skatrak	2025-06-24 18:30:37 +01:00
Kareem Ergawy	97e17e1595	Revert "[flang] Enable delayed localization by default for `do concurrent` (#144074 )" (#144476 ) This reverts commit `b5dbf8210a`. Reverting again due to gfortran failure: https://lab.llvm.org/buildbot/#/builders/17/builds/8868	2025-06-17 11:34:05 +02:00
Kareem Ergawy	b5dbf8210a	[flang] Enable delayed localization by default for `do concurrent` (#144074 ) Reintroduces changes from https://github.com/llvm/llvm-project/issues/143897. A fix for the reported problem in https://github.com/llvm/llvm-project/issues/143897 is hopefully resolved in https://github.com/llvm/llvm-project/pull/144027. This PR aims to make it easier and more self-contained to revert the switch/flag if we discover any problems with enabling it by default.	2025-06-17 06:08:38 +02:00
Kareem Ergawy	4bd0a0e50b	Revert "[flang] Enable delayed localization by default for `do concurrent` (#142567 )" (#143905 ) This reverts commit `937be17752`. Resolves https://github.com/llvm/llvm-project/issues/143897 until the todo is properly handled.	2025-06-12 17:09:55 +02:00
Kareem Ergawy	937be17752	[flang] Enable delayed localization by default for `do concurrent` (#142567 ) This PR aims to make it easier and more self-contained to revert the switch/flag if we discover any problems with enabling it by default.	2025-06-11 10:10:22 +02:00
Kareem Ergawy	bac4aa440c	[flang] Extend localization support for `do concurrent` (`init` regions) (#142564 ) Extends support for locality specifiers in `do concurrent` by supporting data types that need `init` regions. This further unifies the paths taken by the compiler for OpenMP privatization clauses and `do concurrent` locality specifiers.	2025-06-05 01:01:53 +02:00
Leandro Lupori	aac1f85393	[flang][OpenMP] Explicitly set Shared DSA in symbols (#142154 ) Before this change, OmpShared was not always set in shared symbols. Instead, absence of private flags was interpreted as shared DSA. The problem was that symbols with no flags, with only a host association, could also mean "has same DSA as in the enclosing context". Now shared symbols behave the same as private and can be treated the same way. Because of the host association symbols with no flags mentioned above, it was also incorrect to simply test the flags of a given symbol to find out if it was private or shared. The function GetSymbolDSA() was added to fix this. It would be better to avoid the need of these special symbols, but this would require changes to how symbols are collected in lowering. Besides that, some semantic checks need to know if a DSA clause was used or not. To avoid confusing implicit symbols with DSA clauses a new flag was added: OmpExplicit. It is now set for all symbols with explicitly determined data-sharing attributes. With the changes above, AddToContextObjectWithDSA() and the symbol to DSA map could probably be removed and the DSA could be obtained directly from the symbol, but this was not attempted. Some debug messages were also added, with the "omp" DEBUG_TYPE, to make it easier to debug the creation of implicit symbols and to visualize all associations of a given symbol. Fixes #130533 Fixes #140882	2025-06-03 10:58:23 -03:00
Kareem Ergawy	f8dcb059ae	[flang][fir][OpenMP] Refactor privtization code into shared location (#141767 ) Refactors the utils needed to create privtization/locatization ops for both the fir and OpenMP dialects into a shared location isolating OpenMP stuff out of it as much as possible.	2025-05-29 13:13:44 +02:00
Kareem Ergawy	7e9887a50d	[flang] Generlize names of delayed privatization CLI flags (#138816 ) Remove the `openmp` prefix from delayed privatization/localization flags since they are now used for `do concurrent` as well. PR stack: - https://github.com/llvm/llvm-project/pull/137928 - https://github.com/llvm/llvm-project/pull/138505 - https://github.com/llvm/llvm-project/pull/138506 - https://github.com/llvm/llvm-project/pull/138512 - https://github.com/llvm/llvm-project/pull/138534 - https://github.com/llvm/llvm-project/pull/138816 (this PR)	2025-05-29 12:27:03 +02:00
Kareem Ergawy	e33cd9690f	[flang][fir] Basic PFT to MLIR lowering for do concurrent locality specifiers (#138534 ) Extends support for `fir.do_concurrent` locality specifiers to the PFT to MLIR level. This adds code-gen for generating the newly added `fir.local` ops and referencing these ops from `fir.do_concurrent.loop` ops that have locality specifiers attached to them. This reuses the `DataSharingProcessor` component and generalizes it a bit more to allow for handling `omp.private` ops and `fir.local` ops as well. PR stack: - https://github.com/llvm/llvm-project/pull/137928 - https://github.com/llvm/llvm-project/pull/138505 - https://github.com/llvm/llvm-project/pull/138506 - https://github.com/llvm/llvm-project/pull/138512 - https://github.com/llvm/llvm-project/pull/138534 (this PR) - https://github.com/llvm/llvm-project/pull/138816	2025-05-29 11:04:27 +02:00
Asher Mancinelli	bbb7f01481	[flang] Fix volatile attribute propagation on allocatables (#139183 ) Ensure volatility is reflected not just on the reference to an allocatable, but on the box, too. When we declare a volatile allocatable, we now get a volatile reference to a volatile box. Some related cleanups: * SELECT TYPE constructs check the selector's type for volatility when creating and designating the type used in the selecting block. * Refine the verifier for fir.convert. In general, I think it is ok to implicitly drop volatility in any ptr-to-int conversion because it means we are in codegen (and representing volatility on the LLVM ops and intrinsics) or we are calling an external function (are there any cases I'm not thinking of?) * An allocatable test that was XFAILed is now passing. Making allocatables' boxes volatile resulted in accesses of those boxes being volatile, which resolved some errors coming from the strict verifier. * I noticed a runtime function was missing the fir.runtime attribute.	2025-05-13 08:13:47 -07:00
Zhen Wang	eef4b5a0cd	[flang] [cuda] Fix CUDA implicit data transfer entity creation (#139414 ) Fixed an issue in `genCUDAImplicitDataTransfer` where creating an `hlfir::Entity` from a symbol address could fail when the address comes from a `hlfir.declare` operation. Fix is to check if the address comes from a `hlfir.declare` operation. If so, use the base value from the declare op when available. Falling back to the original address otherwise.	2025-05-12 10:06:39 -07:00
Andre Kuhlenschmidt	4d9479fa8f	[flang][openacc] Allow open acc routines from other modules. (#136012 ) OpenACC routines annotations in separate compilation units currently get ignored, which leads to errors in compilation. There are two reason for currently ignoring open acc routine information and this PR is addressing both. - The module file reader doesn't read back in openacc directives from module files. - Simple fix in `flang/lib/Semantics/mod-file.cpp` - The lowering to HLFIR doesn't generate routine directives for symbols imported from other modules that are openacc routines. - This is the majority of this diff, and is address by the changes that start in `flang/lib/Lower/CallInterface.cpp`.	2025-05-09 11:12:24 -07:00
Kareem Ergawy	227e1ff73b	[flang][fir] Add locality specifiers modeling to `fir.do_concurrent.loop` (#138506 )	2025-05-08 21:42:52 +02:00
Kareem Ergawy	2fb288d4b8	[flang][fir] Lower `do concurrent` loop nests to `fir.do_concurrent` (#137928 ) Adds support for lowering `do concurrent` nests from PFT to the new `fir.do_concurrent` MLIR op as well as its special terminator `fir.do_concurrent.loop` which models the actual loop nest. To that end, this PR emits the allocations for the iteration variables within the block of the `fir.do_concurrent` op and creates a region for the `fir.do_concurrent.loop` op that accepts arguments equal in number to the number of the input `do concurrent` iteration ranges. For example, given the following input: ```fortran do concurrent(i=1:10, j=11:20) end do ``` the changes in this PR emit the following MLIR: ```mlir fir.do_concurrent { %22 = fir.alloca i32 {bindc_name = "i"} %23:2 = hlfir.declare %22 {uniq_name = "_QFsub1Ei"} : (!fir.ref<i32>) -> (!fir.ref<i32>, !fir.ref<i32>) %24 = fir.alloca i32 {bindc_name = "j"} %25:2 = hlfir.declare %24 {uniq_name = "_QFsub1Ej"} : (!fir.ref<i32>) -> (!fir.ref<i32>, !fir.ref<i32>) fir.do_concurrent.loop (%arg1, %arg2) = (%18, %20) to (%19, %21) step (%c1, %c1_0) { %26 = fir.convert %arg1 : (index) -> i32 fir.store %26 to %23#0 : !fir.ref<i32> %27 = fir.convert %arg2 : (index) -> i32 fir.store %27 to %25#0 : !fir.ref<i32> } } ```	2025-05-07 12:52:25 +02:00
Asher Mancinelli	8836bce842	[flang] Add lowering of volatile references (#132486 ) [RFC on discourse](https://discourse.llvm.org/t/rfc-volatile-representation-in-flang/85404/1) Flang currently lacks support for volatile variables. For some cases, the compiler produces TODO error messages and others are ignored. Some of our tests are like the example from _C.4 Clause 8 notes: The VOLATILE attribute (8.5.20)_ and require volatile variables. Prior commits: ``` `c9ec1bc753` [flang] Handle volatility in lowering and codegen (#135311) `e42f860985` [flang][nfc] Support volatility in Fir ops (#134858) `b2711e1526` [flang][nfc] Support volatile on ref, box, and class types (#134386) ```	2025-04-30 08:46:33 -07:00
Valentin Clement (バレンタインクレメン)	46e734746d	[flang][cuda] Update stream type for cuf kernel op (#136627 ) Update the type of the stream operand to be similar to KernelLaunchOp.	2025-04-21 19:22:07 -07:00
Slava Zakharin	50db7a7d26	[flang] Fixed fir.dummy_scope generation to work for TBAA. (#136382 ) The nesting of fir.dummy_scope operations defines the roots of the TBAA forest. If we do not generate fir.dummy_scope in functions that do not have any dummy arguments, then the globals accessed in the function and the dummy arguments accessed by the callee may end up in different sub-trees of the same root. The added tbaa-with-dummy-scope2.fir demonstrates the issue.	2025-04-18 17:19:12 -07:00
Kareem Ergawy	30990c09c9	Revert "[flang][fir] Lower `do concurrent` loop nests to `fir.do_concurrent` (#132904 )" (#135904 ) This reverts commit `04b87e15e4`. The reasons for reverting is that the following: 1. I still need need to upstream some part of the do concurrent to OpenMP pass from our downstream implementation and taking this in downstream will make things more difficult. 2. I still need to work on a solution for modeling locality specifiers on `hlfir.do_concurrent` ops. I would prefer to do that and merge the entire stack together instead of having a partial solution. After merging the revert I will reopen the origianl PR and keep it updated against main until I finish the above.	2025-04-16 07:20:27 -05:00
Kareem Ergawy	04b87e15e4	[flang][fir] Lower `do concurrent` loop nests to `fir.do_concurrent` (#132904 ) Adds support for lowering `do concurrent` nests from PFT to the new `fir.do_concurrent` MLIR op as well as its special terminator `fir.do_concurrent.loop` which models the actual loop nest. To that end, this PR emits the allocations for the iteration variables within the block of the `fir.do_concurrent` op and creates a region for the `fir.do_concurrent.loop` op that accepts arguments equal in number to the number of the input `do concurrent` iteration ranges. For example, given the following input: ```fortran do concurrent(i=1:10, j=11:20) end do ``` the changes in this PR emit the following MLIR: ```mlir fir.do_concurrent { %22 = fir.alloca i32 {bindc_name = "i"} %23:2 = hlfir.declare %22 {uniq_name = "_QFsub1Ei"} : (!fir.ref<i32>) -> (!fir.ref<i32>, !fir.ref<i32>) %24 = fir.alloca i32 {bindc_name = "j"} %25:2 = hlfir.declare %24 {uniq_name = "_QFsub1Ej"} : (!fir.ref<i32>) -> (!fir.ref<i32>, !fir.ref<i32>) fir.do_concurrent.loop (%arg1, %arg2) = (%18, %20) to (%19, %21) step (%c1, %c1_0) { %26 = fir.convert %arg1 : (index) -> i32 fir.store %26 to %23#0 : !fir.ref<i32> %27 = fir.convert %arg2 : (index) -> i32 fir.store %27 to %25#0 : !fir.ref<i32> } } ```	2025-04-16 06:14:38 +02:00
Zhen Wang	8f0d8d28cc	Delete duplicated hlfir.declare op of induction variables of do concurrent when inside cuf kernel directive. (#134467 ) Delete duplicated creation of hlfir.declare op of do concurrent induction variables when inside cuf kernel directive. Obtain the correct hlfir.declare op generated from bindSymbol, and add it to ivValues.	2025-04-06 19:31:09 -07:00
Jean-Didier PAILLEUX	c309abd925	[flang] Implement !DIR$ NOVECTOR and !DIR$ NOUNROLL[_AND_JAM] (#133885 ) Hi, This patch implements support for the following directives : - `!DIR$ NOUNROLL_AND_JAM` to disable unrolling and jamming on a DO LOOP. - `!DIR$ NOUNROLL` to disable unrolling on a DO LOOP. - `!DIR$ NOVECTOR` to disable vectorization on a DO LOOP.	2025-04-02 14:30:01 +02:00
Thirumalai Shaktivel	091dcb8fc2	[Flang] Make a private copy for the common block variables in copyin clause (#111359 ) Fixes: https://github.com/llvm/llvm-project/issues/82949	2025-04-01 11:35:44 +05:30
Michael Kruse	123eb75cd4	[Flang] Do not emit numeric_storage_size into object file (#131463 ) The value of numeric_storage_size depends on compilation options and therefore its value is not yet known when building the builtins runtime. Instead, the parameter is folding a __numeric_storage_size() expression which is loaded into the user program. For the iso_fortran_env object file, omit the symbol as it is never used. Similar tests that ensure that __numeric_storage_size() is not folded until compiling the actual user program exist in FortranEvalutate: `1e6ba3cd2f/flang/lib/Evaluate/check-expression.cpp (L487-L492)` `1e6ba3cd2f/flang/lib/Evaluate/fold-integer.cpp (L1457-L1460)` Required for using CMake to compile the builtin module files. See RFC at https://discourse.llvm.org/t/rfc-building-flangs-builtin-mod-files/84626	2025-03-21 12:32:54 +01:00
jeanPerier	3ff3b29dd6	[flang] lower remaining cases of pointer assignments inside forall (#130772 ) Implement handling of `NULL()` RHS, polymorphic pointers, as well as lower bounds or bounds remapping in pointer assignment inside FORALL. These cases eventually do not require updating hlfir.region_assign, lowering can simply prepare the new descriptor for the LHS inside the RHS region. Looking more closely at the polymorphic cases, there is not need to call the runtime, fir.rebox and fir.embox do handle the dynamic type setting correctly. After this patch, the last remaining TODO is the allocatable assignment inside FORALL, which like some cases here, is more likely an accidental feature given FORALL was deprecated in F2003 at the same time than allocatable components where added.	2025-03-14 10:51:46 +01:00
Leandro Lupori	29f5d5bea9	[flang][OpenMP] Fix privatization of procedure pointers (#130336 ) Fixes #121720	2025-03-11 09:38:40 -03:00
jeanPerier	40e245a9aa	[flang] add support for procedure pointer assignment inside FORALL (#130114 ) Very similar to object pointer assignment, the difference is the SSA types of the LHS (!fir.ref<!fir.boxproc<()->()>> and RHS (!fir.boxproc<()->()). The RHS must be saved as simple address, not descriptors (it is not possible to make CFI descriptor out of procedure entity).	2025-03-07 10:28:02 +01:00
Valentin Clement (バレンタインクレメン)	478e516140	[flang][cuda] Sync double descriptor after c_f_pointer call (#130194 ) After a global device pointer is set through `c_f_pointer`, we need to sync the double descriptor so the version on the device is also up to date.	2025-03-06 19:19:51 -08:00
Zhen Wang	d1abbb4dc5	[flang][cuda] Change induction variable from i32 to index for doconcurrent inside cuf kernel directive (#129924 ) Use `index` instead of `i32` for induction variables for doconcurrent inside cuf kernel directive. Regular do loop inside cuf kernel directive also uses `index`: ``` cuf.kernel<<<, >>> (%arg0 : index) = ... ```	2025-03-05 14:50:42 -08:00
jeanPerier	7302e1b94e	[flang] implement simple pointer assignments inside FORALL (#129522 ) The semantic of pointer assignments inside FORALL requires evaluating the targets (RHS) and pointer variables (LHS) of all iterations before evaluating the assignments. In practice, if the compiler can prove that the RHS and LHS evaluations are not impacted by the assignments, the evaluation of the FORALL assignment statement can be done in a single loop. However, if the compiler cannot prove this, it needs to "save" the addresses of the targets and/or the pointer descriptors of each iterations before doing the assignments. This patch implements the most common cases where there is no lower bound spec, no bounds remapping, the LHS is not polymorphic, and the RHS is not NULL. The HLFIR operation used to represent assignments inside FORALL can be used for pointer assignments to (the only difference being that the LHS is a descriptor address). The analysis for intrinsic assignment can be reused, with the distinction that the RHS data is not read during the assignment. The logic that is used to save LHS in intrinsic assignments inside FORALL is extracted to be used for the RHS of pointer assignments when needed (saving a descriptor value). Pointer assignment LHS are just descriptor addresses and are saved as int_ptr values.	2025-03-05 11:24:04 +01:00
Valentin Clement (バレンタインクレメン)	d1fd3698a9	[flang][cuda] Allow unsupported data transfer to be done on the host (#129160 ) Some data transfer marked as unsupported can actually be deferred to an assignment on the host when the variables involved are unified or managed.	2025-03-02 16:12:01 -08:00
Zhen Wang	a67566b185	Allow do concurrent inside cuf kernel directive (#127693 ) Allow do concurrent inside cuf kernel directive to avoid the following Lowering error: ``` void {anonymous}::FirConverter::genFIR(const Fortran::parser::CUFKernelDoConstruct&): Assertion `bounds && "Expected bounds on the loop construct"' failed. ``` --------- Co-authored-by: Valentin Clement (バレンタインクレメン) <clementval@gmail.com>	2025-02-20 14:05:44 -08:00
Jean-Didier PAILLEUX	d6c6bde9db	[flang] Implement !DIR$ UNROLL_AND_JAM [N] (#125046 ) This patch implements support for the UNROLL_AND_JAM directive to enable or disable unrolling and jamming on a `DO LOOP`. It must be placed immediately before a `DO LOOP` and applies only to the loop that follows. N is an integer that specifying the unrolling factor. This is done by adding an attribute to the branch into the loop in LLVM to indicate that the loop should unrolled and jammed.	2025-02-19 15:00:09 +00:00
Akash Banerjee	9905728e2f	[MLIR][OpenMP] Add Lowering support for OpenMP Declare Mapper directive (#117046 ) This patch adds HLFIR/FIR lowering support for OpenMP Declare Mapper directive. Depends on #117045.	2025-02-18 16:36:01 +00:00
Asher Mancinelli	6b52fb25b9	[flang] Correctly handle `!dir$ unroll` with unrolling factors of 0 and 1 (#126170 ) https://github.com/llvm/llvm-project/pull/123331 added support for the unrolling directive. In the presence of an explicit unrolling factor, that unrolling factor would be unconditionally passed into the metadata even when it was 1 or 0. These special cases should instead disable unrolling. Adding an explicit unrolling factor of 0 triggered this assertion which is fixed by this patch: ``` unsigned int unrollCountPragmaValue(const llvm::Loop*): Assertion `Count >= 1 && "Unroll count must be positive."' failed. ``` Updated tests and documentation.	2025-02-10 08:21:22 -08:00
Michael Kruse	b815a3942a	[Flang] Move non-common headers to FortranSupport (#124416 ) Move non-common files from FortranCommon to FortranSupport (analogous to LLVMSupport) such that * declarations and definitions that are only used by the Flang compiler, but not by the runtime, are moved to FortranSupport * declarations and definitions that are used by both ("common"), the compiler and the runtime, remain in FortranCommon * generic STL-like/ADT/utility classes and algorithms remain in FortranCommon This allows a for cleaner separation between compiler and runtime components, which are compiled differently. For instance, runtime sources must not use STL's `<optional>` which causes problems with CUDA support. Instead, the surrogate header `flang/Common/optional.h` must be used. This PR fixes this for `fast-int-sel.h`. Declarations in include/Runtime are also used by both, but are header-only. `ISO_Fortran_binding_wrapper.h`, a header used by compiler and runtime, is also moved into FortranCommon.	2025-02-06 15:29:10 +01:00
Jean-Didier PAILLEUX	e811cb00e5	[flang] Implement !DIR$ UNROLL [N] (#123331 ) This patch implements support for the UNROLL directive to control how many times a loop should be unrolled. It must be placed immediately before a `DO LOOP` and applies only to the loop that follows. N is an integer that specifying the unrolling factor. This is done by adding an attribute to the branch into the loop in LLVM to indicate that the loop should unrolled. The code pushed to support the directive `VECTOR ALWAYS` has been modified to take account of the fact that several directives can be used before a `DO LOOP`.	2025-01-29 09:44:09 +01:00
Valentin Clement (バレンタインクレメン)	654b76321a	[flang][cuda] Allow to set the stack limit size (#124859 ) This patch adds a call to the CUFInit function just after `ProgramStart` when CUDA Fortran is enabled to initialize the CUDA context. This allows us to set up some context information like the stack limit that can be defined by an environment variable `ACC_OFFLOAD_STACKSIZE=<value>`.	2025-01-28 20:57:33 -08:00
Kaviya Rajendiran	daa18205c6	[Flang][OpenMP] Fix copyin allocatable lowering to MLIR (#122097 ) Fixes https://github.com/llvm/llvm-project/issues/113191 Issue: [flang][OpenMP] Runtime segfault when an allocatable variable is used with copyin Rootcause: The value of the threadprivate variable is not being copied from the primary thread to the other threads within a parallel region. As a result it tries to access a null pointer inside a parallel region which causes segfault. Fix: When allocatables used with copyin clause need to ensure that, on entry to any parallel region each thread’s copy of a variable will acquire the allocation status of the primary thread, before copying the value of a threadprivate variable of the primary thread to the threadprivate variable of each other member of the team.	2025-01-23 11:14:00 +05:30
Kareem Ergawy	a0406ce823	[flang][OpenMP] Add `hostIsSource` paramemter to `copyHostAssociateVar` (#123162 ) This fixes a bug when the same variable is used in `firstprivate` and `lastprivate` clauses on the same construct. The issue boils down to the fact that `copyHostAssociateVar` was deciding the direction of the copy assignment (i.e. the `lhs` and `rhs`) based on whether the `copyAssignIP` parameter is set. This is not the best way to do it since it is not related to whether we doing a copy from host to localized copy or the other way around. When we set the insertion for `firstprivate` in delayed privatization, this resulted in switching the direction of the copy assignment. Instead, this PR adds a new paramter to explicitely tell the function the direction of the assignment. This is a follow up PR for https://github.com/llvm/llvm-project/pull/122471, only the latest commit is relevant.	2025-01-16 19:10:12 +01:00
jeanPerier	d82d53b2e3	[flang][openmp] initialize allocatable components of firstprivate copies (#121808 ) Descriptors of allocatable components of firstprivate derived type copies need to be set-up. Otherwise the program later die when manipulating them inside OpenMP region.	2025-01-07 10:04:27 +01:00
Valentin Clement (バレンタインクレメン)	9165848c82	[flang][cuda] Sync global descriptor when nullifying pointer (#121595 )	2025-01-03 14:37:14 -08:00
Matthias Springer	c870632ef6	[flang] Fix some memory leaks (#121050 ) This commit fixes some but not all memory leaks in Flang. There are still 91 tests that fail with ASAN. - Use `mlir::OwningOpRef` instead of `std::unique_ptr`. The latter does not free allocations of nested blocks. - Pass `ModuleOp` as value instead of reference. - Add few missing deallocations in test cases and other places.	2024-12-25 09:42:03 +01:00
Leandro Lupori	1fcb6a9754	[flang][OpenMP] Initialize allocatable members of derived types (#120295 ) Allocatable members of privatized derived types must be allocated, with the same bounds as the original object, whenever that member is also allocated in it, but Flang was not performing such initialization. The `Initialize` runtime function can't perform this task unless its signature is changed to receive an additional parameter, the original object, that is needed to find out which allocatable members, with their bounds, must also be allocated in the clone. As `Initialize` is used not only for privatization, sometimes this other object won't even exist, so this new parameter would need to be optional. Because of this, it seemed better to add a new runtime function: `InitializeClone`. To avoid unnecessary calls, lowering inserts a call to it only for privatized items that are derived types with allocatable members. Fixes https://github.com/llvm/llvm-project/issues/114888 Fixes https://github.com/llvm/llvm-project/issues/114889	2024-12-19 17:26:50 -03:00
Peter Klausler	fc97d2e68b	[flang] Add UNSIGNED (#113504 ) Implement the UNSIGNED extension type and operations under control of a language feature flag (-funsigned). This is nearly identical to the UNSIGNED feature that has been available in Sun Fortran for years, and now implemented in GNU Fortran for gfortran 15, and proposed for ISO standardization in J3/24-116.txt. See the new documentation for details; but in short, this is C's unsigned type, with guaranteed modular arithmetic for +, -, and *, and the related transformational intrinsic functions SUM & al.	2024-12-18 07:02:37 -08:00
Kareem Ergawy	e532241b02	Re-apply (#117867 ): [flang][OpenMP] Implicitly map allocatable record fields (#120374 ) This re-applies #117867 with a small fix that hopefully prevents build bot failures. The fix is avoiding `dyn_cast` for the result of `getOperation()`. Instead we can assign the result to `mlir::ModuleOp` directly since the type of the operation is known statically (`OpT` in `OperationPass`).	2024-12-18 09:19:45 +01:00
Kareem Ergawy	dc936f3c19	Revert "[flang][OpenMP] Implicitly map allocatable record fields (#117867 )" (#120360 )	2024-12-18 06:52:24 +01:00

1 2 3 4 5 ...

403 Commits