clang-p2996

Author	SHA1	Message	Date
Andrew Gozillon	062fce6f4d	[Flang][OpenMP][MLIR] An mlir transformation pass for marking FuncOp's implicitly called from TargetOp's and declare target marked FuncOp's as implicitly declare target This pass will mark functions called from TargetOp's and declare target functions as implicitly declare target by adding the MLIR declare target attribute directly to the function. This pass executes after the initial lowering of Fortran's PFT to MLIR (FIR/OMP+Arith etc.) and is one of a series of passes that aim to clean up the MLIR for offloading (seperate passes in different patches, one for early outlining, another for declare target function filtering). Reviewers: jsjodin, skatrak, kiaranchandramohan Differential Revision: https://reviews.llvm.org/D154247	2023-07-17 08:32:26 -05:00
Sergio Afonso	debdfc0ae2	[Flang][OpenMP][MLIR] Filter emitted code depending on declare target and device This patch adds support for selecting which functions are lowered to LLVM IR from MLIR depending on declare target information and whether host or device code is being generated. The approach proposed by this patch is to perform the filtering in two stages: - An MLIR transformation pass, which is added to the Flang translation flow after the `OMPEarlyOutliningPass`. The functions that are kept are those that match the OpenMP processor (host or device) the compiler invocation is targeting, according to the presence of the `-fopenmp-is-target-device` compiler option and declare target information. All functions contaning an `omp.target` are also kept, regardless of the declare target information of the function, due to the need for keeping target regions visible for both host and device compilation. - A filtering step during translation to LLVM IR, which is peformed for those functions that were kept because of the presence of a target region inside. If the targeted OpenMP processor does not match the declare target information of the function, then it is removed from the LLVM IR after its contents have been processed and translated. Since they should only contain an omp.target operation which, in turn, should have been outlined into another LLVM IR function, the wrapper can be deleted at that point. Depends on D150328 and D150329. Differential Revision: https://reviews.llvm.org/D147641	2023-07-17 09:07:54 +01:00
Hideto Ueno	cf40fde4ed	[mlir] Don't emit forward declaration for user defined storage classes Currently DefGen::emitDecl always emits forward declarations of storage classes even for user define ones, which makes it difficult to use template class directly in ODS. This patch changes `DefGen` not to emit forward decl when `genStorageClass` is false. Original discussion: https://discourse.llvm.org/t/use-template-classes-as-user-defined-storage-classes/72015 Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D155225	2023-07-13 21:14:48 -07:00
Andrew Gozillon	e909a2c1ca	[Flang][OpenMP][Lower] Program level implicit SAVE variable handling for declare target This is an attempt at mimicing the method in which threadprivate handles the following type of variables: program main integer :: i !$omp declare target to(i) end Which essentially generates a GlobalOp for the variable (which would normally only be an alloca) when it's instantiated. The main difference is there is no operation generated within the function, instead the declare target attribute is appended later within handleDeclareTarget. Reviewers: kiranchandramohan Differential Revision: https://reviews.llvm.org/D152037	2023-07-13 12:07:21 -05:00
Slava Zakharin	1fa4a0a012	[flang][hlfir] Fixed character allocatable in structure constructor. The problem appeared as a segfault for case like this: ``` type t character(11), allocatable :: c end type character(12), alloctable :: x type(t) y y = t(x) ``` The frontend representes `y = t(x)` as `y=t(c=%SET_LENGTH(x,11_8))`. When 'x' is unallocated the hlfir.set_length lowering results in segfault. It could probably be handled in hlfir.set_length lowering by using NULL base for the hlfir.declare depending on the allocation status of 'x', but I am not sure if !hlfir.expr, in general, is supposed to represent an expression created from unallocated allocatable. I believe in Fortran that would mean referencing an unallocated allocatable, which is not allowed. I decided to special case `SET_LENGTH` in structure constructor, so that we use its 'x' operand as the RHS for the assign operation implying the isAllocatable check for cases when 'x' is allocatable. This requires setting keep_lhs_length_if_realloc flag for the assign operation. Note that when the component being intialized has deferred length the frontend does not produce `SET_LENGTH`. Differential Revision: https://reviews.llvm.org/D155151	2023-07-13 09:44:39 -07:00
Valentin Clement	a25edba7b5	[flang][NFC] Remove duplicate of getDesignatorNameIfDataRef function Remove duplicate of the getDesignatorNameIfDataRef() function. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D155105	2023-07-13 08:53:19 -07:00
Mark Danial	d85b94bf00	[Flang] -funderscoring bug fix There was a bug with the -funderscoring / -fno-underscoring options from (https://reviews.llvm.org/D140795) that prevented the driver option from controlling the underscoring behaviour and instead the behaviour could only be controlled by the pass option instead of the driver option. The driver test case did not catch the bug and also needed to be updated. Reviewed By: awarzynski Differential Revision: https://reviews.llvm.org/D155042	2023-07-13 11:30:35 -04:00
Jan Sjodin	45a9604417	[Flang][OpenMP][MLIR] Add early outlining pass for omp.target operations to flang This patch implements an early outlining transform of omp.target operations in flang. The pass is needed because optimizations may cross target op region boundaries, but with the outlining the resulting functions only contain a single omp.target op plus a func.return, so there should not be any opportunity to optimize across region boundaries. The patch also adds an interface to be able to store and retrieve the parent function name of the original target operation. This is needed to be able to create correct kernel function names when lowering to LLVM-IR. Reviewed By: kiranchandramohan, domada Differential Revision: https://reviews.llvm.org/D154879	2023-07-13 09:14:42 -04:00
Kelvin Li	10124b3e1e	[flang] Add PowerPC vec_sl, vec_sld, vec_sldw, vec_sll, vec_slo, vec_srl and vec_sro intrinsic Co-authored-by: pscoro Differential Revision: https://reviews.llvm.org/D154563	2023-07-12 17:39:13 -04:00
David Truby	f52c64b115	[flang] Add fastmath flags to localBuilder in IntrinsicCall Currently the local builder used in IntrinsicCall doesn't have the fastmath flags passed to it. This results in the fastmath attribute not being added to certain runtime calls. This patch simply forwards the fastmath flags from the parent builder. Differential Revision: https://reviews.llvm.org/D154611	2023-07-11 18:53:31 +01:00
Sergio Afonso	63ca93c7d1	[OpenMP][OMPIRBuilder] Rename IsEmbedded and IsTargetCodegen flags This patch renames the `OpenMPIRBuilderConfig` flags to reduce confusion over their meaning. `IsTargetCodegen` becomes `IsGPU`, whereas `IsEmbedded` becomes `IsTargetDevice`. The `-fopenmp-is-device` compiler option is also renamed to `-fopenmp-is-target-device` and the `omp.is_device` MLIR attribute is renamed to `omp.is_target_device`. Getters and setters of all these renamed properties are also updated accordingly. Many unit tests have been updated to use the new names, but an alias for the `-fopenmp-is-device` option is created so that external programs do not stop working after the name change. `IsGPU` is set when the target triple is AMDGCN or NVIDIA PTX, and it is only valid if `IsTargetDevice` is specified as well. `IsTargetDevice` is set by the `-fopenmp-is-target-device` compiler frontend option, which is only added to the OpenMP device invocation for offloading-enabled programs. Differential Revision: https://reviews.llvm.org/D154591	2023-07-10 14:14:16 +01:00
Slava Zakharin	b77a06084a	[flang][hlfir] Added hlfir.get_length to inquire char length from hlfir.expr. We will use hlfir.get_length to lower inquiries of char length applied to hlfir.expr character values. Reviewed By: tblah, jeanPerier Differential Revision: https://reviews.llvm.org/D154560	2023-07-06 13:21:45 -07:00
Valentin Clement	85128d8b6a	[flang][openacc] Fix false error when common block is in copy clause Wrong error was reported mentioning that the common block was in more than one data sharing clause. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D154393	2023-07-05 08:34:22 -07:00
Tom Eccles	d2d213018d	[flang][hlfir][NFC] refactor transformational intrinsic lowering The old code had overgrown itself and become difficult to read and modify. I've rewritten it and moved it into its own translation unit. I moved PreparedActualArgument to the header file for the transformational intrinsic lowering. Logically, it belongs in ConvertCall.h, but putting it there would create a circular dependency between HlfirIntrinsics and ConvertCall. Differential Revision: https://reviews.llvm.org/D154235	2023-07-04 09:34:43 +00:00
Dmitriy Smirnov	bc4586da6e	[Flang][OpenMP] Lower allocatable or pointer in private clause This patch lowers allocatables and pointers named in "private" OpenMP clause. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D148570	2023-07-03 16:46:02 +00:00
Peter Klausler	7871deb821	[flang] Add optional portability warning for upcoming Fortran 202X/3 breaking change The soon-to-be-published next revision of the ISO Fortran language standard contains a couple of breaking changes to previous specifications that may cause existing programs to silently change their behavior. For the change that introduces automatic reallocation of deferred length allocatable character scalar variables when they appear as the targets of internal WRITE statements, as IOMSG=/ERRMSG= variables, as outputs of INQUIRE specifiers, or as INTENT(OUT) arguments to intrinsic procedures, this patch adds an optional portability warning. Differential Revision: https://reviews.llvm.org/D154242	2023-07-03 09:07:00 -07:00
Kelvin Li	e435129ba9	[flang] Add PowerPC vec_cmpge, vec_cmpgt, vec_cmple, vec_cmplt and vec_any_ge intrinsic Differential Revision: https://reviews.llvm.org/D154167	2023-07-03 00:14:51 -04:00
Valentin Clement	0d6017cd62	[openacc][NFC] Bump parser support number to OpenACC 3.3 Parser support reached OpenACC 3.3 specification. Bump the numbers to reflect the latest specs. Reviewed By: razvanlupusoru, awarzynski Differential Revision: https://reviews.llvm.org/D154249	2023-06-30 12:56:29 -07:00
Jean Perier	0446bfcc5c	[flang][hlfir] Codegen of hlfir.region_assign where LHS conflicts When the analysis of hlfir.region_assign determined that the LHS region evaluation may be impacted by the assignment effects, all LHS must be fully evaluated and saved before any assignment is done. This patch adds TemporaryStorage variants to save address, including vector subscripted entities addresses whose shape must be saved. It uses the DescriptorStack runtime to deal with complex cases inside forall. For the sake of simplicity, this is also used for vector subscripted LHS outside of foralls (each element address is saved as a descriptor on this stack. This is a bit suboptimal, but it is a safe start that will work with all kinds of type (polymorphic, PDTs...) without further work). Another approach would be to saved only the values that are conflicting in the LHS computation, but this would require a much more complex analysis of the LHS region DAG. Differential Revision: https://reviews.llvm.org/D154057	2023-06-30 09:20:52 +02:00
V Donaldson	09ea692d16	[flang] IEEE_ARITHMETIC intrinsic module procedures Implement - IEEE_CLASS - IEEE_COPY_SIGN - IEEE_GET_ROUNDING_MODE - IEEE_IS_FINITE - IEEE_IS_NAN - IEEE_IS_NEGATIVE - IEEE_IS_NORMAL - IEEE_SET_ROUNDING_MODE - IEEE_SIGNBIT - IEEE_SUPPORT_ROUNDING - IEEE_UNORDERED - IEEE_VALUE for all REAL kinds (2, 3, 4, 8, 10, 16) where applicable.	2023-06-29 16:46:22 -07:00
Raghu Maddhipatla	f36a25479b	[Flang] [OpenMP] [Semantics] Change SIMD ALIGNED clause support from parsing a std::list<Name> to OmpObjectlist This is an assisting patch which is implemented to address review comment to switch std::list<Name> to OmpObjectlist from https://reviews.llvm.org/D142722. Also addressed a semantic check https://github.com/llvm/llvm-project/issues/61161 OpenMP 5.2 standard states that only pointer variables (C_PTR, Cray pointers, POINTER or ALLOCATABLE items) can appear in SIMD aligned clause (section 5.11). And not to allow common block names on an ALIGNED clause. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D152637	2023-06-29 17:24:54 -05:00
Slava Zakharin	7b4aa95d7c	[flang][hlfir] Set/propagate 'unordered' attribute for elementals. This patch adds 'unordered' attribute handling the HLFIR elementals' builders and fixes the attribute handling in lowering and transformations. Depends on D154031, D154032 Reviewed By: jeanPerier, tblah Differential Revision: https://reviews.llvm.org/D154035	2023-06-29 11:16:38 -07:00
Slava Zakharin	583168ee86	[flang][hlfir] Parse unordered attribute for elemental operations. By default, `hlfir.elemental` and `hlfir.elemental_addr` must process the elements in order. The `unordered` attribute may be set, if it is safe to process the elements out of order. This patch just adds parsing support for the new attribute. Reviewed By: jeanPerier, tblah Differential Revision: https://reviews.llvm.org/D154032	2023-06-29 10:35:43 -07:00
Slava Zakharin	5983b8b6d3	[flang][hlfir] Lower ordered elemental subroutine calls. This patch sets `unordered` `fir.do_loop` attribute during lowering of elemental subroutine calls to HLFIR, when it is safe to do so. Proper handling of `hlfir.elemental` will be done in a separate patch. Reviewed By: jeanPerier, tblah Differential Revision: https://reviews.llvm.org/D154031	2023-06-29 10:35:43 -07:00
Peter Klausler	e12ffe6a93	[flang] Honor #line and related preprocessing directives Extend the SourceFile class to take account of #line directives when computing source file positions for error messages. Adjust the output of #line directives to -E output so that they reflect any #line directives that were in the input. Differential Revision: https://reviews.llvm.org/D153910	2023-06-29 08:27:37 -07:00
Valentin Clement	eeba037e0b	[flang][openacc] Resolve symbol in device, host and self clause Some symbols were not resolved in the device, host and self clause resulting in an `Internal: no symbol found` error. This patch adds symbol resolution for these clauses. Reviewed By: razvanlupusoru Differential Revision: https://reviews.llvm.org/D153919	2023-06-28 09:08:42 -07:00
Jean Perier	fc2c8fed0b	[flang][hlfir] Do not reuse hlfir.expr mask when saving RHS. In WHERE and masked FORALL assignment, both the mask and the RHS may need to be saved in some temporary storage before evaluating the assignment. The code was trying to "optimize" that case when evaluating the RHS by not fetching the mask temporary that was just created, but in simple cases of WHERE construct where the evaluated mask is an hlfir.expr, this caused the hlfir.expr to be both used in an hlfir.associate and later in an hlfir.apply to create the fir.if to mask the RHS evaluation. This double usage prevents codegen from inlining the hlfir.expr at the hlfir.apply, and from "moving" the hlfir.expr storage into the temp during hlfir.associate bufferization. So this is pessimizing the code: this would lead to created two mask array temporary storages This was caught by the unexpectedly high number of "not yet implemented: hlfir.associate of hlfir.expr with more than one use" that were firing. Use the mask temporary instead (the hlfir.associate result) when possible. Some temporary (the "inlined stack") do not support fetching and pushing in the same run (a single counter is used to keep track of the fetching and pushing position). Add a canBeFetchedAfterPush() for safety, but this limitation is anyway not relevant for hlfir.expr since the inlined stack is only used to save "trivial" scalars. Also update the temporary storage name to only indicate "forall" if the top level construct is a FORALL. This is not a very precise name, but it should at least give a correct context to indicate in the IR why some temporary array storage was created. Differential Revision: https://reviews.llvm.org/D153880	2023-06-28 08:34:22 +02:00
Slava Zakharin	3212051c91	[RFC][flang] Experimental device build of Flang runtime. These are initial changes to experiment with building the Fortran runtime as a CUDA or OpenMP target offload library. The initial patch defines a set of macros that have to be used consistently in Flang runtime source code so that it can be built for different offload devices using different programming models (CUDA, HIP, OpenMP target offload). Currently supported modes are: * CUDA: Flang runtime may be built as a fatlib for the host and a set of CUDA architectures specified during the build. The packaging of the device code is done by the CUDA toolchain and may differ from toolchan to toolchain. * OpenMP offload: - host_device mode: Flang runtime may be built as a fatlib for the host and a set of OpenMP offload architectures. The packaging of the device code is done by the OpenMP offload compiler and may differ from compiler to compiler. OpenMP offload 'nohost' mode is a TODO to match the build setup of libomptarget/DeviceRTL. Flang runtime will be built as LLVM Bitcode library using Clang/LLVM toolchain. The host part of the library will be "empty", so there will be two distributable object: the host Flang runtime and dummy host library with device Flang runtime pieces packaged using clang-offload-packager and clang. In all supported modes, enabling parts of Flang runtime for the device compilation can be done iteratively to make the patches observable. Note that at any point in time the resulting library may have unresolved references to not yet enabled parts of Flang runtime. Example cmake/make commands for building with Clang for NVPTX target: cmake \ -DFLANG_EXPERIMENTAL_CUDA_RUNTIME=ON \ -DCMAKE_CUDA_ARCHITECTURES=80 \ -DCMAKE_C_COMPILER=/clang_nvptx/bin/clang \ -DCMAKE_CXX_COMPILER=/clang_nvptx/bin/clang++ \ -DCMAKE_CUDA_COMPILER=/clang_nvptx/bin/clang \ /llvm-project/flang/runtime/ make -j FortranRuntime Example cmake/make commands for building with Clang OpenMP offload: cmake \ -DFLANG_EXPERIMENTAL_OMP_OFFLOAD_BUILD="host_device" \ -DCMAKE_C_COMPILER=clang \ -DCMAKE_CXX_COMPILER=clang++ \ -DFLANG_OMP_DEVICE_ARCHITECTURES="sm_80" \ ../flang/runtime/ make -j FortranRuntime Differential Revision: https://reviews.llvm.org/D151173	2023-06-27 17:38:01 -07:00
Jean Perier	6c14e84926	[flang][hlfir] Add codegen for vector subscripted LHS This patch adds support for vector subscripted assignment left-hand side. It does not yet add support for the cases where the LHS must be saved because its evaluation could be impacted by the assignment. The implementation adds an hlfir::ElementalOpInterface to share the elemental inlining utility and some other tools between hlfir::ElementalOp and hlfir::ElelemntalAddrOp. It adds generateYieldedLHS() to allow retrieving the LHS value in lowering, whether or not it is vector subscripted. If it is vector subscripted, this utility creates a loop nest iterating over the elements and returns the address of an element. Differential Revision: https://reviews.llvm.org/D153759	2023-06-27 13:30:24 +02:00
Slava Zakharin	ebd0b8a047	[flang][hlfir] Special handling for temporary LHS in AssignOp. When `AssignOp` is used with LHS that is a compiler generated temporary special care must be taken to initialize the temporary and avoid finalizations of its components. This change-set adds optional `temporary_lhs` attribute for `AssignOp` to convey this information to HLFIR-to-FIR conversion pass. Currently, this results in calling `AssignTemporary` runtime for doing the assignment. Reviewed By: jeanPerier, tblah Differential Revision: https://reviews.llvm.org/D152482	2023-06-26 18:28:10 -07:00
Kelvin Li	d45b1b6bc6	[flang] Fix flang-aarch64-latest-gcc build failure due to `f295c88` Insert a return statement	2023-06-26 18:07:25 -04:00
Anthony Cabrera	4ffdc3ac36	[flang][hlfir] `hlfir.char_extremum` op definition and codegen This patch adds an hlfir operation called `char_extremum`, which takes the lexicographic comparison between a variadic number (minimum of 2 arguments) of characters. Discussion for this work can be found in the draft revision found [here](https://reviews.llvm.org/D143326). The reason I'm not promoting that draft to a true patch for review was because I needed to separate out the op definition/codegen and lowering as two separate patches, as preferred by @jeanPerier. Differential Revision: https://reviews.llvm.org/D152474	2023-06-26 15:41:30 -04:00
Kelvin Li	f295c88937	[flang] Add PPC vec_max, vec_min, vec_madd and vec_nmsub intrinsic Differential Revision: https://reviews.llvm.org/D152938	2023-06-26 10:52:47 -04:00
Jean Perier	6716923332	[flang][hlfir] Lower user defined assignment Lower user defined assignment inside the hlfir.region_assign "userDefinedAssignment" mlir region. This is done by adding an entry point to ConvertCall.h in order to call genUserCall with the region block arguments as arguments. The codegen for hlfir.region_assign with user defined assignment will be added in a later patch. Differential Revision: https://reviews.llvm.org/D153404	2023-06-26 13:06:59 +02:00
Shao-Ce SUN	db8e902b30	[flang] Rename remaining `__Fortran_PPC_intrinsics` to `__ppc_intrinsics` Reviewed By: kkwli0 Differential Revision: https://reviews.llvm.org/D153703	2023-06-25 10:15:57 +08:00
Peter Klausler	73a0ae021e	[flang] Fix looping on LEN type parameter usage When a LEN type parameter of one PDT is being used as the value of a LEN type parameter in another PDT, expression rewriting can loop infinitely due to an incorrect assumption that the same PDT's parameters are being referenced. Fixes LLVM bug https://github.com/llvm/llvm-project/issues/63198 Differential Revision: https://reviews.llvm.org/D153465	2023-06-22 07:05:33 -07:00
Elliot Goodrich	cea0eea28e	[llvm] Split out DenseMapInfo<variant> specialization Remove the `DenseMapInfo<std::variant<Ts...>>` variant out from `llvm/ADT/DenseMapInfo.h` into a separate header `llvm/ADT/DenseMapInfoVariant.h` This allows us to remove the `<variant>` include, which is being transitively and unncessary included in all translation units that include `llvm/ADT/DenseMap.h`. There have been similar changes to move out specializations for * `APInt.h` `fd7e309e02` and * `StringRef.h`/`ArrayRef.h` `983565a6fe` to reduce the compilation time. As we are unable to move the specialization into `<variant>`, we create a separate `DenseMapInfoVariant.h` header that can be used by anyone who needs this specialization. This reduces the total number of preprocessing tokens across the LLVM source files in lib from (roughly) 1,964,876,961 to 1,936,551,496 - a reduction of ~1.44%. This should result in a small improvement in compilation time. Differential Revision: https://reviews.llvm.org/D150997	2023-06-22 06:50:54 +01:00
Tom Eccles	74adc3e0eb	[flang][hlfir] fix missing conversion in transpose simplification It seems just replacing the operation was not replacing all of the uses when the types of the expression before and after this pass differ (due to differing shape information). Now the shape information is always kept the same. This fixes https://github.com/llvm/llvm-project/issues/63399 Differential Revision: https://reviews.llvm.org/D153333	2023-06-21 16:54:58 +00:00
Jacob Crawley	3f8d8c1aac	[flang][hlfir] Add hlfir.count intrinsic Adds a new HLFIR operation for the COUNT intrinsic according to the design set out in flang/docs/HighLevel.md. This patch includes all the necessary changes to create a new HLFIR operation and lower it into the fir runtime call. Author was @jacob-crawley. Minor adjustments by @tblah Differential Revision: https://reviews.llvm.org/D152521	2023-06-19 09:09:26 +00:00
Tom Eccles	f523b9a55a	[flang] don't allow conversions between logical and floating point Codegen only supports conversions between logicals and integers. The verifier should reflect this. Differential Revision: https://reviews.llvm.org/D152935	2023-06-19 09:09:01 +00:00
Paul Scoropan	ce782ebdc7	[Flang] Split PowerPC-specific code out of IntrinsicCall into PPCIntrinsicCall This patch moves PPC intrinsic generator code to PPCIntrinsicCall.cpp. In order to move PowerPC intrinsic code out of IntrinsicCall.cpp, we need to also move some declarations to IntrinsicCall.h. handlers[] and mathOperations[] were also chosen to be moved to the IntrinsicCall header. Similarly, ppcHandlers[] and ppcMathOperations[] were moved to the PPCIntrinsicCall header. There are future patches coming up that will introduce many new PPC intrinsics, these will now be defined in PPCIntrinsicCall. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D152460	2023-06-16 17:26:41 +00:00
Peixin Qiao	ed27d28f9a	[flang][OpenMP][OpenACC] Support stop statement in OpenMP/OpenACC region [flang][OpenMP][OpenACC] Support stop statement in OpenMP/OpenACC region This supports lowering of stop statement in OpenMP/OpenACC region. * OpenMP/OpenACC: Emit `fir.unreachable` only if the block is not terminated by any terminator. This avoids knocking off an existing OpenMP/OpenACC terminator. * OpenMP: Emit the OpenMP terminator instead of `fir.unreachable` since OpenMP regions can only be terminated by OpenMP terminators. This is currently skipped for OpenACC since unstructured code is not yet handled specially in OpenACC lowering. Fixes #60737 Fixes #61877 Co-authored-by: Kiran Chandramohan <kiranchandramohan@gmail.com> Co-authored-by: Val Donaldson <vdonaldson@nvidia.com> Reviewed By: vdonaldson, peixin Differential Revision: https://reviews.llvm.org/D129969	2023-06-15 10:29:42 +00:00
Kelvin Li	7e82379d11	[flang] rename PPC specific intrinsic modules (NFC)	2023-06-14 11:21:03 -04:00
Valentin Clement	c1ca4d6fe7	[flang][openacc] Add lowering for min operator Add lowering support for the min operator in reduction clause. Depends on D151565 Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D151671	2023-06-14 08:18:03 -07:00
Valentin Clement	5923e46fce	[flang][openacc] Add parsing support for dim in gang clause Add parsing supprot for dim in gang clause Depends on D151971 Reviewed By: razvanlupusoru, jeanPerier Differential Revision: https://reviews.llvm.org/D151972	2023-06-13 20:33:36 -07:00
Valentin Clement	e6d8598e13	[mlir][flang][openacc] Use new firstprivate representation for compute construct Use the new firstprivate representation on the comupte construct. Reviewed By: razvanlupusoru, jeanPerier Differential Revision: https://reviews.llvm.org/D151975	2023-06-13 20:32:23 -07:00
Kelvin Li	a9e1d2e760	[flang] Add PowerPC vec_add, vec_and, vec_mul, vec_sub and vec_xor intrinsics Differential Revision: https://reviews.llvm.org/D151857	2023-06-13 16:05:21 -04:00
Valentin Clement	ae86fe8591	[flang][openacc] Add parser support for the force modifier in the collapse clause This patch adds parser support for the force modifier on the collapse clause introduced in OpenACC 3.3. Lowering will currently hit a TODO as the MLIR representation of the acc.loop might need some update. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D151974	2023-06-12 11:01:59 -07:00
River Riddle	a5ef51d786	[mlir] Add support for "promised" interfaces Promised interfaces allow for a dialect to "promise" the implementation of an interface, i.e. declare that it supports an interface, but have the interface defined in an extension in a library separate from the dialect itself. A promised interface is powerful in that it alerts the user when the interface is attempted to be used (e.g. via cast/dyn_cast/etc.) and the implementation has not yet been provided. This makes the system much more robust against misconfiguration, and ensures that we do not lose the benefit we currently have of defining the interface in the dialect library. Differential Revision: https://reviews.llvm.org/D120368	2023-06-09 11:30:13 -07:00
Paul Scoropan	e96648a9bd	[Flang] Replace intrinsic function type generators with single generic function type generator In a future patch we plan on introducing a large set of Power-PC specific intrinsics. During our prototyping we found that the number of function type generators we were defining, plus those already defined, were reaching an unreasonable number. This patch introduces a generic function type generator function that can be used for almost all cases. The generator supports creating function types with up to 4 arguments and with arguments/return type of types: void, integer, real, and comlex. The intention is for a future patch, which introduces a set of PowerPC-specific vector intrinsics, to also introduce support in the generator for: integer vector, unsigned vector, and real vector types. Reviewed By: luporl Differential Revision: https://reviews.llvm.org/D151812	2023-06-08 15:17:22 +00:00

1 2 3 4 5 ...

1474 Commits