clang-p2996

Author	SHA1	Message	Date
Joseph Huber	f81879c0f7	[Libomptarget] Add RPC-based printf implementation for OpenMP #85638 Summary: Relanding after reverting, only applies to AMDGPU for now. This patch adds an implementation of printf that's provided by the GPU C library runtime. This pritnf currently implemented using the same wrapper handling that OpenMP sets up. This will be removed once we have proper varargs support. This printf differs from the one CUDA offers in that it is synchronous and uses a finite size. Additionally we support pretty much every format specifier except the %n option. Depends on #85331	2024-04-10 13:36:25 -05:00
Joseph Huber	a8f9f85ab0	[Libomptarget][NFC] Fix unused variable warnings Summary: This patch fixes a few warnings that would show up while building.	2024-04-10 10:01:15 -05:00
Joseph Huber	943f39d29e	Revert "[Libomptarget] Add RPC-based `printf` implementation for OpenMP (#85638 )" This reverts commit `2cf8118e3a`. Failing tests, revert until I can fix it	2024-04-02 20:53:45 -05:00
Joseph Huber	2cf8118e3a	[Libomptarget] Add RPC-based `printf` implementation for OpenMP (#85638 ) Summary: This patch adds an implementation of `printf` that's provided by the GPU C library runtime. This `pritnf` currently implemented using the same wrapper handling that OpenMP sets up. This will be removed once we have proper varargs support. This `printf` differs from the one CUDA offers in that it is synchronous and uses a finite size. Additionally we support pretty much every format specifier except the `%n` option. Depends on https://github.com/llvm/llvm-project/pull/85331	2024-04-02 16:36:36 -05:00
Joseph Huber	4213f4a9ae	[Libomptarget] Fix resizing the buffer of RPC handles Summary: The previous code would potentially make it smaller if a device with a lower ID touched it later. Also we should minimize changes to the state for multi threaded reasons. This just sets up an owned slot for each at initialization time.	2024-04-01 07:29:57 -05:00
Sitnikov Sergey	7467dc188a	Fix apostrophes in assert error message (#69881 )	2024-03-29 18:43:01 -04:00
dhruvachak	cc8c6b037c	[OpenMP] [amdgpu] Added a synchronous version of data exchange. (#87032 ) Similar to H2D and D2H, use synchronous mode for large data transfers beyond a certain size for D2D as well. As with H2D and D2H, this size is controlled by an env-var.	2024-03-29 13:33:43 -07:00
Joseph Huber	a1a8bb1d3a	[libc] Change RPC interface to not use device ids (#87087 ) Summary: The current implementation of RPC tied everything to device IDs and forced us to do init / shutdown to manage some global state. This turned out to be a bad idea in situations where we want to track multiple hetergeneous devices that may report the same device ID in the same process. This patch changes the interface to instead create an opaque handle to the internal device and simply allocates it via `new`. The user will then take this device and store it to interface with the attached device. This interface puts the burden of tracking the device identifier to mapped d evices onto the user, but in return heavily simplifies the implementation.	2024-03-29 12:49:16 -05:00
Joseph Huber	ea707baca2	[Libomptarget][NFCI] Move logic out of PluginAdaptorTy (#86971 ) Summary: This patch removes most of the special handling from the `PluginAdaptorTy` in preparation for changing this to be the `GenericPluginTy`. Doing this requires that the OpenMP specific handling of stuff like device offsets be contained within the OpenMP plugin manager. Generally this was uninvasive expect for the change to tracking the offset and size of the used devices. The eaiest way I could think to do this was to use some maps, which double as indicators for which plugins have devices active. This should not affect the logic.	2024-03-29 07:19:22 -05:00
Ye Luo	19185d5a72	[Libomptarget] Make dynamic loading libffi more verbose. (#86891 )	2024-03-27 18:40:57 -05:00
Joseph Huber	4a5056be2e	[Libomptarget][NFC] Remove trivially true checks on function pointers (#86804 ) Summary: Previously we had an interface that checked these functions pointers to see if they are implemented by the plugin. This was removed as currently every single function is implemented as a part of the common interface. These checks are now always true and do nothing.	2024-03-27 14:37:55 -05:00
Joseph Huber	ed68aac9f2	[Libomptarget] Move API implementations into GenericPluginTy (#86683 ) Summary: The plan is to remove the entire plugin interface and simply use the `GenericPluginTy` inside of `libomptarget` by statically linking against it. This means that inside of `libomptarget` we will simply do `Plugin.data_alloc` without the dynamically loaded interface. To reduce the amount of code required, this patch simply moves all of the RTL implementation functions inside of the Generic device. Now the `__tgt_rtl_` interface is simply a shallow wrapper that will soon go away. There is some redundancy here, this will be improved later. For now what is important is minimizing the changes to the API.	2024-03-27 14:10:54 -05:00
Joseph Huber	4dc3225248	[Libomptarget] Replace global `PluginTy::get` interface with references (#86595 ) Summary: We have a plugin singleton that implements the Plugin interface. This then spawns separate device and kernels. Previously when these needed to reach into the global singleton they would use the `PluginTy::get` routine to get access to it. In the future we will move away from this as the lifetime of the plugin will be handled by `libomptarget` directly. This patch removes uses of this inside of the plugin implementaion themselves by simply keeping a reference to the plugin inside of the device. The external `__tgt_rtl` functions still use the global method, but will be removed later.	2024-03-26 07:13:59 -05:00
Joseph Huber	2cad43c1ba	[Libomptarget] Factor functions out of 'Plugin' interface (#86528 ) Summary: This patch factors common functions out of the `Plugin` interface prior to its removal in a future patch. This simply temporarily renames it to `PluginTy` so that we could re-use `Plugin::check` internally as this needs to be defined statically per plugin now. We can refactor this later. The future patch will delete `PluginTy` and `PluginTy::get` entirely. This simply tries to minimize a few changes to make it easier to land.	2024-03-25 15:24:39 -05:00
Joseph Huber	9f0321ccf1	[Libomptarget] Make plugins depend explicitly on `intrinsics_gen` Summary: It's possible for the OpenMP offloading plugins to be build before tablegen is run despite the fact that we rely on it. Simply make it depend on it currently.	2024-03-24 15:24:35 -05:00
Joseph Huber	909ea28ac6	[Libomptarget] Specificall add LLVM include dirs in plugins	2024-03-24 14:29:39 -05:00
Michał Górny	3f5e649ff6	[Libomptarget] Fix linking to LLVM dylib (#86397 ) Use `LINK_COMPONENTS` parameter of `add_llvm_library` rather than passing LLVM components directly to `target_link_libraries`, in order to ensure that LLVM dylib is linked correctly when used. Otherwise, CMake insists on linking to static libraries that aren't present on distributions doing pure dylib installs, such as Gentoo. This fixes a regression introduced in `dcbddc2525`.	2024-03-23 15:46:32 +00:00
Joseph Huber	20e0bacd05	[Libomptarget][Fix] Remove duplicate version script for host builds Summary: This causes an error on some linkers and was mistakenly kept in.	2024-03-22 19:09:16 -05:00
Joseph Huber	85af772f3b	[Libomptarget][FIX] Fix unintentinally used PUBLIC interface Summary: This was supposed to be private and caused some issues with certain configs.	2024-03-22 16:38:51 -05:00
Joseph Huber	dcbddc2525	[Libomptarget] Unify and simplify plugin CMake (#86191 ) Summary: This patch reworks the CMake handling for building plugins. All this does is pull a lot of shared and common logic into a single helper function. This also simplifies the OMPT libraries from being built separately instead of just added.	2024-03-22 16:13:58 -05:00
agozillon	8612fa0d84	[MLIR][OpenMP] Refactor bounds offsetting and fix to apply to all directives (#84349 ) This PR refactors bounds offsetting by combining the two differing implementations (one applying to initial derived type member map implementation for descriptors and the other for regular arrays, effectively allocatable array vs regular array in fortran) now that it's a little simpler to do. The PR also moves the utilization of createAlteredByCaptureMap into genMapInfoOp, where it will be correctly applied to all MapInfoData, appropriately offsetting and altering Pointer data set in the kernel argument structure on the host. This primarily means bounds offsets will now correctly apply to enter/exit/update map clauses as opposed to just the Target directive that is currently the case. A few fortran runtime tests have been added to verify this new behavior. This PR depends on: https://github.com/llvm/llvm-project/pull/84328 and is an extraction of the larger derived type member map PR stack (so a requirement for it to land).	2024-03-22 15:32:39 +01:00
Ulrich Weigand	cb071942f8	[OpenMP] Fix SystemZ build failure Commit `a7d5f73a03` introduced an error in a target_compile_definitions on the SystemZ, causing the build to break. Fixed by adding the missing "PRIVATE".	2024-03-21 12:02:50 +01:00
Joseph Huber	a7d5f73a03	[Libomptarget] Consolidate CPU offloading into 'host' directory (#86014 ) Summary: All of these CPU targets use the same underlying implementation. We should consolidate them into a single target to make it easier to update this to a static library based approach. I have decided to call this the 'host' target so it can be given a single name. We still only build these if the system processor matches and we are on Linux.	2024-03-20 17:41:53 -05:00
Gheorghe-Teodor Bercea	c25e77436e	Revert "[libomptarget][nextgen-plugin] Use SCRELEASE/SCACQUIRE in packet headers" (#85950 ) Reverts llvm/llvm-project#85678	2024-03-20 11:40:12 -04:00
Gheorghe-Teodor Bercea	927308a52b	[libomptarget][nextgen-plugin] Use SCRELEASE/SCACQUIRE in packet headers (#85678 ) This patch updates the construction of packet headers to replace the usage of ACQUIRE/RELEASE with SCACQUIRE/SCRELEASE which is now recommended. The patch also ensures consistency across kernel dispatches.	2024-03-20 11:22:01 -04:00
Ulrich Weigand	fe1341248d	[OpenMP] Disable workshare_chunk.c test case on SystemZ The test added by https://github.com/llvm/llvm-project/pull/83261 has been consistently failing. Mark as UNSUPPORTED just like on x86_64 and aarch64.	2024-03-20 11:12:26 +01:00
dhruvachak	b5d02bbd0d	[OpenMP] Increment kernel args version, used by runtime for detecting dyn_ptr. (#85363 ) A kernel implicit parameter (dyn_ptr) was introduced some time back. This patch increments the kernel args version for a compiler supporting dyn_ptr. The version will be used by the runtime to determine whether the implicit parameter is generated by the compiler. The versioning is required to support use cases where code generated by an older compiler is linked with a newer runtime. If approved, this patch should be backported to release 18.	2024-03-19 16:40:22 -07:00
Joseph Huber	5cccc405e3	Revert "[OpenMP] Disable flaky barrier fence test (#85093 )" This reverts commit `cd8843f87a`. Originally disabled to try to unstick the AMD build bot, didn't make a difference after a week so it goes back in.	2024-03-19 14:46:03 -05:00
Dominik Adamski	b9a41b9e9b	[NFC][OpenMP] Add test checking clang offload chunking policy (#83261 ) Verify how clang handles `dist_schedule(static, block_chunk)` and `schedule(static, thread_chunk)` clauses for OpenMP offload loop workshare pragmas.	2024-03-19 10:29:35 -07:00
Akash Banerjee	e9da5f0083	[OpenMP] Fix target data region codegen being omitted for device pass (#85218 ) This patch enables the BodyCodeGen callback to still trigger for the TargetData nested region during the device pass. There maybe Target code nested within the TargetData region for which this is required. Also add tests for the same.	2024-03-19 13:04:23 +00:00
nicebert	29849d589c	[OpenMP] Fix ompx_dump_mapping_tables lit test (#85754 ) Fixes ompx_dump_mapping_tables test by only using one device after breaking built bots	2024-03-19 10:26:41 +01:00
nicebert	20f5bcfb1a	[OpenMP] Add OpenMP extension API to dump mapping tables (#85381 ) This adds an API call ompx_dump_mapping_tables. This allows users to debug the mapping tables and can be especially useful for unified shared memory applications to check if the code behaves in the way it should. The implementation reuses code already present to dump mapping tables (in a debug setting). --------- Co-authored-by: Joseph Huber <huberjn@outlook.com>	2024-03-18 14:09:20 -05:00
Joseph Huber	470040bd4d	[Libomptarget][NFC] Remove warning on return value const	2024-03-15 18:50:33 -05:00
Felipe Cabarcas	0e21672d99	Fixing LIBOMPTARGET_INFO message, for Copying data from device to host (#85444 ) When running OpenMP offloading application with LIBOMPTARGET_INFO=-1, the addresses of the Copying data from device to host, the address are swap. As an example, Currently the address would be ``` omptarget device 0 info: Mapping exists with HstPtrBegin=0x00007ffddaf0fb90, TgtPtrBegin=0x00007fb385404000, Size=8000, DynRefCount=0 (decremented, delayed deletion), HoldRefCount=0 omptarget device 0 info: Copying data from device to host, TgtPtr=0x00007ffddaf0fb90, HstPtr=0x00007fb385404000, Size=8000, Name=d ``` And it should be ``` omptarget device 0 info: Copying data from device to host, TgtPtr=0x00007fb385404000, HstPtr=0x00007ffddaf0fb90, Size=8000, Name=d ``` --------- Co-authored-by: fel-cab <fel-cab@github.com>	2024-03-15 15:25:14 -04:00
Ulrich Weigand	c9062e8f78	Reapply [libomptarget] Build plugins-nextgen for SystemZ (#83978 ) The plugin was not getting built as the build_generic_elf64 macro assumes the LLVM triple processor name matches the CMake processor name, which is unfortunately not the case for SystemZ. Fix this by providing two separate arguments instead. Actually building the plugin exposed a number of other issues causing various test failures. Specifically, I've had to add the SystemZ target to - CompilerInvocation::ParseLangArgs - linkDevice in ClangLinuxWrapper.cpp - OMPContext::OMPContext (to set the device_kind_cpu trait) - LIBOMPTARGET_ALL_TARGETS in libomptarget/CMakeLists.txt - a check_plugin_target call in libomptarget/src/CMakeLists.txt Finally, I've had to set a number of test cases to UNSUPPORTED on s390x-ibm-linux-gnu; all these tests were already marked as UNSUPPORTED for x86_64-pc-linux-gnu and aarch64-unknown-linux-gnu and are failing on s390x for what seem to be the same reason. In addition, this also requires support for BE ELF files in plugins-nextgen: https://github.com/llvm/llvm-project/pull/85246	2024-03-15 19:06:43 +01:00
Ulrich Weigand	2210c85a66	Reapply [libomptarget] Support BE ELF files in plugins-nextgen (#85246 ) Code in plugins-nextgen reading ELF files is currently hard-coded to assume a 64-bit little-endian ELF format. Unfortunately, this assumption is even embedded in the interface between GlobalHandler and Utils/ELF routines, which use ELF64LE types. To fix this, I've refactored the interface to use generic types, in particular by using (a unique_ptr to) ObjectFile instead of ELF64LEObjectFile, and ELFSymbolRef instead of ELF64LE::Sym. This allows properly templating over multiple ELF format variants inside Utils/ELF; specifically, this patch adds support for 64-bit big-endian ELF files in addition to 64-bit little-endian files.	2024-03-15 18:28:28 +01:00
Ulrich Weigand	4c8714efc5	Revert "[libomptarget] Support BE ELF files in plugins-nextgen (#85246 )" This reverts commit `611c62b30d`.	2024-03-14 18:38:13 +01:00
Ulrich Weigand	611c62b30d	[libomptarget] Support BE ELF files in plugins-nextgen (#85246 ) Code in plugins-nextgen reading ELF files is currently hard-coded to assume a 64-bit little-endian ELF format. Unfortunately, this assumption is even embedded in the interface between GlobalHandler and Utils/ELF routines, which use ELF64LE types. To fix this, I've refactored the interface to use generic types, in particular by using (a unique_ptr to) ObjectFile instead of ELF64LEObjectFile, and ELFSymbolRef instead of ELF64LE::Sym. This allows properly templating over multiple ELF format variants inside Utils/ELF; specifically, this patch adds support for 64-bit big-endian ELF files in addition to 64-bit little-endian files.	2024-03-14 18:19:12 +01:00
Joseph Huber	cd8843f87a	[OpenMP] Disable flaky barrier fence test (#85093 ) Summary: This test is flaky on all targets I know of. We should disable it for now so running the test suite doesn't randomly fail 50% of the time.	2024-03-13 15:05:22 -05:00
Joseph Huber	9f69d3cf88	[Libomptarget] Use NVPTX lane id intrinsic in DeviceRTL (#84928 ) Summary: We are currently taking the lower 5 bites of the thread ID as the warp ID. This doesn't work in non-1D grids and is also slower than just using the dedicated hardware register.	2024-03-12 10:39:40 -05:00
Joseph Huber	8a79003307	[libc] Move RPC opcodes include out of the header Summary: This header isn't strictly necessary, and is currently broken because we install these to separate locations.	2024-03-10 14:07:47 -05:00
Gheorghe-Teodor Bercea	5c752df1e1	[libomptarget][nextgen-plugin][NFC] Clean-up InputSignal checks (#83458 ) Clean-up InputSignal checks.	2024-03-07 12:01:42 -05:00
Anchu Rajendran S	c03fd37d9b	[flang] Changes to map variables in link clause of declare target (#83643 ) As per the OpenMP standard, "If a variable appears in a link clause on a declare target directive that does not have a device_type clause with the nohost device-type-description then it is treated as if it had appeared in a map clause with a map-type of tofrom" is an implicit mapping rule. Before this change, such variables were mapped as to by default.	2024-03-07 08:23:58 -08:00
Ulrich Weigand	fb7cc73975	Revert "[libomptarget] Support BE ELF files in plugins-nextgen (#83976 )" This reverts commit `15b7b3182c`.	2024-03-06 21:37:45 +01:00
Ulrich Weigand	70677c81de	Revert "[libomptarget] Build plugins-nextgen for SystemZ (#83978 )" This reverts commit `3ecd38c8e1`.	2024-03-06 21:37:43 +01:00
Ulrich Weigand	d4f4f80236	Revert "[libomptarget] Fix CUDA plugin build regression" This reverts commit `b64482e23e`.	2024-03-06 21:37:35 +01:00
Ulrich Weigand	b64482e23e	[libomptarget] Fix CUDA plugin build regression After `3ecd38c8e`, the Handler.getELFObjectFile routine is no longer available. Call ELF64LEObjectFile::create directly, which should always be suitable for CUDA images.	2024-03-06 21:00:25 +01:00
Ulrich Weigand	3ecd38c8e1	[libomptarget] Build plugins-nextgen for SystemZ (#83978 ) The plugin was not getting built as the build_generic_elf64 macro assumes the LLVM triple processor name matches the CMake processor name, which is unfortunately not the case for SystemZ. Fix this by providing two separate arguments instead. Actually building the plugin exposed a number of other issues causing various test failures. Specifically, I've had to add the SystemZ target to - CompilerInvocation::ParseLangArgs - linkDevice in ClangLinuxWrapper.cpp - OMPContext::OMPContext (to set the device_kind_cpu trait) - LIBOMPTARGET_ALL_TARGETS in libomptarget/CMakeLists.txt - a check_plugin_target call in libomptarget/src/CMakeLists.txt Finally, I've had to set a number of test cases to UNSUPPORTED on s390x-ibm-linux-gnu; all these tests were already marked as UNSUPPORTED for x86_64-pc-linux-gnu and aarch64-unknown-linux-gnu and are failing on s390x for what seem to be the same reason. In addition, this also requires support for BE ELF files in plugins-nextgen: https://github.com/llvm/llvm-project/pull/83976	2024-03-06 20:50:01 +01:00
Ulrich Weigand	15b7b3182c	[libomptarget] Support BE ELF files in plugins-nextgen (#83976 ) Code in plugins-nextgen reading ELF files is currently hard-coded to assume a 64-bit little-endian ELF format. Unfortunately, this assumption is even embedded in the interface between GlobalHandler and Utils/ELF routines, which use ELF64LE types. To fix this, I've refactored the interface to push all ELF specific types into Utils/ELF. Specifically, this patch removes both the getSymbol and getSymbolAddress routines and replaces them with a single findSymbolInImage, which gets a StringRef identifying the raw object file image as input, and returns a StringRef covering the data addressed by the symbol (address and size) if found, or std::nullopt otherwise. This allows properly templating over multiple ELF format variants inside Utils/ELF; specifically, this patch adds support for 64-bit big-endian ELF files in addition to 64-bit little-endian files.	2024-03-06 20:49:12 +01:00
Ye Luo	0fa04b6e2c	[libomptarget] Fix libomptarget.rtl.amdgpu.so installation If AMD GPUs don't exist when building libomptarget, the early return causes skipping the plugin installation.	2024-03-05 21:32:48 -06:00

1 2 3 4 5 ...

1676 Commits