Files

Kewen12 bbe59e19b6 [OpenMP][Offload] Update the Logic for Configuring Auto Zero-Copy (#143638 )

Summary:

Currently the Auto Zero-Copy is enabled by checking every initialized
device to ensure that no dGPU is attached to an APU. However, an APU is
designed to comprise a homogeneous set of GPUs, therefore, it should be
sufficient to check any device for configuring Auto Zero-Copy. In this
PR, it checks the first initialized device in the list.

The changes in this PR are to clearly reflect the design and logic of
enabling the feature for further improving the readibility.

2025-06-11 14:12:54 -04:00

cmake

[offload] Fix finding amdgpu/nvptx-arch to generate tests (#135072 )

2025-04-09 15:54:29 -04:00

DeviceRTL

[OpenMP][GPU][FIX] Enable generic barriers in single threaded contexts (#140786 )

2025-05-20 19:33:54 -07:00

docs

[Offload][NFC] Factor out and rename the __tgt_offload_entry struct (#123785 )

2025-01-21 12:05:24 -06:00

include

[Offload] Don't check in generated files (#141982 )

2025-06-03 10:39:04 -05:00

liboffload

[Offload] Make olMemcpy src parameter const (#143161 )

2025-06-06 10:25:00 -05:00

libomptarget

[OpenMP][Offload] Update the Logic for Configuring Auto Zero-Copy (#143638 )

2025-06-11 14:12:54 -04:00

plugins-nextgen

[PGO][Offload] Fix offload coverage mapping (#143490 )

2025-06-10 20:19:38 -05:00

test

[Offload] Fix APU detection for MI300 testing (#143026 )

2025-06-05 15:31:55 -05:00

tools

[Offload] Use llvm::Error throughout liboffload internals (#140879 )

2025-05-27 13:42:56 -05:00

unittests

[Offload] Allow setting null arguments in olLaunchKernel (#141958 )

2025-06-06 07:05:11 -05:00

utils

…

CMakeLists.txt

[Offload] Add OFFLOAD_INCLUDE_TESTS (#143388 )

2025-06-09 10:27:40 -05:00

Maintainers.md

[Offload] Add 'Maintainers.md' file for offload (#138177 )

2025-05-01 14:06:33 -05:00

README.md

[Offload][NFC] Update README.md

2024-11-17 07:32:29 -08:00

README.txt

…

README.md

The LLVM/Offload Subproject

The Offload subproject aims at providing tooling, runtimes, and APIs that allow users to execute code on accelerators or other "co-processors" that may or may not match the architecture of their "host". In the long run, all kinds of targets are in scope of this effort, including but not limited to: CPUs, GPUs, FPGAs, AI/ML accelerators, distributed resources, etc.

For OpenMP offload users, the project is ready and fully usable. The final API design is still under development. More content will show up here and on our webpage soon. In the meantime, people are encouraged to participate in our meetings (see below) and check our development board as well as the discussions on Discourse.

Meetings

Every second Wednesday, 7:00 - 8:00am PT, starting Jan 24, 2024. Alternates with the OpenMP in LLVM meeting. invite.ics Meeting Minutes and Agenda