Files

Ethan Luis McDonough fde2d23ee2 [PGO][OpenMP] Instrumentation for GPU devices (Revision of #76587 ) (#102691 )

This pull request is a revised version of #76587. This pull request
fixes some build issues that were present in the previous version of
this change.

> This pull request is the first part of an ongoing effort to extends
PGO instrumentation to GPU device code. This PR makes the following
changes:
>
> - Adds blank registration functions to device RTL
> - Gives PGO globals protected visibility when targeting a supported
GPU
> - Handles any addrspace casts for PGO calls
> - Implements PGO global extraction in GPU plugins (currently only
dumps info)
>
> These changes can be tested by supplying `-fprofile-instrument=clang`
while targeting a GPU.

2024-08-22 01:10:54 -05:00

cmake

[Offload] Repair and rename llvm-omp-device-info (to -offload-) (#100309 )

2024-07-24 09:35:09 -07:00

DeviceRTL

[PGO][OpenMP] Instrumentation for GPU devices (Revision of #76587 ) (#102691 )

2024-08-22 01:10:54 -05:00

docs

…

include

[Offload] Ensure to load images when the device is used (#103002 )

2024-08-13 14:41:26 -07:00

plugins-nextgen

[PGO][OpenMP] Instrumentation for GPU devices (Revision of #76587 ) (#102691 )

2024-08-22 01:10:54 -05:00

src

[Offload] Ensure to load images when the device is used (#103002 )

2024-08-13 14:41:26 -07:00

test

[PGO][OpenMP] Instrumentation for GPU devices (Revision of #76587 ) (#102691 )

2024-08-22 01:10:54 -05:00

tools

[Offload] Repair and rename llvm-omp-device-info (to -offload-) (#100309 )

2024-07-24 09:35:09 -07:00

unittests

[Offload][NFC] Remove 'libomptarget' message helpers (#92581 )

2024-05-17 13:24:32 -05:00

utils

…

CMakeLists.txt

[offload][cmake] always define pythonize_bool macro (#96028 )

2024-06-20 07:00:19 -05:00

README.md

…

README.txt

…

README.md

The LLVM/Offload Subproject

The Offload subproject aims at providing tooling, runtimes, and APIs that allow users to execute code on accelerators or other "co-processors" that may or may not match the architecture of their "host". In the long run, all kinds of targets are in scope of this effort, including but not limited to: CPUs, GPUs, FPGAs, AI/ML accelerators, distributed resources, etc.

The project is just starting and the design is still not ironed out. More content will show up here and on our webpage soon. In the meantime people are encouraged to participate in our meetings (see below) and check our development board as well as the discussions on Discourse.

Meetings

Every second Wednesday, 7:00 - 8:00am PT, starting Jan 24, 2024. Alternates with the OpenMP in LLVM meeting. invite.ics Meeting Minutes and Agenda