Summary: This removes the use of OpenMP offloading to build the device runtime. The main benefit here is that we no longer need to rely on offloading semantics to build a device only runtime. Things like variants are now no longer needed and can just be simple if-defs. In the future, I will remove most of the special handling here and fold it into calls to the `<gpuintrin.h>` functions instead. Additionally I will rework the compilation to make this a separate runtime. The current plan is to have this, but make including OpenMP and offloading either automatically add it, or print a warning if it's missing. This will allow us to use a normal CMake workflow and delete all the weird 'lets pull the clang binary out of the build' business. ``` -DRUNTIMES_amdgcn-amd-amdhsa_LLVM_ENABLE_RUNTIMES=offload -DLLVM_RUNTIME_TARGETS=amdgcn-amd-amdhsa ``` After that, linking the OpenMP device runtime will be `-Xoffload-linker -lomp`. I.e. no more fat binary business. Only look at the most recent commit since this includes the two dependencies (fix to AMDGPUEmitPrintfBinding and the PointerToMember bug).
46 lines
1.1 KiB
C++
46 lines
1.1 KiB
C++
//===-------- Allocator.h - OpenMP memory allocator interface ---- C++ -*-===//
|
|
//
|
|
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
|
|
// See https://llvm.org/LICENSE.txt for license information.
|
|
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
|
|
//
|
|
//===----------------------------------------------------------------------===//
|
|
//
|
|
//
|
|
//===----------------------------------------------------------------------===//
|
|
|
|
#ifndef OMPTARGET_ALLOCATOR_H
|
|
#define OMPTARGET_ALLOCATOR_H
|
|
|
|
#include "DeviceTypes.h"
|
|
|
|
// Forward declaration.
|
|
struct KernelEnvironmentTy;
|
|
|
|
namespace ompx {
|
|
|
|
namespace allocator {
|
|
|
|
static uint64_t constexpr ALIGNMENT = 16;
|
|
|
|
/// Initialize the allocator according to \p KernelEnvironment
|
|
void init(bool IsSPMD, KernelEnvironmentTy &KernelEnvironment);
|
|
|
|
/// Allocate \p Size bytes.
|
|
[[gnu::alloc_size(1), gnu::assume_aligned(ALIGNMENT), gnu::malloc]] void *
|
|
alloc(uint64_t Size);
|
|
|
|
/// Free the allocation pointed to by \p Ptr.
|
|
void free(void *Ptr);
|
|
|
|
} // namespace allocator
|
|
|
|
} // namespace ompx
|
|
|
|
extern "C" {
|
|
[[gnu::weak]] void *malloc(size_t Size);
|
|
[[gnu::weak]] void free(void *Ptr);
|
|
}
|
|
|
|
#endif
|