- Add target id support (https://clang.llvm.org/docs/ClangOffloadBundler.html#target-id) - Add code object v4 support (https://llvm.org/docs/AMDGPUUsage.html#elf-code-object) - Add kernarg_size to kernel descriptor - Change trap handler ABI to no longer move queue pointer into s[0:1] - Cleanup ELF definitions - Add V2, V3, V4 suffixes to make a clear distinction for code object version - Consolidate note names Differential Revision: https://reviews.llvm.org/D95638
12 lines
475 B
LLVM
12 lines
475 B
LLVM
; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx1010 -mattr=+wavefrontsize32,-wavefrontsize64 < %s | FileCheck -check-prefixes=GCN,GFX10-32 %s
|
|
; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx1010 -mattr=-wavefrontsize32,+wavefrontsize64 < %s | FileCheck -check-prefixes=GCN,GFX10-64 %s
|
|
|
|
; GCN: amdhsa.kernels:
|
|
; GCN: .name: wavefrontsize
|
|
; GFX10-32: .wavefront_size: 32
|
|
; GFX10-64: .wavefront_size: 64
|
|
define amdgpu_kernel void @wavefrontsize() {
|
|
entry:
|
|
ret void
|
|
}
|