This patch adds an NVPTX compilation path that enables JIT compilation on NVIDIA targets. The following modifications were performed: 1. Adding a format field to the GPU object attribute, allowing the translation attribute to use the correct runtime function to load the module. Likewise, a dictionary attribute was added to add any possible extra options. 2. Adding the `createObject` method to `GPUTargetAttrInterface`; this method returns a GPU object from a binary string. 3. Adding the function `mgpuModuleLoadJIT`, which is only available for NVIDIA GPUs, as there is no equivalent for AMD. 4. Adding the CMake flag `MLIR_GPU_COMPILATION_TEST_FORMAT` to specify the format to use during testing.
5 lines
192 B
INI
5 lines
192 B
INI
if not config.enable_cuda_runner or not config.mlir_run_cuda_sm80_tests:
|
|
config.unsupported = True
|
|
|
|
config.substitutions.append(("%gpu_compilation_format", config.gpu_compilation_format))
|