clang-p2996

Author	SHA1	Message	Date
Shourya Goel	9f0758405b	reland: [libc] Added transitive bindings for OffsetType (#87680 ) Followup to issues addressed here: #87397	2024-04-05 12:11:44 -07:00
Gulfem Savrun Yeniceri	e8aaa3eaed	Revert "[libc] Added transitive bindings for OffsetType (#87397 )" This reverts commit `3ee93f4862` because it broke Fuchsia Clang toolchain builders: https://logs.chromium.org/logs/fuchsia/buildbucket/cr-buildbucket/8751633430491432833/+/u/clang/build/stdout	2024-04-04 04:12:36 +00:00
Shourya Goel	3ee93f4862	[libc] Added transitive bindings for OffsetType (#87397 ) Adding OffTType to fcntl.h and stdio.h 's Macro lists in libc/spec/posix.td as mentioned here: #87266	2024-04-03 14:16:57 -07:00
Joseph Huber	7327014b49	[libc] Implement temporary `printf` on the GPU (#85331 ) Summary: This patch adds a temporary implementation that uses a struct-based interface in lieu of varargs support. Once varargs support exists we will move this implementation to the "real" printf implementation. Conceptually, this patch has the client copy over its format string and arguments to the server. The server will then scan the format string searching for any specifiers that are actually a string. If it is a string then we will send the pointer back to the server to tell it to copy it back. This copied value will then replace the pointer when the final formatting is done. This will require a built-in extension to the varargs support to get access to the underlying struct. The varargs used on the GPU will simply be a struct wrapped in a varargs ABI.	2024-04-02 16:25:18 -05:00
Joseph Huber	6335de4a23	[libc] Disable '_exit' on the GPU build Summary: There are other dependencies to enable `unistd.h` on the GPU which prevented the header from being generated. This is a POSIX extension and isn't part of the core `libc`, so we can just disable this for now to get the bots gree.	2024-04-02 15:24:06 -05:00
aniplcc	82be6e186b	[libc][posix] implement _exit (#87185 ) Fixes #87126.	2024-04-02 10:37:42 -07:00
Joseph Huber	cf835b96b1	[libc] Remove fileno from GPU entrypoints	2024-03-18 14:52:46 -05:00
Nick Desaulniers	27d7bb8616	[libc] fix up fileno tests (#85660 ) Fixes #85628	2024-03-18 13:35:44 -04:00
Petr Hosek	d6722bcbd6	[libc] Move EOF macro to stdio-macros.h (#85159 ) libc++ char_traits.h assumes EOF is always available See #85158 for more details.	2024-03-15 10:56:39 -07:00
Nick Desaulniers	9a3000cf67	[libc] roll out rest of stdbit.h entrypoints to gpu,linux,baremetal (#84938 )	2024-03-13 08:36:49 -07:00
Joseph Huber	043a020688	[libc] Fix missing standard definitions in the GPU config Summary: Some dependencies on the standard C extensions are added transitively. This patch adds the new values.	2024-03-07 08:50:13 -06:00
Joseph Huber	c996023f9a	[libc] Provide an implementation of the 'stdint.h' header (#83353 ) Summary: I've noticed one problem is that the user includes `stdint.h` the compiler will do `#include_next <stdint.h>` potentially into a conflicting implementation on systems with multiple headers installed. The `clang` header is standards compliant and works with `clang` and `gcc` which are both of our targets, so I simply copied it here. This has the effect of including `stdint.h` on clang / LLVM libc behaving the same as `-ffreestanding`.	2024-03-04 12:23:11 -06:00
lntue	aa95aa69b9	[libc][math][c23] Add C23 math functions ilogbf128, logbf128, and llogb(f\|l\|f128). (#82144 )	2024-02-27 12:23:19 -05:00
Nick Desaulniers	646c7e5283	[libc] add more stdbit.h entrypoints to additional targets (#82440 ) stdbit.h isn't complete yet, but looking to turn these on on more targets for earlier feedback.	2024-02-20 16:29:17 -08:00
lntue	72ce629415	[libc] Add C23 limits.h header. (#78887 )	2024-01-24 16:08:56 -05:00
lntue	c80d68a676	[libc] Add float.h header. (#78737 )	2024-01-19 12:04:34 -05:00
Nishant Mittal	0504e93288	[libc][math] Implement nan(f\|l) functions (#76690 ) Specification: https://en.cppreference.com/w/c/numeric/math/nan	2024-01-05 08:23:23 -05:00
Nishant Mittal	0c49fc4c68	[libc][math] Implement nexttoward functions (#72763 ) Implements the `nexttoward`, `nexttowardf` and `nexttowardl` functions. Also, raise excepts required by the standard in `nextafter` functions. cc: @lntue	2023-11-21 09:02:51 -05:00
Joseph Huber	25bf1ae99b	[libc] Enable remaining string functions on the GPU (#68346 ) Summary: We previously had to disable these string functions because they were not compatible with the definitions coming from the GNU / host environment. The GPU, when exporting its declarations, has a very difficult requirement that it be compatible with the host environment as both sides of the compilation need to agree on definitions and what's present. This patch more or less gives up an just copies the definitions as expected by `glibc` if they are provided that way, otherwise we fall back to the accepted way. This is the alternative solution to an existing PR which instead disable's GCC's handling.	2023-10-23 13:16:20 -04:00
Joseph Huber	630037ede4	[libc] Partially implement 'rand' for the GPU (#66167 ) Summary: This patch partially implements the `rand` function on the GPU. This is partial because the GPU currently doesn't support thread local storage or static initializers. To implement this on the GPU. I use 1/8th of the local / shared memory quota to treak the shared memory as thread local storage. This is done by simply allocating enough storage for each thread in the block and indexing into this based off of the thread id. The downside to this is that it does not initialize `srand` correctly to be `1` as the standard says, it is also wasteful. In the future we should figure out a way to support TLS on the GPU so that this can be completely common and less resource intensive.	2023-10-19 17:01:43 -04:00
Anton Rydahl	c73ad025b1	[libc][libm][GPU] Add missing vendor entrypoints to the GPU version of `libm` (#66034 ) This patch populates the GPU version of `libm` with missing vendor entrypoints. The vendor math entrypoints are disabled by default but can be enabled with the CMake option `LIBC_GPU_VENDOR_MATH=ON`.	2023-10-19 12:24:50 -07:00
Joseph Huber	ddc30ff802	[libc] Implement the 'ungetc' function on the GPU (#69248 ) Summary: This function follows closely with the pattern of all the other functions. That is, making a new opcode and forwarding the call to the host. However, this also required modifying the test somewhat. It seems that not all `libc` implementations follow the same error rules as are tested here, and it is not explicit in the standard, so we simply disable these EOF checks when targeting the GPU.	2023-10-17 13:02:31 -05:00
Joseph Huber	cc2445589d	[libc] Fix wrapper headers for some ctype macros and C++ decls Summary: These wrapper headers need to work around things in the standard headers. The existing workarounds didn't correctly handle the macros for `iscascii` and `toascii`. Additionally, `memrchr` can't be used because it has a different declaration for C++ mode. Fix this so it can be compiled.	2023-09-28 10:00:34 -05:00
Joseph Huber	7ac8e26fc7	[libc] Implement `fseek`, `fflush`, and `ftell` on the GPU (#67160 ) Summary: This patch adds the necessary entrypoints to handle the `fseek`, `fflush`, and `ftell` functions. These are all very straightfoward, we simply make RPC calls to the associated function on the other end. Implementing it this way allows us to more or less borrow the state of the stream from the server as we intentionally maintain no internal state on the GPU device. However, this does not implement the `errno` functinality so that must be ignored.	2023-09-26 09:46:46 -05:00
Joseph Huber	59896c168a	[libc] Remove the 'rpc_reset' routine from the RPC implementation (#66700 ) Summary: This patch removes the `rpc_reset` function. This was previously used to initialize the RPC client on the device by setting up the pointers to communicate with the server. The purpose of this was to make it easier to initialize the device for testing. However, this prevented us from enforcing an invariant that the buffers are all read-only from the client side. The expected way to initialize the server is now to copy it from the host runtime. This will allow us to maintain that the RPC client is in the constant address space on the GPU, potentially through inference, and improving caching behaviour.	2023-09-21 11:07:09 -05:00
Joseph Huber	b8f64431ea	[libc] Add GPU config file using the new format (#66635 ) Summary: This patch copies a config file for the GPU similar to the baremetal/embedded implementation. This will configure the implementations of functions like `sprintf` and `snprintf` to be compiled into more simple versions that can be run on the GPU. These functions cannot be enabled yet as Vararg support hasn't landed, but it will be used then.	2023-09-18 08:06:59 -05:00
Joseph Huber	a1be5d69df	[libc] Implement more input functions on the GPU (#66288 ) Summary: This patch implements the `fgets`, `getc`, `fgetc`, and `getchar` functions on the GPU. Their implementations are straightforward enough. One thing worth noting is that the implementation of `fgets` will be extremely slow due to the high latency to read a single char. A faster solution would be to make a new RPC call to call `fgets` (due to the special rule that newline or null breaks the stream). But this is left out because performance isn't the primary concern here.	2023-09-14 15:39:29 -05:00
Joseph Huber	bf85f27370	[libc] Implement 'qsort' and 'bsearch' on the GPU (#66230 ) Summary: This patch simply adds the necessary config to enable qsort and bsearch on the GPU. It is highly unlikely that anyone will use these, as they are single threaded, but we may as well support all entrypoints that we can.	2023-09-13 12:06:34 -05:00
Joseph Huber	60c0d303d6	[libc] Implement stdio writing functions for the GPU port (#65809 ) Summary: This patch implements fwrite, putc, putchar, and fputc on the GPU. These are very straightforward, the main difference for the GPU implementation is that we are currently ignoring `errno`. This patch also introduces a minimal smoke test for `putc` that is an exact copy of the `puts` test except we print the string char by char. This also modifies the `fopen` test to use `fwrite` to mirror its use of `fread` so that it is tested as well.	2023-09-09 13:27:07 -05:00
Joseph Huber	533145c458	[libc] Support 'assert.h' on the GPU This patch adds the necessary support to provide `assert` functionality through the GPU `libc` implementation. This implementation creates a special-case GPU implementation rather than relying on the common version. This is because the GPU has special considerings for printing. The assertion is printed out in chunks with `write_to_stderr`, however when combined with the GPU execution model this causes 32+ threads to all execute in-lock step. Meaning that we'll get a horribly fragmented message. Furthermore, potentially thousands of threads could hit the assertion at once and try to print even if we had it all in one `printf`. This is solved by having a one-time lock that each thread group / wave / warp will attempt to claim. We only let one thread group pass through while the others simply stop executing. Finally only the first thread in that group will do the printing until we finally abort execution. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D159296	2023-08-31 15:04:43 -05:00
Joseph Huber	07102a1194	[libc] Implement the 'abort' function on the GPU This function implements the `abort` function on the GPU. The implementation here closely mirros the `exit` call where we first synchornize with the RPC server to make sure it's listening and then we exit on the GPU. I was unsure if this should be a simple `__builtin_assert` on the GPU. I elected to go with an RPC approach to make this a more "true" `abort` call. That is, it should invoke some signal handlers and exit with the proper code according to the implemented C library on the server. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D159210	2023-08-31 08:40:15 -05:00
Joseph Huber	ca10bc4f41	[libc] Implement the 'nanosleep' function on the GPU The GPU has the ability to sleep for very short periods of time. We can map this to the existing `nanosleep` utility. This patch maps the nanosleep utility to the existing hardware instructions as best as possible. Depends on D159118 Reviewed By: JonChesterfield, sivachandra Differential Revision: https://reviews.llvm.org/D159225	2023-08-30 18:34:59 -05:00
Joseph Huber	30307a7bb7	[libc] Implement the 'clock()' function on the GPU This patch implements the `clock()` function on the GPU. This function is supposed to return a timestamp that can be converted into seconds using the `CLOCKS_PER_SEC` macro. The GPU has a fixed frequency timer that can be used for this purpose. However, there are some considerations. First is that AMDGPU does not have a statically known fixed frequency. I know internally that the gfx10xx and gfx11xx series use a 100 MHz clock which will probably remain for the future. Gfx9xx typically uses a 25 MHz clock except for the Vega 10 GPU. The only way to know for sure is to look it up from the runtime. For this purpose, I elected to default it to some known values and assign these to an exteranlly visible symbol that can be initialized if needed. If we do not have a good guess we just return zero. Second is that the `CLOCKS_PER_SEC` macro only gives about a microsecond of resolution. POSIX demands that it's 1,000,000 so it's best that we keep with this tradition as almost all targets seem to respect this. The reason this is important is because on the GPU we will almost assuredly be copying the host's macro value (see the wrapper header) so we should go with the POSIX version that's most likely to be set. (We could probably make a warning if the included header doesn't match the expected value). Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D159118	2023-08-30 16:16:34 -05:00
Joseph Huber	1e573f378c	[libc] Implement fopen, fclose, and fread on the GPU This patch implements the `fopen`, `fclose`, and `fread` functions on the GPU. These are pretty much re-implemented from what existed but using the new interface. Having this subset allows us to test the interface a bit more strenuously since we can write and read to a file. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D157622	2023-08-16 09:14:38 -05:00
Joseph Huber	d04494ccc9	[libc] Rework the file handling for the GPU The GPU has much tighter requirements for handling IO functions. Previously we attempted to define the GPU as one of the platform files. Using a common interface allowed us to easily define these functions without much extra work. However, it became more clear that this was a poor fit for the GPU. The file interface uses function pointers, which prevented inlining and caused bad perfromance and resource usage on the GPU. Further, using an actual `FILE` type rather than referring to it as a host stub prevented us from usin files coming from the host on the GPU device. After talking with @sivachandra, the approach now is to simply define GPU specific versions of the functions we intend to support. Also, we are ignoring `errno` for the time being as it is unlikely we will ever care about supporting it fully. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D157427	2023-08-09 14:42:20 -05:00
Anton Rydahl	53f5bfdb58	[libc][libm][GPU] Populating 'libmgpu.a' for math on the GPU This commit populates `libmgpu.a` with wrappers for the following built-ins - modf, modff - nearbyint, nearbyintf - remainder, remainderf - remquo, remquof - rint, rintf - scalbn, scalbnf - sqrt, sqrtf - tan, tanf - tanh, tanhf - trunc, truncf and wrappers the following vendor implementations - nextafter, nextafterf - sincos, sincosf - sinh, sinhf - sinf - tan, tanf - tanh, tanhf Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D153395	2023-08-01 13:34:43 -07:00
Joseph Huber	334bbc0d67	[libc] Add support for the 'fread' function on the GPU This patch adds support for `fread` on the GPU via the RPC mechanism. Here we simply pass the size of the read to the server and then copy it back to the client via the RPC channel. This should allow us to do the basic operations on files now. This will obviously be slow for large sizes due ot the number of RPC calls involved, this could be optimized further by having a special RPC call that can initiate a memcpy between the two pointers. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D155121	2023-07-26 13:51:35 -05:00
Ethan Luis McDonough	546c9b3f6a	[libc] Add math functions to AMD/NVPTX libm Related to D152486. The following functions are included in this revision: `acosf`, `acoshf`, `asinf`, `asinhf`, `atanf`, `atanhf`, `ceil`, `ceilf`, `copysign`, `copysignf`, `cos`, `cosf`, `cosh`, `coshf`, `exp10f`, `exp2f`, `expf`, `expm1f`, `fabs`, `fabsf`, `fdim`, `fdimf`, `floor`, `floorf`, `fma`, `fmaf`, `fmax`, `fmaxf`, `fmin`, `fminf`, `fmod`, `fmodf`, `frexp`, `frexpf`, `hypot`, `hypotf`, `ilogb`, `ilogbf`, `ldexp`, `ldexpf`, `llrint`, `llrintf`, `llround`, `llroundf`, `pow`, and `powf`. Reviewed By: jhuber6 Differential Revision: https://reviews.llvm.org/D152603	2023-07-26 03:02:24 -05:00
Joseph Huber	e537c83975	[libc] Add basic support for calling host functions from the GPU This patch adds the `rpc_host_call` function as a GPU extension. This is exported from the `libc` project to use the RPC interface to call a function pointer via RPC any copying the arguments by-value. The interface can only support a single void pointer argument much like pthreads. The function call here is the bare-bones version of what's required for OpenMP reverse offloading. Full support will require interfacing with the mapping table, nowait support, etc. I decided to test this interface in `libomptarget` as that will be the primary consumer and it would be more difficult to make a test in `libc` due to the testing infrastructure not really having a concept of the "host" as it runs directly on the GPU as if it were a CPU target. Reviewed By: jplehr Differential Revision: https://reviews.llvm.org/D155003	2023-07-19 10:11:46 -05:00
Joseph Huber	b454e7aa7c	[libc] Remove GPU string functions incompatible with C++ These functions have definitions differing between C and C++. GNU respects the C++ definitions while the LLVM libc does not. This causes many bugs and the current hack creates other issues. Rather than hack around this I'd rather temporarily disable these than regress with the integration into other offloading languages. We lose test support for them but we should be able to re-enable these once the `libc` headers provide these correctly. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D154850	2023-07-10 10:40:10 -05:00
Joseph Huber	c850ea1498	[libc] Support fopen / fclose on the GPU This patch adds the necessary support for the fopen and fclose functions to work on the GPU via RPC. I added a new test that enables testing this with the minimal features we have on the GPU. I will update it once we have `fread` and `fwrite` to actually check the outputted strings. For now I just relied on checking manually via the outpuot temp file. Reviewed By: JonChesterfield, sivachandra Differential Revision: https://reviews.llvm.org/D154519	2023-07-05 18:31:58 -05:00
Joseph Huber	7e88e26d38	[libc] Add GPU support for the 'inttypes.h' functions Another low hanging fruit we can put on the GPU, this ports the tests over to the hermetic framework so we can run them on the GPU. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D154540	2023-07-05 17:47:10 -05:00
Joseph Huber	b15ac1fd89	[libc] Enable the 'div' routines on the GPU This patch simply enables the `div`, `ldiv,` and, `lldiv` functions on the GPU. This should be straightforward enough. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D154143	2023-06-29 15:42:46 -05:00
Joseph Huber	dcdfc963d7	[libc] Export GPU extensions to `libc` for external use The GPU port of the LLVM C library needs to export a few extensions to the interface such that users can interface with it. This patch adds the necessary logic to define a GPU extension. Currently, this only exports a `rpc_reset_client` function. This allows us to use the server in D147054 to set up the RPC interface outside of `libc`. Depends on https://reviews.llvm.org/D147054 Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D152283	2023-06-15 11:02:24 -05:00
Joseph Huber	fd14f7adbe	[libc] Enable conversion functions on the GPU These functions were previously removed due to problems running the tests with `errno` in them. This was resolved previously by making the internal implementation of these functions use a global `errno` so that tests can still use `errno` functionality as long as they are run with a single thread. This allows us to re-enable these tests as a previous patch has also resolved the issue where the `stdlib` tests could not be hermetic due to the dependence on system rounding functions. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D153016	2023-06-15 09:38:12 -05:00
Joseph Huber	f205fbbb01	[libc] Add support for FMA in the GPU utilities This adds the generic FMA utilities for the GPU. We implement these through the builtins which map to the FMA instructions in the ISA. These may not have strict compliance with other assumptions in the the `libc` such as rounding modes. I've included the relevant information on how the GPU vendors map the behaviour. This should help make it easier to implement some future generic versions. Depends on D152486 Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D152923	2023-06-14 12:59:18 -05:00
Joseph Huber	8060d96aed	[libc] Begin implementing a 'libmgpu.a' for math on the GPU This patch adds an outline to begin adding a `libmgpu.a` file for provindg math on the GPU. Currently, this is most likely going to be wrapping around existing vendor libraries and placing them in a more usable format. Long term, we would like to provide our own implementations of math functions that can be used instead. This patch works by simply forwarding the calls to the standard C math library calls like `sin` to the appropriate vendor call like `__nv_sin`. Currently, we will use the vendor libraries directly and link them in via `-mlink-builtin-bitcode`. This is necessary because of bizarre interactions with the generic bitcode, `-mlink-builtin-bitcode` internalizes and only links in the used symbols, furthermore is propagates the target's default attributes and its the only "truly" correct way to pull in these vendor bitcode libraries without error. If the vendor libraries are not availible at build time, we will still create the `libmgpu.a`, but we will expect that the vendor library definitions will be provided by the user's compilation as is made possible by https://reviews.llvm.org/D152442. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D152486	2023-06-14 12:59:15 -05:00
Joseph Huber	e6c401b5e8	[libc] Add initial support for 'puts' and 'fputs' to the GPU This patch adds the initial support required to support basic priting in `stdio.h` via `puts` and `fputs`. This is done using the existing LLVM C library `File` API. In this sense we can think of the RPC interface as our system call to dump the character string to the file. We carry a `uintptr_t` reference as our native "file descriptor" as it will be used as an opaque reference to the host's version once functions like `fopen` are supported. For some unknown reason the declaration of the `StdIn` variable causes both the AMDGPU and NVPTX backends to crash if I use the `READ` flag. This is not used currently as we only support output now, but it needs to be fixed Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D151282	2023-06-05 17:56:55 -05:00
Joseph Huber	632fa3798c	[libc] Enable running libc unit tests on AMDGPU The previous patches added the necessary support for global constructors used to register tests. This patch enables the AMDGPU target to build and run the unit tests on the GPU. Currently this only tests the `ctype` tests, but adding more should be straightforward from here on. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D149517	2023-05-04 06:32:52 -05:00
Joseph Huber	443d71527b	[libc] Implement `exit` for the GPU partially This patch implements the `exit` function on the GPU. This required breaking the entrypoints calling eachother on `linux` since this doesn't work with a non-aliased target. This is only partial support because full support requires a malloc / free implementation for the exit callbacks array. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D149363	2023-04-27 20:32:00 -05:00

1 2

62 Commits