clang-p2996

Author	SHA1	Message	Date
michaelrj-google	8180ea8694	[libc] Add bind function (#74014 ) This patch adds the bind function to go with the socket function. It also cleans up a lot of socket related data structures.	2023-12-12 13:36:11 -08:00
Schrodinger ZHU Yifan	81e3e7e5d4	[libc] [search] implement hcreate(_r)/hsearch(_r)/hdestroy(_r) (#73469 ) This patch implements `hcreate(_r)/hsearch(_r)/hdestroy(_r)` as specified in https://man7.org/linux/man-pages/man3/hsearch.3.html. Notice that `neon/asimd` extension is not yet added in this patch. - The implementation is largely simplified from rust's [`hashbrown`](https://github.com/rust-lang/hashbrown/blob/master/src/raw/mod.rs) as we only consider fix-sized insertion-only hashtables. Technical details are provided in code comments. - This patch also contains a portable string hash function, which is derived from [`aHash`](https://github.com/tkaitchuck/aHash)'s fallback routine. Not using any SIMD acceleration, it has a good enough quality (passing all SMHasher tests) and is not too bad in speed. - Some general functionalities are added, such as `memory_size`, `offset_to`(alignment), `next_power_of_two`, `is_power_of_two`. `ctz/clz` are extended to support shorter integers.	2023-11-28 21:02:25 -05:00
Joseph Huber	a39215768b	[libc] Rework the 'fgets' implementation on the GPU (#69635 ) Summary: The `fgets` function as implemented is not functional currently when called with multiple threads. This is because we rely on reapeatedly polling the character to detect EOF. This doesn't work when there are multiple threads that may with to poll the characters. this patch pulls out the logic into a standalone RPC call to handle this in a single operation such that calling it from multiple threads functions as expected. It also makes it less slow because we no longer make N RPC calls for N characters.	2023-10-19 17:00:01 -04:00
Joseph Huber	ddc30ff802	[libc] Implement the 'ungetc' function on the GPU (#69248 ) Summary: This function follows closely with the pattern of all the other functions. That is, making a new opcode and forwarding the call to the host. However, this also required modifying the test somewhat. It seems that not all `libc` implementations follow the same error rules as are tested here, and it is not explicit in the standard, so we simply disable these EOF checks when targeting the GPU.	2023-10-17 13:02:31 -05:00
Joseph Huber	6273b6d9dc	[libc] Change RPC opcode enum definition (#67439 ) Summary: This enum previously manually specified the value. This just made it unnecessarily difficult to add new ones without changing everything. This patch also makes it compatible with C by removing the `:` annotation and instead using the `LAST` method.	2023-09-26 15:24:28 -05:00
Joseph Huber	7ac8e26fc7	[libc] Implement `fseek`, `fflush`, and `ftell` on the GPU (#67160 ) Summary: This patch adds the necessary entrypoints to handle the `fseek`, `fflush`, and `ftell` functions. These are all very straightfoward, we simply make RPC calls to the associated function on the other end. Implementing it this way allows us to more or less borrow the state of the stream from the server as we intentionally maintain no internal state on the GPU device. However, this does not implement the `errno` functinality so that must be ignored.	2023-09-26 09:46:46 -05:00
Joseph Huber	791b279924	[libc] Change the `puts` implementation on the GPU (#67189 ) Summary: Normally, the implementation of `puts` simply writes a second newline charcter after printing the first string. However, because the GPU does everything in batches of the SIMT group size, this will end up with very poor output where you get the strings printed and then 1-64 newline characters all in a row. Optimizations like to turn `printf` calls into `puts` so it's a good idea to make this produce the expected output. The least invasive way I could do this was to add a new opcode. It's a little bloated, but it avoids an unneccessary and slow send operation to configure this.	2023-09-25 11:17:22 -05:00
Joseph Huber	f548d19fc8	[libc] Fix and simplify the implementation of 'fread' on the GPU (#66948 ) Summary: Previously, the `fread` operation was wrong in cases when we read less data than was requested. That is, if we tried to read N bytes while the file was in EOF, it would still copy N bytes of garbage. This is fixed by only copying over the sizes we got from locally opening it rather than just using the provided size. Additionally, this patch simplifies the interface. The output functions have special variants for writing to stdout / stderr. This is primarily an optimization for these common cases so we can avoid sending the stream as an argument which has a high delay. Because for input, we already need to start with a `send` to tell the server how much data to read, it costs us nothing to send the file along with it so this is redundant. Re-use the file encoding scheme from the other implementations, the one that stores the stream type in the LSBs of the FILE pointer.	2023-09-21 14:28:06 -05:00
Mikhail R. Gadelha	8d7ca08b9f	[libc] Update siginfo_t to match kernel definition (#66560 ) This patch updates the siginfo_t struct definition to match the definition from the kernel here: https://github.com/torvalds/linux/blob/master/include/uapi/asm-generic/siginfo.h In particular, there are two main changes: 1. swap position of si_code and si_errno: si_code show come after si_errno in all systems except MIPS. Since we don't MIPS, the order is fixed for now, but can be easily \#ifdef'd if MIPS support is implemented in the future. 2. We add a union of structs that are filled depending on the signal raised. This change was required for the fork and spawn integration tests in rv32, since they fork/clone the running process, call wait/waitid/waitpid, and read the status, which was wrong in rv32 because wait/waitid/waitpid are implemented in rv32 using SYS_waitid. SYS_waitid takes a pointer to a siginfo_t and fills the proper fields in the struct. The previous siginfo_t definition was being incorrectly filled due to not taking into account the signal raised.	2023-09-21 10:59:03 -04:00
Joseph Huber	a1be5d69df	[libc] Implement more input functions on the GPU (#66288 ) Summary: This patch implements the `fgets`, `getc`, `fgetc`, and `getchar` functions on the GPU. Their implementations are straightforward enough. One thing worth noting is that the implementation of `fgets` will be extremely slow due to the high latency to read a single char. A faster solution would be to make a new RPC call to call `fgets` (due to the special rule that newline or null breaks the stream). But this is left out because performance isn't the primary concern here.	2023-09-14 15:39:29 -05:00
Mikhail R. Gadelha	75398f28eb	[libc] Make time_t 64 bits long on all platforms but arm32 This patch changes the size of time_t to be an int64_t. This still follows the POSIX standard which only requires time_t to be an integer. Making time_t a 64-bit integer also fixes two cases in 32 bits platforms that use SYS_clock_nanosleep_time64 and SYS_clock_gettime64, as the name of these calls implies, they require a 64-bit time_t. For instance, in rv32, the 32-bit version of these syscalls is not available. We also follow glibc here, where time_t is still a 32-bit integer in arm32. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D159125	2023-09-13 10:49:39 -03:00
Mikhail R. Gadelha	9ff0a447d0	[libc] Fix setrlimit/getrlimit on 32-bit systems libc uses SYS_prlimit64 (which takes a struct rlimit64) to implement setrlimt and getrlimit (which take a struct rlimit). In 64-bit bits systems this is not an issue since the members of struct rlimit64 and struct rlimit are 64 bits long, however, in 32-bit systems the members of struct rlimit are only 32 bits long, causing wrong values being passed to SYS_prlimit64. This patch changes rlim_t to be __UINT64_TYPE__ (which also changes rlimit as a side-effect), fixing the problem of mismatching types in the syscall. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D159104	2023-09-07 16:02:32 -03:00
Joseph Huber	07102a1194	[libc] Implement the 'abort' function on the GPU This function implements the `abort` function on the GPU. The implementation here closely mirros the `exit` call where we first synchornize with the RPC server to make sure it's listening and then we exit on the GPU. I was unsure if this should be a simple `__builtin_assert` on the GPU. I elected to go with an RPC approach to make this a more "true" `abort` call. That is, it should invoke some signal handlers and exit with the proper code according to the implemented C library on the server. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D159210	2023-08-31 08:40:15 -05:00
Alfred Persson Forsberg	1acbc21d46	[libc] Define __UTS_NAME_LENGTH for __APPLE__ Before `cae84d8acf` all __linux__ checks were incorrectly __unix__ checks. __unix__ being true on macOS systems therefore meant that macOS would use 65 as __UTS_NAME_LENGTH. This commit correctly specifices __UTS_NAME_LENGTH to match XNU as 256. https://opensource.apple.com/source/xnu/xnu-201/bsd/sys/utsname.h.auto.html Reviewed By: thesamesam Differential Revision: https://reviews.llvm.org/D157824	2023-08-14 01:56:32 +01:00
Joseph Huber	334bbc0d67	[libc] Add support for the 'fread' function on the GPU This patch adds support for `fread` on the GPU via the RPC mechanism. Here we simply pass the size of the read to the server and then copy it back to the client via the RPC channel. This should allow us to do the basic operations on files now. This will obviously be slow for large sizes due ot the number of RPC calls involved, this could be optimized further by having a special RPC call that can initiate a memcpy between the two pointers. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D155121	2023-07-26 13:51:35 -05:00
Joseph Huber	c381a94753	[libc] Remove test RPC opcodes from the exported header This patch does the noisy work of removing the test opcodes from the exported interface to an interface that is only visible in `libc`. The benefit of this is that we both test the exported RPC registration more directly, and we do not need to give this interface to users. I have decided to export any opcode that is not a "core" libc feature as having its MSB set in the opcode. We can think of these as non-libc "extensions". Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D154848	2023-07-21 15:36:36 -05:00
Joseph Huber	e537c83975	[libc] Add basic support for calling host functions from the GPU This patch adds the `rpc_host_call` function as a GPU extension. This is exported from the `libc` project to use the RPC interface to call a function pointer via RPC any copying the arguments by-value. The interface can only support a single void pointer argument much like pthreads. The function call here is the bare-bones version of what's required for OpenMP reverse offloading. Full support will require interfacing with the mapping table, nowait support, etc. I decided to test this interface in `libomptarget` as that will be the primary consumer and it would be more difficult to make a test in `libc` due to the testing infrastructure not really having a concept of the "host" as it runs directly on the GPU as if it were a CPU target. Reviewed By: jplehr Differential Revision: https://reviews.llvm.org/D155003	2023-07-19 10:11:46 -05:00
Joseph Huber	c850ea1498	[libc] Support fopen / fclose on the GPU This patch adds the necessary support for the fopen and fclose functions to work on the GPU via RPC. I added a new test that enables testing this with the minimal features we have on the GPU. I will update it once we have `fread` and `fwrite` to actually check the outputted strings. For now I just relied on checking manually via the outpuot temp file. Reviewed By: JonChesterfield, sivachandra Differential Revision: https://reviews.llvm.org/D154519	2023-07-05 18:31:58 -05:00
Alfred Persson Forsberg	cae84d8acf	[libc] Correct usage of __unix__ and __linux__ Reviewed By: michaelrj, thesamesam Differential Revision: https://reviews.llvm.org/D153729	2023-07-03 01:08:15 +01:00
Joseph Huber	dcdfc963d7	[libc] Export GPU extensions to `libc` for external use The GPU port of the LLVM C library needs to export a few extensions to the interface such that users can interface with it. This patch adds the necessary logic to define a GPU extension. Currently, this only exports a `rpc_reset_client` function. This allows us to use the server in D147054 to set up the RPC interface outside of `libc`. Depends on https://reviews.llvm.org/D147054 Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D152283	2023-06-15 11:02:24 -05:00
Michael Jones	d3074f16a6	[libc] Add qsort_r This patch adds the reentrent qsort entrypoint, qsort_r. This is done by extending the qsort functionality and moving it to a shared utility header. For this reason the qsort_r tests focus mostly on the places where it differs from qsort, since they share the same sorting code. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D152467	2023-06-12 11:12:17 -07:00
Mikhail R. Gadelha	4c9c1a4e4f	[libc] Enable linux directory entries syscalls in riscv64 This patch updates the struct dirent to be on par with glibc (by adding a missing d_type member) and update the readdir call to use SYS_getdents64 instead of SYS_getdents. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D147738	2023-05-04 19:07:16 -03:00
Michael Jones	ee17fd7d46	[libc] add socket function This patch adds the function "socket" from the header "sys/socket". It's a simple syscall wrapper, and I plan on adding the related functions in a followup patch. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D149622	2023-05-03 11:01:11 -07:00
Joseph Huber	72bfe2c05a	[libc] Support the string conversion methods on the GPU This patch enables us to use the existing `libc` support for string conversion functions on the GPU. This required setting the `fenv_t` and long double configuration. As far as I am aware, long doubles are converted to doubles on the GPU and the floating point environment is just an `uint32_t`. This code is still untested as we are still working out how to run the unit tests on the GPU. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D149306	2023-04-27 20:31:58 -05:00
Noah Goldstein	0432b85d8e	[LIBC] Implement remainder of posix 'sched.h' minus `SCHED_SPORADIC` Includes macros: linux/SCHED_OTHER // posix req linux/SCHED_FIFO // posix req linux/SCHED_RR // posix req linux/SCHED_BATCH linux/SCHED_ISO linux/SCHED_IDLE linux/SCHED_DEADLINE Includes types: struct sched_param { int sched_priority; } Includes functions: sched_setparam sched_getparam sched_setscheduler sched_getscheduler sched_get_priority_max sched_get_priority_min sched_rr_get_interval Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D148069	2023-04-20 14:53:41 -05:00
Michael Jones	e0de24cb0d	[libc] Re-enable wctob with fixes The stdio test failures were due to headers potentially not being built in the correct order. This should set up the dependencies correctly. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D146551	2023-03-29 12:49:29 -07:00
Mikhail R. Gadelha	0f6fd1b704	[libc] Add support for setjmp and longjmp in riscv This patch implements setjmp and longjmp in riscv using inline asm. The following changes were required: * Omit frame pointer: otherwise gcc won't allow us to use s0 * Use __attribute__((naked)): otherwise both gcc and clang will generate function prologue and epilogue in both functions. This doesn't happen in x86_64, so we guard it to only riscv Furthermore, using __attribute__((naked)) causes two problems: we can't use `return 0` (both gcc and clang) and the function arguments in the function body (clang only), so we had to use a0 and a1 directly. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D145584	2023-03-24 16:16:31 -03:00
Michael Jones	46b5087227	[libc] add basic wide char functions This patch adds the wchar header, as well as the functions to convert to and from wide chars. The header also sets up the definitions for wint and wchar. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D145995	2023-03-20 16:36:21 -07:00
Mikhail R. Gadelha	e9be85da8b	[libc] Add fenv_t and signal macros in riscv This patch now enables full build. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D145594	2023-03-08 17:31:58 -03:00
Siva Chandra Reddy	439eebab81	[libc] Add fenv functions to arm32 baremetal config. Also, an "arm" subfolder for baremetal config has been added. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D145476	2023-03-07 18:11:20 +00:00
Siva Chandra Reddy	15ae08c1a1	[libc] Add definitions of a few missing macros and types.	2022-11-02 07:17:33 +00:00
Siva Chandra Reddy	3b82b4fbd5	[libc] Add x86_64 implementation of setjmp and longjmp. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D137147	2022-11-01 22:58:35 +00:00
Alex Brachet	5fd03c8176	[libc] Implement getopt Differential Revision: https://reviews.llvm.org/D133487	2022-10-31 16:55:53 +00:00
Alex Brachet	d6ac84bce8	Revert "[libc] Implement getopt" This reverts commit `a678f86351`.	2022-10-27 06:47:24 +00:00
Alex Brachet	a678f86351	[libc] Implement getopt Differential Revision: https://reviews.llvm.org/D133487	2022-10-27 06:23:33 +00:00
Siva Chandra Reddy	22ea0e5d9b	[libc] Add Linux implementations of time and clock functions. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D136666	2022-10-25 18:06:05 +00:00
Siva Chandra Reddy	be4e425758	[libc] Add select.h and the implementation of the select function for Linux. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D136375	2022-10-22 03:17:48 +00:00
Siva Chandra Reddy	67957368ae	[libc] Add implementation of sigaltstack for linux. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D135949	2022-10-18 22:04:30 +00:00
Siva Chandra Reddy	b2a294bcf8	[libc] Add termios.h and the implementation of functions declared in it. Reviewed By: lntue, michaelrj Differential Revision: https://reviews.llvm.org/D136143	2022-10-18 20:53:00 +00:00
Siva Chandra Reddy	02a543db66	[libc] Add a simple implementation of the posix_spawn function. The implementation currently ignores all spawn attributes. Support for them will be added in future changes. A simple allocator for integration tests has been added so that the integration test for posix_spawn can use the posix_spawn_file_actions_add* functions. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D135752	2022-10-13 18:47:47 +00:00
Siva Chandra Reddy	28943d617a	[libc] Add POSIX functions posix_spawn_file_actions_*. Namely, posix_spawn_file_actions_addclose, posix_spawn_file_actions_adddup2, posix_spawn_file_actions_addopen, posix_spawn_file_actions_destroy, posix_spawn_file_actions_init have been added. Reviewed By: michaelrj, lntue Differential Revision: https://reviews.llvm.org/D135603	2022-10-11 04:54:44 +00:00
Siva Chandra Reddy	438e59182b	[libc] Add implementation of pthread_atfork. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D135432	2022-10-10 18:28:43 +00:00
Michael Jones	f2a9974666	[libc] fix futex type Previously the futex type was defined in terms of unsigned int, now it's uint32, which is more portable. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D135408	2022-10-06 15:19:43 -07:00
Siva Chandra Reddy	3f965818b6	[libc] Add POSIX execv and execve functions. The POSIX global variable environ has also been added. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D135351	2022-10-06 19:50:23 +00:00
Siva Chandra Reddy	995105de1b	[libc] Add the POSIX waitpid function and the BSD wait4 function. Reviewed By: lntue, michaelrj Differential Revision: https://reviews.llvm.org/D135225	2022-10-05 07:38:55 +00:00
Siva Chandra Reddy	215c9fa4de	[libc] Re-enable functions from signal.h and re-enable abort. They were disabled because we were including linux/signal.h from our signal.h. Linux's signal.h is not designed to be included from user programs as it causes a lot of non-standard name pollution. Also, it is not self-contained. This change defines types and macros relevant for signal related syscalls within libc's headers and removes inclusion of Linux headers. This patch enables the funtions only for x86_64. They will be enabled for aarch64 also in a follow up patch after testing. Reviewed By: abrachet, lntue Differential Revision: https://reviews.llvm.org/D134567	2022-09-30 07:31:50 +00:00
Siva Chandra Reddy	545b954251	[libc] Add GNU extension functions sched_getaffinity and sched_setaffinity. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D134858	2022-09-29 20:31:46 +00:00
Michael Jones	b49d626cb4	[libc] add clock_gettime Add the clock_gettime syscall wrapper and tests. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D134773	2022-09-29 10:23:21 -07:00
Siva Chandra Reddy	3367539010	[libc] Add implementation of pthread_once. The existing thrd_once function has been refactored so that the implementation can be shared between thrd_once and pthread_once functions. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D134716	2022-09-28 06:54:48 +00:00
Raman Tenneti	8f1e362ee9	Implement nanosleep per https://pubs.opengroup.org/onlinepubs/009695399/basedefs/time.h.html Tested: Limited unit test: This makes a call and checks that no error was returned, but we currently don't have the ability to ensure that time has elapsed as expected. Co-authored-by: Jeff Bailey <jeffbailey@google.com> Reviewed By: sivachandra, jeffbailey Differential Revision: https://reviews.llvm.org/D134095	2022-09-24 00:13:58 +00:00

1 2

75 Commits