clang-p2996

Author	SHA1	Message	Date
Tue Ly	484319f497	[libc] Make expm1f correctly rounded when the targets have no FMA instructions. Add another exceptional value and fix the case when \|x\| is small. Performance tests with CORE-MATH project scripts: With FMA instructions on Ryzen 1700: ``` $ ./perf.sh expm1f LIBC-location: /home/lnt/experiment/llvm/llvm-project/build/projects/libc/lib/libllvmlibc.a CORE-MATH reciprocal throughput : 15.362 System LIBC reciprocal throughput : 53.194 LIBC reciprocal throughput : 14.595 $ ./perf.sh expm1f --latency LIBC-location: /home/lnt/experiment/llvm/llvm-project/build/projects/libc/lib/libllvmlibc.a CORE-MATH latency : 57.755 System LIBC latency : 147.020 LIBC latency : 60.269 ``` Without FMA instructions: ``` $ ./perf.sh expm1f LIBC-location: /home/lnt/experiment/llvm/llvm-project/build/projects/libc/lib/libllvmlibc.a CORE-MATH reciprocal throughput : 15.362 System LIBC reciprocal throughput : 53.300 LIBC reciprocal throughput : 18.020 $ ./perf.sh expm1f --latency LIBC-location: /home/lnt/experiment/llvm/llvm-project/build/projects/libc/lib/libllvmlibc.a CORE-MATH latency : 57.758 System LIBC latency : 147.025 LIBC latency : 70.304 ``` Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D123440	2022-06-03 15:57:48 -04:00
Tue Ly	614567a7bf	[libc] Automatically add -mfma flag for architectures supporting FMA. Detect if the architecture supports FMA instructions and if the targets depend on fma. Reviewed By: gchatelet Differential Revision: https://reviews.llvm.org/D123615	2022-06-03 01:21:20 -04:00
Siva Chandra Reddy	70c8d12b79	[libc] Add pthread_create and pthread_join functions. They do not yet support all the feature/attributes in pthread_attr_t. Future changes will add such support. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D126718	2022-06-02 01:47:24 +00:00
Siva Chandra Reddy	ad89cf4e2d	[libc] Keep all thread state information separate from the thread structure. The state is now stored on the thread's stack memory. This enables implementing pthread API like pthread_detach which takes the pthread_t structure argument by value. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D126716	2022-06-01 17:36:58 +00:00
Guillaume Chatelet	b2a9ea4420	[libc] Apply no-builtin everywhere, remove unnecessary flags Note, this is a re-submission of D125894 with `features = ["-header_modules"]` added to the main BUILD.bazel file. Some functions like `stpncpy` are implemented in terms of `memset` but are not currently using `-fno-builtin-memset`. This is somewhat hidden by the fact that we use `-ffreestanding` globally and that `-ffreestanding` implies `-fno-builtin` for Clang. This patch also removes `-mllvm -combiner-global-alias-analysis` that is Clang specific and that does not bring substantial gains on modern processors. Also we keep `-mllvm --tail-merge-threshold=0` for aarch64 in CMakeLists.txt but we omit it in the Bazel config. This is because Bazel consumes the source files directly and so it can use PGO to take optimal decisions locally. Differential Revision: https://reviews.llvm.org/D126773	2022-06-01 13:34:36 +00:00
Guillaume Chatelet	4cbfd2e7eb	[libc][mem*] Address facility + test enum support This patch is a subpart of D125768 intented to make the review easier. The `Address` struct represents a pointer but also adds compile time knowledge like alignment or temporal/non-temporal that helps with downstream instruction selection. Differential Revision: https://reviews.llvm.org/D125966	2022-06-01 09:09:43 +00:00
Guillaume Chatelet	299baac64d	[libc] Add support for enum in EXPECT_EQ	2022-06-01 08:42:18 +00:00
Michael Jones	ba7e1cddda	[libc] add fprintf and file_writer This patch adds the file_writer header, which just provides a wrapper for File->write, as well as fprintf to use it. There are no unit tests for file_writer since it's too simple to need them, but fprintf does have a simple test of writing to a file. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D125939	2022-05-31 13:59:19 -07:00
Siva Chandra Reddy	9b8ca3c1f1	[libc] Add global stdout and stderr objects. They are added as entrypoint object targets. The header-gen infrastructure has been extended to enable handling standard required global objects. The libc-api-test has also been extended to verify the global object declarations. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D126329	2022-05-27 05:43:49 +00:00
Siva Chandra Reddy	2a5d5078d5	[libc] Add the pthread_mutex_t type. Simple implementations of the functions pthread_mutex_init, pthread_mutex_destroy, pthread_mutex_lock and pthread_mutex_unlock have have also been added. Future patches will extend these functions to add features required by the POSIX specification. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D126235	2022-05-24 22:48:14 +00:00
Guillaume Chatelet	0443bfabe7	Revert "[libc] Apply no-builtin everywhere, remove unnecessary flags" This reverts commit `94d6dd9057`.	2022-05-20 14:37:17 +00:00
Alex Brachet	b1183305f8	[libc] Add strlcat Differential Revision: https://reviews.llvm.org/D125978	2022-05-19 21:48:39 +00:00
Guillaume Chatelet	94d6dd9057	[libc] Apply no-builtin everywhere, remove unnecessary flags Some functions like `stpncpy` are implemented in terms of `memset` but are not currently using `-fno-builtin-memset`. This is somewhat hidden by the fact that we use `-ffreestanding` globally and that `-ffreestanding` implies `-fno-builtin` for Clang. This patch also removes `-mllvm -combiner-global-alias-analysis` that is Clang specific and that does not bring substantial gains on modern processors. Also we keep `-mllvm --tail-merge-threshold=0` for aarch64 in CMakeLists.txt but we omit it in the Bazel config. This is because Bazel consumes the source files directly and so it can use PGO to take optimal decisions locally. Differential Revision: https://reviews.llvm.org/D125894	2022-05-19 09:08:42 +00:00
Alex Brachet	fc2c8b2371	[libc] Add strlcpy Differential Revision: https://reviews.llvm.org/D125806	2022-05-18 17:45:05 +00:00
Michael Jones	9f1d905f39	[libc] add snprintf After adding sprintf, snprintf is simple. The functions are very similar. The tests only cover the behavior of the max length since the sprintf tests should cover the other behavior. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D125826	2022-05-17 13:32:59 -07:00
Michael Jones	ff6fe39eca	[libc] add sprintf This adds the sprintf entrypoint, as well as unit tests. Currently sprintf only supports %%, %s, and %c, but the other conversions are on the way. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D125573	2022-05-17 11:32:20 -07:00
Michael Jones	6a22b185d6	[libc] add printf converter This adds the main pieces of the last piece of printf, the converter. This takes the completed format section from the parser and then converts it to a string for the writer, which is why it was the last piece to be written. So far it supports chars and strings, but more pieces are coming. Additionally, it supports replacing all of the conversion functions with user supplied versions at compile time to allow for additional functionality. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D125327	2022-05-12 13:10:05 -07:00
Michael Jones	dd7f30464b	[libc] fix uint includes and libc bazel This patch fixes the includes for the new UInt class so that the api test now passes, additionally it fixes the bazel files to account for the new dependencies. Differential Revision: https://reviews.llvm.org/D125490	2022-05-12 11:40:52 -07:00
Michael Jones	1170951c73	[libc] add uint128 implementation Some platforms don't support proper 128 bit integers, but some algorithms use them, such as any that use long doubles. This patch modifies the existing UInt class to support the necessary operators. This does not put this new class into use, that will be in followup patches. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D124959	2022-05-12 11:16:53 -07:00
Michael Jones	945fa672c6	[libc][NFC] add index mode to printf parser This patch is a followup to the previous patch which implemented the main printf parsing logic as well as sequential mode. This patch adds index mode. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D123424	2022-05-06 12:06:08 -07:00
Michael Jones	e072a123d3	[libc] add printf writer The printf implmentation is made up of three main pieces, the parser, the converter, and the writer. This patch adds the implementation for the writer, as well as the function for writing to a string, along with tests. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D124421	2022-05-03 10:15:04 -07:00
Siva Chandra Reddy	9db0037bf1	[libc] Add implementations of feof, ferror and clearerr. The corresponding _unlocked functions have also been added. Reviewed By: lntue, michaelrj Differential Revision: https://reviews.llvm.org/D124311	2022-04-29 23:04:35 +00:00
Dominic Chen	ce6bfd102a	[libc] Support 32-bit ARM platform tests Set LONG_DOUBLE_IS_DOUBLE, add ifdefs for 128-bit integer types Differential Revision: https://reviews.llvm.org/D124204	2022-04-28 12:00:28 -07:00
Michael Jones	ff1374785f	[libc] Add Printf FormatSection Matcher This patch changes the printf parser tests to use a more robust matcher. This allows for better debugging of parsing issues. This does not affect the actual printf code at all, only the tests. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D124130	2022-04-22 14:21:39 -07:00
Siva Chandra Reddy	19a6dd33ee	[libc] Add the implementation of the GNU extension function fopencookie. Reviewed By: lntue, michaelrj Differential Revision: https://reviews.llvm.org/D124141	2022-04-22 08:02:25 +00:00
Dominic Chen	e8572aca0c	[libc] Use correct mnemonic for arm64_32 architecture arm64_32 is an ILP32 platform Differential Revision: https://reviews.llvm.org/D124134	2022-04-21 15:13:07 -07:00
Siva Chandra Reddy	22f9dca113	[libc] Add the implementation of the fflush function. Note that the underlying flush implementation does not yet fully implement the POSIX standard. It is complete with respect to the C standard however. A future change will add the POSIX behavior. It should not affect the implementation of the fflush function however as the POSIX behavior will be added in a lower layer. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D124073	2022-04-20 19:43:39 +00:00
Siva Chandra Reddy	945e0220fd	[libc] Add GNU extention functions fread_unlocked and fwrite_unlocked. POSIX locking and unlocking functions flockfile and funlockfile have also been added. The locking is not recursive yet. A future patch will make the underlying lock a recursive lock. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D123986	2022-04-20 15:39:27 +00:00
Tue Ly	5c6db1dc9b	[libc] Fix nested namespace issues with multiply_add.h. The FMA header was included inside namespaces in multiply_add.h. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D123539	2022-04-11 17:30:02 -04:00
Siva Chandra Reddy	0258f56646	[libc] Add a definition of pthread_attr_t and its getters and setters. Not all attributes have been added to phtread_attr_t in this patch. They will be added gradually in future patches. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D123423	2022-04-11 16:08:49 +00:00
Michael Jones	4f4752ee6f	[libc][NFC] implement printf parser This patch adds the sequential mode implementation of the printf parser, as well as unit tests for it. In addition it adjusts the surrounding files to accomodate changes in the design found in the implementation process. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D123339	2022-04-08 14:21:13 -07:00
Tue Ly	c5f8a0a1e9	[libc] Add support for x86-64 targets that do not have FMA instructions. Make FMA flag checks more accurate for x86-64 targets, and refactor polyeval to use multiply and add instead when FMA instructions are not available. Reviewed By: michaelrj, sivachandra Differential Revision: https://reviews.llvm.org/D123335	2022-04-08 14:12:24 -04:00
Siva Chandra Reddy	2ce09e680a	[libc] Add a linux Thread class in __support/threads. This change is essentially a mechanical change which moves the thread creation and join implementations from src/threads/linux to src/__support/threads/linux/thread.h. The idea being that, in future, a pthread implementation can reuse the common thread implementations in src/__support/threads. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D123287	2022-04-07 16:13:21 +00:00
Michael Jones	5561ab3495	[libc] Add holder class for va_lists This class is intended to be used in cases where a class is being used on a va_list. It provides destruction and copy semantics with small overhead. This is intended to be used in printf. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D123061	2022-04-05 11:39:57 -07:00
Siva Chandra Reddy	83f153ce34	[libc] Add pthread_mutexattr_t type and its setters and getters. A simple implementation of the getters and setters has been added. More logic can be added to them in future as required. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D122969	2022-04-04 18:11:12 +00:00
Siva Chandra Reddy	6a7cd4a1df	[libc][NFC] Do not call mmap and munmap from thread functions. Instead, memory is allocated and deallocated using mmap and munmap syscalls directly. Reviewed By: lntue, michaelrj Differential Revision: https://reviews.llvm.org/D122876	2022-04-02 05:12:08 +00:00
Michael Jones	c4a1b07d09	[libc][NFC] add outline of printf This patch adds the headers for printf. It contains minimal actual code, and is more intended to be used for design review. The code is not built yet, and may have minor errors. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D122773	2022-04-01 14:36:17 -07:00
Siva Chandra	97417e0300	[libc] Enable threads.h functions on aarch64. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D122788	2022-03-31 08:42:07 -07:00
Tue Ly	a5466f0436	[libc] Improve the performance of expm1f. Improve the performance of expm1f: - Rearrange the selection logic for different cases to improve the overall throughput. - Use the same degree-4 polynomial for large inputs as `expf` (https://reviews.llvm.org/D122418), reduced from a degree-7 polynomial. Performance benchmark using perf tool from CORE-MATH project (https://gitlab.inria.fr/core-math/core-math/-/tree/master): Before this patch: ``` $ ./perf.sh expm1f CORE-MATH reciprocal throughput : 15.362 System LIBC reciprocal throughput : 53.288 LIBC reciprocal throughput : 54.572 $ ./perf.sh expm1f --latency CORE-MATH latency : 57.759 System LIBC latency : 147.146 LIBC latency : 118.057 ``` After this patch: ``` $ ./perf.sh expm1f CORE-MATH reciprocal throughput : 15.359 System LIBC reciprocal throughput : 53.188 LIBC reciprocal throughput : 14.600 $ ./perf.sh expm1f --latency CORE-MATH latency : 57.774 System LIBC latency : 147.119 LIBC latency : 60.280 ``` Reviewed By: michaelrj, santoshn, zimmermann6 Differential Revision: https://reviews.llvm.org/D122538	2022-03-30 19:23:25 -04:00
Michael Jones	9276074271	[libc][obvious] Add mfma to log2f In the previous patch adding -mfma to functions that need it for windows builds I missed log2f. Differential Revision: https://reviews.llvm.org/D122693	2022-03-29 16:34:52 -07:00
Michael Jones	2f8829aba3	[libc] Add mfma option to functions that use fma On Windows the functions that use fma don't properly include the fma intrinsics unless -mfma is added to the compile options. This patch adds the compile option to all of the functions that need it. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D122689	2022-03-29 16:23:36 -07:00
Michael Jones	4ac3f7e41a	[libc][obvious] fix sqrt when long double is double Previously, the "fsqrt" instruction was used on all x86_64 platforms for finding the square root of long doubles. On long double is double platforms (e.g. windows) this created errors. This patch changes square root function for long doubles to be the same as the one for doubles if long doubles are doubles. Reviewed By: sivachandra, lntue Differential Revision: https://reviews.llvm.org/D122688	2022-03-29 16:05:40 -07:00
Tue Ly	6168b42225	[libc] Improve the performance of expf. Reduce the polynomial's degree from 7 down to 4. Currently we use a degree-7 minimax polynomial on an interval of length 2^-7 around 0 to compute `expf`. Based on the suggestion of @santoshn and the RLIBM project (https://github.com/rutgers-apl/rlibm-all/blob/main/source/float/exp.c) and the improvement we made with `exp2f` in https://reviews.llvm.org/D122346, it is possible to have a good polynomial of degree-4 on a subinterval of length 2^(-7) to approximate e^x. We did try to either reduce the degree of the polynomial down to 3 or increase the interval size to 2^(-6), but in both cases the number of exceptional values exploded. So we settle with using a degree-4 polynomial of the interval of size 2^(-7) around 0. Reviewed By: sivachandra, zimmermann6, santoshn Differential Revision: https://reviews.llvm.org/D122418	2022-03-25 12:20:20 -04:00
Tue Ly	b9d87d7466	[libc] Improve the performance of exp2f. Reduce the range-reduction table size from 128 entries down to 64 entries, and reduce the polynomial's degree from 6 down to 4. Currently we use a degree-6 minimax polynomial on an interval of length 2^-7 around 0 to compute exp2f. Based on the suggestion of @santoshn and the RLIBM project (https://github.com/rutgers-apl/rlibm-prog/blob/main/libm/float/exp2.c) it is possible to have a good polynomial of degree-4 on a subinterval of length 2^(-6) to approximate 2^x. We did try to either reduce the degree of the polynomial down to 3 or increase the interval size to 2^(-5), but in both cases the number of exceptional values exploded. So we settle with using a degree-4 polynomial of the interval of size 2^(-6) around 0. Reviewed By: michaelrj, sivachandra, zimmermann6, santoshn Differential Revision: https://reviews.llvm.org/D122346	2022-03-24 18:06:37 -04:00
Siva Chandra Reddy	441606f5ff	[libc] Add implementations of fopen, flose, fread, fwrite and fseek. A follow up patch will add feof and ferror. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D122327	2022-03-24 04:20:12 +00:00
Michael Jones	6d0f5d95ad	[libc][obvious] add aligned_alloc as entrypoint This patch adds aligned_alloc as an entrypoint. Previously it was being included implicitly. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D122362	2022-03-23 16:44:15 -07:00
Michael Jones	805899e68a	[libc] Change FEnv to use MXCSR as source of truth This patch primarily fixes the fenv implementation on Windows, since Windows uses the MXCSR in place of the x87 status registers for storing information about the floating point environment. This allows FEnv to work correctly on Windows, and successfully build. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D121839	2022-03-23 16:08:00 -07:00
Siva Chandra Reddy	a0f6d12cd4	[libc][File] Fix a bug under fseek(..., SEEK_CUR). Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D122284	2022-03-23 16:24:15 +00:00
Siva Chandra Reddy	b950a0d44d	[libc][Obvious] Remove an unnecessary dep and use inline_memcpy. An unnecessary dep of the getenv function is removed. From the x86_64 loader, a call to __llvm_libc::memcpy is replaced with call to __llvm_libc::inline_memcpy.	2022-03-22 07:08:57 +00:00
Siva Chandra Reddy	df4814d45d	[libc] Add a linux file implementation. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D121976	2022-03-21 07:07:22 +00:00

1 2 3 4 5 ...

408 Commits