clang-p2996

Author	SHA1	Message	Date
Fabio D'Urso	2396c46999	[libc] Add malloc.h header defining mallopt (#110908 ) This patch adds the malloc.h header, declaring Scudo's mallopt entrypoint when built LLVM_LIBC_INCLUDE_SCUDO, as well as two constants that can be passed to it (M_PURGE and M_PURGE_ALL). Due to limitations of the current build system, only the declaration of mallopt is gated by LLVM_LIBC_INCLUDE_SCUDO, and the two new constants are defined irrespectively of it. We may need to refine this in the future. Note that some allocators other than Scudo may offer a mallopt implementation too (e.g. man 3 mallopt), albeit with different supported input values. This patch only supports the specific case of LLVM_LIBC_INCLUDE_SCUDO.	2024-10-03 18:45:23 +02:00
aaryanshukla	8c81fb6167	[libc][math][c23] Add fadd{l,f128} C23 math functions (#102531 ) Co-authored-by: OverMighty <its.overmighty@gmail.com>	2024-08-09 10:26:09 -07:00
Job Henandez Lara	ff1cc5b97c	[libc][math][c23] Add totalorderl function. (#102564 )	2024-08-09 10:18:00 -04:00
aaryanshukla	d0fe470fd2	[libc][math] Add scalbln{,f,l,f128} math functions (#102219 ) Co-authored-by: OverMighty <its.overmighty@gmail.com>	2024-08-08 14:33:50 -07:00
Job Henandez Lara	aae7c38827	[libc][math][c23] Add entrypoints and tests for setpayloadsig{,f,l,f128} and setpayloadl. (#102413 )	2024-08-08 00:17:20 -04:00
Sylvestre Ledru	0ac9a65900	libc: remove trailing whitespaces	2024-08-07 23:23:39 +02:00
aaryanshukla	2c74237c0f	[libc][math][c23] Add fsub{,l,f128} and remainderf128 C23 math functions (#101576 ) Co-authored-by: OverMighty <its.overmighty@gmail.com>	2024-08-07 13:03:58 -07:00
Job Henandez Lara	e3778a5d0e	[libc][math] Add getpayloadl function. (#102214 )	2024-08-07 08:27:45 -04:00
aaryanshukla	0395bf7636	[libc][math][c23] Add ffma{,l,f128} and fdiv{,l,f128} C23 math functions #101089 (#101253 ) - added all variations of ffma and fdiv - will add all new headers into yaml for next patch - only fsub is left then all basic operations for float is complete --------- Co-authored-by: OverMighty <its.overmighty@gmail.com>	2024-08-06 10:19:54 -07:00
lntue	9f6b440adf	[libc][math] Implement fast pass for double precision pow function with up to 1ULP error. (#101926 )	2024-08-05 13:08:59 -04:00
aaryanshukla	7471387b3d	[libc][math][C23] removing daddl from arm32 (#101567 )	2024-08-01 15:08:45 -07:00
aaryanshukla	8f33f1d829	[libc][math][c23] Add dadd{l,f128} and ddiv{l,f128} C23 math functions (#100456 ) - fadd removed because I need to add for different input types - finishing rest of basic operations - noticed duplicates will remove --------- Co-authored-by: OverMighty <its.overmighty@gmail.com>	2024-08-01 13:36:50 -07:00
Job Henandez Lara	ed12f80ff0	[libc][math][c23] add entrypoints and tests for getpayload{,f,f128} (#101285 )	2024-07-31 23:16:42 -04:00
aaryanshukla	74f9579443	[libc][math][c23] removed dsubl for 32 arm (#101423 )	2024-07-31 15:24:08 -07:00
aaryanshukla	30b5d4a763	[libc][math][c23] Add dfma{l,f128} and dsub{l,f128} C23 math functions (#101089 ) Co-authored-by: OverMighty <its.overmighty@gmail.com>	2024-07-31 13:07:03 -07:00
Job Henandez Lara	546e4a210c	[libc][math][c23] Add entrypoints and tests for setpayload{,f,f128} (#101122 )	2024-07-30 19:52:47 -04:00
Job Henandez Lara	0813260aa8	[libc][math][c23] Add entrypoints and tests for totalorder{,f,f128} (#100593 )	2024-07-30 18:58:44 +02:00
lntue	ca8b14de51	[libc][math] Implement fast pass for double precision atan2 with 1 ULP errors. (#100648 )	2024-07-26 09:56:46 -04:00
Job Henandez Lara	7b51777ed8	[libc][math][c23] add entrypoints and tests for totalordermag{f,l,f128} (#100159 ) Fixes https://github.com/llvm/llvm-project/issues/100139	2024-07-24 19:53:23 -04:00
Job Henandez Lara	c1562374c8	[libc][math][c23] Add entrypoints and tests for dsqrt{l,f128} (#99815 )	2024-07-21 15:55:11 -04:00
aaryanshukla	a2f61ba08b	[libc][math]fadd implementation (#99694 ) - [libc] math fadd - [libc][math] implemented fadd	2024-07-19 14:40:34 -07:00
lntue	7fc9fb9f3f	[libc][math] Implement double precision cbrt correctly rounded to all rounding modes. (#99262 ) Division-less Newton iterations algorithm for cube roots. 1. Range reduction For `x = (-1)^s * 2^e * (1.m)`, we get 2 reduced arguments `x_r` and `a` as: ``` x_r = 1.m a = (-1)^s * 2^(e % 3) * (1.m) ``` Then `cbrt(x) = x^(1/3)` can be computed as: ``` x^(1/3) = 2^(e / 3) * a^(1/3). ``` In order to avoid division, we compute `a^(-2/3)` using Newton method and then multiply the results by a: ``` a^(1/3) = a * a^(-2/3). ``` 2. First approximation to a^(-2/3) First, we use a degree-7 minimax polynomial generated by Sollya to approximate `x_r^(-2/3)` for `1 <= x_r < 2`. ``` p = P(x_r) ~ x_r^(-2/3), ``` with relative errors bounded by: ``` \| p / x_r^(-2/3) - 1 \| < 1.16 * 2^-21. ``` Then we multiply with `2^(e % 3)` from a small lookup table to get: ``` x_0 = 2^(-2(e % 3)/3) p ~ 2^(-2(e % 3)/3) x_r^(-2/3) = a^(-2/3) ``` with relative errors: ``` \| x_0 / a^(-2/3) - 1 \| < 1.16 * 2^-21. ``` This step is done in double precision. 3. First Newton iteration We follow the method described in: Sibidanov, A. and Zimmermann, P., "Correctly rounded cubic root evaluation in double precision", https://core-math.gitlabpages.inria.fr/cbrt64.pdf to derive multiplicative Newton iterations as below: Let `x_n` be the nth approximation to `a^(-2/3)`. Define the n^th error as: ``` h_n = x_n^3 * a^2 - 1 ``` Then: ``` a^(-2/3) = x_n / (1 + h_n)^(1/3) = x_n * (1 - (1/3) * h_n + (2/9) * h_n^2 - (14/81) * h_n^3 + ...) ``` using the Taylor series expansion of `(1 + h_n)^(-1/3)`. Apply to `x_0` above: ``` h_0 = x_0^3 * a^2 - 1 = a^2 * (x_0 - a^(-2/3)) * (x_0^2 + x_0 * a^(-2/3) + a^(-4/3)), ``` it's bounded by: ``` \|h_0\| < 4 * 3 * 1.16 * 2^-21 * 4 < 2^-17. ``` So in the first iteration step, we use: ``` x_1 = x_0 * (1 - (1/3) * h_n + (2/9) * h_n^2 - (14/81) * h_n^3) ``` Its relative error is bounded by: ``` \| x_1 / a^(-2/3) - 1 \| < 35/242 * \|h_0\|^4 < 2^-70. ``` Then we perform Ziv's rounding test and check if the answer is exact. This step is done in double-double precision. 4. Second Newton iteration If the Ziv's rounding test from the previous step fails, we define the error term: ``` h_1 = x_1^3 * a^2 - 1, ``` And perform another iteration: ``` x_2 = x_1 * (1 - h_1 / 3) ``` with the relative errors exceed the precision of double-double. We then check the Ziv's accuracy test with relative errors < 2^-102 to compensate for rounding errors. 5. Final iteration If the Ziv's accuracy test from the previous step fails, we perform another iteration in 128-bit precision and check for exact outputs.	2024-07-17 12:23:14 -04:00
lntue	c9ee6b1977	[libc][math] Implement cbrtf function correctly rounded to all rounding modes. (#97936 ) Fixes https://github.com/llvm/llvm-project/issues/92874 Algorithm: Let `x = (-1)^s * 2^e * (1 + m)`. - Step 1: Range reduction: reduce the exponent with: ``` y = cbrt(x) = (-1)^s * 2^(floor(e/3)) * 2^((e % 3)/3) * (1 + m)^(1/3) ``` - Step 2: Use the first 4 bit fractional bits of `m` to look up for a degree-7 polynomial approximation to: ``` (1 + m)^(1/3) ~ 1 + m * P(m). ``` - Step 3: Perform the multiplication: ``` 2^((e % 3)/3) * (1 + m)^(1/3). ``` - Step 4: Check for exact cases to prevent rounding and clear `FE_INEXACT` floating point exception. - Step 5: Combine with the exponent and sign before converting down to `float` and return.	2024-07-08 10:02:12 -04:00
lntue	7d68d9d2f2	[libc][math] Implement correctly rounded double precision tan (#97489 ) Using the same range reduction as `sin`, `cos`, and `sincos`: 1) Reducing `x = kpi/128 + u`, with `\|u\| <= pi/256`, and `u` is in double-double. 2) Approximate `tan(u)` using degree-9 Taylor polynomial. 3) Compute ``` tan(x) ~ (sin(kpi/128) + tan(u) * cos(kpi/128)) / (cos(kpi/128) - tan(u) * sin(k*pi/128)) ``` using the fast double-double division algorithm in [the CORE-MATH project](https://gitlab.inria.fr/core-math/core-math/-/blob/master/src/binary64/tan/tan.c#L1855). 4) Perform relative-error Ziv's accuracy test 5) If the accuracy tests failed, we redo the computations using 128-bit precision `DyadicFloat`. Fixes https://github.com/llvm/llvm-project/issues/96930	2024-07-03 18:05:24 -04:00
Petr Hosek	6b55ec1198	[libc] Sort entrypoints alphabetically (#96955 ) This makes it easier to diff the different configurations.	2024-06-27 14:27:14 -07:00
lntue	4080f174ab	[libc][math] Implement double precision sincos correctly rounded to all rounding modes. (#96719 ) Sharing the same algorithm as double precision sin: https://github.com/llvm/llvm-project/pull/95736 and cos: https://github.com/llvm/llvm-project/pull/96591	2024-06-27 10:15:22 -04:00
Nick Desaulniers (paternity leave)	e214ed9d70	[libc][arm] move setjmp+longjmp to fullbuild-only entrypoints (#96708 ) The opaque type jmp_buf should only be tested in fullbuild mode.	2024-06-25 15:35:24 -07:00
lntue	88f80aeb0c	[libc][math] Implement double precision cos correctly rounded to all rounding modes. (#96591 ) Sharing the same algorithm as double precision sin: https://github.com/llvm/llvm-project/pull/95736	2024-06-25 16:51:31 -04:00
lntue	16903ace18	[libc][math] Implement double precision sin correctly rounded to all rounding modes. (#95736 ) - Algorithm: - Step 1 - Range reduction: for a double precision input `x`, return `k` and `u` such that - k is an integer - u = x - k * pi / 128, and \|u\| < pi/256 - Step 2 - Calculate `sin(u)` and `cos(u)` in double-double using Taylor polynomials with errors < 2^-70 with FMA or < 2^-66 w/o FMA. - Step 3 - Calculate `sin(x) = sin(kpi/128) cos(u) + cos(kpi/128) sin(u)` using look-up table for `sin(kpi/128)` and `cos(kpi/128)`. - Step 4 - Use Ziv's rounding test to decide if the result is correctly rounded. - Step 4' - If the Ziv's rounding test failed, redo step 1-3 using 128-bit precision. - Currently, without FMA instructions, the large range reduction only works correctly for the default rounding mode (FE_TONEAREST). - Provide `LIBC_MATH` flag so that users can set `LIBC_MATH = LIBC_MATH_SKIP_ACCURATE_PASS` to build the `sin` function without step 4 and 4'.	2024-06-24 17:57:08 -04:00
Nick Desaulniers (paternity leave)	9eba835dec	[libc][arm] add malloc/free/aligned_alloc to entrypoints (#96516 ) Necessary for arm32 cross full build.	2024-06-24 09:51:38 -07:00
Nick Desaulniers (paternity leave)	f1ce6a465d	[libc][arm] implement a basic setjmp/longjmp (#93220 ) Note: our baremetal arm configuration compiles this as `--target=arm-none-eabi`, so this code is built in -marm mode. It could be smaller with `--target=armv7-none-eabi -mthumb`. The assembler is valid ARMv5, or THUMB2, but not THUMB(1).	2024-06-20 08:16:48 -07:00
Job Henandez Lara	263be9fb00	[libc][math][c23] fmul correcly rounded to all rounding modes (#91537 ) This is an implementation of floating point multiplication: It will consist of - `double x double -> float`	2024-06-08 15:07:27 -04:00
Fabian Keßler	cd7a7a56fc	Add basic char*_t support for libc (partial WG14 N2653) (#90360 ) This PR implements a part of WG14 N2653: - Define C23 char8_t - Define C11 char16_t - Define C11 char32_t Missing goals are: - The type of UTF-8 character literals is changed from unsigned char to char8_t. (Since UTF-8 character literals already have type unsigned char, this is not a semantic change). - New mbrtoc8() and c8rtomb() functions declared in <uchar.h> enable conversions between multibyte characters and UTF-8. - A new ATOMIC_CHAR8_T_LOCK_FREE macro. - A new atomic_char8_t typedef name.	2024-04-30 15:08:38 -07:00
Robin Caloudis	b854a23233	[libc][c23][fenv] Implement fetestexceptflag (#87828 ) Provide C23 `fetestexceptflag` function according to 7.6.4.6 in the latest [revision of the C standard](https://www.open-std.org/jtc1/sc22/wg14/www/docs/n3096.pdf) from 2023-04-02. Closes https://github.com/llvm/llvm-project/issues/87565.	2024-04-17 08:38:47 -07:00
OverMighty	41ff91e143	Reapply "[libc][math][c23] Add remaining linux/* entrypoints for {,u}fromfp{,x}* (#86692 )" (#88567 ) This reverts commit `8a071678a9`. The test failure on 32-bit Arm should have been fixed by #86892. cc @nickdesaulniers @lntue	2024-04-12 16:01:36 -04:00
aniplcc	8ee6ab7f69	[libc][c23][fenv] implement fesetexcept (#87603 ) Closes #87564	2024-04-05 13:52:57 -07:00
lntue	2be722587f	[libc][math] Implement atan2f correctly rounded to all rounding modes. (#86716 ) We compute atan2f(y, x) in 2 stages: - Fast step: perform computations in double precision , with relative errors < 2^-50 - Accurate step: if the result from the Fast step fails Ziv's rounding test, then we perform computations in double-double precision, with relative errors < 2^-100. On Ryzen 5900X, worst-case latency is ~ 200 clocks, compared to average latency ~ 60 clocks, and average reciprocal throughput ~ 20 clocks.	2024-04-01 13:31:07 -04:00
Nick Desaulniers	8a071678a9	Revert "[libc][math][c23] Add remaining linux/* entrypoints for {,u}fromfp{,x}* (#86692 )" This reverts commit `cd17082b24` because the newly added tests fail on 32b ARM. Link: #86692 Link: https://lab.llvm.org/buildbot/#/builders/229/builds/24458	2024-03-27 13:28:26 -07:00
OverMighty	cd17082b24	[libc][math][c23] Add remaining linux/* entrypoints for {,u}fromfp{,x}* (#86692 )	2024-03-27 12:28:27 -07:00
Job Henandez Lara	3e3f0c3175	[libc][math][c23] add c23 floating point fmaximum and fminimum functions. (#86016 ) Fixes https://github.com/llvm/llvm-project/issues/85496. --------- Co-authored-by: Job Hernandez <h93@protonmail.com>	2024-03-25 15:19:10 -04:00
OverMighty	85b6af198f	[libc][math][c23] Add linux/* entrypoints for nextup* and nextdown* (#85803 ) See https://github.com/llvm/llvm-project/pull/85484#discussion_r1526971653. There already were entrypoints for linux/x86_64. I haven't tested on the other targets and will rely on the buildbots.	2024-03-19 12:47:01 -07:00
Nick Desaulniers	9a3000cf67	[libc] roll out rest of stdbit.h entrypoints to gpu,linux,baremetal (#84938 )	2024-03-13 08:36:49 -07:00
Joseph Huber	c996023f9a	[libc] Provide an implementation of the 'stdint.h' header (#83353 ) Summary: I've noticed one problem is that the user includes `stdint.h` the compiler will do `#include_next <stdint.h>` potentially into a conflicting implementation on systems with multiple headers installed. The `clang` header is standards compliant and works with `clang` and `gcc` which are both of our targets, so I simply copied it here. This has the effect of including `stdint.h` on clang / LLVM libc behaving the same as `-ffreestanding`.	2024-03-04 12:23:11 -06:00
lntue	aa95aa69b9	[libc][math][c23] Add C23 math functions ilogbf128, logbf128, and llogb(f\|l\|f128). (#82144 )	2024-02-27 12:23:19 -05:00
Nick Desaulniers	646c7e5283	[libc] add more stdbit.h entrypoints to additional targets (#82440 ) stdbit.h isn't complete yet, but looking to turn these on on more targets for earlier feedback.	2024-02-20 16:29:17 -08:00
Schrodinger ZHU Yifan	96c5b8cbd1	[libc][c23] add definitions for stdckdint.h (#82059 ) See docs at - https://gustedt.gitlabpages.inria.fr/c23-library/#stdckdint - https://www.open-std.org/jtc1/sc22/wg14/www/docs/n3047.pdf (Ch7.10) Compiler header: - `450462cbac/clang/lib/Headers/stdckdint.h` - New version of GCC (`cd503b0616/gcc/ginclude/stdckdint.h`) also provides this.	2024-02-20 11:20:15 -05:00
michaelrj-google	9f3854a01f	[reland][libc] add epoll_wait functions (#79635 ) The epoll_wait functions are syscall wrappers that were requested by upstream users. This patch adds them, as well as their header and types. The tests are currently incomplete since they require epoll_create to properly test epoll_wait. That will be added in a followup patch since this one is already very large.	2024-01-30 10:07:47 -08:00
Nick Desaulniers	6b330470df	[libc] add more arch entrypoints for stdc_leading_zeros (#79923 ) Otherwise the include test fails on these targets.	2024-01-29 16:57:17 -08:00
michaelrj-google	59e90609d2	Revert "[libc] add epoll_wait functions" (#79534 ) Reverts llvm/llvm-project#79515 Some minor breakages. Will fix tomorrow.	2024-01-25 17:05:56 -08:00
michaelrj-google	edb720666f	[libc] add epoll_wait functions (#79515 ) The epoll_wait functions are syscall wrappers that were requested by upstream users. This patch adds them, as well as their header and types. The tests are currently incomplete since they require epoll_create to properly test epoll_wait. That will be added in a followup patch since this one is already very large.	2024-01-25 17:03:18 -08:00

1 2

79 Commits