clang-p2996

Author	SHA1	Message	Date
OverMighty	560ed8ce3d	[libc][math][c23] Add expm1f16 C23 math function (#102387 ) Part of #95250.	2024-08-13 16:48:28 +02:00
Job Henandez Lara	ff1cc5b97c	[libc][math][c23] Add totalorderl function. (#102564 )	2024-08-09 10:18:00 -04:00
aaryanshukla	d0fe470fd2	[libc][math] Add scalbln{,f,l,f128} math functions (#102219 ) Co-authored-by: OverMighty <its.overmighty@gmail.com>	2024-08-08 14:33:50 -07:00
lntue	eddfd504f8	[libc][math][c23] Add ddivl C23 math function. (#102468 )	2024-08-08 11:41:17 -04:00
Job Henandez Lara	aae7c38827	[libc][math][c23] Add entrypoints and tests for setpayloadsig{,f,l,f128} and setpayloadl. (#102413 )	2024-08-08 00:17:20 -04:00
aaryanshukla	2c74237c0f	[libc][math][c23] Add fsub{,l,f128} and remainderf128 C23 math functions (#101576 ) Co-authored-by: OverMighty <its.overmighty@gmail.com>	2024-08-07 13:03:58 -07:00
OverMighty	59338ad8c5	[libc][math][c23] Add exp10f16 C23 math function (#101588 ) Part of #95250.	2024-08-07 15:54:06 +02:00
Job Henandez Lara	e3778a5d0e	[libc][math] Add getpayloadl function. (#102214 )	2024-08-07 08:27:45 -04:00
aaryanshukla	0395bf7636	[libc][math][c23] Add ffma{,l,f128} and fdiv{,l,f128} C23 math functions #101089 (#101253 ) - added all variations of ffma and fdiv - will add all new headers into yaml for next patch - only fsub is left then all basic operations for float is complete --------- Co-authored-by: OverMighty <its.overmighty@gmail.com>	2024-08-06 10:19:54 -07:00
OverMighty	936515c7a5	[libc][math][c23] Add exp2f16 C23 math function (#101217 ) Part of #95250.	2024-08-06 14:44:01 +02:00
lntue	9f6b440adf	[libc][math] Implement fast pass for double precision pow function with up to 1ULP error. (#101926 )	2024-08-05 13:08:59 -04:00
aaryanshukla	8f33f1d829	[libc][math][c23] Add dadd{l,f128} and ddiv{l,f128} C23 math functions (#100456 ) - fadd removed because I need to add for different input types - finishing rest of basic operations - noticed duplicates will remove --------- Co-authored-by: OverMighty <its.overmighty@gmail.com>	2024-08-01 13:36:50 -07:00
Job Henandez Lara	ed12f80ff0	[libc][math][c23] add entrypoints and tests for getpayload{,f,f128} (#101285 )	2024-07-31 23:16:42 -04:00
aaryanshukla	30b5d4a763	[libc][math][c23] Add dfma{l,f128} and dsub{l,f128} C23 math functions (#101089 ) Co-authored-by: OverMighty <its.overmighty@gmail.com>	2024-07-31 13:07:03 -07:00
Job Henandez Lara	546e4a210c	[libc][math][c23] Add entrypoints and tests for setpayload{,f,f128} (#101122 )	2024-07-30 19:52:47 -04:00
Job Henandez Lara	0813260aa8	[libc][math][c23] Add entrypoints and tests for totalorder{,f,f128} (#100593 )	2024-07-30 18:58:44 +02:00
OverMighty	971a1ac445	[libc][math][c23] Add expf16 C23 math function (#100632 ) Part of #95250.	2024-07-30 18:38:03 +02:00
lntue	ca8b14de51	[libc][math] Implement fast pass for double precision atan2 with 1 ULP errors. (#100648 )	2024-07-26 09:56:46 -04:00
Mikhail R. Gadelha	e90d552c77	[libc][NFC] Update riscv documentation (#100578 ) This adds linux-riscv32 to the documentation and fixes riscv's entrypoint broken link.	2024-07-25 13:25:09 -03:00
Job Henandez Lara	7b51777ed8	[libc][math][c23] add entrypoints and tests for totalordermag{f,l,f128} (#100159 ) Fixes https://github.com/llvm/llvm-project/issues/100139	2024-07-24 19:53:23 -04:00
Job Henandez Lara	c1562374c8	[libc][math][c23] Add entrypoints and tests for dsqrt{l,f128} (#99815 )	2024-07-21 15:55:11 -04:00
Job Henandez Lara	af0f58cf14	[libc][math][c23] Add entrypoints and tests for fsqrt{,l,f128} (#99669 )	2024-07-21 11:17:41 -04:00
aaryanshukla	a2f61ba08b	[libc][math]fadd implementation (#99694 ) - [libc] math fadd - [libc][math] implemented fadd	2024-07-19 14:40:34 -07:00
OverMighty	9fb049c8c6	[libc][math][c23] Add {f,d}mul{l,f128} and f16mul{,f,l,f128} C23 math functions (#98972 ) Part of #93566. Fixes #94833.	2024-07-18 19:50:49 +02:00
lntue	7fc9fb9f3f	[libc][math] Implement double precision cbrt correctly rounded to all rounding modes. (#99262 ) Division-less Newton iterations algorithm for cube roots. 1. Range reduction For `x = (-1)^s * 2^e * (1.m)`, we get 2 reduced arguments `x_r` and `a` as: ``` x_r = 1.m a = (-1)^s * 2^(e % 3) * (1.m) ``` Then `cbrt(x) = x^(1/3)` can be computed as: ``` x^(1/3) = 2^(e / 3) * a^(1/3). ``` In order to avoid division, we compute `a^(-2/3)` using Newton method and then multiply the results by a: ``` a^(1/3) = a * a^(-2/3). ``` 2. First approximation to a^(-2/3) First, we use a degree-7 minimax polynomial generated by Sollya to approximate `x_r^(-2/3)` for `1 <= x_r < 2`. ``` p = P(x_r) ~ x_r^(-2/3), ``` with relative errors bounded by: ``` \| p / x_r^(-2/3) - 1 \| < 1.16 * 2^-21. ``` Then we multiply with `2^(e % 3)` from a small lookup table to get: ``` x_0 = 2^(-2(e % 3)/3) p ~ 2^(-2(e % 3)/3) x_r^(-2/3) = a^(-2/3) ``` with relative errors: ``` \| x_0 / a^(-2/3) - 1 \| < 1.16 * 2^-21. ``` This step is done in double precision. 3. First Newton iteration We follow the method described in: Sibidanov, A. and Zimmermann, P., "Correctly rounded cubic root evaluation in double precision", https://core-math.gitlabpages.inria.fr/cbrt64.pdf to derive multiplicative Newton iterations as below: Let `x_n` be the nth approximation to `a^(-2/3)`. Define the n^th error as: ``` h_n = x_n^3 * a^2 - 1 ``` Then: ``` a^(-2/3) = x_n / (1 + h_n)^(1/3) = x_n * (1 - (1/3) * h_n + (2/9) * h_n^2 - (14/81) * h_n^3 + ...) ``` using the Taylor series expansion of `(1 + h_n)^(-1/3)`. Apply to `x_0` above: ``` h_0 = x_0^3 * a^2 - 1 = a^2 * (x_0 - a^(-2/3)) * (x_0^2 + x_0 * a^(-2/3) + a^(-4/3)), ``` it's bounded by: ``` \|h_0\| < 4 * 3 * 1.16 * 2^-21 * 4 < 2^-17. ``` So in the first iteration step, we use: ``` x_1 = x_0 * (1 - (1/3) * h_n + (2/9) * h_n^2 - (14/81) * h_n^3) ``` Its relative error is bounded by: ``` \| x_1 / a^(-2/3) - 1 \| < 35/242 * \|h_0\|^4 < 2^-70. ``` Then we perform Ziv's rounding test and check if the answer is exact. This step is done in double-double precision. 4. Second Newton iteration If the Ziv's rounding test from the previous step fails, we define the error term: ``` h_1 = x_1^3 * a^2 - 1, ``` And perform another iteration: ``` x_2 = x_1 * (1 - h_1 / 3) ``` with the relative errors exceed the precision of double-double. We then check the Ziv's accuracy test with relative errors < 2^-102 to compensate for rounding errors. 5. Final iteration If the Ziv's accuracy test from the previous step fails, we perform another iteration in 128-bit precision and check for exact outputs.	2024-07-17 12:23:14 -04:00
Joseph Huber	60ff9c2ea5	[libc] Add support for `powi` as an LLVM libc extension on the GPU (#98236 ) Summary: This function is used by the CUDA / HIP / OpenMP headers and exists as an NVIDIA extension basically. This function is implemented in the C23 standard as `pown`, but for now we need to provide `powi` for backwards compatibility. In the future this entrypoint will just be a redirect to `pown` once that is implemented.	2024-07-09 20:51:36 -05:00
lntue	c9ee6b1977	[libc][math] Implement cbrtf function correctly rounded to all rounding modes. (#97936 ) Fixes https://github.com/llvm/llvm-project/issues/92874 Algorithm: Let `x = (-1)^s * 2^e * (1 + m)`. - Step 1: Range reduction: reduce the exponent with: ``` y = cbrt(x) = (-1)^s * 2^(floor(e/3)) * 2^((e % 3)/3) * (1 + m)^(1/3) ``` - Step 2: Use the first 4 bit fractional bits of `m` to look up for a degree-7 polynomial approximation to: ``` (1 + m)^(1/3) ~ 1 + m * P(m). ``` - Step 3: Perform the multiplication: ``` 2^((e % 3)/3) * (1 + m)^(1/3). ``` - Step 4: Check for exact cases to prevent rounding and clear `FE_INEXACT` floating point exception. - Step 5: Combine with the exponent and sign before converting down to `float` and return.	2024-07-08 10:02:12 -04:00
Hendrik Hübner	f8834ed24b	[libc][C23][math] Implement cospif function correctly rounded for all rounding modes (#97464 ) I also fixed a comment in sinpif.cpp in the first commit. Should this be included in this PR? All tests were passed, including the exhaustive test. CC: @lntue	2024-07-06 09:24:05 -04:00
OverMighty	ac76ce2693	[libc][math][c23] Classify f16fma{,f,l} as LLVM libc extensions (#97728 )	2024-07-05 09:58:01 -04:00
lntue	7d68d9d2f2	[libc][math] Implement correctly rounded double precision tan (#97489 ) Using the same range reduction as `sin`, `cos`, and `sincos`: 1) Reducing `x = kpi/128 + u`, with `\|u\| <= pi/256`, and `u` is in double-double. 2) Approximate `tan(u)` using degree-9 Taylor polynomial. 3) Compute ``` tan(x) ~ (sin(kpi/128) + tan(u) * cos(kpi/128)) / (cos(kpi/128) - tan(u) * sin(k*pi/128)) ``` using the fast double-double division algorithm in [the CORE-MATH project](https://gitlab.inria.fr/core-math/core-math/-/blob/master/src/binary64/tan/tan.c#L1855). 4) Perform relative-error Ziv's accuracy test 5) If the accuracy tests failed, we redo the computations using 128-bit precision `DyadicFloat`. Fixes https://github.com/llvm/llvm-project/issues/96930	2024-07-03 18:05:24 -04:00
OverMighty	4e56724213	[libc][math][c23] Add f16{add,sub}{,l,f128} C23 math functions (#97072 ) Part of #93566.	2024-07-02 19:27:09 -04:00
OverMighty	12a1e6dd12	[libc][math][c23] Add f16{add,sub}f C23 math functions (#96787 ) Part of #93566.	2024-07-02 09:16:12 -04:00
Hendrik Hübner	ea93c538c7	[libc][math][c23] Implemented sinpif function correctly rounded for all rounding modes. (#97149 ) This implements the sinpif function. An exhaustive test shows it's correct for all rounding modes. Issue: #94895	2024-07-01 16:38:03 -04:00
OverMighty	6c1c451b86	[libc][math][c23] Add f16sqrt{,l,f128} C23 math functions (#96642 ) Part of #95250.	2024-06-30 19:20:39 -04:00
OverMighty	56ef6a2eb2	[libc][math][c23] Add f16div{,l,f128} C23 math functions (#97054 ) Part of #93566.	2024-06-29 18:48:12 -04:00
OverMighty	e34dbb127a	[libc][math][c23] Add f16fma{,l,f128} C23 math function (#96711 ) Part of #93566.	2024-06-27 14:44:19 -04:00
lntue	4080f174ab	[libc][math] Implement double precision sincos correctly rounded to all rounding modes. (#96719 ) Sharing the same algorithm as double precision sin: https://github.com/llvm/llvm-project/pull/95736 and cos: https://github.com/llvm/llvm-project/pull/96591	2024-06-27 10:15:22 -04:00
lntue	88f80aeb0c	[libc][math] Implement double precision cos correctly rounded to all rounding modes. (#96591 ) Sharing the same algorithm as double precision sin: https://github.com/llvm/llvm-project/pull/95736	2024-06-25 16:51:31 -04:00
OverMighty	edbe698ead	[libc][math][c23] Add f16divf C23 math function (#96131 ) Part of #93566.	2024-06-25 08:48:28 -04:00
lntue	16903ace18	[libc][math] Implement double precision sin correctly rounded to all rounding modes. (#95736 ) - Algorithm: - Step 1 - Range reduction: for a double precision input `x`, return `k` and `u` such that - k is an integer - u = x - k * pi / 128, and \|u\| < pi/256 - Step 2 - Calculate `sin(u)` and `cos(u)` in double-double using Taylor polynomials with errors < 2^-70 with FMA or < 2^-66 w/o FMA. - Step 3 - Calculate `sin(x) = sin(kpi/128) cos(u) + cos(kpi/128) sin(u)` using look-up table for `sin(kpi/128)` and `cos(kpi/128)`. - Step 4 - Use Ziv's rounding test to decide if the result is correctly rounded. - Step 4' - If the Ziv's rounding test failed, redo step 1-3 using 128-bit precision. - Currently, without FMA instructions, the large range reduction only works correctly for the default rounding mode (FE_TONEAREST). - Provide `LIBC_MATH` flag so that users can set `LIBC_MATH = LIBC_MATH_SKIP_ACCURATE_PASS` to build the `sin` function without step 4 and 4'.	2024-06-24 17:57:08 -04:00
OverMighty	b5efd21429	[libc][math][c23] Add {ldexp,scalbn,scalbln}f16 C23 math functions (#94797 ) Part of #93566.	2024-06-21 09:01:47 -04:00
OverMighty	1107575c95	[libc][math][c23] Add {getpayload,setpayload,setpayloadsig}f16 C23 math functions (#95159 ) Part of #93566.	2024-06-20 13:33:34 -04:00
OverMighty	f3aceeee8a	[libc][math][c23] Add f16fmaf C23 math function (#95483 ) Part of #93566.	2024-06-14 12:31:32 -04:00
OverMighty	a239343521	[libc][math][c23] Add f16sqrtf C23 math function (#95251 ) Part of #95250.	2024-06-13 12:57:24 -04:00
OverMighty	f5dcfb9968	[libc][math][c23] Add {totalorder,totalordermag}f16 C23 math functions (#95014 ) Part of #93566.	2024-06-11 11:04:48 -04:00
OverMighty	7683a16dbf	[libc][math][c23] Add {remainder,remquo}f16 C23 math functions (#94773 ) Part of #93566.	2024-06-10 11:02:09 -04:00
OverMighty	10cd96dd33	[libc][math][c23] Add {frexp,ilogb,llogb,logb,modf}f16 C23 math functions (#94758 ) Part of #93566.	2024-06-10 08:38:47 -04:00
OverMighty	cb1a727dea	[libc][math][c23] Add nanf16 C23 math function (#94767 ) Part of #93566.	2024-06-10 00:19:22 -04:00
Hendrik Hübner	44aecca020	[libc][math][C23] Implemented remquof128 function (#94809 ) Added remquof128 function. Closes #94312	2024-06-08 15:08:45 -04:00
Job Henandez Lara	263be9fb00	[libc][math][c23] fmul correcly rounded to all rounding modes (#91537 ) This is an implementation of floating point multiplication: It will consist of - `double x double -> float`	2024-06-08 15:07:27 -04:00

1 2 3

114 Commits