clang-p2996

Author	SHA1	Message	Date
Petr Hosek	5ff3ff33ff	[libc] Migrate to using LIBC_NAMESPACE_DECL for namespace declaration (#98597 ) This is a part of #97655.	2024-07-12 09:28:41 -07:00
Mehdi Amini	ce9035f5bd	Revert "[libc] Migrate to using LIBC_NAMESPACE_DECL for namespace declaration" (#98593 ) Reverts llvm/llvm-project#98075 bots are broken	2024-07-12 09:12:13 +02:00
Petr Hosek	3f30effe1b	[libc] Migrate to using LIBC_NAMESPACE_DECL for namespace declaration (#98075 ) This is a part of #97655.	2024-07-11 12:35:22 -07:00
lntue	c9ee6b1977	[libc][math] Implement cbrtf function correctly rounded to all rounding modes. (#97936 ) Fixes https://github.com/llvm/llvm-project/issues/92874 Algorithm: Let `x = (-1)^s * 2^e * (1 + m)`. - Step 1: Range reduction: reduce the exponent with: ``` y = cbrt(x) = (-1)^s * 2^(floor(e/3)) * 2^((e % 3)/3) * (1 + m)^(1/3) ``` - Step 2: Use the first 4 bit fractional bits of `m` to look up for a degree-7 polynomial approximation to: ``` (1 + m)^(1/3) ~ 1 + m * P(m). ``` - Step 3: Perform the multiplication: ``` 2^((e % 3)/3) * (1 + m)^(1/3). ``` - Step 4: Check for exact cases to prevent rounding and clear `FE_INEXACT` floating point exception. - Step 5: Combine with the exponent and sign before converting down to `float` and return.	2024-07-08 10:02:12 -04:00
Hendrik Hübner	f8834ed24b	[libc][C23][math] Implement cospif function correctly rounded for all rounding modes (#97464 ) I also fixed a comment in sinpif.cpp in the first commit. Should this be included in this PR? All tests were passed, including the exhaustive test. CC: @lntue	2024-07-06 09:24:05 -04:00
OverMighty	12a1e6dd12	[libc][math][c23] Add f16{add,sub}f C23 math functions (#96787 ) Part of #93566.	2024-07-02 09:16:12 -04:00
Job Henandez Lara	6f60d2b807	[libc] Add mpfr tests for fmul. (#97376 ) Fixes https://github.com/llvm/llvm-project/issues/94834	2024-07-02 00:38:15 -04:00
Hendrik Hübner	ea93c538c7	[libc][math][c23] Implemented sinpif function correctly rounded for all rounding modes. (#97149 ) This implements the sinpif function. An exhaustive test shows it's correct for all rounding modes. Issue: #94895	2024-07-01 16:38:03 -04:00
OverMighty	6c1c451b86	[libc][math][c23] Add f16sqrt{,l,f128} C23 math functions (#96642 ) Part of #95250.	2024-06-30 19:20:39 -04:00
OverMighty	56ef6a2eb2	[libc][math][c23] Add f16div{,l,f128} C23 math functions (#97054 ) Part of #93566.	2024-06-29 18:48:12 -04:00
OverMighty	e34dbb127a	[libc][math][c23] Add f16fma{,l,f128} C23 math function (#96711 ) Part of #93566.	2024-06-27 14:44:19 -04:00
OverMighty	ef05b03223	[libc][math][c23] Add MPFR exhaustive test for fmodf16 (#94656 )	2024-06-25 16:44:47 -04:00
OverMighty	edbe698ead	[libc][math][c23] Add f16divf C23 math function (#96131 ) Part of #93566.	2024-06-25 08:48:28 -04:00
OverMighty	f3aceeee8a	[libc][math][c23] Add f16fmaf C23 math function (#95483 ) Part of #93566.	2024-06-14 12:31:32 -04:00
OverMighty	a239343521	[libc][math][c23] Add f16sqrtf C23 math function (#95251 ) Part of #95250.	2024-06-13 12:57:24 -04:00
OverMighty	f50656c509	[libc][math][c23] Add MPFR unit tests for {rint,lrint,llrint,lround,llround}f16 (#94473 )	2024-06-10 17:30:18 -04:00
OverMighty	65310f34d7	Reapply "[libc][math][c23] Add MPFR unit tests for {ceil,floor,round,roundeven,trunc}f16 (#94383 )" (#94807 ) This reverts commit `cbe97e959d`.	2024-06-10 15:48:19 -04:00
OverMighty	cbe97e959d	Revert "[libc][math][c23] Add MPFR unit tests for {ceil,floor,round,roundeven,trunc}f16 (#94383 )" (#94505 ) This reverts commit `fda1e4b01f`. The commit caused Buildbot failures: - https://lab.llvm.org/buildbot/#/builders/256/builds/14331 - https://lab.llvm.org/buildbot/#/builders/229/builds/27009	2024-06-05 13:16:09 -04:00
OverMighty	fda1e4b01f	[libc][math][c23] Add MPFR unit tests for {ceil,floor,round,roundeven,trunc}f16 (#94383 )	2024-06-05 12:38:15 -04:00
Job Henandez Lara	49561181bd	[libc] Add proxy header for fenv.h macro constants. #87863 (#87896 ) Hello, this addresses #87863.	2024-04-09 12:55:10 -04:00
lntue	5748ad84e5	[libc] Add proxy header math_macros.h. (#87598 ) Context: https://github.com/llvm/llvm-project/pull/87017 - Add proxy header `libc/hdr/math_macros.h` that will: - include `<math.h>` in overlay mode, - include `"include/llvm-libc-macros/math-macros.h"` in full build mode. - Its corresponding CMake target `libc.hdr.math_macros` will only depend on `libc.include.math` and `libc.include.llvm-libc-macros.math_macros` in full build mode. - Replace all `#include "include/llvm-libc-macros/math-macros.h"` with `#include "hdr/math_macros.h"`. - Add dependency to `libc.hdr.math_macros` CMake target when using `add_fp_unittest`. - Update the remaining dependency. - Update bazel overlay: add `libc:hdr_math_macros` target, and replacing all dependency on `libc:llvm_libc_macros_math_macros` with `libc:hdr_math_macros`.	2024-04-05 18:21:16 -04:00
Vinayak Dev	3b961d113e	[libc] Implement roundeven C23 math functions (#87678 ) Implements the functions `roundeven()`, `roundevenf()`, `roundevenl()` from the roundeven family of functions introduced in C23. Also implements `roundevenf128()`.	2024-04-05 08:36:12 -04:00
OverMighty	a8c59750d9	[libc][math][c23] Add exp2m1f C23 math function (#86996 ) Fixes #86502. cc @lntue	2024-04-04 08:22:45 -04:00
lntue	2be722587f	[libc][math] Implement atan2f correctly rounded to all rounding modes. (#86716 ) We compute atan2f(y, x) in 2 stages: - Fast step: perform computations in double precision , with relative errors < 2^-50 - Accurate step: if the result from the Fast step fails Ziv's rounding test, then we perform computations in double-double precision, with relative errors < 2^-100. On Ryzen 5900X, worst-case latency is ~ 200 clocks, compared to average latency ~ 60 clocks, and average reciprocal throughput ~ 20 clocks.	2024-04-01 13:31:07 -04:00
Michael Jones	5d56b34807	[libc] Remove direct math.h includes (#85324 ) Reland of #84991 A downstream overlay mode user ran into issues with the isnan macro not working in our sources with a specific libc configuration. This patch replaces the last direct includes of math.h with our internal math_macros.h, along with the necessary build system changes.	2024-03-18 14:19:33 -07:00
Guillaume Chatelet	2856db0d3b	[libc][NFC] Remove `FPBits` cast operator (#79142 ) The semantics for casting can range from "bitcast" (same representation) to "different representation", to "type promotion". Here we remove the cast operator and force usage of `get_val` as the only function to get the floating point value, making the intent clearer and more consistent.	2024-01-23 17:30:19 +01:00
Guillaume Chatelet	6b02d2f863	[reland][libc] Remove unnecessary `FPBits` functions and properties (#79128 ) - reland #79113 - Fix aarch64 RISC-V build	2024-01-23 13:48:03 +01:00
Guillaume Chatelet	b524eed925	Revert "[libc] Remove unnecessary `FPBits` functions and properties" (#79118 ) Reverts llvm/llvm-project#79113 It broke aarch64 build bot machines.	2024-01-23 11:51:18 +01:00
Guillaume Chatelet	3bc86bf3bf	[libc] Remove unnecessary `FPBits` functions and properties (#79113 ) This patch reduces the surface of `FPBits`.	2024-01-23 11:48:28 +01:00
Guillaume Chatelet	14f0c06f48	[libc] Fix is_subnormal for Intel Extended Precision (#78592 ) Also turn a set of `get_biased_exponent() == 0` into `is_subnormal()` which is clearer.	2024-01-19 09:36:03 +01:00
Guillaume Chatelet	c09e690556	[libc][NFC] Remove `FloatProperties` (#76508 ) Access is now done through `FPBits` exclusively. This patch also renames a few internal structs and uses `T` instead of `FP` as a template parameter.	2024-01-03 09:51:58 +01:00
Guillaume Chatelet	c23991478a	[libc][NFC] Integrate `FloatProperties` into `FPBits` (#76506 ) `FloatProperties` is always included when `FPBits` is. This will help further refactoring.	2023-12-28 15:42:47 +01:00
Guillaume Chatelet	3546f4da19	[libc][NFC] Rename `MANTISSA_WIDTH` in `FRACTION_LEN` (#75489 ) This one might be a bit controversial since the terminology has been introduced from the start but I think `FRACTION_LEN` is a better name here. AFAICT it really is "the number of bits after the decimal dot when the number is in normal form." `MANTISSA_WIDTH` is less precise as it's unclear whether we take the leading bit into account. This patch also renames most of the properties to use the `_LEN` suffix and fixes useless casts or variables.	2023-12-15 13:57:35 +01:00
Guillaume Chatelet	493cc71d72	[libc][NFC] Remove MantissaWidth traits (#75458 ) Same as #75362, the traits does not bring a lot of value over `FloatProperties::MANTISSA_WIDTH` (or `FPBits::MANTISSA_WIDTH`).	2023-12-14 15:07:09 +01:00
Guillaume Chatelet	7b387d2758	[libc][NFC] Fix mixed up biased/unbiased exponent (#75037 ) According to [wikipedia](https://en.wikipedia.org/wiki/Exponent_bias) the "biased exponent" is the encoded form that is always positive whereas the unbiased form is the actual "real" exponent that can be positive or negative. `FPBits` seems to be using `unbiased_exponent` to describe the encoded form (unsigned). This patch simply use `biased` instead of `unbiased`.	2023-12-11 17:06:48 +01:00
Guillaume Chatelet	d924c5d721	[libc][NFC] Sink "PlatformDefs.h" into "FloatProperties.h" (#73226 ) `PlatformDefs.h` does not bring a lot of value as a separate file. It is transitively included in `FloatProperties.h` and `FPBits.h`. This patch sinks it into `FloatProperties.h` and removes the associated build targets.	2023-11-23 11:23:18 +01:00
lntue	bc7a3bd864	[libc][math] Implement powf function correctly rounded to all rounding modes. (#71188 ) We compute `pow(x, y)` using the formula ``` pow(x, y) = x^y = 2^(y * log2(x)) ``` We follow similar steps as in `log2f(x)` and `exp2f(x)`, by breaking down into `hi + mid + lo` parts, in which `hi` parts are computed using the exponent field directly, `mid` parts will use look-up tables, and `lo` parts are approximated by polynomials. We add some speedup for common use-cases: ``` pow(2, y) = exp2(y) pow(10, y) = exp10(y) pow(x, 2) = x * x pow(x, 1/2) = sqrt(x) pow(x, -1/2) = rsqrt(x) - to be added ```	2023-11-06 16:54:25 -05:00
Mikhail R. Gadelha	8fc87f54a8	[libc][NFC] Couple of small warning fixes (#67847 ) This patch fixes a couple of warnings when compiling with gcc 13: * CPP/type_traits_test.cpp: 'apply' overrides a member function but is not marked 'override' * UnitTest/LibcTest.cpp:98: control reaches end of non-void function * MPFRWrapper/MPFRUtils.cpp:75: control reaches end of non-void function * smoke/FrexpTest.h:92: backslash-newline at end of file * __support/float_to_string.h:118: comparison of unsigned expression in ‘>= 0’ is always true * test/src/__support/CPP/bitset_test.cpp:197: comparison of unsigned expression in ‘>= 0’ is always true --------- Signed-off-by: Mikhail R. Gadelha <mikhail@igalia.com>	2023-10-02 19:29:26 -04:00
Guillaume Chatelet	b6bc9d72f6	[libc] Mass replace enclosing namespace (#67032 ) This is step 4 of https://discourse.llvm.org/t/rfc-customizable-namespace-to-allow-testing-the-libc-when-the-system-libc-is-also-llvms-libc/73079	2023-09-26 11:45:04 +02:00
Michael Jones	cfbcbc8f88	[libc] fix MPFR rounding problems in fuzz test The accuracy for the MPFR numbers in the strtofloat fuzz test was set too high, causing rounding issues when rounding to a smaller final result. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D154150	2023-07-05 10:53:40 -07:00
Tue Ly	f320fefc4a	[libc][math] Implement erff function correctly rounded to all rounding modes. Implement correctly rounded `erff` functions. For `x >= 4`, `erff(x) = 1` for `FE_TONEAREST` or `FE_UPWARD`, `0x1.ffffep-1` for `FE_DOWNWARD` or `FE_TOWARDZERO`. For `0 <= x < 4`, we divide into 32 sub-intervals of length `1/8`, and use a degree-15 odd polynomial to approximate `erff(x)` in each sub-interval: ``` erff(x) ~ x * (c0 + c1 * x^2 + c2 * x^4 + ... + c7 * x^14). ``` For `x < 0`, we can use the same formula as above, since the odd part is factored out. Performance tested with `perf.sh` tool from the CORE-MATH project on AMD Ryzen 9 5900X: Reciprocal throughput (clock cycles / op) ``` $ ./perf.sh erff --path2 GNU libc version: 2.35 GNU libc release: stable -- CORE-MATH reciprocal throughput -- with -march=native (with FMA instructions) [####################] 100 % Ntrial = 20 ; Min = 11.790 + 0.182 clc/call; Median-Min = 0.154 clc/call; Max = 12.255 clc/call; -- CORE-MATH reciprocal throughput -- with -march=x86-64-v2 (without FMA instructions) [####################] 100 % Ntrial = 20 ; Min = 14.205 + 0.151 clc/call; Median-Min = 0.159 clc/call; Max = 15.893 clc/call; -- System LIBC reciprocal throughput -- [####################] 100 % Ntrial = 20 ; Min = 45.519 + 0.445 clc/call; Median-Min = 0.552 clc/call; Max = 46.345 clc/call; -- LIBC reciprocal throughput -- with -mavx2 -mfma (with FMA instructions) [####################] 100 % Ntrial = 20 ; Min = 9.595 + 0.214 clc/call; Median-Min = 0.220 clc/call; Max = 9.887 clc/call; -- LIBC reciprocal throughput -- with -msse4.2 (without FMA instructions) [####################] 100 % Ntrial = 20 ; Min = 10.223 + 0.190 clc/call; Median-Min = 0.222 clc/call; Max = 10.474 clc/call; ``` and latency (clock cycles / op): ``` $ ./perf.sh erff --path2 GNU libc version: 2.35 GNU libc release: stable -- CORE-MATH latency -- with -march=native (with FMA instructions) [####################] 100 % Ntrial = 20 ; Min = 38.566 + 0.391 clc/call; Median-Min = 0.503 clc/call; Max = 39.170 clc/call; -- CORE-MATH latency -- with -march=x86-64-v2 (without FMA instructions) [####################] 100 % Ntrial = 20 ; Min = 43.223 + 0.667 clc/call; Median-Min = 0.680 clc/call; Max = 43.913 clc/call; -- System LIBC latency -- [####################] 100 % Ntrial = 20 ; Min = 111.613 + 1.267 clc/call; Median-Min = 1.696 clc/call; Max = 113.444 clc/call; -- LIBC latency -- with -mavx2 -mfma (with FMA instructions) [####################] 100 % Ntrial = 20 ; Min = 40.138 + 0.410 clc/call; Median-Min = 0.536 clc/call; Max = 40.729 clc/call; -- LIBC latency -- with -msse4.2 (without FMA instructions) [####################] 100 % Ntrial = 20 ; Min = 44.858 + 0.872 clc/call; Median-Min = 0.814 clc/call; Max = 46.019 clc/call; ``` Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D153683	2023-06-28 13:58:37 -04:00
Tue Ly	37458f6693	[libc][math] Move str method from FPBits class to testing utils. str method of FPBits class is only used for pretty printing its objects in tests. It brings cpp::string dependency to FPBits class, which is not ideal for embedded use case. We move str method to a free function in test utils and remove this dependency of FPBits class. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D152607	2023-06-10 02:50:58 -04:00
Michael Jones	ae3b59e623	[libc] Use MPFR for strtofloat fuzzing The previous string to float tests didn't check correctness, but due to the atof differential test proving unreliable the strtofloat fuzz test has been changed to use MPFR for correctness checking. Some minor bugs have been found and fixed as well. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D150905	2023-05-22 11:04:53 -07:00
Siva Chandra Reddy	00bd8e9011	[libc] Add a str() method to FPBits which returns a string representation. Unit tests for the str() method have also been added. Previously, a separate test only helper function was being used by the test matchers which has regressed over many cleanups. Moreover, being a test only utility, it was not tested separately (and hence the regression). Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D150906	2023-05-19 06:20:41 +00:00
Siva Chandra Reddy	dcf296b541	[libc][NFC] Remove the StreamWrapper class and use the new test logger. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D148452	2023-04-17 15:48:18 +00:00
Siva Chandra Reddy	af1315c28f	[libc][NFC] Move UnitTest and IntegrationTest to the 'test' directory. This part of the effort to make all test related pieces into the `test` directory. This helps is excluding test related pieces in a straight forward manner if LLVM_INCLUDE_TESTS is OFF. Future patches will also move the MPFR wrapper and testutils into the 'test' directory.	2023-02-07 19:45:51 +00:00
Tue Ly	9b30f6b6d7	[libc][math] Implement acoshf function correctly rounded to all rounding modes. Implement acoshf function correctly rounded to all rounding modes. Reviewed By: zimmermann6 Differential Revision: https://reviews.llvm.org/D142781	2023-02-01 11:35:15 -05:00
Tue Ly	46b15fd19e	[libc][math] Implement asinhf function correctly rounded for all rounding modes. Implement asinhf function correctly rounded for all rounding modes. Reviewed By: zimmermann6 Differential Revision: https://reviews.llvm.org/D142681	2023-01-27 11:12:27 -05:00
Tue Ly	a752460d73	[libc][math] Implement exp10f function correctly rounded to all rounding modes. Implement exp10f function correctly rounded to all rounding modes. Algorithm: perform range reduction to reduce ``` 10^x = 2^(hi + mid) * 10^lo ``` where: ``` hi is an integer, 0 <= mid * 2^5 < 2^5 -log10(2) / 2^6 <= lo <= log10(2) / 2^6 ``` Then `2^mid` is stored in a table of 32 entries and the product `2^hi * 2^mid` is performed by adding `hi` into the exponent field of `2^mid`. `10^lo` is then approximated by a degree-5 minimax polynomials generated by Sollya with: ``` > P = fpminimax((10^x - 1)/x, 4, [\|D...\|], [-log10(2)/64. log10(2)/64]); ``` Performance benchmark using perf tool from the CORE-MATH project on Ryzen 1700: ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh exp10f GNU libc version: 2.35 GNU libc release: stable CORE-MATH reciprocal throughput : 10.215 System LIBC reciprocal throughput : 7.944 LIBC reciprocal throughput : 38.538 LIBC reciprocal throughput : 12.175 (with `-msse4.2` flag) LIBC reciprocal throughput : 9.862 (with `-mfma` flag) $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh exp10f --latency GNU libc version: 2.35 GNU libc release: stable CORE-MATH latency : 40.744 System LIBC latency : 37.546 BEFORE LIBC latency : 48.989 LIBC latency : 44.486 (with `-msse4.2` flag) LIBC latency : 40.221 (with `-mfma` flag) ``` This patch relies on https://reviews.llvm.org/D134002 Reviewed By: orex, zimmermann6 Differential Revision: https://reviews.llvm.org/D134104	2022-09-19 10:01:40 -04:00
Tue Ly	463dcc8749	[libc][math] Implement acosf function correctly rounded for all rounding modes. Implement acosf function correctly rounded for all rounding modes. We perform range reduction as follows: - When `\|x\| < 2^(-10)`, we use cubic Taylor polynomial: ``` acos(x) = pi/2 - asin(x) ~ pi/2 - x - x^3 / 6. ``` - When `2^(-10) <= \|x\| <= 0.5`, we use the same approximation that is used for `asinf(x)` when `\|x\| <= 0.5`: ``` acos(x) = pi/2 - asin(x) ~ pi/2 - x - x^3 * P(x^2). ``` - When `0.5 < x <= 1`, we use the double angle formula: `cos(2y) = 1 - 2 * sin^2 (y)` to reduce to: ``` acos(x) = 2 * asin( sqrt( (1 - x)/2 ) ) ``` - When `-1 <= x < -0.5`, we reduce to the positive case above using the formula: ``` acos(x) = pi - acos(-x) ``` Performance benchmark using perf tool from the CORE-MATH project on Ryzen 1700: ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh acosf GNU libc version: 2.35 GNU libc release: stable CORE-MATH reciprocal throughput : 28.613 System LIBC reciprocal throughput : 29.204 LIBC reciprocal throughput : 24.271 $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh asinf --latency GNU libc version: 2.35 GNU libc release: stable CORE-MATH latency : 55.554 System LIBC latency : 76.879 LIBC latency : 62.118 ``` Reviewed By: orex, zimmermann6 Differential Revision: https://reviews.llvm.org/D133550	2022-09-09 09:55:30 -04:00

1 2 3

111 Commits