diff options
| author | Adhemerval Zanella <adhemerval.zanella@linaro.org> | 2024-11-18 08:45:16 -0300 |
|---|---|---|
| committer | Adhemerval Zanella <adhemerval.zanella@linaro.org> | 2024-12-18 17:24:43 -0300 |
| commit | a357d6273f796b384b7e36d58a8de947ebde9668 (patch) | |
| tree | 60c206ae0acd2465638b5c35a12d95b29796e1cc /SHARED-FILES | |
| parent | ed608a40e2903351c64a3cea51ce8bd7c62996ef (diff) | |
| download | glibc-a357d6273f796b384b7e36d58a8de947ebde9668.tar.xz glibc-a357d6273f796b384b7e36d58a8de947ebde9668.zip | |
math: Use atanf from CORE-MATH
The CORE-MATH implementation is correctly rounded (for any rounding mode)
and shows slight better performance to the generic atanf.
The code was adapted to glibc style and to use the definition of
math_config.h (to handle errno, overflow, and underflow).
Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1,
gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1):
Latency master patched improvement
x86_64 56.8265 53.6842 5.53%
x86_64v2 54.8177 53.6842 2.07%
x86_64v3 46.2915 48.7034 -5.21%
i686 158.3760 108.9560 31.20%
aarch64 (Neoverse) 21.687 20.5893 5.06%
power10 13.1903 13.5012 -2.36%
reciprocal-throughput master patched improvement
x86_64 16.6787 16.7601 -0.49%
x86_64v2 16.6983 16.7601 -0.37%
x86_64v3 16.2268 12.1391 25.19%
i686 138.6840 36.0640 74.00%
aarch64 (Neoverse) 11.8012 10.3565 12.24%
power10 5.3212 4.2894 19.39%
Signed-off-by: Alexei Sibidanov <sibid@uvic.ca>
Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr>
Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Reviewed-by: DJ Delorie <dj@redhat.com>
Diffstat (limited to 'SHARED-FILES')
| -rw-r--r-- | SHARED-FILES | 4 |
1 files changed, 4 insertions, 0 deletions
diff --git a/SHARED-FILES b/SHARED-FILES index 18b3244e44..b9627afdfe 100644 --- a/SHARED-FILES +++ b/SHARED-FILES @@ -310,3 +310,7 @@ sysdeps/ieee754/flt-32/s_asinhf.c: (src/binary32/asinh/asinhf.c in CORE-MATH) - The code was adapted to use glibc code style and internal functions to handle errno, overflow, and underflow. +sysdeps/ieee754/flt-32/s_atanf.c: + (src/binary32/atan/atanf.c in CORE-MATH) + - The code was adapted to use glibc code style and internal + functions to handle errno, overflow, and underflow. |
