glibc/sysdeps/ieee754
Adhemerval Zanella 5fa89852fa math: Use exp10m1f from CORE-MATH
The CORE-MATH implementation is correctly rounded (for any rounding mode)
and shows better performance compared to the generic exp10m1f.

The code was adapted to glibc style and to use the definition of
math_config.h (to handle errno, overflow, and underflow).  I mostly
fixed some small issues in corner cases (sNaN handling, -INFINITY,
a specific overflow check).

Benchtest on x64_64 (Ryzen 9 5900X, gcc 14.2.1), aarch64 (Neoverse-N1,
gcc 13.3.1), and powerpc (POWER10, gcc 13.2.1):

Latency                      master        patched   improvement
x86_64                      45.4690        49.5845        -9.05%
x86_64v2                    46.1604        36.2665        21.43%
x86_64v3                    37.8442        31.0359        17.99%
i686                        121.367        93.0079        23.37%
aarch64                     21.1126        15.0165        28.87%
power10                     12.7426        8.4929         33.35%

reciprocal-throughput        master        patched   improvement
x86_64                      19.6005        17.4005        11.22%
x86_64v2                    19.6008        11.1977        42.87%
x86_64v3                    17.5427        10.2898        41.34%
i686                        59.4215        60.9675        -2.60%
aarch64                     13.9814        7.9173         43.37%
power10                      6.7814        6.4258          5.24%

The generic implementation calls __ieee754_exp10f which has an
optimized version, although it is not correctly rounded, which is
the main culprit of the the latency difference for x86_64 and
throughp for i686.

Signed-off-by: Alexei Sibidanov <sibid@uvic.ca>
Signed-off-by: Paul Zimmermann <Paul.Zimmermann@inria.fr>
Signed-off-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>
Reviewed-by: DJ Delorie <dj@redhat.com>
2024-11-01 11:27:26 -03:00
..
dbl-64 replace tgammaf by the CORE-MATH implementation 2024-10-11 11:12:32 +02:00
float128 Implement C23 logp1 2024-06-17 13:47:09 +00:00
flt-32 math: Use exp10m1f from CORE-MATH 2024-11-01 11:27:26 -03:00
ldbl-64-128 math: Provide missing math symbols on libc.a (BZ 31781) 2024-05-23 09:36:08 -03:00
ldbl-96 Implement C23 logp1 2024-06-17 13:47:09 +00:00
ldbl-128 Implement C23 logp1 2024-06-17 13:47:09 +00:00
ldbl-128ibm Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
ldbl-128ibm-compat Implement C23 exp2m1, exp10m1 2024-06-17 16:31:49 +00:00
ldbl-opt Convert to autoconf 2.72 (vanilla release, no distribution patches) 2024-06-17 21:15:28 +02:00
soft-fp Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
ieee754.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
k_standard.c Use copysign functions not __copysign functions in glibc libm. 2018-09-27 20:04:48 +00:00
k_standardf.c Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
k_standardl.c Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
libm-alias-finite.h Update copyright dates with scripts/update-copyrights 2024-01-01 10:53:40 -08:00
Makefile
s_lib_version.c
s_matherr.c
s_signgam.c Remove unnecessary math_private.h includes. 2018-09-28 21:53:33 +00:00