glibc/sysdeps/ieee754
H.J. Lu a8ecb126d4 x86_64: Add log1p with FMA
On Skylake, it changes log1p bench performance by:

        Before       After     Improvement
max     63.349       58.347       8%
min     4.448        5.651        -30%
mean    12.0674      10.336       14%

The minimum code path is

 if (hx < 0x3FDA827A)                          /* x < 0.41422  */
    {
      if (__glibc_unlikely (ax >= 0x3ff00000))           /* x <= -1.0 */
        {
	   ...
        }
      if (__glibc_unlikely (ax < 0x3e200000))           /* |x| < 2**-29 */
        {
          math_force_eval (two54 + x);          /* raise inexact */
          if (ax < 0x3c900000)                  /* |x| < 2**-54 */
            {
	      ...
            }
          else
            return x - x * x * 0.5;

FMA and non-FMA code sequences look similar.  Non-FMA version is slightly
faster.  Since log1p is called by asinh and atanh, it improves asinh
performance by:

        Before       After     Improvement
max     75.645       63.135       16%
min     10.074       10.071       0%
mean    15.9483      14.9089      6%

and improves atanh performance by:

        Before       After     Improvement
max     91.768       75.081       18%
min     15.548       13.883       10%
mean    18.3713      16.8011      8%
2023-08-21 10:44:26 -07:00
..
dbl-64 x86_64: Add log1p with FMA 2023-08-21 10:44:26 -07:00
float128 Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
flt-32 aarch64: Add vector implementations of cos routines 2023-06-30 09:04:10 +01:00
ldbl-64-128 Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
ldbl-96 Fix misspellings in sysdeps/ -- BZ 25337 2023-05-30 23:02:29 +00:00
ldbl-128 Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
ldbl-128ibm Fix misspellings in sysdeps/ -- BZ 25337 2023-05-30 23:02:29 +00:00
ldbl-128ibm-compat sysdeps/ieee754/ldbl-128ibm-compat: Fix warn unused result 2023-07-05 16:59:48 +02:00
ldbl-opt configure: Use autoconf 2.71 2023-07-17 10:08:10 -04:00
soft-fp math: Suppress -O0 warnings for soft-fp fsqrt [BZ #19444] 2023-01-11 17:50:51 -03:00
ieee754.h Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
k_standard.c Use copysign functions not __copysign functions in glibc libm. 2018-09-27 20:04:48 +00:00
k_standardf.c Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
k_standardl.c Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
libm-alias-finite.h Update copyright dates with scripts/update-copyrights 2023-01-06 21:14:39 +00:00
Makefile
s_lib_version.c Simplify math-svid-compat code. 2017-08-28 15:19:52 +00:00
s_matherr.c Obsolete matherr, _LIB_VERSION, libieee.a. 2017-08-21 17:45:10 +00:00
s_signgam.c Remove unnecessary math_private.h includes. 2018-09-28 21:53:33 +00:00