glibc/sysdeps/i386/fpu
H.J. Lu 5313581cb5 i386: Replace assembly versions of e_powf with generic e_powf.c
This patch replaces i386 assembly versions of e_powf with generic
e_powf.c.  For workload-spec2017.wrf, on Nehalem, it improves
performance by:

                           Before            After     Improvement
reciprocal-throughput      230.855          78.3358       194%
latency                    231.685          94.1259       146%

On Skylake, it improves performance by:

                           Before            After     Improvement
reciprocal-throughput      239.858          47.4713       405%
latency                    247.57           93.8798       163%

On IvyBridge with --disable-multi-arch, it improves performance by:

                           Before            After     Improvement
reciprocal-throughput      269.078          63.3758       324%
latency                    271.473          102.091       165%

	* sysdeps/i386/fpu/e_powf.S: Removed.
	* sysdeps/i386/fpu/e_powf_log2_data.c: Likewise.
	* sysdeps/i386/fpu/w_powf.c: Likewise.
	* sysdeps/i386/fpu/libm-test-ulps: Updated for generic e_powf.c.
	* sysdeps/i386/i686/fpu/multiarch/libm-test-ulps: Likewise.
	* sysdeps/i386/i686/fpu/multiarch/Makefile (libm-sysdep_routines):
	Add e_powf-sse2.
	(CFLAGS-e_powf-sse2.c): New.
	* sysdeps/i386/i686/fpu/multiarch/e_powf-sse2.c: New file.
	* sysdeps/i386/i686/fpu/multiarch/e_powf.c: Likewise.
2017-10-22 08:12:41 -07:00
..
doasin.c
e_acos.S
e_acosf.S
e_acosh.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
e_acoshf.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
e_acoshl.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
e_acosl.c
e_asin.S Refactor i386 libm code forcing underflow exceptions. 2015-09-24 21:41:00 +00:00
e_asinf.S Refactor i386 libm code forcing underflow exceptions. 2015-09-24 21:41:00 +00:00
e_atan2.S Refactor i386 libm code forcing underflow exceptions. 2015-09-24 21:41:00 +00:00
e_atan2f.S Refactor i386 libm code forcing underflow exceptions. 2015-09-24 21:41:00 +00:00
e_atan2l.c
e_atanh.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
e_atanhf.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
e_atanhl.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
e_exp2.S Avoid excess range in results from i386 exp, hypot, pow functions (bug 18980). 2015-09-18 21:53:22 +00:00
e_exp2l.S Refactor i386 libm code forcing underflow exceptions. 2015-09-24 21:41:00 +00:00
e_exp10.S Avoid excess range in results from i386 exp, hypot, pow functions (bug 18980). 2015-09-18 21:53:22 +00:00
e_exp10f.S Avoid excess range in results from i386 exp, hypot, pow functions (bug 18980). 2015-09-18 21:53:22 +00:00
e_exp10l.S
e_exp.S Avoid excess range in results from i386 exp, hypot, pow functions (bug 18980). 2015-09-18 21:53:22 +00:00
e_expl.S Fix i386/x86_64 expl, exp10l, expm1l for sNaN input (bug 20226). 2016-06-08 21:55:06 +00:00
e_fmod.S
e_fmodf.S
e_fmodl.c
e_hypot.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
e_hypotf.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
e_ilogb.S
e_ilogbf.S
e_ilogbl.S
e_log2.S Fix log2 (1) in round-downward mode (bug 17042). 2014-06-10 12:07:15 +00:00
e_log2l.S Fix i386/x86_64 log2l (sNaN) (bug 20235). 2016-06-09 18:04:30 +00:00
e_log10.S Fix log10 (1) in round-downward mode (bug 16977). 2014-05-23 12:07:50 +00:00
e_log10f.S Fix log10 (1) in round-downward mode (bug 16977). 2014-05-23 12:07:50 +00:00
e_log10l.S Fix i386/x86_64 log10l (sNaN) (bug 20228). 2016-06-08 22:59:18 +00:00
e_log.S Fix i386/x86_64 log* (1) zero sign for -ffinite-math-only (bug 19213). 2015-11-05 21:56:31 +00:00
e_logl.S Fix i386/x86_64 logl (sNaN) (bug 20227). 2016-06-08 22:24:06 +00:00
e_pow.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
e_powl.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
e_rem_pio2.c
e_remainder.S
e_remainderf.S
e_remainderl.S
e_scalb.S Avoid excess range in results from i386 scalb functions (bug 18981). 2015-09-18 20:34:59 +00:00
e_scalbf.S Avoid excess range in results from i386 scalb functions (bug 18981). 2015-09-18 20:34:59 +00:00
e_scalbl.S Fix i386/x86_64 scalbl with sNaN input (bug 20296). 2016-06-23 22:17:41 +00:00
e_sqrt.S Fix x86 sqrt rounding (bug 14032). 2013-11-29 16:31:16 +00:00
e_sqrtf.S
e_sqrtl.c
fclrexcpt.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
fedisblxcpt.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
feenablxcpt.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
fegetenv.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
fegetexcept.c Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
fegetmode.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
fegetround.c Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
feholdexcpt.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
fenv_private.h Add float128 support for x86_64, x86. 2017-06-26 22:02:24 +00:00
fesetenv.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
fesetexcept.c Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
fesetmode.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
fesetround.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
feupdateenv.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
fgetexcptflg.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
fraiseexcpt.c Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
fsetexcptflg.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
ftestexcept.c Check if SSE is available with HAS_CPU_FEATURE 2017-04-07 07:44:59 -07:00
halfulp.c
i386-math-asm.h Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
Implies
libm-test-ulps i386: Replace assembly versions of e_powf with generic e_powf.c 2017-10-22 08:12:41 -07:00
libm-test-ulps-name Do not hardcode platform names in manual/libm-err-tab.pl (bug 14139). 2016-11-04 16:49:06 +00:00
math_private.h Fix math_private.h multiple include guards. 2015-11-20 23:46:23 +00:00
math-tests.h Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
mpatan2.c
mpatan.c
mpexp.c
mplog.c
mpsqrt.c
s_asinh.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_asinhf.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_asinhl.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_atan.S Refactor i386 libm code forcing underflow exceptions. 2015-09-24 21:41:00 +00:00
s_atanf.S Refactor i386 libm code forcing underflow exceptions. 2015-09-24 21:41:00 +00:00
s_atanl.c
s_cbrt.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_cbrtf.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_cbrtl.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_ceil.S Avoid "inexact" exceptions in i386/x86_64 ceil functions (bug 15479). 2016-06-27 17:24:30 +00:00
s_ceilf.S Avoid "inexact" exceptions in i386/x86_64 ceil functions (bug 15479). 2016-06-27 17:24:30 +00:00
s_ceill.S Avoid "inexact" exceptions in i386/x86_64 ceil functions (bug 15479). 2016-06-27 17:24:30 +00:00
s_copysign.S
s_copysignf.S
s_copysignl.S
s_expm1.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_expm1f.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_expm1l.S
s_fabs.S
s_fabsf.S
s_fabsl.S
s_fdim.c Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_finite.S
s_finitef.S
s_finitel.S
s_floor.S Avoid "inexact" exceptions in i386/x86_64 floor functions (bug 15479). 2016-06-27 17:25:47 +00:00
s_floorf.S Avoid "inexact" exceptions in i386/x86_64 floor functions (bug 15479). 2016-06-27 17:25:47 +00:00
s_floorl.S Avoid "inexact" exceptions in i386/x86_64 floor functions (bug 15479). 2016-06-27 17:25:47 +00:00
s_fmax.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_fmaxf.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_fmaxl.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_fmin.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_fminf.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_fminl.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_fpclassifyl.c Consistently use uintN_t not u_intN_t in libm. 2017-08-03 19:55:04 +00:00
s_frexp.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_frexpf.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_frexpl.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_isinfl.c
s_isnanl.c Consistently use uintN_t not u_intN_t in libm. 2017-08-03 19:55:04 +00:00
s_llrint.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_llrintf.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_llrintl.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_log1p.S Refactor i386 libm code forcing underflow exceptions. 2015-09-24 21:41:00 +00:00
s_log1pf.S Refactor i386 libm code forcing underflow exceptions. 2015-09-24 21:41:00 +00:00
s_log1pl.S Fix i386/x86_64 log1pl (sNaN) (bug 20229). 2016-06-08 23:11:42 +00:00
s_logb.S
s_logbf.S
s_logbl.c
s_lrint.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_lrintf.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_lrintl.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_nearbyint.S Simplify x86 nearbyint functions. 2016-06-22 15:40:30 +00:00
s_nearbyintf.S Simplify x86 nearbyint functions. 2016-06-22 15:40:30 +00:00
s_nearbyintl.S Simplify x86 nearbyint functions. 2016-06-22 15:40:30 +00:00
s_nextafterl.c Consistently use uintN_t not u_intN_t in libm. 2017-08-03 19:55:04 +00:00
s_nexttoward.c Consistently use uintN_t not u_intN_t in libm. 2017-08-03 19:55:04 +00:00
s_nexttowardf.c Consistently use uintN_t not u_intN_t in libm. 2017-08-03 19:55:04 +00:00
s_remquo.S
s_remquof.S
s_remquol.S
s_rint.S
s_rintf.S
s_rintl.c
s_scalbln.c
s_scalblnf.c
s_scalblnl.c
s_scalbn.S Avoid excess range in results from i386 scalb functions (bug 18981). 2015-09-18 20:34:59 +00:00
s_scalbnf.S Avoid excess range in results from i386 scalb functions (bug 18981). 2015-09-18 20:34:59 +00:00
s_scalbnl.S Make scalbn set errno (bug 6803). 2015-09-16 21:11:00 +00:00
s_significand.S
s_significandf.S
s_significandl.c
s_trunc.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_truncf.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
s_truncl.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
slowexp.c
slowpow.c
t_exp.c
Versions
w_sqrt_compat.c Move wrappers to libm-compat-calls-auto 2017-01-04 16:25:04 -02:00
w_sqrt.c Prefer new libm function wrappers for !LIBM_SVID_COMPAT. 2017-09-05 23:35:55 +00:00