glibc

mirror of https://sourceware.org/git/glibc.git synced 2024-12-24 11:41:07 +00:00

Author	SHA1	Message	Date
Adhemerval Zanella	41e8926aa4	PowerPC: Set/restore rounding mode only when needed This patch helps some math functions performance by adding the libc_fexxx variant of inline functions to handle both FPU round and exception set/restore and by using them on the libc_fexxx_ctx functions. It is based on already coded fexxx family functions for PPC with fpu. Here is the summary of performance improvements due this patch (measured on a POWER7 machine): Before: cos(): ITERS:9.5895e+07: TOTAL:5116.03Mcy, MAX:77.6cy, MIN:49.792cy, 18744 calls/Mcy exp(): ITERS:2.827e+07: TOTAL:5187.15Mcy, MAX:494.018cy, MIN:38.422cy, 5450.01 calls/Mcy pow(): ITERS:6.1705e+07: TOTAL:5144.26Mcy, MAX:171.95cy, MIN:29.935cy, 11994.9 calls/Mcy sin(): ITERS:8.6898e+07: TOTAL:5117.06Mcy, MAX:83.841cy, MIN:46.582cy, 16982 calls/Mcy tan(): ITERS:2.9473e+07: TOTAL:5115.39Mcy, MAX:191.017cy, MIN:172.352cy, 5761.63 calls/Mcy After: cos(): ITERS:2.05265e+08: TOTAL:5111.37Mcy, MAX:78.754cy, MIN:24.196cy, 40158.5 calls/Mcy exp(): ITERS:3.341e+07: TOTAL:5170.84Mcy, MAX:476.317cy, MIN:15.574cy, 6461.23 calls/Mcy pow(): ITERS:7.6153e+07: TOTAL:5129.1Mcy, MAX:147.5cy, MIN:30.916cy, 14847.2 calls/Mcy sin(): ITERS:1.58816e+08: TOTAL:5115.11Mcy, MAX:1490.39cy, MIN:22.341cy, 31048.4 calls/Mcy tan(): ITERS:3.4964e+07: TOTAL:5114.18Mcy, MAX:177.422cy, MIN:146.115cy, 6836.68 calls/Mcy	2013-11-25 06:34:41 -06:00
Adhemerval Zanella	7a2ad8cf39	PowerPC: Fix __fe_mask_env export This patch does not export __fe_mask_env anymore, only providing a compatibility symbol. It fixes BZ#14143.	2013-11-13 06:45:19 -06:00
Alan Modra	9c008155b7	PowerPC floating point little-endian [11 of 15] http://sourceware.org/ml/libc-alpha/2013-07/msg00202.html Another little-endian fix. * sysdeps/powerpc/fpu_control.h (_FPU_GETCW): Rewrite using 64-bit int/double union. (_FPU_SETCW): Likewise. * sysdeps/powerpc/fpu/tst-setcontext-fpscr.c (_GET_DI_FPSCR): Likewise. (_SET_DI_FPSCR, _GET_SI_FPSCR, _SET_SI_FPSCR): Likewise.	2013-10-04 10:35:21 +09:30
Alan Modra	da13146da1	PowerPC floating point little-endian [10 of 15] http://sourceware.org/ml/libc-alpha/2013-07/msg00201.html These two functions oddly test x+1>0 when a double x is >= 0.0, and similarly when x is negative. I don't see the point of that since the test should always be true. I also don't see any need to convert x+1 to integer rather than simply using xr+1. Note that the standard allows these functions to return any value when the input is outside the range of long long, but it's not too hard to prevent xr+1 overflowing so that's what I've done. (With rounding mode FE_UPWARD, x+1 can be a lot more than what you might naively expect, but perhaps that situation was covered by the x - xrf < 1.0 test.) * sysdeps/powerpc/fpu/s_llround.c (__llround): Rewrite. * sysdeps/powerpc/fpu/s_llroundf.c (__llroundf): Rewrite.	2013-10-04 10:35:10 +09:30
Alan Modra	603e84104c	PowerPC floating point little-endian [9 of 15] http://sourceware.org/ml/libc-alpha/2013-07/msg00200.html This works around the fact that vsx is disabled in current little-endian gcc. Also, float constants take 4 bytes in memory vs. 16 bytes for vector constants, and we don't need to write one lot of masks for double (register format) and another for float (mem format). * sysdeps/powerpc/fpu/s_float_bitwise.h (__float_and_test28): Don't use vector int constants. (__float_and_test24, __float_and8, __float_get_exp): Likewise.	2013-10-04 10:34:57 +09:30
Anton Blanchard	4a28b3ca4b	PowerPC floating point little-endian [8 of 15] http://sourceware.org/ml/libc-alpha/2013-07/msg00199.html Corrects floating-point environment code for little-endian. * sysdeps/powerpc/fpu/fenv_libc.h (fenv_union_t): Replace int array with long long. * sysdeps/powerpc/fpu/e_sqrt.c (__slow_ieee754_sqrt): Adjust. * sysdeps/powerpc/fpu/e_sqrtf.c (__slow_ieee754_sqrtf): Adjust. * sysdeps/powerpc/fpu/fclrexcpt.c (__feclearexcept): Adjust. * sysdeps/powerpc/fpu/fedisblxcpt.c (fedisableexcept): Adjust. * sysdeps/powerpc/fpu/feenablxcpt.c (feenableexcept): Adjust. * sysdeps/powerpc/fpu/fegetexcept.c (__fegetexcept): Adjust. * sysdeps/powerpc/fpu/feholdexcpt.c (feholdexcept): Adjust. * sysdeps/powerpc/fpu/fesetenv.c (__fesetenv): Adjust. * sysdeps/powerpc/fpu/feupdateenv.c (__feupdateenv): Adjust. * sysdeps/powerpc/fpu/fgetexcptflg.c (__fegetexceptflag): Adjust. * sysdeps/powerpc/fpu/fraiseexcpt.c (__feraiseexcept): Adjust. * sysdeps/powerpc/fpu/fsetexcptflg.c (__fesetexceptflag): Adjust. * sysdeps/powerpc/fpu/ftestexcept.c (fetestexcept): Adjust.	2013-10-04 10:34:26 +09:30
Alan Modra	765714cafc	PowerPC floating point little-endian [3 of 15] http://sourceware.org/ml/libc-alpha/2013-08/msg00083.html Further replacement of ieee854 macros and unions. These files also have some optimisations for comparison against 0.0L, infinity and nan. Since the ABI specifies that the high double of an IBM long double pair is the value rounded to double, a high double of 0.0 means the low double must also be 0.0. The ABI also says that infinity and nan are encoded in the high double, with the low double unspecified. This means that tests for 0.0L, +/-Infinity and +/-NaN need only check the high double. * sysdeps/ieee754/ldbl-128ibm/e_atan2l.c (__ieee754_atan2l): Rewrite all uses of ieee854 long double macros and unions. Simplify tests for long doubles that are fully specified by the high double. * sysdeps/ieee754/ldbl-128ibm/e_gammal_r.c (__ieee754_gammal_r): Likewise. * sysdeps/ieee754/ldbl-128ibm/e_ilogbl.c (__ieee754_ilogbl): Likewise. Remove dead code too. * sysdeps/ieee754/ldbl-128ibm/e_jnl.c (__ieee754_jnl): Likewise. (__ieee754_ynl): Likewise. * sysdeps/ieee754/ldbl-128ibm/e_log10l.c (__ieee754_log10l): Likewise. * sysdeps/ieee754/ldbl-128ibm/e_logl.c (__ieee754_logl): Likewise. * sysdeps/ieee754/ldbl-128ibm/e_powl.c (__ieee754_powl): Likewise. Remove dead code too. * sysdeps/ieee754/ldbl-128ibm/k_tanl.c (__kernel_tanl): Likewise. * sysdeps/ieee754/ldbl-128ibm/s_expm1l.c (__expm1l): Likewise. * sysdeps/ieee754/ldbl-128ibm/s_frexpl.c (__frexpl): Likewise. * sysdeps/ieee754/ldbl-128ibm/s_isinf_nsl.c (__isinf_nsl): Likewise. Simplify. * sysdeps/ieee754/ldbl-128ibm/s_isinfl.c (___isinfl): Likewise. Simplify. * sysdeps/ieee754/ldbl-128ibm/s_log1pl.c (__log1pl): Likewise. * sysdeps/ieee754/ldbl-128ibm/s_modfl.c (__modfl): Likewise. * sysdeps/ieee754/ldbl-128ibm/s_nextafterl.c (__nextafterl): Likewise. Comment on variable precision. * sysdeps/ieee754/ldbl-128ibm/s_nexttoward.c (__nexttoward): Likewise. * sysdeps/ieee754/ldbl-128ibm/s_nexttowardf.c (__nexttowardf): Likewise. * sysdeps/ieee754/ldbl-128ibm/s_remquol.c (__remquol): Likewise. * sysdeps/ieee754/ldbl-128ibm/s_scalblnl.c (__scalblnl): Likewise. * sysdeps/ieee754/ldbl-128ibm/s_scalbnl.c (__scalbnl): Likewise. * sysdeps/ieee754/ldbl-128ibm/s_tanhl.c (__tanhl): Likewise. * sysdeps/powerpc/fpu/libm-test-ulps: Adjust tan_towardzero ulps.	2013-10-04 10:32:36 +09:30
Adhemerval Zanella	ddf985af47	Update powerpc-fpu ULPs.	2013-09-11 07:38:57 -05:00
Adhemerval Zanella	3f16954315	Update powerpc-fpu ULPs.	2013-07-04 07:14:44 -05:00
Joseph Myers	9c84384cc1	Remove trailing whitespace.	2013-06-05 20:44:03 +00:00
Joseph Myers	fab7ce3f5b	Link extra-libs consistently with libc and ld.so.	2013-05-31 16:16:33 +00:00
Adhemerval Zanella	5e056687df	Update powerpc libm-test ULPs.	2013-05-28 08:39:00 -05:00
Joseph Myers	b50a71810b	Don't include expected results in libm-test test names.	2013-05-22 11:49:36 +00:00
Joseph Myers	db62a90753	Handle sincos with generic libm-test logic.	2013-05-19 14:45:41 +00:00
Adhemerval Zanella	13d3b41a36	PowerPC: fix hypot/hypotf check for -INF	2013-05-17 08:15:59 -05:00
Ryan S. Arnold	e054f49430	Add #include <stdint.h> for uint[32\|64]_t usage (except installed headers).	2013-05-16 11:32:54 -05:00
Adhemerval Zanella	e9a5e0fd54	Update powerpc libm-test ULPs	2013-05-08 15:06:56 -05:00
Adhemerval Zanella	16e616a72f	PowerPC: fix hypot/hypof FP exceptions This patch fixes hypot/hypotf spurious floating-point exceptions generate by internal operations.	2013-05-06 14:40:17 -05:00
Adhemerval Zanella	8237f48c90	Update powerpc libm-test ULPs	2013-05-03 07:33:49 -05:00
Adhemerval Zanella	cb4d54147e	Update powerpc libm-test ULPs	2013-04-30 08:51:02 -05:00
Adhemerval Zanella	7e7b6f36a9	Update powerpc libm-test ULPs	2013-04-29 12:10:03 -05:00
Adhemerval Zanella	fbbe2b9a1f	Fix e_logl (128ibm) spurious underflow	2013-03-28 09:52:48 -03:00
Adhemerval Zanella	fce14d4e9c	PowerPC: fix libm ABI issue for llroundl	2013-03-26 10:01:57 -03:00
Adhemerval Zanella	b5784d95bb	PowerPC: fix sqrtl ABI issue This patch fixes a sqrtl ABI issue when building for powerpc64.	2013-03-21 14:15:45 -03:00
Thomas Schwinge	495ded2c8c	Promote a math test for sNaN handling to the top-level.	2013-03-15 19:12:10 +01:00
Thomas Schwinge	777b0332c0	Use GCC's builtins for generating NaNs.	2013-03-15 19:10:20 +01:00
Thomas Schwinge	67e971f18f	Better distinguish between NaN/qNaN/sNaN.	2013-03-15 19:06:02 +01:00
Adhemerval Zanella	edf66e57fc	PowerPC: unify math_ldbl.h implementations This patch removes redudant definition from PowerPC specific math_ldbl, using the definitions from ieee754 math_ldbl.h.	2013-03-08 11:07:15 -03:00
Joseph Myers	6850863319	Use same installed powerpc headers for hard and soft float.	2013-03-01 22:53:27 +00:00
Joseph Myers	e97ed6ddbe	Remove bp-sym.h and BP_SYM uses from C code.	2013-02-14 13:12:02 +00:00
Joseph Myers	59981e9b0d	Adapt installed powerpc headers better for soft-float / no-FPRs.	2013-01-17 02:08:51 +00:00
Siddhesh Poyarekar	47cf227845	Update powerpc ULPs	2013-01-09 21:01:48 +05:30
Anton Blanchard	2ccdea26f2	Fix spelling errors in sysdeps/powerpc files.	2013-01-07 11:20:53 -06:00
Andreas Schwab	d643bac1f7	Fix warnings in test-powerpc-snan.c	2013-01-04 19:04:08 +01:00
Andreas Schwab	e47686e960	Update powerpc libm ULPs	2013-01-04 18:53:13 +01:00
Joseph Myers	568035b787	Update copyright notices with scripts/update-copyrights.	2013-01-02 19:05:09 +00:00
Andreas Schwab	4e6e34e6ca	Update powerpc libm-test ULPs	2012-11-23 16:57:39 +01:00
Joseph Myers	5b5b04d628	Make fma use of Dekker and Knuth algorithms use round-to-nearest (bug 14796).	2012-11-03 19:48:53 +00:00
Andreas Schwab	1818fcb781	Update powerpc libm ULPs	2012-10-31 22:20:21 +01:00
Adhemerval Zanella	28cfe84316	Fix ctan, ctanh of subnormals in round-upwards mode (bug 14328). IBM long double fixes and POWER ulps update.	2012-07-11 09:19:27 -03:00
Andreas Schwab	0abaf3e48a	Fix float range reduction problems (#14283 )	2012-07-06 23:57:20 +02:00
Adhemerval Zanella	73a68f94d6	PowerPC: Fix for POWER7 sinf/cosf This patch fixes some sinf/cosf calculations that generated unexpected underflows exceptions.	2012-06-01 10:10:18 -03:00
Andreas Schwab	e77253266a	Sort sysdeps/powerpc/fpu/libm-test-ulps	2012-06-01 01:26:27 +02:00
Joseph Myers	795405f956	Don't include exceptions in libm-test-ulps test names.	2012-05-24 12:14:03 +00:00
Adhemerval Zanella	478143fa17	PowerPC: ULPs update Adjustments for libm ulps added with commit `d8b82cad1b`, `495fd99f3a`, and `5ba3cc691c`. I also adjusted some exp10 ulps definition that was higher than needed.	2012-05-21 14:28:51 -03:00
Adhemerval Zanella	a6f1845d45	Update powerpc ULPs for ccos, csin, ccosh, csinh tests.	2012-05-19 15:46:20 +00:00
Adhemerval Zanella	31dc8730af	Fix for ldbl-128ibm acosl/asinl inaccuracies 2012-05-02 Adhemerval Zanella <azanella@linux.vnet.ibm.com> * sysdeps/ieee754/ldbl-128ibm/e_acosl.c (__ieee754_acosl): Fix long double comparison inaccuracies. * sysdeps/ieee754/ldbl-128ibm/e_asinl.c (__ieee754_asinl): * Likewise. * sysdeps/powerpc/fpu/libm-test-ulps: Update.	2012-05-04 13:06:32 +02:00
Adhemerval Zanella	0ac229c819	Fix ctan, ctanh overflow for ldbl-128ibm (bug 11521).	2012-04-26 11:18:11 -05:00
Adhemerval Zanella	94e02fc410	Correct powerpc64 s_floorl edge cases (bug 13886). [BZ #13886] Remove powerpc64/fpu/s_floorl. Use fully correct ldbl-128bim/s_floorl.c.	2012-04-24 14:21:45 -05:00
Andreas Schwab	c876e002a2	Update powerpc libm test ULPs	2012-03-26 14:20:28 +02:00

1 2 3 4

184 Commits