Joseph Myers
3b1004624e
Fix makefile/configure problems with sse2avx changes.
2012-01-30 19:55:15 +00:00
Ulrich Drepper
96bc5b45a6
Optimize x86-64 math inline header a bit
2012-01-28 21:20:06 -05:00
Ulrich Drepper
56f6f6a240
Use -msse2avx option for x86-64 libm functions
2012-01-28 14:48:46 -05:00
Ulrich Drepper
73139a7628
Simplify use of AVX instructions in internal math macros
2012-01-28 11:19:06 -05:00
Ulrich Drepper
08cf777f9e
Really fix AVX tests
...
There is no problem with strcmp, it doesn't use the YMM registers.
The math routines might since gcc perhaps generates such code.
Introduce bit_YMM_USBALE and use it in the math routines.
2012-01-26 09:45:54 -05:00
Ulrich Drepper
a784e50247
Remove pre-ISO C support
...
No more __const.
2012-01-07 23:57:22 -05:00
Ulrich Drepper
21eaf3a5f9
Use __REDIRECT_NTH for __feraiseexcept_renamed
2011-12-22 08:05:21 -05:00
Rafael Ávila de Espíndola
d2daaa1eb6
Define x86_64 feraiseexcept inline only under __USE_EXTERN_INLINES.
2011-12-21 13:27:09 -08:00
Ulrich Drepper
aff2453df7
Fix more warnings
2011-12-03 21:49:35 -05:00
Ulrich Drepper
a5b81e1fb7
Remove code without too much effects
...
Some of the AVX-specific code is not giving enough speed-up to
justify the extra code.
2011-10-28 16:55:01 -04:00
Ulrich Drepper
e0016b11d6
Add AVX optimized versions for some x86-64 math functions
2011-10-25 21:34:55 -04:00
Ulrich Drepper
31ea014d8b
Use VEX encoding in inline math functions on x86-64 when possible
2011-10-25 08:17:57 -04:00
Ulrich Drepper
31d3cc00b0
Cleanup FMA4 patch
...
Move the FMA4 code into its own section. Avoid some of the duplication
of data resulting from the double use of source files.
2011-10-25 00:56:33 -04:00
Ulrich Drepper
202c9deb15
Better DLA_FMS
...
It's better to use __builtin_fma if it works. Use it for gcc 4.6 and
higher. Move the x86-64 dla.h to the correct place.
2011-10-24 22:11:21 -04:00
Ulrich Drepper
af968f62f2
Optimize accurate 64-bit routines for FMA4 on x86-64
2011-10-24 20:19:17 -04:00
Ulrich Drepper
fb24de5932
Fix typo in last change
2011-10-22 20:09:58 -04:00
Ulrich Drepper
0d355eb7c7
Update ULPs for x86-64
2011-10-22 20:06:23 -04:00
Ulrich Drepper
1a97a8c78f
Don't use NULL in last s_fma{,f} change
2011-10-21 07:39:28 -04:00
Ulrich Drepper
ed72b6545f
Check for FMA4 support and generate appropriate fma functions
2011-10-20 22:43:15 -04:00
Ulrich Drepper
8d4f46c613
Move fma routines to right place
2011-10-20 21:55:41 -04:00
Andreas Schwab
8f3b1ffefa
Fix PLT use for feraiseexcept on x86_64
2011-10-19 13:03:31 +02:00
Ulrich Drepper
d9a8d0abcc
Use new internal libc_fe* interfaces in more functions
2011-10-18 15:11:31 -04:00
Ulrich Drepper
4855e3ddf5
Provide combined internal feholdexcept/fesetround interface
2011-10-18 09:59:04 -04:00
Ulrich Drepper
23ce562780
Pretty print last change to x86-64 mathinline.h
2011-10-18 09:38:47 -04:00
Ulrich Drepper
581d30e386
Add optimized nearbyint{,f} for x86-64
2011-10-18 09:13:23 -04:00
Ulrich Drepper
d38f1dba00
Start optimizing the use of the fenv interfaces in libm itself
2011-10-18 09:00:46 -04:00
Andreas Schwab
83c7615c2d
Fix last change
2011-10-18 14:11:29 +02:00
Andreas Schwab
caa6c9d845
Fix linkage conflict with feraiseexcept
2011-10-18 11:46:51 +02:00
Ulrich Drepper
228a984d54
Relax asm requirements for recently added x86-64 math interfaces
2011-10-17 20:30:52 -04:00
Ulrich Drepper
c8553a6a6f
Makr x86-64 math_private.h more robust
2011-10-17 16:00:39 -04:00
Ulrich Drepper
ed22dcf691
Provide internal optimizations on x86-64 with SSE4.1
...
Provide macros so that the internal users can, if possible, directly use
the new instructions.
Also fix up the mathinline.h header when compiling with SSE4.1 enabled.
2011-10-17 11:23:40 -04:00
Ulrich Drepper
b171c13768
Fix last x86-64 mathinline change
...
Use correct function names.
2011-10-17 10:37:00 -04:00
Ulrich Drepper
ad0f5cad15
Use rounds{s,d} for x86 rint, ceil, floor
2011-10-16 20:58:17 -04:00
Andreas Schwab
6b1f68c91f
Fix lost feraiseexcept symbol
2011-10-14 11:21:23 +02:00
Andreas Schwab
714fad23c6
Fix PLT use in feupdateenv on x86_64
2011-10-13 15:26:45 +02:00
Ulrich Drepper
0ac5ae2335
Optimize libm
...
libm is now somewhat integrated with gcc's -ffinite-math-only option
and lots of the wrapper functions have been optimized.
2011-10-12 11:27:51 -04:00
Ulrich Drepper
7edb55ce06
Optimize use of isnan, isinf, finite
2011-10-08 10:18:26 -04:00
Andreas Schwab
bf972c9dfc
Fix parse error in bits/mathinline.h with --std=c99
2011-09-26 14:01:30 +02:00
Ulrich Drepper
4c1a1f71c0
Add fmax and fmin inlines for x86-64
2011-09-15 13:11:08 -04:00
Ulrich Drepper
ee4d03150a
Use correct section to allow merging
2011-09-14 13:43:24 -04:00
Ulrich Drepper
cd20565401
Optimized lrint and llrint for x86-64
2011-09-14 12:58:43 -04:00
Petr Baudis
1248c1c415
Fix jn precision
2011-09-09 22:16:10 -04:00
H.J. Lu
8db736347c
Fix typo in x86-64 powl
2011-05-18 19:50:48 -04:00
Jakub Jelinek
9ff8d36f27
Correct implementation of fmaf.
2010-10-11 09:27:05 -04:00
Andreas Schwab
7eb22e757e
Avoid PLT call to fegetenv on s390
2010-02-09 22:34:17 -08:00
Jakub Jelinek
240441038f
Fix x86_64 bits/mathinline.h for -m32 compilation.
2009-09-01 15:30:12 -07:00
Andreas Schwab
c2735e958a
Fix parse error in bits/mathinline.h with --std=c99
2009-08-31 17:26:14 +02:00
Ulrich Drepper
65b14bcee2
Optimize out duplicated scalbln code for x86-64.
2009-08-25 16:46:34 -07:00
Ulrich Drepper
7423a3456a
Optimized signbit{,f} for x86-64.
2009-08-25 14:54:12 -07:00
Ulrich Drepper
cf00cc00bc
Add ceil implementation for 64-bit machines.
...
On 64-bit machines we should not split doubles into two 32 bit
integer and handle the words separately. We have wide registers.
This patch implements a 64-bit ceil version. Ideally all other
functions will be converted over time.
2009-08-24 18:05:48 -07:00