Ulrich Drepper
618280a192
Optimize x86-64 SSE4.2+ strcmp a bit more
2011-10-25 14:50:31 -04:00
Ulrich Drepper
d7826aa149
Use math_force_eval in more places
2011-10-25 10:52:45 -04:00
Ulrich Drepper
31ea014d8b
Use VEX encoding in inline math functions on x86-64 when possible
2011-10-25 08:17:57 -04:00
Ulrich Drepper
31d3cc00b0
Cleanup FMA4 patch
...
Move the FMA4 code into its own section. Avoid some of the duplication
of data resulting from the double use of source files.
2011-10-25 00:56:33 -04:00
Ulrich Drepper
202c9deb15
Better DLA_FMS
...
It's better to use __builtin_fma if it works. Use it for gcc 4.6 and
higher. Move the x86-64 dla.h to the correct place.
2011-10-24 22:11:21 -04:00
Ulrich Drepper
a0cf1edd4c
Use inline asm for DLA_FMS because of broken old compilers
2011-10-24 21:17:10 -04:00
Ulrich Drepper
af968f62f2
Optimize accurate 64-bit routines for FMA4 on x86-64
2011-10-24 20:19:17 -04:00
Ulrich Drepper
58985aa92f
Finish renamed DLA_FMA -> DLA_FMS
2011-10-24 14:21:18 -04:00
Ulrich Drepper
bb3129bd25
Optimized remquo for 64-bit platforms
2011-10-23 21:31:21 -04:00
Ulrich Drepper
0275fff8a1
Optimize f_mod
...
Branch prediction for the 32-bit implementation and a new optimized
64-bit implementation.
2011-10-23 20:55:26 -04:00
Ulrich Drepper
2fa2ae85ca
Fix strnlen change
2011-10-23 16:30:40 -04:00
Liubov Dmitrieva
fc2ee42abe
Add optimized wcslen and strnlen for x86-32
2011-10-23 15:17:23 -04:00
Ulrich Drepper
09229f3e1b
Fix WS
2011-10-23 14:57:28 -04:00
Liubov Dmitrieva
ce7dd29f28
Optimized strnlen and wcscmp for x86-64
2011-10-23 14:56:04 -04:00
Michael Zolotukhin
979c70a3b1
Improve x86-32 SSSE3 memcpy
2011-10-23 14:28:26 -04:00
Ulrich Drepper
f17424ed53
Fix WS
2011-10-23 13:35:24 -04:00
Liubov Dmitrieva
95584d3b33
Fix signedness in wcscmp comparison
2011-10-23 13:34:15 -04:00
Ulrich Drepper
774a2669af
Clean up FMA use
...
The macro's name should reflect that subtraction is being done. And
use __builtin_fma, it seems to work after all.
2011-10-23 13:31:01 -04:00
Ulrich Drepper
c8b3296bbe
Clean up last dla.h change
2011-10-23 12:50:28 -04:00
Andreas Schwab
246ad57ad6
Add missing _finite aliases
2011-10-23 15:32:05 +02:00
Ulrich Drepper
fb24de5932
Fix typo in last change
2011-10-22 20:09:58 -04:00
Ulrich Drepper
0d355eb7c7
Update ULPs for x86-64
2011-10-22 20:06:23 -04:00
Ulrich Drepper
fa36182bda
Improve DLA_FMA for FMA4
2011-10-22 19:53:33 -04:00
Ulrich Drepper
a1a8716924
Start using fma in the libm implementation
2011-10-22 19:02:20 -04:00
Andreas Schwab
ee2aafe08e
Fix compilation due to __nan defines
2011-10-22 14:03:26 +02:00
Ulrich Drepper
bc62c2fb15
Optimization to some complex math functions
...
Also, change last reference to __sqrt into __sqrt to easily.
2011-10-22 00:32:39 -04:00
Ulrich Drepper
c196fed8f0
Fix compilation problems in x86-64 init-arch
2011-10-21 20:47:20 -04:00
Ulrich Drepper
8ec250a484
Add some __builtin_expect to generic __ieee754_log
2011-10-21 12:28:42 -04:00
Ulrich Drepper
1a97a8c78f
Don't use NULL in last s_fma{,f} change
2011-10-21 07:39:28 -04:00
Ulrich Drepper
ed72b6545f
Check for FMA4 support and generate appropriate fma functions
2011-10-20 22:43:15 -04:00
Ulrich Drepper
8d4f46c613
Move fma routines to right place
2011-10-20 21:55:41 -04:00
Ulrich Drepper
855d156018
Optimize x86-64 rawmemchr and add test
2011-10-19 22:22:29 -04:00
Ulrich Drepper
d9a4d2ab27
Add optimized str{,n}casecmp for AVX on x86-64
2011-10-19 12:42:38 -04:00
Andreas Schwab
8f3b1ffefa
Fix PLT use for feraiseexcept on x86_64
2011-10-19 13:03:31 +02:00
Ulrich Drepper
d9a8d0abcc
Use new internal libc_fe* interfaces in more functions
2011-10-18 15:11:31 -04:00
Ulrich Drepper
4855e3ddf5
Provide combined internal feholdexcept/fesetround interface
2011-10-18 09:59:04 -04:00
Ulrich Drepper
23ce562780
Pretty print last change to x86-64 mathinline.h
2011-10-18 09:38:47 -04:00
Ulrich Drepper
581d30e386
Add optimized nearbyint{,f} for x86-64
2011-10-18 09:13:23 -04:00
Ulrich Drepper
d38f1dba00
Start optimizing the use of the fenv interfaces in libm itself
2011-10-18 09:00:46 -04:00
Andreas Schwab
83c7615c2d
Fix last change
2011-10-18 14:11:29 +02:00
Andreas Schwab
caa6c9d845
Fix linkage conflict with feraiseexcept
2011-10-18 11:46:51 +02:00
Ulrich Drepper
99ce7b04ed
Small optimization of generic exp2
2011-10-17 23:16:34 -04:00
Ulrich Drepper
1004d1821e
Add optimized 64-bit frexp
2011-10-17 22:30:44 -04:00
Ulrich Drepper
228a984d54
Relax asm requirements for recently added x86-64 math interfaces
2011-10-17 20:30:52 -04:00
Ulrich Drepper
c8553a6a6f
Makr x86-64 math_private.h more robust
2011-10-17 16:00:39 -04:00
Ulrich Drepper
ed22dcf691
Provide internal optimizations on x86-64 with SSE4.1
...
Provide macros so that the internal users can, if possible, directly use
the new instructions.
Also fix up the mathinline.h header when compiling with SSE4.1 enabled.
2011-10-17 11:23:40 -04:00
Ulrich Drepper
b171c13768
Fix last x86-64 mathinline change
...
Use correct function names.
2011-10-17 10:37:00 -04:00
Ulrich Drepper
ad0f5cad15
Use rounds{s,d} for x86 rint, ceil, floor
2011-10-16 20:58:17 -04:00
Ulrich Drepper
79b195b55a
No need for boundary case handling in x86-32 __ieee_log
2011-10-15 22:21:53 -04:00
Ulrich Drepper
d4a285692a
Optimized floor for 64-bit architectures
2011-10-15 21:44:45 -04:00