H.J. Lu
b416a90085
Missing comma in last commit.
2010-08-26 13:18:46 -07:00
Roland McGrath
8b2b771538
Clean up warnings in new x86_64/multiarch code.
2010-08-25 12:13:08 -07:00
H.J. Lu
e73015f2d6
Unroll 32bit SSE strlen and handle slow bsf
2010-08-25 10:07:37 -07:00
Ulrich Drepper
1cdfe7242f
Add missing copyright year updated and pretty printing.
2010-08-24 11:42:19 -07:00
Richard Henderson
73f27d5e72
Clean up SSE variable shifts
2010-08-24 11:35:01 -07:00
Ulrich Drepper
9da4bb316f
Fix two typos in x86-64 SSE4.2 strncasecmp implementation.
2010-08-19 09:20:44 -07:00
Ulrich Drepper
1feccb6caf
Fix fourth parameter of SSE4.2 strcmp for x86-64.
2010-08-15 20:46:09 -07:00
Ulrich Drepper
28c90b2cf5
Use correct register for fourth parameter of x86-64 strncasecmp_l.
2010-08-15 17:42:12 -07:00
Ulrich Drepper
25244f174f
Undo inccorect change.
2010-08-15 10:34:33 -07:00
Ulrich Drepper
e9f82e0d1d
Add optimized strncasecmp versions for x86-64.
2010-08-14 22:04:01 -07:00
Ulrich Drepper
ca6bb004eb
Fix x86-64 build without multiarch.
2010-08-14 14:56:32 -07:00
Andi Kleen
d22e4cc939
x86: Add support for frame pointer less mcount
2010-08-07 21:24:05 -07:00
Ulrich Drepper
73507d3ae0
Add support for SSSE3 and SSE4.2 versions of strcasecmp on x86-64.
2010-07-31 21:41:09 -07:00
Ulrich Drepper
66f6765a47
Pretty printing x86-64 SSE4.3 strcmp.
2010-07-30 12:54:37 -07:00
Ulrich Drepper
42e08a5438
Implement optimized strcaecmp for x86-64.
2010-07-30 00:14:04 -07:00
Ulrich Drepper
fe36dd025e
Fix tolower operation in strcasestr.
2010-07-30 00:09:07 -07:00
Ulrich Drepper
880113d91e
Avoid compiling unneeded file in ld.so.
2010-07-27 21:12:59 -07:00
Ulrich Drepper
24fb0f88ed
Add optimized x86-64 implementation of strnlen.
...
While at it, beef up the test suite for strnlen and add performance
tests for it, too.
2010-07-26 08:37:08 -07:00
Ulrich Drepper
8e96b93aa7
Speed up x86-64 strcasestr a bit moew.
...
Using the new SSE4.2 instructions is cool but not really the fastest.
Some older SSE instructions can do the trick faster.
2010-07-24 08:34:44 -07:00
Andreas Schwab
f6a31e0eb6
Add strcasestr-nonascii to i386 build
2010-07-21 07:26:18 -07:00
Ulrich Drepper
d02dc4ba08
Fix non-ASCII case of SSE4.2 strcasstr.
2010-07-16 16:00:22 -07:00
Ulrich Drepper
cc9f2e47a0
Speed up SSE4.2 strcasestr by avoiding indirect function call.
2010-07-16 15:37:38 -07:00
H.J. Lu
6fb8cbcb58
Improve 64bit memcpy/memmove for Atom, Core 2 and Core i7
...
This patch includes optimized 64bit memcpy/memmove for Atom, Core 2 and
Core i7. It improves memcpy by up to 3X on Atom, up to 4X on Core 2 and
up to 1X on Core i7. It also improves memmove by up to 3X on Atom, up to
4X on Core 2 and up to 2X on Core i7.
2010-06-30 08:26:11 -07:00
H.J. Lu
3c88fe1e3a
Incorrect x86 CPU family and model check.
2010-05-27 11:14:18 -07:00
Ulrich Drepper
94a27fabeb
Whitespace fix.
2010-04-14 22:29:51 -07:00
H.J. Lu
a11ec63713
Add x86-32 FMA support
2010-04-14 22:27:59 -07:00
H.J. Lu
df87f54923
Check DATA_CACHE_SIZE_HALF
2010-04-14 22:18:27 -07:00
H.J. Lu
dd37cd1a12
Optimie x86-64 SSE4 memcmp for unaligned data.
2010-04-14 17:53:44 -07:00
H.J. Lu
404a6e3201
x86-64 SSE4 optimized memcmp
...
This is 64bit SSE4 optimized memcmp. It improves memcmp by upto 3X
on Intel Core i7.
2010-04-14 00:12:53 -07:00
Ulrich Drepper
bbbdd77809
Update x86-64 cpu multiarch selection header.
2010-04-13 19:17:10 -07:00
Ulrich Drepper
22f4f44b67
Fix concurrent handling of __cpu_features.
2010-04-04 00:25:46 -07:00
H.J. Lu
7d9335ecd7
Don't define __strpbrk_sse42 in static library
2010-03-24 12:16:24 -07:00
Richard Guenther
e39acb1f16
Fix R_X86_64_PC32 overflow detection
2010-03-04 19:33:41 -08:00
Ulrich Drepper
4a1297d761
We can use the 64-bit register versions of the double functions.
2010-02-24 20:00:30 -08:00
Andreas Schwab
7eb22e757e
Avoid PLT call to fegetenv on s390
2010-02-09 22:34:17 -08:00
Ulrich Drepper
f69190e74a
Prevent silent errors should x86-64 strncmp be needed outside libc.
2010-01-14 08:09:32 -08:00
H.J. Lu
5a7af22fbb
Unroll the loop x86-64 SSE4.2 strlen.
2010-01-13 07:51:48 -08:00
H.J. Lu
3af48cbdfa
Optimize 32bit memset/memcpy with SSE2/SSSE3.
2010-01-12 11:22:03 -08:00
H.J. Lu
2510d01ddb
Define bit_SSE2 and index_SSE2.
2009-12-13 15:23:02 -08:00
H.J. Lu
51ddd2c01e
Define bit_XXX and index_XXX.
...
This patch defines bit_XXX and index_XXX and use them to check processor
feature in assembly code. It can prevent typos in processor feature
check.
2009-12-13 09:47:02 -08:00
Ulrich Drepper
823bc6da65
Fix whitespaces.
2009-10-22 22:50:00 -07:00
H.J. Lu
001659f4d5
Implement SSE4.2 optimized strchr and strrchr.
2009-10-22 22:47:12 -07:00
Roland McGrath
b0f3a2e43f
Clean up unnecessary libc_hidden_builtin_def fiddling in x86 multiarch definitions.
2009-10-06 20:01:23 -07:00
Roland McGrath
9d6982d5d2
Clean up x86 multiarch HAS_FOO macros.
2009-10-06 19:59:03 -07:00
Roland McGrath
7967983fd4
configure tweaks, support $libc_add_on_config_subdirs
2009-09-15 14:14:42 -07:00
Jakub Jelinek
22bb992d51
Fix strstr/strcasestr/fma/fmaf on x86_64.
2009-09-02 19:43:04 -07:00
Jakub Jelinek
240441038f
Fix x86_64 bits/mathinline.h for -m32 compilation.
2009-09-01 15:30:12 -07:00
Andreas Schwab
c2735e958a
Fix parse error in bits/mathinline.h with --std=c99
2009-08-31 17:26:14 +02:00
H.J. Lu
5a4eb7282e
Remove ENABLE_SSSE3_ON_ATOM.
...
It turns that SSSE3 isn't slow on Atom. The problem is bsf. This patch
removes ENABLE_SSSE3_ON_ATOM.
2009-08-28 14:54:46 -07:00
Ulrich Drepper
65b14bcee2
Optimize out duplicated scalbln code for x86-64.
2009-08-25 16:46:34 -07:00