glibc

mirror of https://sourceware.org/git/glibc.git synced 2024-11-30 00:31:08 +00:00

Author	SHA1	Message	Date
Chris Metcalf	3c87c6167e	tilegx: enable wordsize-64 support for ieee745 dbl-64. I missed this during the initial port. Some testing shows that enabling this mode does, unsurprisingly, yield some nice speedups on the math functions in question.	2014-12-23 14:04:35 -05:00
Chris Metcalf	0dacd7a3b9	tilegx: remove implicit boolean conversion in strstr. [BZ #17746] The __builtin_expect() truncated a uint64_t to a 32-bit long in ILP32 mode, discarding the high 32 bits, and potentially missing the NUL terminator that we were searching for with SIMD operations. Explicitly compare to zero to fix the problem.	2014-12-22 14:50:26 -05:00
Chris Metcalf	95dee05f17	tilegx: fix strstr to build and link better The two_way_short_needle() routine included from str-two-way.h is not used, so mark it so to avoid compiler warnings. Calling strnlen() breaks linknamespace tests, so change it to __strnlen().	2014-12-19 22:54:35 -05:00
Chris Metcalf	f627ca82fb	tile: add inhibit_loop_to_libcall to string functions Without this, on gcc 4.8.2 the built glibc crashes when memcpy or memset are invoked, since they call themselves recursively. See commit `85c2e6110c` for the generic inhibit_loop_to_libcall.	2014-12-11 15:13:48 -05:00
Torvald Riegel	1ea339b697	Add arch-specific configuration for C11 atomics support. This sets __HAVE_64B_ATOMICS if provided. It also sets USE_ATOMIC_COMPILER_BUILTINS to true if the existing atomic ops use the __atomic* builtins (aarch64, mips partially) or if this has been tested (x86_64); otherwise, this is set to false so that C11 atomics will be based on the existing atomic operations.	2014-11-20 11:57:38 +01:00
Chris Metcalf	563a74d86c	tile: fix copyright header blocks in just-committed files I accidentally committed versions not following the conventions.	2014-10-06 13:47:02 -04:00
Chris Metcalf	c86f7b80f4	tilegx: provide optimized strnlen, strstr, and strcasestr strnlen() is based on the existing tile strlen() with length checking added. It speeds up by up to 5x, but on average across the benchtest corpus by around 35%. No regressions are seen. strstr() does 8-byte aligned loads and compares using a 2-byte filter on the first two bytes of the needle and then testing the remaining bytes in needle using memcmp(). It speeds up about 5x in the best case (for "found" needles), about 2x looking at benchtest as a whole, with some slowdowns as much as 45%. on a few cases (including the "fail" case for 128KB search). strcasestr() is based on strstr() but uses a SIMD tolower routine to convert 8-bytes to lower case in 5 instructions. It also uses a 2-byte filter and then strncasecmp() for the remaining bytes. strncasecmp() is not optimized for SIMD, so there is futher room for improvement. However, it is still up to 16x faster for "found" needles, averaging 2x faster on the whole corpus of benchtests. It does slow down by up to 35% on a few cases, similarly to strstr().	2014-10-06 11:19:18 -04:00
Chris Metcalf	1c4c1a6f4d	tilegx: optimize string copy_byte() internal function We can use one "shufflebytes" instruction instead of 3 "bfins" instructions to optimize the string functions.	2014-10-06 11:18:41 -04:00
Siddhesh Poyarekar	64df73c2ea	Fix Wundef warning for MEMCPY_OK_FOR_FWD_MEMMOVE Define MEMCPY_OK_FOR_FWD_MEMMOVE in memcopy.h and let arch-specific implementations of that file override the value if necessary. This override is only useful for tile and moving this macro to memcopy.h allows us to remove the tile-specific memmove.c.	2014-06-28 06:05:24 +05:30
Chris Metcalf	4372980f58	Move tilegx, tilepro, and linux-generic from ports to libc. I've moved the TILE-Gx and TILEPro ports to the main sysdeps hierarchy, along with the linux-generic ports infrastructure. Beyond the README update, the move was just git mv ports/sysdeps/tile sysdeps/tile git mv ports/sysdeps/unix/sysv/linux/tile \ sysdeps/unix/sysv/linux/tile git mv ports/sysdeps/unix/sysv/linux/generic \ sysdeps/unix/sysv/linux/generic I updated the relevant ChangeLogs along the lines of the ARM move in commit `c6bfe5c4d7` and tested the 64-bit tilegx build to confirm that there were no changes in "objdump -dr" output in the shared objects.	2014-02-10 11:04:39 -05:00

10 Commits