glibc/sysdeps
Siddhesh Poyarekar 70c97f8493 aarch64,falkor: Ignore prefetcher hints for memmove tail
The tail of the copy loops are unable to train the falkor hardware
prefetcher because they load from a different base compared to the hot
loop.  In this case avoid serializing the instructions by loading them
into different registers.  Also peel the last iteration of the loop
into the tail (and have them use different registers) since it gives
better performance for medium sizes.

This results in performance improvements of between 3% and 20% over
the current falkor implementation for sizes between 128 bytes and 1K
on the memmove-walk benchmark, thus mostly covering the regressions
seen against the generic memmove.

	* sysdeps/aarch64/multiarch/memmove_falkor.S
	(__memmove_falkor): Use multiple registers to move data in
	loop tail.
2018-05-11 00:08:02 +05:30
..
aarch64 aarch64,falkor: Ignore prefetcher hints for memmove tail 2018-05-11 00:08:02 +05:30
alpha Move math_opt_barrier, math_force_eval to separate math-barriers.h. 2018-05-09 19:45:47 +00:00
arm Drop fpregset unused symbol exposition 2018-04-20 01:27:13 +02:00
generic Move math_check_force_underflow macros to separate math-underflow.h. 2018-05-10 00:53:04 +00:00
gnu Increase robustness of internal dlopen() by using RTLD_NOW [BZ #22766] 2018-04-26 10:41:43 -03:00
hppa Update hppa libm-test-ulps 2018-04-20 15:36:41 -03:00
htl hurd: Avoid exposing all <sched.h> symbols from sys/types.h 2018-04-19 20:24:36 +02:00
hurd hurd: Fix hurd installed headers test 2018-04-20 00:16:40 +02:00
i386 Move math_opt_barrier, math_force_eval to separate math-barriers.h. 2018-05-09 19:45:47 +00:00
ia64 elf: Unify symbol address run-time calculation [BZ #19818] 2018-04-04 23:09:37 +01:00
ieee754 Move math_check_force_underflow macros to separate math-underflow.h. 2018-05-10 00:53:04 +00:00
init_array sysdeps/init_array: Add PREINIT_FUNCTION to crti.S 2018-01-29 10:22:26 -08:00
m68k Move math_opt_barrier, math_force_eval to separate math-barriers.h. 2018-05-09 19:45:47 +00:00
mach Ignore absolute symbols in ABI tests. 2018-05-04 15:46:32 +00:00
microblaze elf: Unify symbol address run-time calculation [BZ #19818] 2018-04-04 23:09:37 +01:00
mips Drop fpregset unused symbol exposition 2018-04-20 01:27:13 +02:00
nios2 Update sysdeps/nios2/libm-test-ulps 2018-04-19 12:00:58 -07:00
nptl Fix comment typo 2018-05-08 14:59:13 +02:00
posix Consolidate Linux readdir{64}{_r} implementation 2018-04-06 17:49:02 -03:00
powerpc Move math_check_force_underflow macros to separate math-underflow.h. 2018-05-10 00:53:04 +00:00
pthread hurd: fix sigevent's sigev_notify_attributes field type 2018-04-19 21:43:44 +02:00
riscv elf: Unify symbol address run-time calculation [BZ #19818] 2018-04-04 23:09:37 +01:00
s390 S390: Regenerate ULPs. 2018-04-05 16:24:06 +02:00
sh elf: Unify symbol address run-time calculation [BZ #19818] 2018-04-04 23:09:37 +01:00
sparc Update SPARC libm-test-ulps. 2018-04-05 17:10:03 -03:00
unix Ignore absolute symbols in ABI tests. 2018-05-04 15:46:32 +00:00
wordsize-32 Use libc_hidden_* for strtoumax (bug 15105). 2018-02-28 14:16:21 +00:00
wordsize-64 Use libc_hidden_* for strtoumax (bug 15105). 2018-02-28 14:16:21 +00:00
x86 Move math_check_force_underflow macros to separate math-underflow.h. 2018-05-10 00:53:04 +00:00
x86_64 x86-64/memset: Mark the debugger symbol as hidden 2018-05-07 11:01:48 -07:00