glibc/sysdeps/aarch64
Wilco Dijkstra f46ef33ad1 AArch64: Improve strlen_asimd performance (bug 25824)
Optimize strlen using a mix of scalar and SIMD code.  On modern micro
architectures large strings are 2.6 times faster than existing
strlen_asimd and 35% faster than the new MTE version of strlen.

On a random strlen benchmark using small sizes the speedup is 7% vs
strlen_asimd and 40% vs the MTE strlen.  This fixes the main strlen
regressions on Cortex-A53 and other cores with a simple Neon unit.

Rename __strlen_generic to __strlen_mte, and select strlen_asimd when
MTE is not enabled (this is waiting on support for a HWCAP_MTE bit).

This fixes big-endian bug 25824. Passes GLIBC regression tests.

Reviewed-by: Szabolcs Nagy <szabolcs.nagy@arm.com>
2020-07-17 15:07:23 +01:00
..
bits Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
fpu aarch64: Remove fpu Makefile 2020-06-22 11:09:50 -03:00
multiarch AArch64: Improve strlen_asimd performance (bug 25824) 2020-07-17 15:07:23 +01:00
nptl nptl: add missing pthread-offsets.h 2020-02-10 17:01:21 +01:00
sys Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
__longjmp.S Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
abort-instr.h
atomic-machine.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
bsd-_setjmp.c aarch64: Rename place holder .S files to .c 2020-07-08 15:02:37 +01:00
bsd-setjmp.c aarch64: Rename place holder .S files to .c 2020-07-08 15:02:37 +01:00
configure aarch64: configure check for pac-ret code generation 2020-07-08 15:02:38 +01:00
configure.ac aarch64: configure check for pac-ret code generation 2020-07-08 15:02:38 +01:00
crti.S aarch64: Add pac-ret support to assembly files 2020-07-08 15:02:38 +01:00
crtn.S aarch64: Add pac-ret support to assembly files 2020-07-08 15:02:38 +01:00
dl-bti.c aarch64: enable BTI at runtime 2020-07-08 15:02:37 +01:00
dl-dtprocnum.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
dl-irel.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
dl-link.sym
dl-machine.h aarch64: fix RTLD_START for BTI 2020-07-08 15:02:37 +01:00
dl-prop.h aarch64: enable BTI at runtime 2020-07-08 15:02:37 +01:00
dl-sysdep.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
dl-tls.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
dl-tlsdesc.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
dl-tlsdesc.S aarch64: Add pac-ret support to assembly files 2020-07-08 15:02:38 +01:00
dl-trampoline.S aarch64: Add pac-ret support to assembly files 2020-07-08 15:02:38 +01:00
dl-tunables.list Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
e_sqrtl.c Add libm_alias_finite for _finite symbols 2020-01-03 10:02:04 -03:00
elf-initfini.h Introduce <elf-initfini.h> and ELF_INITFINI for all architectures 2020-02-18 15:12:25 +01:00
Implies Remove sysdeps/aarch64/soft-fp directory. 2018-05-22 17:23:34 +00:00
jmpbuf-offsets.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
jmpbuf-unwind.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
ldsodefs.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
libc-tls.c Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
libm-test-ulps Update aarch64 libm-test-ulps 2020-04-08 13:52:44 -03:00
libm-test-ulps-name Do not hardcode platform names in manual/libm-err-tab.pl (bug 14139). 2016-11-04 16:49:06 +00:00
linkmap.h aarch64: enable BTI at runtime 2020-07-08 15:02:37 +01:00
machine-gmon.h aarch64: fix pac-ret support in _mcount 2020-07-08 15:02:38 +01:00
Makefile aarch64: ensure objects are BTI compatible 2020-07-08 15:02:38 +01:00
math-tests-trap.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
mcount.c Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
memchr.S aarch64: MTE compatible memchr 2020-06-23 17:55:39 +01:00
memcmp.S Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
memcpy.S [AArch64] Improve integer memcpy 2020-03-11 17:15:25 +00:00
memmove.c aarch64: Rename place holder .S files to .c 2020-07-08 15:02:37 +01:00
memrchr.S aarch64: MTE compatible memrchr 2020-06-23 17:55:39 +01:00
memset-reg.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
memset.S Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
memusage.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
preconfigure Remove relro configure test. 2014-06-27 16:51:22 +00:00
rawmemchr.S Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
setjmp.S Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
sfp-machine.h Break more lines before not after operators. 2019-02-25 13:19:19 +00:00
sotruss-lib.c Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
stackinfo.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
start.S aarch64: Add BTI support to assembly files 2020-07-08 15:02:37 +01:00
stpcpy.S Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
strchr.S aarch64: MTE compatible strchr 2020-06-09 09:20:27 +01:00
strchrnul.S aarch64: MTE compatible strchrnul 2020-06-09 09:20:27 +01:00
strcmp.S aarch64: MTE compatible strcmp 2020-06-23 17:55:39 +01:00
strcpy.S aarch64: MTE compatible strcpy 2020-06-23 17:55:39 +01:00
string_private.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
strlen.S aarch64: MTE compatible strlen 2020-06-09 09:21:11 +01:00
strncmp.S aarch64: MTE compatible strncmp 2020-06-23 17:55:39 +01:00
strnlen.S aarch64: fix strcpy and strnlen for big-endian [BZ #25824] 2020-05-15 12:15:56 +01:00
strrchr.S aarch64: MTE compatible strrchr 2020-06-23 17:55:39 +01:00
sysdep.h AArch64: Align ENTRY to a cacheline 2020-07-15 16:50:02 +01:00
tls-macros.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
tlsdesc.c Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
tlsdesc.sym aarch64: Use explicit offsets in _dl_tlsdesc_dynamic 2016-12-02 16:52:57 +01:00
tst-audit.h Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
tst-ifunc-arg-1.c Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
tst-ifunc-arg-2.c Update copyright dates with scripts/update-copyrights. 2020-01-01 00:14:33 +00:00
Versions