glibc/sysdeps/aarch64/multiarch
Anton Youdkevitch 32e902a94e aarch64: thunderx2 memmove performance improvements
The performance improvement is about 20%-30% for
larger cases and about 1%-5% for smaller cases.

Used SIMD load/store instead of GPR for large
overlapping forward moves.

Reused existing memcpy implementation for smaller
or overlapping backward moves.

Fixed the existing memcpy implementation to allow it
to deal with the overlapping case.

Simplified loop tails in the memcpy implementation -
use branchless overlapping sequence of fixed length
load/stores instead of branching depending on the
size.

A cleanup/optimization converting str's to stp's.

Added __memmove_thunderx2 to the list of the
available implementations.
2019-05-03 11:01:34 -07:00
..
ifunc-impl-list.c aarch64: thunderx2 memmove performance improvements 2019-05-03 11:01:34 -07:00
init-arch.h Update copyright dates with scripts/update-copyrights. 2019-01-01 00:11:28 +00:00
Makefile aarch64: Optimized memchr specific to AmpereComputing emag 2019-02-01 08:14:21 -05:00
memchr_generic.S aarch64: Optimized memchr specific to AmpereComputing emag 2019-02-01 08:14:21 -05:00
memchr_nosimd.S aarch64: Optimized memchr specific to AmpereComputing emag 2019-02-01 08:14:21 -05:00
memchr.c aarch64: Optimized memchr specific to AmpereComputing emag 2019-02-01 08:14:21 -05:00
memcpy_falkor.S Update copyright dates with scripts/update-copyrights. 2019-01-01 00:11:28 +00:00
memcpy_generic.S Update copyright dates with scripts/update-copyrights. 2019-01-01 00:11:28 +00:00
memcpy_thunderx2.S aarch64: thunderx2 memmove performance improvements 2019-05-03 11:01:34 -07:00
memcpy_thunderx.S Update copyright dates with scripts/update-copyrights. 2019-01-01 00:11:28 +00:00
memcpy.c [AArch64] Add ifunc support for Ares 2019-01-09 10:35:34 +00:00
memmove_falkor.S Update copyright dates with scripts/update-copyrights. 2019-01-01 00:11:28 +00:00
memmove.c aarch64: thunderx2 memmove performance improvements 2019-05-03 11:01:34 -07:00
memset_base64.S aarch64: Optimized memset specific to AmpereComputing emag 2019-02-01 07:59:18 -05:00
memset_emag.S aarch64: Optimized memset specific to AmpereComputing emag 2019-02-01 07:59:18 -05:00
memset_falkor.S Update copyright dates with scripts/update-copyrights. 2019-01-01 00:11:28 +00:00
memset_generic.S Update copyright dates with scripts/update-copyrights. 2019-01-01 00:11:28 +00:00
memset.c aarch64: Optimized memset specific to AmpereComputing emag 2019-02-01 07:59:18 -05:00
rtld-memset.S Update copyright dates with scripts/update-copyrights. 2019-01-01 00:11:28 +00:00
strlen_asimd.S Update copyright dates with scripts/update-copyrights. 2019-01-01 00:11:28 +00:00
strlen_generic.S Update copyright dates with scripts/update-copyrights. 2019-01-01 00:11:28 +00:00
strlen.c Update copyright dates with scripts/update-copyrights. 2019-01-01 00:11:28 +00:00