glibc/sysdeps/powerpc/powerpc64
Adhemerval Zanella c9cd7b0ce5 powerpc: POWER8 memcpy optimization for cached memory
On POWER8, unaligned memory accesses to cached memory has little impact
on performance as opposed to its ancestors.

It is disabled by default and will only be available when the tunable
glibc.tune.cached_memopt is set to 1.

                 __memcpy_power8_cached      __memcpy_power7
============================================================
    max-size=4096:     33325.70 ( 12.65%)        38153.00
    max-size=8192:     32878.20 ( 11.17%)        37012.30
   max-size=16384:     33782.20 ( 11.61%)        38219.20
   max-size=32768:     33296.20 ( 11.30%)        37538.30
   max-size=65536:     33765.60 ( 10.53%)        37738.40

	* manual/tunables.texi (Hardware Capability Tunables): Document
	glibc.tune.cached_memopt.
	* sysdeps/powerpc/cpu-features.c: New file.
	* sysdeps/powerpc/cpu-features.h: New file.
	* sysdeps/powerpc/dl-procinfo.c [!IS_IN(ldconfig)]: Add
	_dl_powerpc_cpu_features.
	* sysdeps/powerpc/dl-tunables.list: New file.
	* sysdeps/powerpc/ldsodefs.h: Include cpu-features.h.
	* sysdeps/powerpc/powerpc32/power4/multiarch/init-arch.h
	(INIT_ARCH): Initialize use_aligned_memopt.
	* sysdeps/powerpc/powerpc64/dl-machine.h [defined(SHARED &&
	IS_IN(rtld))]: Restrict dl_platform_init availability and
	initialize CPU features used by tunables.
	* sysdeps/powerpc/powerpc64/multiarch/Makefile (sysdep_routines):
	Add memcpy-power8-cached.
	* sysdeps/powerpc/powerpc64/multiarch/ifunc-impl-list.c: Add
	__memcpy_power8_cached.
	* sysdeps/powerpc/powerpc64/multiarch/memcpy.c: Likewise.
	* sysdeps/powerpc/powerpc64/multiarch/memcpy-power8-cached.S:
	New file.

Reviewed-by: Rajalakshmi Srinivasaraghavan  <raji@linux.vnet.ibm.com>
2017-12-11 17:39:42 -02:00
..
970 powerpc: Re-work the Implies structure 2010-06-30 09:57:38 -07:00
a2 PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
bits Define wordsize.h macros everywhere 2016-11-04 09:37:44 -07:00
cell PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
fpu Use libm_alias_float for powerpc. 2017-12-05 00:26:26 +00:00
multiarch powerpc: POWER8 memcpy optimization for cached memory 2017-12-11 17:39:42 -02:00
power4 Do not use __ptr_t. 2017-08-08 17:14:49 +00:00
power5 PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
power5+ Use libm_alias_float for powerpc. 2017-12-05 00:26:26 +00:00
power6 Use libm_alias_float for powerpc. 2017-12-05 00:26:26 +00:00
power6x Use libm_alias_float for powerpc. 2017-12-05 00:26:26 +00:00
power7 powerpc: Replace lxvd2x/stxvd2x with lvx/stvx in P7's memcpy/memmove 2017-10-25 13:14:30 -02:00
power8 Use libm_alias_float for powerpc. 2017-12-05 00:26:26 +00:00
power9 PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
__longjmp-common.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
__longjmp.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
addmul_1.S PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
atomic-machine.h Optimize generic spinlock code and use C11 like atomic macros. 2017-06-06 09:41:56 +02:00
backtrace.c Rename struct ucontext tag (bug 21457). 2017-06-26 22:03:58 +00:00
bsd-_setjmp.S PowerPC64 ABI fixes 2010-08-12 09:19:19 -07:00
bsd-setjmp.S PowerPC64 ABI fixes 2010-08-12 09:19:19 -07:00
bzero.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
configure powerpc64 configure message 2015-03-25 15:45:36 +10:30
configure.ac powerpc64 configure message 2015-03-25 15:45:36 +10:30
crti.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
crtn.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
dl-dtprocnum.h Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
dl-irel.h Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
dl-machine.c PowerPC64 ELFv2 PPC64_OPT_LOCALENTRY 2017-06-14 10:47:25 +09:30
dl-machine.h powerpc: POWER8 memcpy optimization for cached memory 2017-12-11 17:39:42 -02:00
dl-trampoline.S PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
entry.h Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
ffsll.c Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
hp-timing.h Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
Implies Revert "Use ieee754/dbl-64/wordsize-64 on powerpc64" 2013-01-10 10:44:05 +01:00
lshift.S PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
Makefile Remove HP_TIMING_DIFF_INIT and dl_hp_timing_overhead 2014-07-03 08:38:25 -07:00
memcpy.S Do not use __ptr_t. 2017-08-08 17:14:49 +00:00
memset.S Do not use __ptr_t. 2017-08-08 17:14:49 +00:00
mul_1.S PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
ppc-mcount.S PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
register-dump.h Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
rtld-memset.c powerpc: Use generic memset for RTLD for ppc32/64 2010-09-29 12:21:14 -04:00
setjmp-common.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
setjmp.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
stackguard-macros.h PowerPC: Fix POINTER_CHK_GUARD thread register for PPC64 2013-09-25 13:43:04 -05:00
start.S PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
strchr.S PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
strcmp.S PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
strlen.S PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
strncmp.S PowerPC64 ENTRY_TOCLESS 2017-06-14 10:45:50 +09:30
submul_1.S Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
sysdep.h Add elision tunables 2017-12-05 17:48:48 -02:00
tls-macros.h tst-tlsopt-powerpc as a shared lib 2017-08-03 15:39:21 +09:30
tst-audit.h Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00