glibc/sysdeps/powerpc/powerpc64/power8
Adhemerval Zanella 71ae86478e PowerPC: memset optimization for POWER8/PPC64
This patch adds an optimized memset implementation for POWER8.  For
sizes from 0 to 255 bytes, a word/doubleword algorithm similar to
POWER7 optimized one is used.

For size higher than 255 two strategies are used:

1. If the constant is different than 0, the memory is written with
   altivec vector instruction;

2. If constant is 0, dbcz instructions are used.  The loop is unrolled
   to clear 512 byte at time.

Using vector instructions increases throughput considerable, with a
double performance for sizes larger than 1024.  The dcbz loops unrolls
also shows performance improvement, by doubling throughput for sizes
larger than 8192 bytes.
2014-09-10 07:39:46 -04:00
..
fpu PowerPC: Fix --disable-multi-arch builds 2014-04-09 06:22:53 -05:00
multiarch PowerPC: Adjust multiarch Implies for PowerPC64 2013-12-13 14:29:27 -05:00
Implies PowerPC: Enable POWER8 platform sans hwcap bits. 2013-06-24 15:33:32 -05:00
memset.S PowerPC: memset optimization for POWER8/PPC64 2014-09-10 07:39:46 -04:00