glibc/sysdeps/x86_64/multiarch
Noah Goldstein 330881763e x86: Optimize memchr-evex.S and implement with VMM headers
Optimizations are:

1. Use the fact that tzcnt(0) -> VEC_SIZE for memchr to save a branch
   in short string case.
2. Restructure code so that small strings are given the hot path.
	- This is a net-zero on the benchmark suite but in general makes
      sense as smaller sizes are far more common.
3. Use more code-size efficient instructions.
	- tzcnt ...     -> bsf ...
	- vpcmpb $0 ... -> vpcmpeq ...
4. Align labels less aggressively, especially if it doesn't save fetch
   blocks / causes the basic-block to span extra cache-lines.

The optimizations (especially for point 2) make the memchr and
rawmemchr code essentially incompatible so split rawmemchr-evex
to a new file.

Code Size Changes:
memchr-evex.S       : -107 bytes
rawmemchr-evex.S    :  -53 bytes

Net perf changes:

Reported as geometric mean of all improvements / regressions from N=10
runs of the benchtests. Value as New Time / Old Time so < 1.0 is
improvement and 1.0 is regression.

memchr-evex.S       : 0.928
rawmemchr-evex.S    : 0.986 (Less targets cross cache lines)

Full results attached in email.

Full check passes on x86-64.
2022-10-19 17:31:03 -07:00
..
scripts x86: Update VEC macros to complete API for evex/evex512 impls 2022-10-14 21:21:58 -07:00
dl-symbol-redir-ifunc.h elf: Remove -fno-tree-loop-distribute-patterns usage on dl-support 2022-10-10 10:32:28 -03:00
ifunc-avx2.h x86-64: Require BMI1/BMI2 for AVX2 strrchr and wcsrchr implementations 2022-10-03 23:46:11 +02:00
ifunc-evex.h x86: Fix backwards Prefer_No_VZEROUPPER check in ifunc-evex.h 2022-06-27 08:35:51 -07:00
ifunc-impl-list.c x86_64: Implement evex512 version of memchr, rawmemchr and wmemchr 2022-10-18 13:26:33 -07:00
ifunc-memcmp.h x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
ifunc-memcmpeq.h x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
ifunc-memmove.h x86: Add support for building {w}memmove{_chk} with explicit ISA level 2022-07-05 16:42:42 -07:00
ifunc-memset.h x86: Add support for building {w}memset{_chk} with explicit ISA level 2022-07-05 16:42:42 -07:00
ifunc-sse4_2.h x86: Add support for building str{c|p}{brk|spn} with explicit ISA level 2022-07-05 16:42:42 -07:00
ifunc-strcasecmp.h x86-64: Require BMI2 for AVX2 str(n)casecmp implementations 2022-10-03 23:46:11 +02:00
ifunc-strcpy.h x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
ifunc-strncpy.h x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
ifunc-wcslen.h x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
ifunc-wmemset.h x86: Add support for building {w}memset{_chk} with explicit ISA level 2022-07-05 16:42:42 -07:00
Makefile x86_64: Implement evex512 version of memchr, rawmemchr and wmemchr 2022-10-18 13:26:33 -07:00
memchr-avx2-rtm.S x86: Shrink code size of memchr-avx2.S 2022-06-07 13:10:31 -07:00
memchr-avx2.S x86: Add support for compiling {raw|w}memchr with high ISA level 2022-06-22 19:41:35 -07:00
memchr-evex512.S x86_64: Implement evex512 version of memchr, rawmemchr and wmemchr 2022-10-18 13:26:33 -07:00
memchr-evex-base.S x86_64: Implement evex512 version of memchr, rawmemchr and wmemchr 2022-10-18 13:26:33 -07:00
memchr-evex-rtm.S x86: Add EVEX optimized memchr family not safe for RTM 2021-05-08 16:26:30 -04:00
memchr-evex.S x86: Optimize memchr-evex.S and implement with VMM headers 2022-10-19 17:31:03 -07:00
memchr-sse2.S x86: Add support for compiling {raw|w}memchr with high ISA level 2022-06-22 19:41:35 -07:00
memchr.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
memcmp-avx2-movbe-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
memcmp-avx2-movbe.S x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
memcmp-evex-movbe.S x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
memcmp-sse2.S x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
memcmp.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
memcmpeq-avx2-rtm.S x86_64: Add avx2 optimized __memcmpeq in memcmpeq-avx2.S 2021-10-27 13:03:46 -05:00
memcmpeq-avx2.S x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
memcmpeq-evex.S x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
memcmpeq-sse2.S x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
memcmpeq.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
memcpy_chk-nonshared.S Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
memcpy_chk.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
memcpy.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
memmove_chk-nonshared.S Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
memmove_chk.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
memmove-avx512-no-vzeroupper.S x86: Add support for building {w}memmove{_chk} with explicit ISA level 2022-07-05 16:42:42 -07:00
memmove-avx512-unaligned-erms.S x86: Update memmove to use new VEC macros 2022-10-14 21:21:58 -07:00
memmove-avx-unaligned-erms-rtm.S x86: Update memmove to use new VEC macros 2022-10-14 21:21:58 -07:00
memmove-avx-unaligned-erms.S x86: Update memmove to use new VEC macros 2022-10-14 21:21:58 -07:00
memmove-erms.S x86: Move mem{p}{mov|cpy}_{chk_}erms to its own file 2022-06-29 19:47:52 -07:00
memmove-evex-unaligned-erms.S x86: Update memmove to use new VEC macros 2022-10-14 21:21:58 -07:00
memmove-shlib-compat.h x86: Add support for building {w}memmove{_chk} with explicit ISA level 2022-07-05 16:42:42 -07:00
memmove-sse2-unaligned-erms.S x86: Update memmove to use new VEC macros 2022-10-14 21:21:58 -07:00
memmove-ssse3.S x86: Add support for building {w}memmove{_chk} with explicit ISA level 2022-07-05 16:42:42 -07:00
memmove-vec-unaligned-erms.S x86: Update memmove to use new VEC macros 2022-10-14 21:21:58 -07:00
memmove.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
mempcpy_chk-nonshared.S Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
mempcpy_chk.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
mempcpy.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
memrchr-avx2-rtm.S x86: Optimize memrchr-avx2.S 2022-06-07 13:10:27 -07:00
memrchr-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
memrchr-evex.S x86: Update memrchr to use new VEC macros 2022-10-14 21:21:58 -07:00
memrchr-sse2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
memrchr.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
memset_chk-nonshared.S Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
memset_chk.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
memset-avx2-unaligned-erms-rtm.S x86: Update memset to use new VEC macros 2022-10-14 21:21:58 -07:00
memset-avx2-unaligned-erms.S x86: Update memset to use new VEC macros 2022-10-14 21:21:58 -07:00
memset-avx512-no-vzeroupper.S x86: Add support for building {w}memset{_chk} with explicit ISA level 2022-07-05 16:42:42 -07:00
memset-avx512-unaligned-erms.S x86: Update memset to use new VEC macros 2022-10-14 21:21:58 -07:00
memset-erms.S x86: Move and slightly improve memset_erms 2022-06-29 19:47:52 -07:00
memset-evex-unaligned-erms.S x86: Update memset to use new VEC macros 2022-10-14 21:21:58 -07:00
memset-sse2-unaligned-erms.S x86: Update memset to use new VEC macros 2022-10-14 21:21:58 -07:00
memset-vec-unaligned-erms.S x86: Update memset to use new VEC macros 2022-10-14 21:21:58 -07:00
memset.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
rawmemchr-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
rawmemchr-avx2.S x86: Add support for compiling {raw|w}memchr with high ISA level 2022-06-22 19:41:35 -07:00
rawmemchr-evex512.S x86_64: Implement evex512 version of memchr, rawmemchr and wmemchr 2022-10-18 13:26:33 -07:00
rawmemchr-evex-rtm.S x86: Optimize memchr-evex.S and implement with VMM headers 2022-10-19 17:31:03 -07:00
rawmemchr-evex.S x86: Optimize memchr-evex.S and implement with VMM headers 2022-10-19 17:31:03 -07:00
rawmemchr-sse2.S x86: Add support for compiling {raw|w}memchr with high ISA level 2022-06-22 19:41:35 -07:00
rawmemchr.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
reg-macros.h x86: Update VEC macros to complete API for evex/evex512 impls 2022-10-14 21:21:58 -07:00
rtld-memchr.S x86: Add support for compiling {raw|w}memchr with high ISA level 2022-06-22 19:41:35 -07:00
rtld-memcmp.S x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
rtld-memcmpeq.S x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
rtld-memmove.S x86: Add support for building {w}memmove{_chk} with explicit ISA level 2022-07-05 16:42:42 -07:00
rtld-memset.S x86: Add support for building {w}memset{_chk} with explicit ISA level 2022-07-05 16:42:42 -07:00
rtld-rawmemchr.S x86: Add support for compiling {raw|w}memchr with high ISA level 2022-06-22 19:41:35 -07:00
rtld-stpcpy.S x86: Move strcpy SSE2 implementation to multiarch/strcpy-sse2.S 2022-07-13 14:55:31 -07:00
rtld-strchr.S x86: Move strchr SSE2 implementation to multiarch/strchr-sse2.S 2022-07-13 14:55:31 -07:00
rtld-strchrnul.S x86: Move strchr SSE2 implementation to multiarch/strchr-sse2.S 2022-07-13 14:55:31 -07:00
rtld-strcmp.S x86: Move strcmp SSE2 implementation to multiarch/strcmp-sse2.S 2022-07-13 14:55:31 -07:00
rtld-strcpy.S x86: Fix -Os build (BZ #29576) 2022-10-05 18:04:13 -03:00
rtld-strcspn.c x86: Add support for building str{c|p}{brk|spn} with explicit ISA level 2022-07-05 16:42:42 -07:00
rtld-strlen.S x86: Move strlen SSE2 implementation to multiarch/strlen-sse2.S 2022-07-13 14:55:31 -07:00
rtld-strncmp.S x86: Move strcmp SSE2 implementation to multiarch/strcmp-sse2.S 2022-07-13 14:55:31 -07:00
rtld-strnlen.S x86: Move strlen SSE2 implementation to multiarch/strlen-sse2.S 2022-07-13 14:55:31 -07:00
stpcpy-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
stpcpy-avx2.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
stpcpy-evex.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
stpcpy-sse2-unaligned.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
stpcpy-sse2.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
stpcpy.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
stpncpy-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
stpncpy-avx2.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
stpncpy-evex.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
stpncpy-sse2-unaligned.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
stpncpy.c x86: Remove generic strncat, strncpy, and stpncpy implementations 2022-07-12 11:44:12 -07:00
strcasecmp_l-avx2-rtm.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strcasecmp_l-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strcasecmp_l-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strcasecmp_l-sse2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strcasecmp_l-sse4_2.S x86: Move strcmp SSE42 implementation to multiarch/strcmp-sse4_2.S 2022-07-13 14:55:31 -07:00
strcasecmp_l.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strcasecmp.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strcat-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
strcat-avx2.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strcat-evex.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strcat-sse2-unaligned.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strcat-sse2.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strcat.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strchr-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
strchr-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strchr-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strchr-sse2-no-bsf.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strchr-sse2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strchr.c x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strchrnul-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
strchrnul-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strchrnul-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strchrnul-sse2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strchrnul.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strcmp-avx2-rtm.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strcmp-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strcmp-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strcmp-naming.h x86: Move strcmp SSE2 implementation to multiarch/strcmp-sse2.S 2022-07-13 14:55:31 -07:00
strcmp-sse2-unaligned.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strcmp-sse2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strcmp-sse4_2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strcmp.c x86-64: Require BMI2 for AVX2 strcmp implementation 2022-10-03 23:46:11 +02:00
strcpy-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
strcpy-avx2.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strcpy-evex.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strcpy-sse2-unaligned.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strcpy-sse2.S x86: Fix #define STRCPY guard in strcpy-sse2.S 2022-08-09 17:00:03 +08:00
strcpy.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strcspn-generic.c x86: Add support for building str{c|p}{brk|spn} with explicit ISA level 2022-07-05 16:42:42 -07:00
strcspn-sse4.c x86: Add support for building str{c|p}{brk|spn} with explicit ISA level 2022-07-05 16:42:42 -07:00
strcspn.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strlen-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
strlen-avx2.S x86: Fix wcsnlen-avx2 page cross length comparison [BZ #29591] 2022-09-28 20:15:16 -07:00
strlen-evex512.S x86: Update strlen-evex-base to use new reg/vec macros. 2022-10-14 21:21:58 -07:00
strlen-evex-base.S x86: Update strlen-evex-base to use new reg/vec macros. 2022-10-14 21:21:58 -07:00
strlen-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strlen-sse2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strlen.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strncase_l-avx2-rtm.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strncase_l-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strncase_l-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strncase_l-sse2.S x86: Move strcmp SSE2 implementation to multiarch/strcmp-sse2.S 2022-07-13 14:55:31 -07:00
strncase_l-sse4_2.S x86: Move strcmp SSE42 implementation to multiarch/strcmp-sse4_2.S 2022-07-13 14:55:31 -07:00
strncase_l.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strncase.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strncat-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
strncat-avx2.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strncat-evex.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strncat-sse2-unaligned.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strncat.c x86: Remove generic strncat, strncpy, and stpncpy implementations 2022-07-12 11:44:12 -07:00
strncmp-avx2-rtm.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strncmp-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strncmp-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strncmp-sse2.S x86: Move strcmp SSE2 implementation to multiarch/strcmp-sse2.S 2022-07-13 14:55:31 -07:00
strncmp-sse4_2.S x86: Move strcmp SSE42 implementation to multiarch/strcmp-sse4_2.S 2022-07-13 14:55:31 -07:00
strncmp.c x86-64: Require BMI2 for AVX2 strncmp implementation 2022-10-03 23:46:11 +02:00
strncpy-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
strncpy-avx2.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strncpy-evex.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strncpy-sse2-unaligned.S x86: Add support to build st{p|r}{n}{cpy|cat} with explicit ISA level 2022-07-16 03:07:59 -07:00
strncpy.c x86: Remove generic strncat, strncpy, and stpncpy implementations 2022-07-12 11:44:12 -07:00
strnlen-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
strnlen-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strnlen-evex512.S x86_64: Implement evex512 version of strlen, strnlen, wcslen and wcsnlen 2022-05-26 13:11:36 -07:00
strnlen-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strnlen-sse2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strnlen.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strpbrk-generic.c x86: Add support for building str{c|p}{brk|spn} with explicit ISA level 2022-07-05 16:42:42 -07:00
strpbrk-sse4.c x86: Add support for building str{c|p}{brk|spn} with explicit ISA level 2022-07-05 16:42:42 -07:00
strpbrk.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strrchr-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
strrchr-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strrchr-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strrchr-sse2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strrchr.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strspn-generic.c x86: Add support for building str{c|p}{brk|spn} with explicit ISA level 2022-07-05 16:42:42 -07:00
strspn-sse4.c x86: Add support for building str{c|p}{brk|spn} with explicit ISA level 2022-07-05 16:42:42 -07:00
strspn.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
strstr-avx512.c x86: Remove __mmask intrinsics in strstr-avx512.c 2022-07-12 15:41:14 -07:00
strstr-sse2-unaligned.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
strstr.c x86: Rename strstr_sse2 to strstr_generic as it uses string/strstr.c 2022-06-27 08:35:51 -07:00
varshift.c x86: Add support for building str{c|p}{brk|spn} with explicit ISA level 2022-07-05 16:42:42 -07:00
varshift.h x86: Align varshift table to 32-bytes 2022-06-09 12:50:26 -07:00
wcschr-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
wcschr-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcschr-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcschr-sse2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcschr.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
wcscmp-avx2-rtm.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcscmp-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcscmp-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcscmp-sse2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcscmp.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
wcscpy-generic.c x86: Add support to build wcscpy with explicit ISA level 2022-07-16 03:07:59 -07:00
wcscpy-ssse3.S x86: Add support to build wcscpy with explicit ISA level 2022-07-16 03:07:59 -07:00
wcscpy.c x86: Add support to build wcscpy with explicit ISA level 2022-07-16 03:07:59 -07:00
wcslen-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
wcslen-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcslen-evex512.S x86_64: Implement evex512 version of strlen, strnlen, wcslen and wcsnlen 2022-05-26 13:11:36 -07:00
wcslen-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcslen-sse2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcslen-sse4_1.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcslen.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
wcsncmp-avx2-rtm.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcsncmp-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcsncmp-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcsncmp-generic.c x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcsncmp.c x86: Rename generic functions with unique postfix for clarity 2022-06-16 20:17:45 -07:00
wcsnlen-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
wcsnlen-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcsnlen-evex512.S x86_64: Implement evex512 version of strlen, strnlen, wcslen and wcsnlen 2022-05-26 13:11:36 -07:00
wcsnlen-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcsnlen-generic.c x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcsnlen-sse4_1.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcsnlen.c x86: Rename generic functions with unique postfix for clarity 2022-06-16 20:17:45 -07:00
wcsrchr-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
wcsrchr-avx2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcsrchr-evex.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcsrchr-sse2.S x86: Add support to build strcmp/strlen/strchr with explicit ISA level 2022-07-16 03:07:59 -07:00
wcsrchr.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
wmemchr-avx2-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
wmemchr-avx2.S x86: Add support for compiling {raw|w}memchr with high ISA level 2022-06-22 19:41:35 -07:00
wmemchr-evex512.S x86_64: Implement evex512 version of memchr, rawmemchr and wmemchr 2022-10-18 13:26:33 -07:00
wmemchr-evex-rtm.S x86: Add EVEX optimized memchr family not safe for RTM 2021-05-08 16:26:30 -04:00
wmemchr-evex.S x86: Add support for compiling {raw|w}memchr with high ISA level 2022-06-22 19:41:35 -07:00
wmemchr-sse2.S x86: Add support for compiling {raw|w}memchr with high ISA level 2022-06-22 19:41:35 -07:00
wmemchr.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
wmemcmp-avx2-movbe-rtm.S x86-64: Add AVX optimized string/memory functions for RTM 2021-03-29 07:40:17 -07:00
wmemcmp-avx2-movbe.S x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
wmemcmp-evex-movbe.S x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
wmemcmp-sse2.S x86: Add support for building {w}memcmp{eq} with explicit ISA level 2022-07-05 16:42:42 -07:00
wmemcmp.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
wmemset_chk-nonshared.S Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
wmemset_chk.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
wmemset.c Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
x86-avx-rtm-vecs.h x86: Update VEC macros to complete API for evex/evex512 impls 2022-10-14 21:21:58 -07:00
x86-avx-vecs.h x86: Update VEC macros to complete API for evex/evex512 impls 2022-10-14 21:21:58 -07:00
x86-evex256-vecs.h x86: Update VEC macros to complete API for evex/evex512 impls 2022-10-14 21:21:58 -07:00
x86-evex512-vecs.h x86: Update VEC macros to complete API for evex/evex512 impls 2022-10-14 21:21:58 -07:00
x86-evex-vecs-common.h x86: Update VEC macros to complete API for evex/evex512 impls 2022-10-14 21:21:58 -07:00
x86-sse2-vecs.h x86: Update VEC macros to complete API for evex/evex512 impls 2022-10-14 21:21:58 -07:00
x86-vec-macros.h x86: Update VEC macros to complete API for evex/evex512 impls 2022-10-14 21:21:58 -07:00