Go to file
Wilco Dijkstra 612fba2fe9 Improve performance of memmem
This patch significantly improves performance of memmem using a novel
modified Horspool algorithm.  Needles up to size 256 use a bad-character
table indexed by hashed pairs of characters to quickly skip past mismatches.
Long needles use a self-adapting filtering step to avoid comparing the whole
needle repeatedly.

By limiting the needle length to 256, the shift table only requires 8 bits
per entry, lowering preprocessing overhead and minimizing cache effects.
This limit also implies worst-case performance is linear.

Small needles up to size 2 use a dedicated linear search.  Very long needles
use the Two-Way algorithm (to avoid increasing stack size or slowing down
the common case, inlining is disabled).

The performance gain is 6.6 times on English text on AArch64 using random
needles with average size 8.

Tested against GLIBC testsuite and randomized tests.

Reviewed-by: Szabolcs Nagy <szabolcs.nagy@arm.com>

	* string/memmem.c (__memmem): Rewrite to improve performance.

(cherry picked from commit 680942b016)
2019-09-13 16:41:34 +01:00
argp Remove __need macros from errno.h (__need_Emath, __need_error_t). 2017-06-14 08:14:34 -04:00
assert Fix position of tests-unsupported definition in assert/Makefile. 2017-09-04 11:31:43 +02:00
benchtests Improve strstr performance 2019-09-13 16:39:12 +01:00
bits Factor out shared definitions from bits/signum.h. 2017-06-20 20:32:50 -04:00
catgets Update copyright dates not handled by scripts/update-copyrights. 2017-01-01 00:26:24 +00:00
conform conform/conformtest.pl: Escape literal braces in regular expressions 2018-07-06 16:41:11 +02:00
crypt crypt: Use NSPR header files in addition to NSS header files [BZ #17956] 2017-11-18 19:26:57 +01:00
csu powerpc: Fix float128 IFUNC relocations [BZ #21707] 2017-07-17 17:49:26 -03:00
ctype Use locale_t, not __locale_t, throughout glibc 2017-06-20 20:30:06 -04:00
debug libio: Avoid _allocate_buffer, _free_buffer function pointers [BZ #23236] 2018-06-01 11:24:58 +02:00
dev Rename xlocale.h to bits/types/__locale_t.h. 2017-06-20 20:28:11 -04:00
dirent support: Prevent multiple deletion of temporary files 2017-05-08 16:20:40 +02:00
dlfcn Miscellaneous low-risk changes preparing for _ISOMAC testsuite. 2017-03-01 20:32:50 -05:00
elf aarch64: add STO_AARCH64_VARIANT_PCS and DT_AARCH64_VARIANT_PCS 2019-07-12 10:50:12 +01:00
gmon Assume that O_NOFOLLOW is always defined 2017-04-13 21:28:18 +02:00
gnulib Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
grp Fix cast-after-dereference 2017-07-19 13:17:03 -04:00
gshadow Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
hesiod Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
hurd Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
iconv Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
iconvdata Increase some test timeouts. 2017-07-06 17:01:03 +00:00
include <array_length.h>: New array_length and array_end macros 2017-12-16 12:57:38 +01:00
inet __inet6_scopeid_pton: Remove attribute_hidden, internal_function 2017-08-22 14:50:56 +02:00
intl intl: Do not return NULL on asprintf failure in gettext [BZ #24018] 2019-01-02 20:01:05 +01:00
io linux: make getcwd(3) fail if it cannot obtain an absolute path [BZ #22679] 2018-01-12 14:49:49 +00:00
libidn Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
libio Fix crash in _IO_wfile_sync (bug 20568) 2019-05-16 10:10:04 +02:00
locale Minor improvements to new az_IR locale 2017-07-27 16:11:04 +02:00
localedata Fix country name in title of mai_NP locale 2017-07-27 16:24:07 +02:00
login Remove check for NULL buffer passed to `ptsname_r' 2017-06-07 17:37:59 +02:00
mach Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
malloc Fix tcache count maximum (BZ #24531) 2019-05-22 15:41:24 +01:00
manual [AArch64] Add ifunc support for Ares 2019-09-06 18:58:34 +01:00
math Fix parameter type in C++ version of iseqsig (bug 23171) 2018-06-19 14:16:36 -03:00
mathvec Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
misc utmp: Avoid -Wstringop-truncation warning 2018-10-22 14:00:13 +02:00
nis Include shlib-compat.h in many sunrpc/nis source files. 2017-06-04 11:31:28 -04:00
nptl Add compiler barriers around modifications of the robust mutex list for pthread_mutex_trylock. [BZ #24180] 2019-02-07 15:49:36 +01:00
nptl_db Narrowing the visibility of libc-internal.h even further. 2017-03-01 20:33:46 -05:00
nscd Fix nscd readlink argument aliasing (bug 22446). 2018-10-22 14:08:12 +02:00
nss nss_files: Avoid large buffers with many host addresses [BZ #22078] 2017-10-19 10:44:31 +02:00
po Update translations 2017-10-10 15:47:10 +05:30
posix posix: Fix large mmap64 offset for mips64n32 (BZ#24699) 2019-07-12 19:02:04 +00:00
pwd Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
resolv Add an additional test to resolv/tst-resolv-network.c 2018-11-09 14:43:45 +01:00
resource Define struct rusage in sys/wait.h when required (bug 21575). 2017-06-19 11:59:19 +00:00
rt Remove __need macros from signal.h. 2017-05-20 19:04:43 -04:00
scripts Synchronize support/ infrastructure with master 2018-01-15 15:23:35 +01:00
setjmp Remove __need macros from signal.h. 2017-05-20 19:04:43 -04:00
shadow Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
signal Factor out shared definitions from bits/signum.h. 2017-06-20 20:32:50 -04:00
socket Remove __need macros from signal.h. 2017-05-20 19:04:43 -04:00
soft-fp Narrowing the visibility of libc-internal.h even further. 2017-03-01 20:33:46 -05:00
stdio-common stdio-common/tst-printf.c: Remove part under a non-free license [BZ #23363] 2018-07-03 18:34:26 +02:00
stdlib Fix path length overflow in realpath [BZ #22786] 2018-05-17 14:10:16 +02:00
streams Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
string Improve performance of memmem 2019-09-13 16:41:34 +01:00
sunrpc Replace all internal uses of __bzero with memset. This removes the need 2017-06-12 14:56:53 +01:00
support Synchronize support/ infrastructure with master 2018-07-03 18:13:05 +02:00
sysdeps [AArch64] Add ifunc support for Ares 2019-09-06 18:58:34 +01:00
sysvipc Fix test-sysvsem on some platforms 2017-01-02 18:53:50 -02:00
termios Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
time Use _STRUCT_TIMESPEC as guard in <bits/types/struct_timespec.h> [BZ #23349] 2018-06-28 13:22:34 +02:00
timezone timezone: pacify GCC -Wstringop-truncation 2018-10-22 14:00:17 +02:00
wcsmbs Use locale_t, not __locale_t, throughout glibc 2017-06-20 20:30:06 -04:00
wctype Use locale_t, not __locale_t, throughout glibc 2017-06-20 20:30:06 -04:00
.gitattributes Assume __NR_openat is always defined 2016-03-23 23:35:08 +01:00
.gitignore Add *.pyc to .gitignore 2015-05-18 15:26:26 +05:30
abi-tags Remove the bulk of the NaCl port. 2017-05-20 08:09:10 -04:00
aclocal.m4 Work even with compilers which enable -fstack-protector by default [BZ #7065] 2016-12-26 10:10:58 +01:00
BUGS [BZ #5222] 2007-10-28 08:24:07 +00:00
ChangeLog Improve performance of memmem 2019-09-13 16:41:34 +01:00
ChangeLog.1 * Makefile (distribute): Add ChangeLog.[0-9]. 1995-04-14 03:52:54 +00:00
ChangeLog.2 * Makefile (distribute): Add ChangeLog.[0-9]. 1995-04-14 03:52:54 +00:00
ChangeLog.3 * Makefile (distribute): Add ChangeLog.[0-9]. 1995-04-14 03:52:54 +00:00
ChangeLog.4 * Makefile (distribute): Add ChangeLog.[0-9]. 1995-04-14 03:52:54 +00:00
ChangeLog.5 * sysdeps/posix/getaddrinfo.c: Implement configuration file 2006-05-04 06:38:07 +00:00
ChangeLog.6 Revert "ChangeLogs: convert to utf-8" 2016-02-12 16:35:27 -05:00
ChangeLog.7 Revert "ChangeLogs: convert to utf-8" 2016-02-12 16:35:27 -05:00
ChangeLog.8 ChangeLog: change Winblowz to Windows 2016-08-10 00:49:28 +08:00
ChangeLog.9 Update. 2000-04-28 06:14:43 +00:00
ChangeLog.10 Revert "ChangeLogs: convert to utf-8" 2016-02-12 16:35:27 -05:00
ChangeLog.11 ChangeLog: change Winblowz to Windows 2016-08-10 00:49:28 +08:00
ChangeLog.12 Revert "ChangeLogs: convert to utf-8" 2016-02-12 16:35:27 -05:00
ChangeLog.13 Update. 2002-10-03 16:37:04 +00:00
ChangeLog.14 Revert "ChangeLogs: convert to utf-8" 2016-02-12 16:35:27 -05:00
ChangeLog.15 Split out ChangeLog.15 at 2.3 branch point 2005-02-16 07:34:17 +00:00
ChangeLog.16 Fix typo in name 2012-06-21 16:45:27 +02:00
ChangeLog.17 Revert "Sun agreed to a change of the license for the RPC code to a BSD-like license." 2010-06-27 19:34:03 -07:00
ChangeLog.old-ports Move ports/ChangeLog* files to ChangeLog.old-ports*, remove ports/ directory. 2014-04-30 10:40:29 -07:00
ChangeLog.old-ports-aarch64 Move ports/ChangeLog* files to ChangeLog.old-ports*, remove ports/ directory. 2014-04-30 10:40:29 -07:00
ChangeLog.old-ports-aix Move ports/ChangeLog* files to ChangeLog.old-ports*, remove ports/ directory. 2014-04-30 10:40:29 -07:00
ChangeLog.old-ports-alpha ChangeLog: fix BZ style to be consistent and match majority of existing code 2017-04-03 15:18:07 -04:00
ChangeLog.old-ports-am33 Move ports/ChangeLog* files to ChangeLog.old-ports*, remove ports/ directory. 2014-04-30 10:40:29 -07:00
ChangeLog.old-ports-arm Move ports/ChangeLog* files to ChangeLog.old-ports*, remove ports/ directory. 2014-04-30 10:40:29 -07:00
ChangeLog.old-ports-cris Move ports/ChangeLog* files to ChangeLog.old-ports*, remove ports/ directory. 2014-04-30 10:40:29 -07:00
ChangeLog.old-ports-hppa ChangeLog: fix BZ style to be consistent and match majority of existing code 2017-04-03 15:18:07 -04:00
ChangeLog.old-ports-ia64 Move ports/ChangeLog* files to ChangeLog.old-ports*, remove ports/ directory. 2014-04-30 10:40:29 -07:00
ChangeLog.old-ports-linux-generic Move ports/ChangeLog* files to ChangeLog.old-ports*, remove ports/ directory. 2014-04-30 10:40:29 -07:00
ChangeLog.old-ports-m68k Move ports/ChangeLog* files to ChangeLog.old-ports*, remove ports/ directory. 2014-04-30 10:40:29 -07:00
ChangeLog.old-ports-microblaze Move ports/ChangeLog* files to ChangeLog.old-ports*, remove ports/ directory. 2014-04-30 10:40:29 -07:00
ChangeLog.old-ports-mips ChangeLog: fix BZ style to be consistent and match majority of existing code 2017-04-03 15:18:07 -04:00
ChangeLog.old-ports-powerpc Move ports/ChangeLog* files to ChangeLog.old-ports*, remove ports/ directory. 2014-04-30 10:40:29 -07:00
ChangeLog.old-ports-tile Move ports/ChangeLog* files to ChangeLog.old-ports*, remove ports/ directory. 2014-04-30 10:40:29 -07:00
config.h.in Suppress internal declarations for most of the testsuite. 2017-05-11 19:27:59 -04:00
config.make.in Add per-thread cache to malloc 2017-07-06 13:37:30 -04:00
configure crypt: Use NSPR header files in addition to NSS header files [BZ #17956] 2017-11-18 19:26:57 +01:00
configure.ac crypt: Use NSPR header files in addition to NSS header files [BZ #17956] 2017-11-18 19:26:57 +01:00
CONFORMANCE Move __STDC_* predefined macros from features.h to stdc-predef.h. 2012-02-22 12:53:04 +00:00
COPYING Update to latest versions of GPL-2.0 and LGPL-2.1 2013-09-09 12:52:48 +10:00
COPYING.LIB Update to latest versions of GPL-2.0 and LGPL-2.1 2013-09-09 12:52:48 +10:00
extra-lib.mk Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
gen-locales.mk Split locale generation snippet into a separate file 2015-05-13 13:05:28 +05:30
INSTALL Update contributors and latest gcc and binutils versions 2017-08-02 18:22:58 +05:30
libc-abis A few more archs have IFUNC support. 2010-03-17 02:43:12 -07:00
libof-iterator.mk Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
LICENSES stdio-common/tst-printf.c: Remove part under a non-free license [BZ #23363] 2018-07-03 18:34:26 +02:00
MAINTAINERS Add MAINTAINERS 2017-05-11 13:38:30 -04:00
Makeconfig Polish the treatment of dl-tunable-list.h in Makeconfig. 2017-06-09 09:35:31 -04:00
Makefile Suppress internal declarations for most of the testsuite. 2017-05-11 19:27:59 -04:00
Makefile.in New make target to only build benchmark binaries 2016-04-20 10:23:28 +05:30
Makerules Place $(elf-objpfx)sofini.os last [BZ #22051] 2017-09-07 08:27:30 -07:00
NAMESPACE Add and update many more entries. 2000-03-20 00:42:58 +00:00
NEWS Fix crash in _IO_wfile_sync (bug 20568) 2019-05-16 10:10:04 +02:00
o-iterator.mk Fri Mar 17 12:58:37 1995 Roland McGrath <roland@churchy.gnu.ai.mit.edu> 1995-03-17 18:42:51 +00:00
README Require Linux kernel 3.2 or later on x86 / x86_64. 2017-05-08 10:45:20 +00:00
README.pretty-printers Fix mutex pretty printer test and pretty printer output. 2017-01-20 14:56:39 +01:00
README.tunables tunables: Clean up hooks to get and set tunables 2017-06-07 11:11:36 +05:30
Rules Suppress internal declarations for most of the testsuite. 2017-05-11 19:27:59 -04:00
shlib-versions Extend NSS test suite 2017-07-17 15:52:44 -04:00
test-skeleton.c Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
version.h Update for 2.26 release 2017-08-02 18:27:16 +05:30
WUR-REPORT * posix/unistd.h (setuid, setreuid, seteuid, setresuid): 2012-08-01 18:12:58 +02:00

This directory contains the sources of the GNU C Library.
See the file "version.h" for what release version you have.

The GNU C Library is the standard system C library for all GNU systems,
and is an important part of what makes up a GNU system.  It provides the
system API for all programs written in C and C-compatible languages such
as C++ and Objective C; the runtime facilities of other programming
languages use the C library to access the underlying operating system.

In GNU/Linux systems, the C library works with the Linux kernel to
implement the operating system behavior seen by user applications.
In GNU/Hurd systems, it works with a microkernel and Hurd servers.

The GNU C Library implements much of the POSIX.1 functionality in the
GNU/Hurd system, using configurations i[4567]86-*-gnu.  The current
GNU/Hurd support requires out-of-tree patches that will eventually be
incorporated into an official GNU C Library release.

When working with Linux kernels, this version of the GNU C Library
requires Linux kernel version 3.2 or later.

Also note that the shared version of the libgcc_s library must be
installed for the pthread library to work correctly.

The GNU C Library supports these configurations for using Linux kernels:

	aarch64*-*-linux-gnu
	alpha*-*-linux-gnu
	arm-*-linux-gnueabi
	hppa-*-linux-gnu	Not currently functional without patches.
	i[4567]86-*-linux-gnu
	x86_64-*-linux-gnu	Can build either x86_64 or x32
	ia64-*-linux-gnu
	m68k-*-linux-gnu
	microblaze*-*-linux-gnu
	mips-*-linux-gnu
	mips64-*-linux-gnu
	powerpc-*-linux-gnu	Hardware or software floating point, BE only.
	powerpc64*-*-linux-gnu	Big-endian and little-endian.
	s390-*-linux-gnu
	s390x-*-linux-gnu
	sh[34]-*-linux-gnu
	sparc*-*-linux-gnu
	sparc64*-*-linux-gnu
	tilegx-*-linux-gnu
	tilepro-*-linux-gnu

If you are interested in doing a port, please contact the glibc
maintainers; see http://www.gnu.org/software/libc/ for more
information.

See the file INSTALL to find out how to configure, build, and install
the GNU C Library.  You might also consider reading the WWW pages for
the C library at http://www.gnu.org/software/libc/.

The GNU C Library is (almost) completely documented by the Texinfo manual
found in the `manual/' subdirectory.  The manual is still being updated
and contains some known errors and omissions; we regret that we do not
have the resources to work on the manual as much as we would like.  For
corrections to the manual, please file a bug in the `manual' component,
following the bug-reporting instructions below.  Please be sure to check
the manual in the current development sources to see if your problem has
already been corrected.

Please see http://www.gnu.org/software/libc/bugs.html for bug reporting
information.  We are now using the Bugzilla system to track all bug reports.
This web page gives detailed information on how to report bugs properly.

The GNU C Library is free software.  See the file COPYING.LIB for copying
conditions, and LICENSES for notices about a few contributions that require
these additional notices to be distributed.  License copyright years may be
listed using range notation, e.g., 1996-2015, indicating that every year in
the range, inclusive, is a copyrightable year that would otherwise be listed
individually.