glibc

mirror of https://sourceware.org/git/glibc.git synced 2024-11-30 16:50:07 +00:00

Author	SHA1	Message	Date
Florian Weimer	fada901819	dlfcn: dlerror needs to call free from the base namespace [BZ #24773 ] Calling free directly may end up freeing a pointer allocated by the dynamic loader using malloc from libc.so in the base namespace using the allocator from libc.so in a secondary namespace, which results in crashes. This commit redirects the free call through GLRO and the dynamic linker, to reach the correct namespace. It also cleans up the dlerror handling along the way, so that pthread_setspecific is no longer needed (which avoids triggering bug 24774).	2021-04-21 19:49:51 +02:00
Florian Weimer	b2964eb1d9	dlfcn: Failures after dlmopen should not terminate process [BZ #24772 ] Commit `9e78f6f6e7` ("Implement _dl_catch_error, _dl_signal_error in libc.so [BZ #16628]") has the side effect that distinct namespaces, as created by dlmopen, now have separate implementations of the rtld exception mechanism. This means that the call to _dl_catch_error from libdl in a secondary namespace does not actually install an exception handler because the thread-local variable catch_hook in the libc.so copy in the secondary namespace is distinct from that of the base namepace. As a result, a dlsym/dlopen/... failure in a secondary namespace terminates the process with a dynamic linker error because it looks to the exception handler mechanism as if no handler has been installed. This commit restores GLRO (dl_catch_error) and uses it to set the handler in the base namespace. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:51 +02:00
Florian Weimer	66d99dc53a	nptl: Invoke the set_robust_list system call directly in fork This removes one of the pthread forwarder functions. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:51 +02:00
Florian Weimer	75376a3fb8	nptl: Move pthread_setcanceltype into libc No new symbol version is required because there was a forwarder. The symbol has been moved using scripts/move-symbol-to-libc.py. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Florian Weimer	93d78ec1cb	nptl: Move pthread_setcancelstate into libc No new symbol version is required because there was a forwarder. The symbol has been moved using scripts/move-symbol-to-libc.py. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Florian Weimer	c62cef023c	nptl: Move pthread_exit into libc The pthread_exit symbol was moved using scripts/move-symbol-to-libc.py. No new symbol version is needed because there was a forwarder. The new tests nptl/tst-pthread_exit-nothreads and nptl/tst-pthread_exit-nothreads-static exercise the scenario that pthread_exit is called without libpthread having been linked in. This is not possible for the generic code, so these tests do not live in sysdeps/pthread for now. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Florian Weimer	2cfef0b042	nptl: Move __nptl_deallocate_tsd into libc This prepares moving pthread_exit, and later the pthread_key_create infrastructure. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Florian Weimer	43fe356d18	nptl: Move internal __nptl_nthreads variable into libc Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Florian Weimer	130fca173f	csu: Move calling main out of __libc_start_main_impl This code depends on whether glibc has unwinding support for a particular port. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Florian Weimer	1d95b035c7	nptl: Move __pthread_unwind_next into libc It's necessary to stub out __libc_disable_asynccancel and __libc_enable_asynccancel via rtld-stubbed-symbols because the new direct references to the unwinder result in symbol conflicts when the rtld exception handling from libc is linked in during the construction of librtld.map. unwind-forcedunwind.c is merged into unwind-resume.c. libc now needs the functions that were previously only used in libpthread. The GLIBC_PRIVATE exports of __libc_longjmp and __libc_siglongjmp are no longer needed, so switch them to hidden symbols. The symbol __pthread_unwind_next has been moved using scripts/move-symbol-to-libc.py. Reviewed-by: Adhemerva Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Florian Weimer	3fec7f18bf	nptl: Move pthread_once and __pthread_once into libc And also the fork generation counter, __fork_generation. This eliminates the need for __fork_generation_pointer. call_once remains in libpthread and calls the exported __pthread_once symbol. pthread_once and __pthread_once have been moved using scripts/move-symbol-to-libc.py. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Florian Weimer	4647ce82c7	nptl: Move __pthread_cleanup_upto into libc This internal symbol is used as part of the longjmp implementation. Rename the file from nptl/pt-cleanup.c to nptl/pthread_cleanup_upto.c so that the pt-* files remain restricted to libpthread. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Adhemerval Zanella	5a3140b489	x86: Restore compile-time check for shadow stack pointer in longjmp	2021-04-21 19:49:50 +02:00
Florian Weimer	81dfc6694c	nptl: Remove longjmp, siglongjmp from libpthread The definitions in libc are sufficient, the forwarders are no longer needed. The symbols have been moved using scripts/move-symbol-to-libc.py. s390-linux-gnu and s390x-linux-gnu need a new version placeholder to keep the GLIBC_2.19 symbol version in libpthread. Tested on i386-linux-gnu, powerpc64le-linux-gnu, s390x-linux-gnu, x86_64-linux-gnu. Built with build-many-glibcs.py. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Florian Weimer	1f2e5bfe48	nptl: Move legacy cancelation handling into libc as compat symbols This affects _pthread_cleanup_pop, _pthread_cleanup_pop_restore, _pthread_cleanup_push, _pthread_cleanup_push_defer. The symbols have been moved using scripts/move-symbol-to-libc.py. No new symbol versions are added because the symbols are turned into compatibility symbols at the same time. __pthread_cleanup_pop and __pthread_cleanup_push are added as GLIBC_PRIVATE symbols because they are also used internally, for glibc's own cancellation handling. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Florian Weimer	f79f206581	nptl: Move legacy unwinding implementation into libc It is still used internally. Since unwinding is now available unconditionally, avoid indirect calls through function pointers loaded from the stack by inlining the non-cancellation cleanup code. This avoids a regression in security hardening. The out-of-line __libc_cleanup_routine implementation is no longer needed because the inline definition is now static __always_inline. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Florian Weimer	5715c29e91	nptl: Move __pthread_cleanup_routine into libc Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Florian Weimer	f03b78fae4	nptl: Move pthread_mutex_consistent into libc And deprecated pthread_mutex_consistent_np, its old name. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 19:49:50 +02:00
Szabolcs Nagy	2208066603	elf: Remove lazy tlsdesc relocation related code Remove generic tlsdesc code related to lazy tlsdesc processing since lazy tlsdesc relocation is no longer supported. This includes removing GL(dl_load_lock) from _dl_make_tlsdesc_dynamic which is only called at load time when that lock is already held. Added a documentation comment too. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-21 14:35:53 +01:00
Noah Goldstein	aaa23c3507	x86: Optimize strlen-avx2.S No bug. This commit optimizes strlen-avx2.S. The optimizations are mostly small things but they add up to roughly 10-30% performance improvement for strlen. The results for strnlen are bit more ambiguous. test-strlen, test-strnlen, test-wcslen, and test-wcsnlen are all passing. Signed-off-by: Noah Goldstein <goldstein.w.n@gmail.com>	2021-04-19 18:03:49 -07:00
Noah Goldstein	4ba6558684	x86: Optimize strlen-evex.S No bug. This commit optimizes strlen-evex.S. The optimizations are mostly small things but they add up to roughly 10-30% performance improvement for strlen. The results for strnlen are bit more ambiguous. test-strlen, test-strnlen, test-wcslen, and test-wcsnlen are all passing. Signed-off-by: Noah Goldstein <goldstein.w.n@gmail.com>	2021-04-19 18:03:49 -07:00
Noah Goldstein	f53790272c	x86: Optimize less_vec evex and avx512 memset-vec-unaligned-erms.S No bug. This commit adds optimized cased for less_vec memset case that uses the avx512vl/avx512bw mask store avoiding the excessive branches. test-memset and test-wmemset are passing. Signed-off-by: Noah Goldstein <goldstein.w.n@gmail.com>	2021-04-19 15:08:04 -07:00
H.J. Lu	83c5b36822	x86-64: Require BMI2 for strchr-avx2.S Since strchr-avx2.S updated by commit `1f745ecc21` Author: noah <goldstein.w.n@gmail.com> Date: Wed Feb 3 00:38:59 2021 -0500 x86-64: Refactor and improve performance of strchr-avx2.S uses sarx: c4 e2 72 f7 c0 sarx %ecx,%eax,%eax for strchr-avx2 family functions, require BMI2 in ifunc-impl-list.c and ifunc-avx2.h.	2021-04-19 11:01:45 -07:00
H.J. Lu	55bf411b45	x86-64: Require BMI2 for __strlen_evex and __strnlen_evex Since __strlen_evex and __strnlen_evex added by commit `1fd8c163a8` Author: H.J. Lu <hjl.tools@gmail.com> Date: Fri Mar 5 06:24:52 2021 -0800 x86-64: Add ifunc-avx2.h functions with 256-bit EVEX use sarx: c4 e2 6a f7 c0 sarx %edx,%eax,%eax require BMI2 for __strlen_evex and __strnlen_evex in ifunc-impl-list.c. ifunc-avx2.h already requires BMI2 for EVEX implementation.	2021-04-19 07:51:33 -07:00
noah	1a8605b6cd	x86: Update large memcpy case in memmove-vec-unaligned-erms.S No Bug. This commit updates the large memcpy case (no overlap). The update is to perform memcpy on either 2 or 4 contiguous pages at once. This 1) helps to alleviate the affects of false memory aliasing when destination and source have a close 4k alignment and 2) In most cases and for most DRAM units is a modestly more efficient access pattern. These changes are a clear performance improvement for VEC_SIZE =16/32, though more ambiguous for VEC_SIZE=64. test-memcpy, test-memccpy, test-mempcpy, test-memmove, and tst-memmove-overflow all pass. Signed-off-by: Noah Goldstein <goldstein.w.n@gmail.com>	2021-04-16 10:06:56 -07:00
Matheus Castanho	5d61fc2021	powerpc: Add missing registers to clobbers list for syscalls [BZ #27623 ] Some registers that can be clobbered by the kernel during a syscall are not listed on the clobbers list in sysdeps/unix/sysv/linux/powerpc/sysdep.h. For syscalls using sc: - XER is zeroed by the kernel on exit For syscalls using scv: - XER is zeroed by the kernel on exit - Different from the sc case, most CR fields can be clobbered (according to the ELF ABI and the Linux kernel's syscall ABI for powerpc (linux/Documentation/powerpc/syscall64-abi.rst) The same should apply to vsyscalls, which effectively execute a function call but are not currently adding these registers as clobbers either. These are likely not causing issues today, but they should be added to the clobbers list just in case things change on the kernel side in the future. Reported-by: Nicholas Piggin <npiggin@gmail.com> Reviewed-by: Nicholas Piggin <npiggin@gmail.com> Reviewed-by: Raphael M Zinsly <rzinsly@linux.ibm.com>	2021-04-16 08:40:37 -03:00
Adhemerval Zanella	ded3cef361	misc: syslog: Assume MSG_NOSIGNAL support (BZ #17144 ) MSG_NOSIGNAL was added on POSIX 2008 and Hurd seems to support it. The SIGPIPE handling also makes the implementation not thread-safe (due the sigaction usage). Checked on x86_64-linux-gnu.	2021-04-15 11:32:40 -03:00
Adhemerval Zanella	243339d055	io: Move file timestamps tests out of Linux Now that libsupport abstract Linux possible missing support (either due FS limitation that can't handle 64 bit timestamp or architectures that do not handle values larger than unsigned 32 bit values) the tests can be turned generic. Checked on x86_64-linux-gnu and i686-linux-gnu. I also built the tests for i686-gnu. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2021-04-15 09:39:43 -03:00
Stefan Liebler	07c245a76b	s390: Update ulps Required after `9acda61d94` "Fix the inaccuracy of j0f/j1f/y0f/y1f [BZ #14469, #14470, #14471, #14472]".	2021-04-15 11:05:43 +02:00
Szabolcs Nagy	a75a02a696	i386: Remove lazy tlsdesc relocation related code Like in commit e75711ebfa976d5468ec292282566a18b07e4d67 for x86_64, remove unused lazy tlsdesc relocation processing code: _dl_tlsdesc_resolve_abs_plus_addend _dl_tlsdesc_resolve_rel _dl_tlsdesc_resolve_rela _dl_tlsdesc_resolve_hold	2021-04-15 09:47:59 +01:00
Szabolcs Nagy	55c9f32380	x86_64: Remove lazy tlsdesc relocation related code _dl_tlsdesc_resolve_rela and _dl_tlsdesc_resolve_hold are only used for lazy tlsdesc relocation processing which is no longer supported.	2021-04-15 09:47:47 +01:00
Szabolcs Nagy	ddcacd91cc	i386: Avoid lazy relocation of tlsdesc [BZ #27137 ] Lazy tlsdesc relocation is racy because the static tls optimization and tlsdesc management operations are done without holding the dlopen lock. This similar to the commit `b7cf203b5c` for aarch64, but it fixes a different race: bug 27137. On i386 the code is a bit more complicated than on x86_64 because both rel and rela relocs are supported.	2021-04-15 09:47:43 +01:00
Szabolcs Nagy	8f7e09f4db	x86_64: Avoid lazy relocation of tlsdesc [BZ #27137 ] Lazy tlsdesc relocation is racy because the static tls optimization and tlsdesc management operations are done without holding the dlopen lock. This similar to the commit `b7cf203b5c` for aarch64, but it fixes a different race: bug 27137. Another issue is that ld auditing ignores DT_BIND_NOW and thus tries to relocate tlsdesc lazily, but that does not work in a BIND_NOW module due to missing DT_TLSDESC_PLT. Unconditionally relocating tlsdesc at load time fixes this bug 27721 too.	2021-04-15 09:47:37 +01:00
Vineet Gupta	aecbe50c9d	ARC: Update ulps Needed after `43576de04a` Signed-off-by: Vineet Gupta <vgupta@synopsys.com>	2021-04-14 09:24:45 -07:00
Szabolcs Nagy	f4596d9540	Remove PR_TAGGED_ADDR_ENABLE from sys/prctl.h The value of PR_TAGGED_ADDR_ENABLE was incorrect in the installed headers and the prctl command macros were missing that are needed for it to be useful (PR_SET_TAGGED_ADDR_CTRL). Linux headers have the definitions since 5.4 so it's widely available, we don't need to repeat these definitions. The remaining definitions are from Linux 5.10. To build glibc with --enable-memory-tagging, Linux 5.4 headers and binutils 2.33.1 or newer is needed. Reviewed-by: DJ Delorie <dj@redhat.com>	2021-04-14 08:45:21 +01:00
Adhemerval Zanella	bdc12a77b7	linux: sysconf: Use a more explicit maximum_ARG_MAX	2021-04-13 17:45:14 -03:00
Michal Nazarewicz	a9880586ee	linux: sysconf: limit _SC_MAX_ARG to 6 MiB (BZ #25305 ) Since Linux 4.13, kernel limits the maximum command line arguments length to 6 MiB [1]. Normally the limit is still quarter of the maximum stack size but if that limit exceeds 6 MiB it's clamped down. glibc's __sysconf implementation for Linux platform is not aware of this limitation and for stack sizes of over 24 MiB it returns higher ARG_MAX than Linux will actually accept. This can be verified by executing the following application on Linux 4.13 or newer: #include <stdio.h> #include <string.h> #include <sys/resource.h> #include <sys/time.h> #include <unistd.h> int main(void) { const struct rlimit rlim = { 40 * 1024 * 1024, 40 * 1024 * 1024 }; if (setrlimit(RLIMIT_STACK, &rlim) < 0) { perror("setrlimit: RLIMIT_STACK"); return 1; } printf("ARG_MAX : %8ld\n", sysconf(_SC_ARG_MAX)); printf("63 * 100 KiB: %8ld\n", 63L * 100 * 1024); printf("6 MiB : %8ld\n", 6L * 1024 * 1024); char str[100 * 1024], argv[64], envp[1]; memset(&str, 'A', sizeof str); str[sizeof str - 1] = '\0'; for (size_t i = 0; i < sizeof argv / sizeof argv - 1; ++i) { argv[i] = str; } argv[sizeof argv / sizeof argv - 1] = envp[0] = 0; execve("/bin/true", argv, envp); perror("execve"); return 1; } On affected systems the program will report ARG_MAX as 10 MiB but despite that executing /bin/true with a bit over 6 MiB of command line arguments will fail with E2BIG error. Expected result is that ARG_MAX is reported as 6 MiB. Update the __sysconf function to clamp ARG_MAX value to 6 MiB if it would otherwise exceed it. This resolves bug #25305 which was market WONTFIX as suggested solution was to cap ARG_MAX at 128 KiB. As an aside and point of comparison, bionic (a libc implementation for Android systems) decided to resolve this issue by always returning 128 KiB ignoring any potential xargs regressions [2]. On older kernels this results in returning overly conservative value but that's a safer option than being aggressive and returning invalid value on recent systems. It's also worth noting that at this point all supported Linux releases have the 6 MiB barrier so only someone running an unsupported kernel version would get incorrectly truncated result. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> [1] See https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=da029c11e6b12f321f36dac8771e833b65cec962 [2] See `baed51ee3a`	2021-04-13 17:10:02 -03:00
Adhemerval Zanella	58137d00ba	s390: Update ulps Required after `43576de04a` "Improve the accuracy of tgamma (BZ #26983)"	2021-04-13 16:33:27 -03:00
Adhemerval Zanella	30c2a0e41b	i386: Update ulps Required after `43576de04a` "Improve the accuracy of tgamma (BZ #26983)"	2021-04-13 16:33:27 -03:00
Adhemerval Zanella	cedbf6d5f3	linux: always update select timeout (BZ #27706 ) The timeout should be updated even on failure for time64 support. Checked on i686-linux-gnu.	2021-04-12 18:38:37 -03:00
Adhemerval Zanella	9d7c5cc38e	linux: Normalize and return timeout on select (BZ #27651 ) The commit `2433d39b69`, which added time64 support to select, changed the function to use __NR_pselect6 (or __NR_pelect6_time64) on all architectures. However, on architectures where the symbol was implemented with __NR_select the kernel normalizes the passed timeout instead of return EINVAL. For instance, the input timeval { 0, 5000000 } is interpreted as { 5, 0 }. And as indicated by BZ #27651, this semantic seems to be expected and changing it results in some performance issues (most likely the program does not check the return code and keeps issuing select with unormalized tv_usec argument). To avoid a different semantic depending whether which syscall the architecture used to issue, select now always normalize the timeout input. This is a slight change for some ABIs (for instance aarch64). Checked on x86_64-linux-gnu and i686-linux-gnu.	2021-04-12 18:38:37 -03:00
Szabolcs Nagy	8d4d77f6c8	arm: Fix an incorrect check in ____longjmp_chk [BZ #27709 ] An incorrect check in __longjmp_chk could fail on valid code causing FAIL: debug/tst-longjmp_chk2 The original check was altstack_sp + altstack_size - setjmp_sp > altstack_size i.e. sp at setjmp was outside of the altstack range. Here we know that longjmp is called from a signal handler on the altstack (SS_ONSTACK), and that it jumps in the wrong direction (sp decreases), so the check wants to ensure the jump goes to another stack. The check is wrong when altstack_sp == setjmp_sp which can happen when the altstack is a local buffer in the function that calls setjmp, so the patch allows == too. This fixes bug 27709. Note that the generic __longjmp_chk check seems to be different. (it checks if longjmp was on the altstack but does not check setjmp, so it would not catch incorrect longjmp use within the signal handler).	2021-04-12 14:28:07 +01:00
Samuel Thibault	0385d5fff8	hurd: Export _hurd_libc_proc_init hurd's libdiskfs needs to be able to call _hurd_init + _hurd_libc_proc_init for bootstrap initialization.	2021-04-12 00:23:36 +02:00
Tulio Magno Quites Machado Filho	667d9c8d55	powerpc: Update libm test ulps Update after commit `43576de04a`.	2021-04-09 17:41:22 -03:00
Szabolcs Nagy	2d690bbb17	arm: update libm test ulps Updated after commits `9acda61d94` and `43576de04a`.	2021-04-08 09:55:33 +01:00
Szabolcs Nagy	e06e6554c3	aarch64: update libm test ulps Update after commit `43576de04a`.	2021-04-08 08:24:30 +01:00
Paul Zimmermann	43576de04a	Improve the accuracy of tgamma (BZ #26983 ) With this patch, the maximal known error for tgamma is now reduced to 9 ulps for dbl-64, for all rounding modes. Since exhaustive testing is not possible for dbl-64, it might be that there are still cases with an error larger than 9 ulps, but all known cases are fixed (intensive tests were done to find cases with large errors). Tested on x86_64 and powerpc (and by Adhemerval Zanella on aarch64, arm, s390x, sparc, and i686). Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-07 13:23:39 +02:00
John David Anglin	e9eeeb3a58	Update hppa libm-test-ulps	2021-04-06 18:55:58 +00:00
Adhemerval Zanella	5f6ff07dbf	m68: Fix build after `9acda61d94` The j0f/j1f/y0f/y1f now uses __inv_pio4.	2021-04-06 15:10:31 -03:00
Szabolcs Nagy	69499bb6ee	aarch64: free tlsdesc data on dlclose [BZ #27403 ] DL_UNMAP_IS_SPECIAL and DL_UNMAP were not defined. The definitions are now copied from arm, since the same is needed on aarch64. The cleanup of tlsdesc data is handled by the custom _dl_unmap. Fixes bug 27403.	2021-04-06 14:35:05 +01:00
Adhemerval Zanella	edb0ba79a1	ia64: Update ulps Required after `9acda61d94` "Fix the inaccuracy of j0f/j1f/y0f/y1f [BZ #14469, #14470, #14471, #14472]" and `db3f7bb558` "math: Remove slow paths from asin and acos [BZ #15267]".	2021-04-05 10:11:09 -03:00
Adhemerval Zanella	52c512bc56	ia64: Fix build after `9acda61d94` The j0f/j1f/y0f/y1f now uses __inv_pio4 and call roundf (which turns to __roundf on ia64).	2021-04-05 10:07:42 -03:00
Adhemerval Zanella	1d64e962ab	i386: Update ulps Required after `9acda61d94` "Fix the inaccuracy of j0f/j1f/y0f/y1f [BZ #14469, #14470, #14471, #14472]".	2021-04-05 10:02:15 -03:00
Paul Zimmermann	9acda61d94	Fix the inaccuracy of j0f/j1f/y0f/y1f [BZ #14469 , #14470 , #14471 , #14472 ] For j0f/j1f/y0f/y1f, the largest error for all binary32 inputs is reduced to at most 9 ulps for all rounding modes. The new code is enabled only when there is a cancellation at the very end of the j0f/j1f/y0f/y1f computation, or for very large inputs, thus should not give any visible slowdown on average. Two different algorithms are used: * around the first 64 zeros of j0/j1/y0/y1, approximation polynomials of degree 3 are used, computed using the Sollya tool (https://www.sollya.org/) * for large inputs, an asymptotic formula from [1] is used [1] Fast and Accurate Bessel Function Computation, John Harrison, Proceedings of Arith 19, 2009. Inputs yielding the new largest errors are added to auto-libm-test-in, and ulps are regenerated for various targets (thanks Adhemerval Zanella). Tested on x86_64 with --disable-multi-arch and on powerpc64le-linux-gnu. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-04-02 06:15:48 +02:00
Sunil K Pandey	595c22ecd8	x86-64: Fix ifdef indentation in strlen-evex.S Fix some indentations of ifdef in file strlen-evex.S which are off by 1 and confusing to read.	2021-04-01 16:13:33 -07:00
Joseph Myers	e21b7c87e8	Update Nios II libm-test-ulps.	2021-04-01 19:41:40 +00:00
Adhemerval Zanella	be60d70166	Update arm libm-tests-ulps Required after `db3f7bb558` "math: Remove slow paths from asin and acos [BZ #15267]".	2021-04-01 14:02:05 -03:00
H.J. Lu	b1ec623ed5	x86_64: Correct THREAD_SETMEM/THREAD_SETMEM_NC for movq [BZ #27591 ] config/i386/constraints.md in GCC has (define_constraint "e" "32-bit signed integer constant, or a symbolic reference known to fit that range (for immediate operands in sign-extending x86-64 instructions)." (match_operand 0 "x86_64_immediate_operand")) Since movq takes a signed 32-bit immediate or a register source operand, use "er", instead of "nr"/"ir", constraint for 32-bit signed integer constant or register on movq. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2021-04-01 07:00:22 -07:00
Andreas Schwab	5ccea9a011	powerpc64le: Use ifunc for _Float128 functions also in libc This fixes missing definition of math functions in libc in a static link that are no longer built for libm after commit `4898d9712b` ("Avoid adding duplicated symbols into static libraries").	2021-04-01 10:55:42 +02:00
Stefan Liebler	01e0451175	S390: Allow "v" constraint for long double math_opt_barrier and math_force_eval with GCC 11. Starting with GCC 11, long double values can also be processed in vector registers if build with -march >= z14. Then GCC defines the __LONG_DOUBLE_VX__ macro. FYI: GCC commit "IBM Z: Introduce __LONG_DOUBLE_VX__ macro" https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=f47df2af313d2ce7f9149149010a142c2237beda	2021-04-01 09:14:20 +02:00
Stefan Liebler	18f0afa848	Fix conform linknamespace tests due to gnu_dev_makedev If building on s390 / i686 with -Os, various conformance tests are failing with e.g. conform/ISO/assert.h/linknamespace.out: [initial] __assert_fail -> [libc.a(assert.o)] __dcgettext -> [libc.a(dcgettext.o)] __dcigettext -> [libc.a(dcigettext.o)] __getcwd -> [libc.a(getcwd.o)] __fstatat64 -> [libc.a(fstatat64.o)] gnu_dev_makedev The usage of gnu_dev_makedev was recently introduced by usage of the makedev makro in commit: `5b980d4809` linux: Use statx for MIPSn64 This patch is now linking against __gnu_dev_makedev as also done in commit: `8b4a118222` Fix -Os gnu_dev_* linknamespace, localplt issues (bug 15105, bug 19463).	2021-03-31 16:10:14 +02:00
Adhemerval Zanella	42624c7dc7	Update sparc libm-tests-ulps Required after `db3f7bb558` "math: Remove slow paths from asin and acos [BZ #15267]".	2021-03-30 14:04:11 -03:00
Siddhesh Poyarekar	abadbef5c8	Move __isnanf128 to libc.so All of the isnan functions are in libc.so due to printf_fp, so move __isnanf128 there too for consistency. Reviewed-by: Tulio Magno Quites Machado Filho <tuliom@ascii.art.br> Reviewed-by: Florian Weimer <fweimer@redhat.com>	2021-03-30 14:58:19 +05:30
Samuel Thibault	64786a7090	fork.h: replace with register-atfork.h UNREGISTER_ATFORK is now defined for all ports in register-atfork.h, so most previous includes of fork.h actually only need register-atfork.h now, and cxa_finalize.c does not need an ifdef UNREGISTER_ATFORK any more. The nptl-specific fork generation counters can then go to pthreadP.h, and fork.h be removed. Checked on x86_64-linux-gnu and i686-gnu. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-03-29 21:41:09 +02:00
H.J. Lu	e4fda46310	x86-64: Use ZMM16-ZMM31 in AVX512 memmove family functions Update ifunc-memmove.h to select the function optimized with AVX512 instructions using ZMM16-ZMM31 registers to avoid RTM abort with usable AVX512VL since VZEROUPPER isn't needed at function exit.	2021-03-29 07:40:17 -07:00
H.J. Lu	4e2d8f3527	x86-64: Use ZMM16-ZMM31 in AVX512 memset family functions Update ifunc-memset.h/ifunc-wmemset.h to select the function optimized with AVX512 instructions using ZMM16-ZMM31 registers to avoid RTM abort with usable AVX512VL and AVX512BW since VZEROUPPER isn't needed at function exit.	2021-03-29 07:40:17 -07:00
H.J. Lu	4bd660be40	x86: Add string/memory function tests in RTM region At function exit, AVX optimized string/memory functions have VZEROUPPER which triggers RTM abort. When such functions are called inside a transactionally executing RTM region, RTM abort causes severe performance degradation. Add tests to verify that string/memory functions won't cause RTM abort in RTM region.	2021-03-29 07:40:17 -07:00
H.J. Lu	7ebba91361	x86-64: Add AVX optimized string/memory functions for RTM Since VZEROUPPER triggers RTM abort while VZEROALL won't, select AVX optimized string/memory functions with xtest jz 1f vzeroall ret 1: vzeroupper ret at function exit on processors with usable RTM, but without 256-bit EVEX instructions to avoid VZEROUPPER inside a transactionally executing RTM region.	2021-03-29 07:40:17 -07:00
H.J. Lu	91264fe357	x86-64: Add memcmp family functions with 256-bit EVEX Update ifunc-memcmp.h to select the function optimized with 256-bit EVEX instructions using YMM16-YMM31 registers to avoid RTM abort with usable AVX512VL, AVX512BW and MOVBE since VZEROUPPER isn't needed at function exit.	2021-03-29 07:40:17 -07:00
H.J. Lu	1b968b6b9b	x86-64: Add memset family functions with 256-bit EVEX Update ifunc-memset.h/ifunc-wmemset.h to select the function optimized with 256-bit EVEX instructions using YMM16-YMM31 registers to avoid RTM abort with usable AVX512VL and AVX512BW since VZEROUPPER isn't needed at function exit.	2021-03-29 07:40:17 -07:00
H.J. Lu	63ad43566f	x86-64: Add memmove family functions with 256-bit EVEX Update ifunc-memmove.h to select the function optimized with 256-bit EVEX instructions using YMM16-YMM31 registers to avoid RTM abort with usable AVX512VL since VZEROUPPER isn't needed at function exit.	2021-03-29 07:40:17 -07:00
H.J. Lu	525bc2a32c	x86-64: Add strcpy family functions with 256-bit EVEX Update ifunc-strcpy.h to select the function optimized with 256-bit EVEX instructions using YMM16-YMM31 registers to avoid RTM abort with usable AVX512VL and AVX512BW since VZEROUPPER isn't needed at function exit.	2021-03-29 07:40:17 -07:00
H.J. Lu	1fd8c163a8	x86-64: Add ifunc-avx2.h functions with 256-bit EVEX Update ifunc-avx2.h, strchr.c, strcmp.c, strncmp.c and wcsnlen.c to select the function optimized with 256-bit EVEX instructions using YMM16-YMM31 registers to avoid RTM abort with usable AVX512VL, AVX512BW and BMI2 since VZEROUPPER isn't needed at function exit. For strcmp/strncmp, prefer AVX2 strcmp/strncmp if Prefer_AVX2_STRCMP is set.	2021-03-29 07:40:17 -07:00
H.J. Lu	1da50d4bda	x86: Set Prefer_No_VZEROUPPER and add Prefer_AVX2_STRCMP 1. Set Prefer_No_VZEROUPPER if RTM is usable to avoid RTM abort triggered by VZEROUPPER inside a transactionally executing RTM region. 2. Since to compare 2 32-byte strings, 256-bit EVEX strcmp requires 2 loads, 3 VPCMPs and 2 KORDs while AVX2 strcmp requires 1 load, 2 VPCMPEQs, 1 VPMINU and 1 VPMOVMSKB, AVX2 strcmp is faster than EVEX strcmp. Add Prefer_AVX2_STRCMP to prefer AVX2 strcmp family functions.	2021-03-29 07:40:17 -07:00
Adhemerval Zanella	f8466cc504	linux: Add y2106 support on utimensat tests The tests are refactored to use a common skeleton that handles whether the underlying filesystem supports 64 bit time, skips 64 bit time tests when the TU only supports 32 bit, and also skip 64 bit time tests larger than 32 unsigned int (y2106) if the system does not support it (MIPSn64 on kernels without statx support). Checked on x86_64-linux-gnu and i686-linux-gnu. I also checked on a mips64el-linux-gnu with 4.1.4 and 5.10.0-4-5kc-malta kernel to verify if the y2106 are indeed skipped.	2021-03-29 10:22:13 -03:00
Adhemerval Zanella	5b980d4809	linux: Use statx for MIPSn64 MIPSn64 kernel ABI for legacy stat uses unsigned 32 bit for second timestamp, which limits the maximum value to y2106. This patch make mips64 use statx as for 32-bit architectures. Thie __cp_stat64_t64_statx is open coded, its usage is solely on fstatat64 and it avoid the need to redefine the name for mips64 (which will call __cp_stat64_statx since its does not use __stat64_t64 internally).	2021-03-29 10:22:13 -03:00
Adhemerval Zanella	1fbffbda36	linux: Disable fstatat64 fallback if __ASSUME_STATX is defined If the minimum kernel supports statx there is no need to call the fallback stat legacy syscalls. The statx is also called on compat xstat syscall, but different than the fstatat it calls no fallback and it is assumed to be always present. Checked on powerpc-linux-gnu (with and without --enable-kernel=4.11) and on powerpc64-linux-gnu.	2021-03-29 10:22:13 -03:00
Adhemerval Zanella	4c4e90ccf8	linux: Implement fstatat with __fstatat64_time64 It makes fstatat use __NR_statx, which fix the s390 issue with missing nanoxsecond support on compat stat syscalls (at least on recent kernels) and limits the statx call to only one function (which simplifies the __ASSUME_STATX support). Checked on i686-linux-gnu and on powerpc-linux-gnu.	2021-03-29 10:22:13 -03:00
H.J. Lu	27f7463675	x86: Properly disable XSAVE related features [BZ #27605 ] 1. Support GLIBC_TUNABLES=glibc.cpu.hwcaps=-XSAVE. 2. Disable all features which depend on XSAVE: a. If OSXSAVE is disabled by glibc tunables. Or b. If both XSAVE and XSAVEC aren't usable.	2021-03-29 06:04:17 -07:00
Adhemerval Zanella	09ce31eddf	nptl: Remove __libc_allocate_rtsig, __libc_current_sigrtmax, and __libc_current_sigrtmin The libc version is identical and built with same flags. Checked on x86_64-linux-gnu.	2021-03-26 13:37:18 -03:00
Adhemerval Zanella	70a1e36cbe	nptl: Move sigaction to libc The libc version is identical and built with same flags. Checked on x86_64-linux-gnu.	2021-03-26 13:37:18 -03:00
Adhemerval Zanella	ff1e342cd1	nptl: Remove pthread raise implementation The Linux version already target the current thread by using tgkill along with getpid and gettid. For arm, libpthread does not do a intra PLT since it will call the raise from libc. Checked on x86_64-linux-gnu.	2021-03-26 13:37:18 -03:00
Adhemerval Zanella	b76658451c	nptl: Move pthread_kill to libc A new 2.34 version is also provided. Checked on x86_64-linux-gnu.	2021-03-26 13:37:18 -03:00
Adhemerval Zanella	4c8cb283ec	nptl: Remove pwrite from libpthread The libc version is identical and built with same flags, it is also uses as the default version. Checked on x86_64-linux-gnu.	2021-03-26 13:37:18 -03:00
Adhemerval Zanella	dd795c6c24	nptl: Remove pread from libpthread The libc version is identical and built with same flags, it is also uses as the default version. Checked on x86_64-linux-gnu.	2021-03-26 13:37:18 -03:00
Adhemerval Zanella	40873cdd38	nptl: Remove open from libpthread The libc version is identical and built with same flags. The libc version is set as the default version. Checked on x86_64-linux-gnu.	2021-03-26 13:37:14 -03:00
Adhemerval Zanella	c5c3588475	nptl: Remove lseek from libpthread The libc version is identical and built with same flags. The libc version is set as the default version. The libpthread compat symbol requires to mask it when building the loader object otherwise ld might complain about a missing versioned symbol (as for alpha). Checked on x86_64-linux-gnu.	2021-03-26 13:36:17 -03:00
Adhemerval Zanella	78d1724d53	nptl: Remove send from libpthread The libc version is identical and built with same flags. Both aarch64 and nios2 also requires to export __send and tt was done previously with the HAVE_INTERNAL_SEND_SYMBOL (which forced the symbol creation). All __send callers are internal to libc and the original issue that required the symbol export was due a missing libc_hidden_def. So a compat symbol is added for __send and the libc_hidden_def is defined regardless. Checked on x86_64-linux-gnu and i686-linux-gnu.	2021-03-26 13:36:17 -03:00
Szabolcs Nagy	1dc17ea8f8	aarch64: Optimize __libc_mtag_tag_zero_region This is a target hook for memory tagging, the original was a naive implementation. Uses the same algorithm as __libc_mtag_tag_region, but with instructions that also zero the memory. This was not benchmarked on real cpu, but expected to be faster than the naive implementation.	2021-03-26 11:03:06 +00:00
Szabolcs Nagy	23fd760add	aarch64: Optimize __libc_mtag_tag_region This is a target hook for memory tagging, the original was a naive implementation. The optimized version relies on "dc gva" to tag 64 bytes at a time for large allocations and optimizes small cases without adding too many branches. This was not benchmarked on real cpu, but expected to be faster than the naive implementation.	2021-03-26 11:03:06 +00:00
Szabolcs Nagy	383bc24028	aarch64: inline __libc_mtag_new_tag This is a common operation when heap tagging is enabled, so inline the instructions instead of using an extern call.	2021-03-26 11:03:06 +00:00
Szabolcs Nagy	40dc773f92	aarch64: inline __libc_mtag_address_get_tag This is a common operation when heap tagging is enabled, so inline the instruction instead of using an extern call. The .inst directive is used instead of the name of the instruction (or acle intrinsics) because malloc.c is not compiled for armv8.5-a+memtag architecture, runtime cpu support detection is used. Prototypes are removed from the comments as they were not always correct.	2021-03-26 11:03:06 +00:00
Szabolcs Nagy	c076a0bc69	malloc: Only support zeroing and not arbitrary memset with mtag The memset api is suboptimal and does not provide much benefit. Memory tagging only needs a zeroing memset (and only for memory that's sized and aligned to multiples of the tag granule), so change the internal api and the target hooks accordingly. This is to simplify the implementation of the target hook. Reviewed-by: DJ Delorie <dj@redhat.com>	2021-03-26 11:03:06 +00:00
Szabolcs Nagy	e865dcbb7b	malloc: Ensure the generic mtag hooks are not used Use inline functions instead of macros, because macros can cause unused variable warnings and type conversion issues. We assume these functions may appear in the code but only in dead code paths (hidden by a runtime check), so it's important that they can compile with correct types, but if they are actually used that should be an error. Currently the hooks are only used when USE_MTAG is true which only happens on aarch64 and then the aarch64 specific code is used not this generic header. However followup refactoring will allow the hooks to be used with !USE_MTAG. Note: the const qualifier in the comment was wrong: changing tags is a write operation. Reviewed-by: DJ Delorie <dj@redhat.com>	2021-03-26 11:03:06 +00:00
Stefan Liebler	7759be2593	S390: Also check vector support in memmove ifunc-selector [BZ #27511 ] The arch13 memmove variant is currently selected by the ifunc selector if the Miscellaneous-Instruction-Extensions Facility 3 facility bit is present, but the function is also using vector instructions. If the vector support is not present, one is receiving an operation exception. Therefore this patch also checks for vector support in the ifunc selector and in ifunc-impl-list.c. Just to be sure, the configure check is now also testing an arch13 vector instruction and an arch13 Miscellaneous-Instruction-Extensions Facility 3 instruction.	2021-03-26 10:51:31 +01:00
Florian Weimer	0923f74ada	Support for multiple versions in versioned_symbol, compat_symbol This essentially folds compat_symbol_unique functionality into compat_symbol. This change eliminates the need for intermediate aliases for defining multiple symbol versions, for both compat_symbol and versioned_symbol. Some binutils versions do not suport multiple versions per symbol on some targets, so aliases are automatically introduced, similar to what compat_symbol_unique did. To reduce symbol table sizes, a configure check is added to avoid these aliases if they are not needed. The new mechanism works with data symbols as well as function symbols, due to the way an assembler-level redirect is used. It is not compatible with weak symbols for old binutils versions, which is why the definition of __malloc_initialize_hook had to be changed. This is not a loss of functionality because weak symbols do not matter to dynamic linking. The placeholder symbol needs repeating in nptl/libpthread-compat.c now that compat_symbol is used, but that seems more obvious than introducing yet another macro. A subtle difference was that compat_symbol_unique made the symbol global automatically. compat_symbol does not do this, so static had to be removed from the definition of __libpthread_version_placeholder. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-03-25 12:33:02 +01:00
Florian Weimer	3a24ddeab5	Change how the symbol_version_reference macro is defined A subsequent change will require including <config.h> for defining symbol_version_reference. <libc-symbol.h> should not include <config.h> for _ISOMAC, so it cannot define symbol_version_reference anymore, but symbol_version_reference is needed <shlib-compat.h> even for _ISOMAC. Moving the definition of symbol_version_reference to a separate file <libc-symver.h> makes it possible to use a single definition for both cases. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-03-25 11:06:56 +01:00
Samuel Thibault	16b597807d	elf: Fix not compiling ifunc tests that need gcc ifunc support	2021-03-24 01:52:46 +01:00
Samuel Thibault	14beab5321	htl: Add missing fork.h `2b47727c68` ("posix: Consolidate register-atfork") introduced a fork.h header to declare the atfork unregister hook, but was missing adding it for htl. This fixes tst-atfork2.	2021-03-24 00:18:17 +00:00
Samuel Thibault	c3b287be74	hurd: handle EINTR during critical sections During critical sections, signal handling is deferred and thus RPCs return EINTR, even if SA_RESTART is set. We thus have to restart the whole critical section in that case. This also adds HURD_CRITICAL_UNLOCK in the cases where one wants to break the section in the middle.	2021-03-23 22:40:10 +00:00

1 2 3 4 5 ...

13974 Commits