glibc

mirror of https://sourceware.org/git/glibc.git synced 2025-01-06 01:21:08 +00:00

Author	SHA1	Message	Date
Joseph Myers	af16a59ee1	Fix Hurd getcwd build with GCC >= 13 The build of glibc for i686-gnu has been failing for a while with GCC mainline / GCC 13: ../sysdeps/mach/hurd/getcwd.c: In function '__hurd_canonicalize_directory_name_internal': ../sysdeps/mach/hurd/getcwd.c:242:48: error: pointer 'file_name' may be used after 'realloc' [-Werror=use-after-free] 242 \| file_namep = &buf[file_namep - file_name + size / 2]; \| ~~~~~~~~~~~^~~~~~~~~~~ ../sysdeps/mach/hurd/getcwd.c:236:25: note: call to 'realloc' here 236 \| buf = realloc (file_name, size); \| ^~~~~~~~~~~~~~~~~~~~~~~~~ Fix by doing the subtraction before the reallocation. Tested with build-many-glibcs.py for i686-gnu. [samuel.thibault@ens-lyon.rg: Removed mention of this being a bug] Message-Id: <18587337-7815-4056-ebd0-724df262d591@codesourcery.com>	2023-04-27 01:27:28 +02:00
Joseph Myers	bcca5ae804	Regenerate sysdeps/mach/hurd/bits/errno.h This file was out of date, as shown by build-many-glibcs.py runs resulting in a modified source directory.	2023-04-26 17:11:41 +00:00
Joe Simmons-Talbott	a3461d4923	if_index: Remove unneeded alloca.h include Nothing is being used from this header. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-04-26 08:06:49 -04:00
Joe Simmons-Talbott	19fdc3542b	gethostid: Do not include alloca.h Nothing from alloca.h is being used here. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-04-26 08:06:44 -04:00
Samuel Thibault	323fe6a1a9	hurd: Do not take any flag from the CMSG_DATA As fixed in `0822e3552a` ("hurd: Don't pass FD_CLOEXEC in CMSG_DATA"), senders currently don't have any flag to pass. We shouldn't blindly take random flags that senders could be erroneously giving us.	2023-04-25 00:14:58 +02:00
Sergey Bugaev	5fa8945605	hurd: Implement MSG_CMSG_CLOEXEC This is a new flag that can be passed to recvmsg () to make it atomically set the CLOEXEC flag on all the file descriptors received using the SCM_RIGHTS mechanism. This is useful for all the same reasons that the other XXX_CLOEXEC flags are useful: namely, it provides atomicity with respect to another thread of the same process calling (fork and then) exec at the same time. This flag is already supported on Linux and FreeBSD. The flag's value, 0x40000, is choosen to match FreeBSD's. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230423160548.126576-2-bugaevc@gmail.com>	2023-04-24 23:09:50 +02:00
Samuel Thibault	0822e3552a	hurd: Don't pass FD_CLOEXEC in CMSG_DATA The flags are used by _hurd_intern_fd, which takes O_* flags, not FD_*. Also, it is of no concern to the receiving process whether or not the sender process wants to close its copy of sent file descriptor upon exec, and it should not influence whether or not the received file descriptor gets the FD_CLOEXEC flag set in the receiving process. The latter should in fact be dependent on the MSG_CMSG_CLOEXEC flag being passed to the recvmsg () call, which is going to be implemented in the following commit. Fixes `344e755248` "hurd: Support sending file descriptors over Unix sockets" Signed-off-by: Samuel Thibault <samuel.thibault@ens-lyon.org> Signed-off-by: Sergey Bugaev <bugaevc@gmail.com>	2023-04-24 23:05:15 +02:00
Sergey Bugaev	c02b26455b	hurd: Implement prefer_map_32bit_exec tunable This makes the prefer_map_32bit_exec tunable no longer Linux-specific. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230423215526.346009-4-bugaevc@gmail.com>	2023-04-24 22:48:35 +02:00
Sergey Bugaev	35b7bf2fe0	hurd: Don't attempt to deallocate MACH_PORT_DEAD ...in some more places. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230423215526.346009-2-bugaevc@gmail.com>	2023-04-24 22:44:53 +02:00
Sergey Bugaev	4c39333050	hurd: Only deallocate addrport when it's valid Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230423160548.126576-3-bugaevc@gmail.com>	2023-04-24 22:44:18 +02:00
Sergey Bugaev	70b9173caa	hurd: Implement MAP_32BIT This is a flag that can be passed to mmap () to request that the mapping being established should be located in the lower 2 GB area of the address space, so only the lower 31 (not 32) bits can be set in its address, and the address can be represented as a 32-bit integer without truncating it. This flag is intended to be compatible with Linux, FreeBSD, and Darwin flags of the same name. Out of those systems, it appears Linux and FreeBSD take MAP_32BIT to mean "map 31 bit", whereas Darwin allows the 32nd bit to be set in the address as well. The Hurd follows Linux and FreeBSD behavior. Unlike on those systems, on the Hurd MAP_32BIT is defined on all supported architectures (which currently are only i386 and x86_64). Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230423215526.346009-1-bugaevc@gmail.com>	2023-04-24 22:42:12 +02:00
Sergey Bugaev	533deafbdf	Use O_CLOEXEC in more places (BZ #15722 ) When opening a temporary file without O_CLOEXEC we risk leaking the file descriptor if another thread calls (fork and then) exec while we have the fd open. Fix this by consistently passing O_CLOEXEC everywhere where we open a file for internal use (and not to return it to the user, in which case the API defines whether or not the close-on-exec flag shall be set on the returned fd). Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230419160207.65988-4-bugaevc@gmail.com>	2023-04-22 13:50:14 +02:00
Sergey Bugaev	8e78a2e1d1	hurd: Don't migrate reply port into __init1_tcbhead Properly differentiate between setting up the real TLS with TLS_INIT_TP, and setting up the early TLS (__init1_tcbhead) in static builds. In the latter case, don't yet migrate the reply port into the TCB, and don't yet set __libc_tls_initialized to 1. This also lets us move the __init1_desc assignment inside _hurd_tls_init (). Fixes `cd019ddd89` "hurd: Don't leak __hurd_reply_port0" Signed-off-by: Sergey Bugaev <bugaevc@gmail.com>	2023-04-21 03:02:04 +02:00
Sergey Bugaev	88cc282a9a	hurd: Make dl-sysdep's open () cope with O_IGNORE_CTTY Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230419160207.65988-6-bugaevc@gmail.com>	2023-04-20 23:05:54 +02:00
Cupertino Miranda	b630be0922	Created tunable to force small pages on stack allocation. Created tunable glibc.pthread.stack_hugetlb to control when hugepages can be used for stack allocation. In case THP are enabled and glibc.pthread.stack_hugetlb is set to 0, glibc will madvise the kernel not to use allow hugepages for stack allocations. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-04-20 13:54:24 -03:00
Adhemerval Zanella	320768a664	linux: Re-flow and sort multiline Makefile definitions	2023-04-20 10:40:54 -03:00
Sergey Bugaev	8895a99c10	hurd: Microoptimize sigreturn Signed-off-by: Sergey Bugaev <bugaevc@gmail.com>	2023-04-18 16:20:09 +02:00
Sergey Bugaev	e411e31b7b	hurd: Fix restoring reply port in sigreturn We must not use the user's reply port (scp->sc_reply_port) for any of our own RPCs, otherwise various things break. So, use MACH_PORT_DEAD as a reply port when destroying our reply port, and make sure to do this after _hurd_sigstate_unlock (), which may do a gsync_wake () RPC. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com>	2023-04-17 21:00:02 +02:00
Wilco Dijkstra	76d0f094dd	math: Improve fmod(f) performance Optimize the fast paths (x < y) and (x/y < 2^12). Delay handling of special cases to reduce the number of instructions executed before the fast paths. Performance improvements for fmod: Skylake Zen2 Neoverse V1 subnormals 11.8% 4.2% 11.5% normal 3.9% 0.01% -0.5% close-exponents 6.3% 5.6% 19.4% Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-04-17 13:03:10 +01:00
Sergey Bugaev	e275690332	hurd: Only check for TLS initialization inside rtld or in static builds When glibc is built as a shared library, TLS is always initialized by the call of TLS_INIT_TP () macro made inside the dynamic loader, prior to running the main program (see dl-call_tls_init_tp.h). We can take advantage of this: we know for sure that __LIBC_NO_TLS () will evaluate to 0 in all other cases, so let the compiler know that explicitly too. Also, only define _hurd_tls_init () and TLS_INIT_TP () under the same conditions (either !SHARED or inside rtld), to statically assert that this is the case. Other than a microoptimization, this also helps with avoiding awkward sharing of the __libc_tls_initialized variable between ld.so and libc.so that we would have to do otherwise -- we know for sure that no sharing is required, simply because __libc_tls_initialized would always be set to true inside libc.so. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-25-bugaevc@gmail.com>	2023-04-14 10:31:22 +00:00
Sergey Bugaev	ba00d787f3	hurd: Remove __hurd_local_reply_port Now that the signal code no longer accesses it, the only real user of it was mig-reply.c, so move the logic for managing the port there. If we're in SHARED and outside of rtld, we know that __LIBC_NO_TLS () always evaluates to 0, and a TLS reply port will always be used, not __hurd_reply_port0. Still, the compiler does not see that __hurd_reply_port0 is never used due to its address being taken. To deal with this, explicitly compile out __hurd_reply_port0 when we know we won't use it. Also, instead of accessing the port via THREAD_SELF->reply_port, this uses THREAD_GETMEM and THREAD_SETMEM directly, avoiding possible miscompilations. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com>	2023-04-14 10:31:22 +00:00
Adhemerval Zanella	05fe3ecfff	malloc: Assure that THP mode read do write OOB end of stringt	2023-04-14 08:22:40 -03:00
Adhemerval Zanella	801deb07f6	malloc: Assure that THP mode is always null terminated	2023-04-13 17:18:04 -03:00
Samuel Thibault	decf02d382	hurd: Mark two tests as unsupported They make the whole testsuite hang/crash.	2023-04-13 02:02:38 +02:00
Samuel Thibault	6538a288be	hurd: Restore destroying receive rights on sigreturn Just subtracting a ref is making signal/tst-signal signal/tst-raise signal/tst-minsigstksz-5 htl/tst-raise1 fail.	2023-04-13 00:49:16 +02:00
Samuel Thibault	5473a1747a	Revert "hurd: Only check for TLS initialization inside rtld or in static builds" This reverts commit `b37899d34d`. Apparently we load libc.so (and thus start using its functions) before calling TLS_INIT_TP, so libc.so functions should not actually assume that TLS is always set up.	2023-04-11 18:45:47 +00:00
Sergey Bugaev	cd019ddd89	hurd: Don't leak __hurd_reply_port0 Previously, once we set up TLS, we would implicitly switch from using __hurd_reply_port0 to reply_port inside the TCB, leaving the former unused. But we never deallocated it, so it got leaked. Instead, migrate the port into the new TCB's reply_port slot. This avoids both the port leak and an extra syscall to create a new reply port for the TCB. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-28-bugaevc@gmail.com>	2023-04-11 00:24:40 +02:00
Sergey Bugaev	747812349d	hurd: Improve reply port handling when exiting signal handlers If we're doing signals, that means we've already got the signal thread running, and that implies TLS having been set up. So we know that __hurd_local_reply_port will resolve to THREAD_SELF->reply_port, and can access that directly using the THREAD_GETMEM and THREAD_SETMEM macros. This avoids potential miscompilations, and should also be a tiny bit faster. Also, use mach_port_mod_refs () and not mach_port_destroy () to destroy the receive right. mach_port_destroy () should never be used on mach_task_self (); this can easily lead to port use-after-free vulnerabilities if the task has any other references to the same port. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-26-bugaevc@gmail.com>	2023-04-10 23:54:28 +02:00
Sergey Bugaev	b37899d34d	hurd: Only check for TLS initialization inside rtld or in static builds When glibc is built as a shared library, TLS is always initialized by the call of TLS_INIT_TP () macro made inside the dynamic loader, prior to running the main program (see dl-call_tls_init_tp.h). We can take advantage of this: we know for sure that __LIBC_NO_TLS () will evaluate to 0 in all other cases, so let the compiler know that explicitly too. Also, only define _hurd_tls_init () and TLS_INIT_TP () under the same conditions (either !SHARED or inside rtld), to statically assert that this is the case. Other than a microoptimization, this also helps with avoiding awkward sharing of the __libc_tls_initialized variable between ld.so and libc.so that we would have to do otherwise -- we know for sure that no sharing is required, simply because __libc_tls_initialized would always be set to true inside libc.so. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-25-bugaevc@gmail.com>	2023-04-10 23:33:30 +02:00
Sergey Bugaev	4644fb9c4c	elf: Stop including tls.h in ldsodefs.h Nothing in there needs tls.h Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-24-bugaevc@gmail.com>	2023-04-10 23:26:28 +02:00
Sergey Bugaev	60f9bf9746	hurd: Port trampoline.c to x86_64 Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230403115621.258636-3-bugaevc@gmail.com>	2023-04-10 20:44:43 +02:00
Sergey Bugaev	645da826bb	hurd: Do not declare local variables volatile These are just regular local variables that are not accessed in any funny ways, not even though a pointer. There's absolutely no reason to declare them volatile. It only ends up hurting the quality of the generated machine code. If anything, it would make sense to decalre sigsp as pointing to volatile memory (volatile void *sigsp), but evidently that's not needed either. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230403115621.258636-2-bugaevc@gmail.com>	2023-04-10 20:42:28 +02:00
Sergey Bugaev	892f702827	hurd: Implement x86_64/intr-msg.h Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-18-bugaevc@gmail.com>	2023-04-10 20:39:28 +02:00
Sergey Bugaev	57df0f16b4	hurd: Add sys/ucontext.h and sigcontext.h for x86_64 This is based on the Linux port's version, but laid out to match Mach's struct i386_thread_state, much like the i386 version does. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com>	2023-04-10 20:11:43 +02:00
Flavio Cruz	f7f7dd8009	hurd: Stop depending on the default_pager stubs provided by gnumach The hurd source tree already provides the same stubs and they are only needed there. Message-Id: <ZDN3rDdjMowtUWf7@jupiter.tail36e24.ts.net>	2023-04-10 19:01:52 +02:00
H.J. Lu	81a3cc956e	<sys/platform/x86.h>: Add PREFETCHI support Add PREFETCHI support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	b05521c916	<sys/platform/x86.h>: Add AMX-COMPLEX support Add AMX-COMPLEX support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	609b7b2d3c	<sys/platform/x86.h>: Add AVX-NE-CONVERT support Add AVX-NE-CONVERT support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	4c120c88a6	<sys/platform/x86.h>: Add AVX-VNNI-INT8 support Add AVX-VNNI-INT8 support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	b39741b45f	<sys/platform/x86.h>: Add MSRLIST support Add MSRLIST support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	96037c697d	<sys/platform/x86.h>: Add AVX-IFMA support Add AVX-IFMA support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	8b4cc05eab	<sys/platform/x86.h>: Add AMX-FP16 support Add AMX-FP16 support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	227983551d	<sys/platform/x86.h>: Add WRMSRNS support Add WRMSRNS support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	a00db8305d	<sys/platform/x86.h>: Add ArchPerfmonExt support Add Architectural Performance Monitoring Extended Leaf (EAX = 23H) support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	2f02d0d8e1	<sys/platform/x86.h>: Add CMPCCXADD support Add CMPCCXADD support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	aa528a579b	<sys/platform/x86.h>: Add LASS support Add Linear Address Space Separation (LASS) support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	231bf916ce	<sys/platform/x86.h>: Add RAO-INT support Add RAO-INT support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	fb90dc8513	<sys/platform/x86.h>: Add LBR support Add architectural LBR support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	f47b7d96fb	<sys/platform/x86.h>: Add RTM_FORCE_ABORT support Add RTM_FORCE_ABORT support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	f6790a489d	<sys/platform/x86.h>: Add SGX-KEYS support Add SGX-KEYS support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	09cc5fee21	<sys/platform/x86.h>: Add BUS_LOCK_DETECT support Add Bus lock debug exceptions (BUS_LOCK_DETECT) support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	8c8e391166	<sys/platform/x86.h>: Add LA57 support Add 57-bit linear addresses and five-level paging (LA57) support to <sys/platform/x86.h>. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
H.J. Lu	2d8c590a5e	<bits/platform/x86.h>: Rename to x86_cpu_INDEX_7_ECX_15 Rename x86_cpu_INDEX_7_ECX_1 to x86_cpu_INDEX_7_ECX_15 for the unused bit 15 in ECX from CPUID with EAX == 0x7 and ECX == 0. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-05 14:46:10 -07:00
John David Anglin	c4468cd399	hppa: Update struct __pthread_rwlock_arch_t comment. Signed-off-by: John David Anglin <dave.anglin@bell.net>	2023-04-05 18:54:47 +00:00
John David Anglin	e9327e8584	hppa: Revise __TIMESIZE define to use __WORDSIZE Handle both 32 and 64-bit ABIs. Signed-off-by: John David Anglin <dave.anglin@bell.net>	2023-04-05 18:35:38 +00:00
Guy-Fleury Iteriteka	5476f8cd2e	htl: move pthread_self info libc. Signed-off-by: Guy-Fleury Iteriteka <gfleury@disroot.org> Message-Id: <20230318095826.1125734-4-gfleury@disroot.org>	2023-04-05 01:26:36 +02:00
Guy-Fleury Iteriteka	f987e9b7a3	htl: move ___pthread_self into libc. sysdeps/mach/hurd/htl/pt-pthread_self.c: New file. htl/Makefile: .. Add it to libc routine. sysdeps/mach/hurd/htl/pt-sysdep.c(__pthread_self): Remove it. sysdeps/mach/hurd/htl/pt-sysdep.h(__pthread_self): Add hidden propertie. htl/Versions(__pthread_self) Version it as private symbol. Signed-off-by: Guy-Fleury Iteriteka <gfleury@disroot.org> Message-Id: <20230318095826.1125734-3-gfleury@disroot.org>	2023-04-05 01:26:34 +02:00
Andreas Schwab	856bab7717	x86/dl-cacheinfo: remove unsused parameter from handle_amd Also replace an unreachable assert with __builtin_unreachable.	2023-04-04 16:16:21 +02:00
Adhemerval Zanella	59db5735e6	powerpc: Disable stack protector in early static initialization Similar to `fb95c31638`, also disable for string-ppc64.c (pulled on rltd as the default string implementation). Checked on powerpc64-linux-gnu.	2023-04-03 17:42:08 -03:00
Adhemerval Zanella	370da8a121	nptl: Fix tst-cancel30 on sparc64 As indicated by sparc kernel-features.h, even though sparc64 defines __NR_pause, it is not supported (ENOSYS). Always use ppoll or the 64 bit time_t variant instead.	2023-04-03 17:41:59 -03:00
Adhemerval Zanella Netto	16439f419b	math: Remove the error handling wrapper from fmod and fmodf The error handling is moved to sysdeps/ieee754 version with no SVID support. The compatibility symbol versions still use the wrapper with SVID error handling around the new code. There is no new symbol version nor compatibility code on !LIBM_SVID_COMPAT targets (e.g. riscv). The ia64 is unchanged, since it still uses the arch specific __libm_error_region on its implementation. For both i686 and m68k, which provive arch specific implementation, wrappers are added so no new symbol are added (which would require to change the implementations). It shows an small improvement, the results for fmod: Architecture \| Input \| master \| patch -----------------\|-----------------\|----------\|-------- x86_64 (Ryzen 9) \| subnormals \| 12.5049 \| 9.40992 x86_64 (Ryzen 9) \| normal \| 296.939 \| 296.738 x86_64 (Ryzen 9) \| close-exponents \| 16.0244 \| 13.119 aarch64 (N1) \| subnormal \| 6.81778 \| 4.33313 aarch64 (N1) \| normal \| 155.620 \| 152.915 aarch64 (N1) \| close-exponents \| 8.21306 \| 5.76138 armhf (N1) \| subnormal \| 15.1083 \| 14.5746 armhf (N1) \| normal \| 244.833 \| 241.738 armhf (N1) \| close-exponents \| 21.8182 \| 22.457 Checked on x86_64-linux-gnu, i686-linux-gnu, and aarch64-linux-gnu. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2023-04-03 16:45:27 -03:00
Adhemerval Zanella Netto	cf9cf33199	math: Improve fmodf This uses a new algorithm similar to already proposed earlier [1]. With x = mx * 2^ex and y = my * 2^ey (mx, my, ex, ey being integers), the simplest implementation is: mx * 2^ex == 2 * mx * 2^(ex - 1) while (ex > ey) { mx = 2; --ex; mx %= my; } With mx/my being mantissa of double floating pointer, on each step the argument reduction can be improved 8 (which is sizeof of uint32_t minus MANTISSA_WIDTH plus the signal bit): while (ex > ey) { mx << 8; ex -= 8; mx %= my; } / The implementation uses builtin clz and ctz, along with shifts to convert hx/hy back to doubles. Different than the original patch, this path assume modulo/divide operation is slow, so use multiplication with invert values. I see the following performance improvements using fmod benchtests (result only show the 'mean' result): Architecture \| Input \| master \| patch -----------------\|-----------------\|----------\|-------- x86_64 (Ryzen 9) \| subnormals \| 17.2549 \| 12.0318 x86_64 (Ryzen 9) \| normal \| 85.4096 \| 49.9641 x86_64 (Ryzen 9) \| close-exponents \| 19.1072 \| 15.8224 aarch64 (N1) \| subnormal \| 10.2182 \| 6.81778 aarch64 (N1) \| normal \| 60.0616 \| 20.3667 aarch64 (N1) \| close-exponents \| 11.5256 \| 8.39685 I also see similar improvements on arm-linux-gnueabihf when running on the N1 aarch64 chips, where it a lot of soft-fp implementation (for modulo, and multiplication): Architecture \| Input \| master \| patch -----------------\|-----------------\|----------\|-------- armhf (N1) \| subnormal \| 11.6662 \| 10.8955 armhf (N1) \| normal \| 69.2759 \| 34.1524 armhf (N1) \| close-exponents \| 13.6472 \| 18.2131 Instead of using the math_private.h definitions, I used the math_config.h instead which is used on newer math implementations. Co-authored-by: kirill <kirill.okhotnikov@gmail.com> [1] https://sourceware.org/pipermail/libc-alpha/2020-November/119794.html Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2023-04-03 16:45:18 -03:00
Adhemerval Zanella Netto	34b9f8bc17	math: Improve fmod This uses a new algorithm similar to already proposed earlier [1]. With x = mx * 2^ex and y = my * 2^ey (mx, my, ex, ey being integers), the simplest implementation is: mx * 2^ex == 2 * mx * 2^(ex - 1) while (ex > ey) { mx = 2; --ex; mx %= my; } With mx/my being mantissa of double floating pointer, on each step the argument reduction can be improved 11 (which is sizeo of uint64_t minus MANTISSA_WIDTH plus the signal bit): while (ex > ey) { mx << 11; ex -= 11; mx %= my; } / The implementation uses builtin clz and ctz, along with shifts to convert hx/hy back to doubles. Different than the original patch, this path assume modulo/divide operation is slow, so use multiplication with invert values. I see the following performance improvements using fmod benchtests (result only show the 'mean' result): Architecture \| Input \| master \| patch -----------------\|-----------------\|----------\|-------- x86_64 (Ryzen 9) \| subnormals \| 19.1584 \| 12.5049 x86_64 (Ryzen 9) \| normal \| 1016.51 \| 296.939 x86_64 (Ryzen 9) \| close-exponents \| 18.4428 \| 16.0244 aarch64 (N1) \| subnormal \| 11.153 \| 6.81778 aarch64 (N1) \| normal \| 528.649 \| 155.62 aarch64 (N1) \| close-exponents \| 11.4517 \| 8.21306 I also see similar improvements on arm-linux-gnueabihf when running on the N1 aarch64 chips, where it a lot of soft-fp implementation (for modulo, clz, ctz, and multiplication): Architecture \| Input \| master \| patch -----------------\|-----------------\|----------\|-------- armhf (N1) \| subnormal \| 15.908 \| 15.1083 armhf (N1) \| normal \| 837.525 \| 244.833 armhf (N1) \| close-exponents \| 16.2111 \| 21.8182 Instead of using the math_private.h definitions, I used the math_config.h instead which is used on newer math implementations. Co-authored-by: kirill <kirill.okhotnikov@gmail.com> [1] https://sourceware.org/pipermail/libc-alpha/2020-November/119794.html Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2023-04-03 16:36:24 -03:00
H.J. Lu	743113d42e	x86: Set FSGSBASE to active if enabled by kernel Linux kernel uses AT_HWCAP2 to indicate if FSGSBASE instructions are enabled. If the HWCAP2_FSGSBASE bit in AT_HWCAP2 is set, FSGSBASE instructions can be used in user space. Define dl_check_hwcap2 to set the FSGSBASE feature to active on Linux when the HWCAP2_FSGSBASE bit is set. Add a test to verify that FSGSBASE is active on current kernels. NB: This test will fail if the kernel doesn't set the HWCAP2_FSGSBASE bit in AT_HWCAP2 while fsgsbase shows up in /proc/cpuinfo. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2023-04-03 11:36:48 -07:00
Florian Weimer	5d1ccdda7b	x86_64: Fix asm constraints in feraiseexcept (bug 30305) The divss instruction clobbers its first argument, and the constraints need to reflect that. Fortunately, with GCC 12, generated code does not actually change, so there is no externally visible bug. Suggested-by: Jakub Jelinek <jakub@redhat.com> Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-04-03 18:40:52 +02:00
Sergey Bugaev	17841fa7d4	hurd: Add vm_param.h for x86_64 Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-30-bugaevc@gmail.com>	2023-04-03 01:24:13 +02:00
Sergey Bugaev	20427b8f23	hurd: Implement _hurd_longjmp_thread_state for x86_64 Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-29-bugaevc@gmail.com>	2023-04-03 01:23:30 +02:00
Sergey Bugaev	e0bbae0062	htl: Implement thread_set_pcsptp for x86_64 Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-23-bugaevc@gmail.com>	2023-04-03 01:18:27 +02:00
Sergey Bugaev	8d873a4904	x86_64: Add rtld-stpncpy & rtld-strncpy Just like the other existing rtld-str* files, this provides rtld with usable versions of stpncpy and strncpy. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-22-bugaevc@gmail.com>	2023-04-03 01:17:56 +02:00
Sergey Bugaev	fb9e7f6732	htl: Add tcb-offsets.sym for x86_64 The source code is the same as sysdeps/i386/htl/tcb-offsets.sym, but of course the produced tcb-offsets.h will be different. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-21-bugaevc@gmail.com>	2023-04-03 01:15:30 +02:00
Sergey Bugaev	d8b69e89d8	hurd: Move a couple of signal-related files to x86 These do not need any changes to be used on x86_64. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-20-bugaevc@gmail.com>	2023-04-03 01:14:51 +02:00
Sergey Bugaev	a1fbae7527	hurd: Use uintptr_t for register values in trampoline.c This is more correct, if only because these fields are defined as having the type unsigned int in the Mach headers, so casting them to a signed int and then back is suboptimal. Also, remove an extra reassignment of uesp -- this is another remnant of the ecx kludge. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-16-bugaevc@gmail.com>	2023-04-03 01:13:28 +02:00
Sergey Bugaev	b43cb67457	hurd: Move rtld-strncpy-c.c out of mach/hurd/ There's nothing Mach- or Hurd-specific about it; any port that ends up with rtld pulling in strncpy will need this. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-15-bugaevc@gmail.com>	2023-04-03 01:10:23 +02:00
Sergey Bugaev	0001a23f7a	hurd: More 64-bit integer casting fixes Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-13-bugaevc@gmail.com>	2023-04-03 01:03:06 +02:00
Sergey Bugaev	af2942cc62	mach, hurd: Drop __libc_lock_self0 This was used for the value of libc-lock's owner when TLS is not yet set up, so THREAD_SELF can not be used. Since the value need not be anything specific -- it just has to be non-NULL -- we can just use a plain constant, such as (void *) 1, for this. This avoids accessing the symbol through GOT, and exporting it from libc.so in the first place. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-12-bugaevc@gmail.com>	2023-04-03 01:02:44 +02:00
Sergey Bugaev	71232da3b3	hurd: Remove __hurd_threadvar_stack_{offset,mask} Noone is or should be using __hurd_threadvar_stack_{offset,mask}, we have proper TLS now. These two remaining variables are never set to anything other than zero, so any code that would try to use them as described would just dereference a zero pointer and crash. So remove them entirely. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Message-Id: <20230319151017.531737-6-bugaevc@gmail.com>	2023-04-03 00:53:25 +02:00
Adhemerval Zanella	113549d9fd	Update arm libm-tests-ulps For the next test from `cf7ffdd8a5`.	2023-03-30 10:50:50 -03:00
Andreas Schwab	0d83b349fa	getlogin_r: fix missing fallback if loginuid is unset (bug 30235) When /proc/self/loginuid is not set, we should still fall back to using the traditional utmp lookup, instead of failing right away.	2023-03-30 13:58:27 +02:00
Adhemerval Zanella Netto	33237fe83d	Remove --enable-tunables configure option And make always supported. The configure option was added on glibc 2.25 and some features require it (such as hwcap mask, huge pages support, and lock elisition tuning). It also simplifies the build permutations. Changes from v1: * Remove glibc.rtld.dynamic_sort changes, it is orthogonal and needs more discussion. * Cleanup more code. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-03-29 14:33:06 -03:00
Joe Simmons-Talbott	868506eb42	system: Add "--" after "-c" for sh (BZ #28519 ) Prevent sh from interpreting a user string as shell options if it starts with '-' or '+'. Since the version of /bin/sh used for testing system() is different from the full-fledged system /bin/sh add support to it for handling "--" after "-c". Add a testcase to ensure the expected behavior. Signed-off-by: Joe Simmons-Talbott <josimmon@redhat.com> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-03-28 10:12:30 -03:00
caiyinyu	07dd75589e	LoongArch: ldconfig: Add comments for using EF_LARCH_OBJABI_V1 We added Adhemerval Zanella's comment to explain the reason for using EF_LARCH_OBJABI_V1.	2023-03-28 09:55:48 +08:00
Adhemerval Zanella Netto	f65ca70dea	libio: Do not autogenerate stdio_lim.h Instead define the required fields in system dependend files. The only system dependent definition is FILENAME_MAX, which should match POSIX PATH_MAX, and it is obtained from either kernel UAPI or mach headers. Currently set pre-defined value from current kernels. It avoids a circular dependendy when including stdio.h in gen-as-const-headers files. Checked on x86_64-linux-gnu and i686-linux-gnu Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2023-03-27 13:57:55 -03:00
Adhemerval Zanella Netto	88677348b4	Move libc_freeres_ptrs and libc_subfreeres to hidden/weak functions They are both used by __libc_freeres to free all library malloc allocated resources to help tooling like mtrace or valgrind with memory leak tracking. The current scheme uses assembly markers and linker script entries to consolidate the free routine function pointers in the RELRO segment and to be freed buffers in BSS. This patch changes it to use specific free functions for libc_freeres_ptrs buffers and call the function pointer array directly with call_function_static_weak. It allows the removal of both the internal macros and the linker script sections. Checked on x86_64-linux-gnu, i686-linux-gnu, and aarch64-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2023-03-27 13:57:55 -03:00
Joe Ramsay	e4d336f1ac	benchtests: Move libmvec benchtest inputs to benchtests directory This allows other targets to use the same inputs for their own libmvec microbenchmarks without having to duplicate them in their own subdirectory. Reviewed-by: Szabolcs Nagy <szabolcs.nagy@arm.com>	2023-03-27 17:04:03 +01:00
Xi Ruoyao	952b7630c7	LoongArch: ldconfig: Ignore EF_LARCH_OBJABI_V1 in shared objects Binutils 2.40 sets EF_LARCH_OBJABI_V1 for shared objects: $ ld --version \| head -n1 GNU ld (GNU Binutils) 2.40 $ echo 'int dummy;' > dummy.c $ cc dummy.c -shared $ readelf -h a.out \| grep Flags Flags: 0x43, DOUBLE-FLOAT, OBJ-v1 We need to ignore it in ldconfig or ldconfig will consider all shared objects linked by Binutils 2.40 "unsupported". Maybe we should stop setting EF_LARCH_OBJABI_V1 for shared objects, but Binutils 2.40 is already released and we cannot change it.	2023-03-27 16:57:56 +08:00
John David Anglin	ab991a3d1b	hppa: Drop 16-byte pthread lock alignment Linux threads were removed about 12 years ago and the current nptl implementation only requires 4-byte alignment for pthread locks. The 16-byte alignment causes various issues. For example in building ignition-msgs, we have: /usr/include/google/protobuf/map.h:124:37: error: static assertion failed 124 \| static_assert(alignof(value_type) <= 8, ""); \| ~~~~~~~~~~~~~~~~~~~~^~~~ This is caused by the 16-byte pthread lock alignment. Signed-off-by: John David Anglin <dave.anglin@bell.net>	2023-03-26 21:16:22 +00:00
DJ Delorie	db9b47e9f9	x86: Don't check PREFETCHWT1 in tst-cpu-features-cpuinfo.c Don't check PREFETCHWT1 against /proc/cpuinfo since kernel doesn't report PREFETCHWT1 in /proc/cpuinfo. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-03-21 17:49:49 -04:00
Pavel Kozlov	447273e0bf	ARC: run child from the separate start block in __clone For better debug experience use separate code block with extra cfi_* directives to run child (same as in __clone3). Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-03-13 18:12:32 +04:00
Pavel Kozlov	3681cdb8f8	ARC: Add the clone3 wrapper Use the clone3 wrapper on ARC. It doesn't care about stack alignment. All callers should provide an aligned stack. It follows the internal signature: extern int clone3 (struct clone_args __cl_args, size_t __size, int (__func) (void __arg), void __arg); Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-03-13 18:12:32 +04:00
caiyinyu	90233f113c	LoongArch: Add get_rounding_mode.	2023-03-13 09:20:42 +08:00
caiyinyu	f0d33cbdce	LoongArch: Add support for ldconfig.	2023-03-13 09:20:28 +08:00
Kacper Piwiński	e78c4c4983	linux: fix ntp_gettime abi break (BZ# 30156) Between versions v2.11 and v2.12 struct ntptimeval got new fields. That wasn't a problem because new function ntp_gettimex was created (and made default) to support new struct. Old ntp_gettime was not using new fields so it was safe to call with old struct definition. Then commits `5613afe9e3` and `b6ad64b907` (added for 64 bit time_t support), ntp_gettime start setting new fields. Sets fields manually to maintain compatibility with v2.11 struct definition. Resolves #30156 Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-03-10 15:07:55 -03:00
abushwang	0b7bf0e0a4	rt: fix shm_open not set ENAMETOOLONG when name exceeds {_POSIX_PATH_MAX} according to man-pages-posix-2017, shm_open() function may fail if the length of the name argument exceeds {_POSIX_PATH_MAX} and set ENAMETOOLONG Signed-off-by: abushwang <abushwangs@gmail.com> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-03-08 10:11:54 -03:00
caiyinyu	4c721f24fc	x86: Fix bug about glibc.cpu.hwcaps. Recorded in [BZ #30183]: 1. export GLIBC_TUNABLES=glibc.cpu.hwcaps=-AVX512 2. Add _dl_printf("p -- %s\n", p); just before switch(nl) in sysdeps/x86/cpu-tunables.c 3. compiled and run ./testrun.sh /usr/bin/ls you will get: p -- -AVX512 p -- LC_ADDRESS=en_US.UTF-8 p -- LC_NUMERIC=C ... The function, TUNABLE_CALLBACK (set_hwcaps) (tunable_val_t *valp), checks far more than it should and it should stop at end of "-AVX512".	2023-03-07 21:42:25 +08:00
Adam Yi	436a604b7d	posix: Fix system blocks SIGCHLD erroneously [BZ #30163 ] Fix bug that SIGCHLD is erroneously blocked forever in the following scenario: 1. Thread A calls system but hasn't returned yet 2. Thread B calls another system but returns SIGCHLD would be blocked forever in thread B after its system() returns, even after the system() in thread A returns. Although POSIX does not require, glibc system implementation aims to be thread and cancellation safe. This bug was introduced in `5fb7fc9635` when we moved reverting signal mask to happen when the last concurrently running system returns, despite that signal mask is per thread. This commit reverts this logic and adds a test. Signed-off-by: Adam Yi <ayi@janestreet.com> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-03-07 09:54:50 -03:00
Joseph Myers	8390014c23	Update kernel version to 6.2 in header constant tests This patch updates the kernel version in the tests tst-mman-consts.py, tst-mount-consts.py and tst-pidfd-consts.py to 6.2. (There are no new constants covered by these tests in 6.2 that need any other header changes, and the removed MAP_VARIABLE for hppa was addressed separately.) Tested with build-many-glibcs.py.	2023-03-06 15:13:22 +00:00
Adhemerval Zanella Netto	3a4781c16c	arm: Remove __builtin_arm_uqsub8 usage on string-fza.h The __builtin_arm_uqsub8 is an internal GCC builtin which might change in future release (the correct way is to include "arm_acle.h" and use __uqsub8 ()). Since not all compilers support it, just use the inline assembler instead. Checked on armv7a-linux-gnueabihf. Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>	2023-03-02 16:41:43 -03:00
Adhemerval Zanella Netto	90ae3bc393	alpha: Remove strncmp optimization The generic implementation already cover word access along with cmpbge for both aligned and unaligned, so use it instead. Checked qemu static for alpha-linux-gnu.	2023-03-02 16:41:43 -03:00
Adhemerval Zanella Netto	92fdb11ae7	powerpc: Remove powerpc64 strncmp variants The default, and power7 implementation just adds word aligned access when inputs have the same aligment. The unaligned case is still done by byte operations. This is already covered by the generic implementation, which also add the unaligned input optimization. Checked on powerpc64-linux-gnu built without multi-arch for powerpc64, power7, power8, and power9 (build for le). Reviewed-by: Rajalakshmi Srinivasaraghavan <rajis@linux.ibm.com>	2023-03-02 16:41:43 -03:00
Adhemerval Zanella Netto	a46bb1523d	powerpc: Remove strncmp variants The default, power4, and power7 implementation just adds word aligned access when inputs have the same aligment. The unaligned case is still done by byte operations. This is already covered by the generic implementation, which also add the unaligned input optimization. Checked on powerpc-linux-gnu built without multi-arch for powerpc, power4, and power7. Reviewed-by: Rajalakshmi Srinivasaraghavan <rajis@linux.ibm.com>	2023-03-02 16:41:43 -03:00

1 2 3 4 5 ...

15677 Commits