glibc

mirror of https://sourceware.org/git/glibc.git synced 2024-11-22 13:00:06 +00:00

Author	SHA1	Message	Date
Joe Ramsay	4a9392ffc2	aarch64: Add vector implementations of exp routines Optimised implementations for single and double precision, Advanced SIMD and SVE, copied from Arm Optimized Routines. As previously, data tables are used via a barrier to prevent overly aggressive constant inlining. Special-case handlers are marked NOINLINE to avoid incurring the penalty of switching call standards unnecessarily. Reviewed-by: Szabolcs Nagy <szabolcs.nagy@arm.com>	2023-06-30 09:04:26 +01:00
Joe Ramsay	78c01a5cbe	aarch64: Add vector implementations of log routines Optimised implementations for single and double precision, Advanced SIMD and SVE, copied from Arm Optimized Routines. Log lookup table added as HIDDEN symbol to allow it to be shared between AdvSIMD and SVE variants. As previously, data tables are used via a barrier to prevent overly aggressive constant inlining. Special-case handlers are marked NOINLINE to avoid incurring the penalty of switching call standards unnecessarily. Reviewed-by: Szabolcs Nagy <szabolcs.nagy@arm.com>	2023-06-30 09:04:22 +01:00
Joe Ramsay	3bb1af2051	aarch64: Add vector implementations of sin routines Optimised implementations for single and double precision, Advanced SIMD and SVE, copied from Arm Optimized Routines. As previously, data tables are used via a barrier to prevent overly aggressive constant inlining. Special-case handlers are marked NOINLINE to avoid incurring the penalty of switching call standards unnecessarily. Reviewed-by: Szabolcs Nagy <szabolcs.nagy@arm.com>	2023-06-30 09:04:16 +01:00
Joe Ramsay	aed39a3aa3	aarch64: Add vector implementations of cos routines Replace the loop-over-scalar placeholder routines with optimised implementations from Arm Optimized Routines (AOR). Also add some headers containing utilities for aarch64 libmvec routines, and update libm-test-ulps. Data tables for new routines are used via a pointer with a barrier on it, in order to prevent overly aggressive constant inlining in GCC. This allows a single adrp, combined with offset loads, to be used for every constant in the table. Special-case handlers are marked NOINLINE in order to confine the save/restore overhead of switching from vector to normal calling standard. This way we only incur the extra memory access in the exceptional cases. NOINLINE definitions have been moved to math_private.h in order to reduce duplication. AOR exposes a config option, WANT_SIMD_EXCEPT, to enable selective masking (and later fixing up) of invalid lanes, in order to trigger fp exceptions correctly (AdvSIMD only). This is tested and maintained in AOR, however it is configured off at source level here for performance reasons. We keep the WANT_SIMD_EXCEPT blocks in routine sources to greatly simplify the upstreaming process from AOR to glibc. Reviewed-by: Szabolcs Nagy <szabolcs.nagy@arm.com>	2023-06-30 09:04:10 +01:00
Joseph Myers	1a21693e16	Update syscall lists for Linux 6.4 Linux 6.4 adds the riscv_hwprobe syscall on riscv and enables memfd_secret on s390. Update syscall-names.list and regenerate the arch-syscall.h headers with build-many-glibcs.py update-syscalls. Tested with build-many-glibcs.py.	2023-06-28 21:22:14 +00:00
Adhemerval Zanella	d35fbd3e68	linux: Return unsupported if procfs can not be mount on tst-ttyname-namespace Trying to mount procfs can fail due multiples reasons: proc is locked due the container configuration, mount syscall is filtered by a Linux Secuirty Module, or any other security or hardening mechanism that Linux might eventually add. The tests does require a new procfs without binding to parent, and to fully fix it would require to change how the container was created (which is out of the scope of the test itself). Instead of trying to foresee any possible scenario, if procfs can not be mount fail with unsupported. Checked on aarch64-linux-gnu. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-06-28 09:19:11 -03:00
Adhemerval Zanella	a9fed5ea81	linux: Split tst-ttyname The tst-ttyname-direct.c checks the ttyname with procfs mounted in bind mode (MS_BIND\|MS_REC), while tst-ttyname-namespace.c checks with procfs mount with MS_NOSUID\|MS_NOEXEC\|MS_NODEV in a new namespace. Checked on x86_64-linux-gnu and aarch64-linux-gnu. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-06-28 09:18:23 -03:00
Adhemerval Zanella	b29e70657d	x86: Adjust Linux x32 dl-cache inclusion path It fixes the x32 build failure introduced by `45e2483a6c`. Checked on a x86_64-linux-gnu-x32 build.	2023-06-26 16:51:30 -03:00
Joe Simmons-Talbott	9a17a193b4	check_native: Get rid of alloca Use malloc rather than alloca to avoid potential stack overflow. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-06-26 10:17:47 -03:00
Joe Simmons-Talbott	48170127d9	ifaddrs: Get rid of alloca Use scratch_buffer and malloc rather than alloca to avoid potential stack overflows. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-06-26 10:17:39 -03:00
Sergey Bugaev	45e2483a6c	x86: Make dl-cache.h and readelflib.c not Linux-specific These files could be useful to any port that wants to use ld.so.cache. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-06-26 10:04:31 -03:00
Frederic Berat	99f9ae4ed0	benchtests: fix warn unused result Few tests needed to properly check for asprintf and system calls return values with _FORTIFY_SOURCE enabled. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-06-22 00:21:19 -04:00
Frederic Berat	d636339306	sysdeps/powerpc/fpu/tst-setcontext-fpscr.c: Fix warn unused result The fread routine return value needs to be checked when fortification is enabled, hence use xfread helper. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-06-22 00:21:17 -04:00
Frederic Berat	1bc85effd5	sysdeps/{i386, x86_64}/mempcpy_chk.S: fix linknamespace for __mempcpy_chk On i386 and x86_64, for libc.a specifically, __mempcpy_chk calls mempcpy which leads POSIX routines to call non-POSIX mempcpy indirectly. This leads the linknamespace test to fail when glibc is built with __FORTIFY_SOURCE=3. Since calling mempcpy doesn't bring any benefit for libc.a, directly call __mempcpy instead. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-06-22 00:20:52 -04:00
Joe Simmons-Talbott	9e6863a537	hurd: readv: Get rid of alloca Replace alloca with a scratch_buffer to avoid potential stack overflows. Checked on i686-gnu and x86_64-linux-gnu Message-Id: <20230619144334.2902429-1-josimmon@redhat.com>	2023-06-20 19:15:10 +02:00
Joe Simmons-Talbott	c6957bddb9	hurd: writev: Add back cleanup handler There is a potential memory leak for large writes due to writev being a "shall occur" cancellation point. Add back the cleanup handler removed in `cf30aa43a5`. Checked on i686-gnu and x86_64-linux-gnu. Message-Id: <20230619143842.2901522-1-josimmon@redhat.com>	2023-06-20 18:37:04 +02:00
Paul Pluzhnikov	4290aed051	Fix misspellings -- BZ 25337	2023-06-19 21:58:33 +00:00
Frédéric Bérat	20b6b8e8a5	tests: replace read by xread With fortification enabled, read calls return result needs to be checked, has it gets the __wur macro enabled. Note on read call removal from sysdeps/pthread/tst-cancel20.c and sysdeps/pthread/tst-cancel21.c: It is assumed that this second read call was there to overcome the race condition between pipe closure and thread cancellation that could happen in the original code. Since this race condition got fixed by `d0e3ffb7a5` the second call seems superfluous. Hence, instead of checking for the return value of read, it looks reasonable to simply remove it. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-06-19 09:14:56 -04:00
Joe Simmons-Talbott	cf30aa43a5	hurd: writev: Get rid of alloca Use a scratch_buffer rather than alloca to avoid potential stack overflows. Checked on i686-gnu and x86_64-linux-gnu Message-Id: <20230608155844.976554-1-josimmon@redhat.com>	2023-06-19 02:45:19 +02:00
Joe Simmons-Talbott	01dd2875f8	grantpt: Get rid of alloca Replace alloca with a scratch_buffer to avoid potential stack overflows. Message-Id: <20230613191631.1080455-1-josimmon@redhat.com>	2023-06-18 01:08:04 +02:00
Florian Weimer	388ae538dd	hurd: Add strlcpy, strlcat, wcslcpy, wcslcat to libc.abilist	2023-06-15 10:05:25 +02:00
Florian Weimer	b54e5d1c92	Add the wcslcpy, wcslcat functions These functions are about to be added to POSIX, under Austin Group issue 986. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-06-14 18:10:24 +02:00
Florian Weimer	454a20c875	Implement strlcpy and strlcat [BZ #178 ] These functions are about to be added to POSIX, under Austin Group issue 986. The fortified strlcat implementation does not raise SIGABRT if the destination buffer does not contain a null terminator, it just inherits the non-failing regular strlcat behavior. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-06-14 18:10:08 +02:00
Frederic Berat	7ba426a111	tests: replace fgets by xfgets With fortification enabled, fgets calls return result needs to be checked, has it gets the __wur macro enabled. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-06-13 19:59:08 -04:00
Dridi Boukelmoune	658f601f2a	posix: Handle success in gai_strerror() Signed-off-by: Dridi Boukelmoune <dridi.boukelmoune@gmail.com> Reviewed-by: Arjun Shankar <arjun@redhat.com>	2023-06-13 20:54:49 +02:00
caiyinyu	eaa5b1cce8	LoongArch: Add support for dl_runtime_profile This commit can fix the FAIL item: elf/tst-sprof-basic.	2023-06-13 10:27:45 +08:00
Noah Goldstein	180897c161	x86: Make the divisor in setting `non_temporal_threshold` cpu specific Different systems prefer a different divisors. From benchmarks[1] so far the following divisors have been found: ICX : 2 SKX : 2 BWD : 8 For Intel, we are generalizing that BWD and older prefers 8 as a divisor, and SKL and newer prefers 2. This number can be further tuned as benchmarks are run. [1]: https://github.com/goldsteinn/memcpy-nt-benchmarks Reviewed-by: DJ Delorie <dj@redhat.com>	2023-06-12 11:33:39 -05:00
Noah Goldstein	f193ea20ed	x86: Refactor Intel `init_cpu_features` This patch should have no affect on existing functionality. The current code, which has a single switch for model detection and setting prefered features, is difficult to follow/extend. The cases use magic numbers and many microarchitectures are missing. This makes it difficult to reason about what is implemented so far and/or how/where to add support for new features. This patch splits the model detection and preference setting stages so that CPU preferences can be set based on a complete list of available microarchitectures, rather than based on model magic numbers. Reviewed-by: DJ Delorie <dj@redhat.com>	2023-06-12 11:33:39 -05:00
Noah Goldstein	af992e7abd	x86: Increase `non_temporal_threshold` to roughly `sizeof_L3 / 4` Current `non_temporal_threshold` set to roughly '3/4 * sizeof_L3 / ncores_per_socket'. This patch updates that value to roughly 'sizeof_L3 / 4` The original value (specifically dividing the `ncores_per_socket`) was done to limit the amount of other threads' data a `memcpy`/`memset` could evict. Dividing by 'ncores_per_socket', however leads to exceedingly low non-temporal thresholds and leads to using non-temporal stores in cases where REP MOVSB is multiple times faster. Furthermore, non-temporal stores are written directly to main memory so using it at a size much smaller than L3 can place soon to be accessed data much further away than it otherwise could be. As well, modern machines are able to detect streaming patterns (especially if REP MOVSB is used) and provide LRU hints to the memory subsystem. This in affect caps the total amount of eviction at 1/cache_associativity, far below meaningfully thrashing the entire cache. As best I can tell, the benchmarks that lead this small threshold where done comparing non-temporal stores versus standard cacheable stores. A better comparison (linked below) is to be REP MOVSB which, on the measure systems, is nearly 2x faster than non-temporal stores at the low-end of the previous threshold, and within 10% for over 100MB copies (well past even the current threshold). In cases with a low number of threads competing for bandwidth, REP MOVSB is ~2x faster up to `sizeof_L3`. The divisor of `4` is a somewhat arbitrary value. From benchmarks it seems Skylake and Icelake both prefer a divisor of `2`, but older CPUs such as Broadwell prefer something closer to `8`. This patch is meant to be followed up by another one to make the divisor cpu-specific, but in the meantime (and for easier backporting), this patch settles on `4` as a middle-ground. Benchmarks comparing non-temporal stores, REP MOVSB, and cacheable stores where done using: https://github.com/goldsteinn/memcpy-nt-benchmarks Sheets results (also available in pdf on the github): https://docs.google.com/spreadsheets/d/e/2PACX-1vS183r0rW_jRX6tG_E90m9qVuFiMbRIJvi5VAE8yYOvEOIEEc3aSNuEsrFbuXw5c3nGboxMmrupZD7K/pubhtml Reviewed-by: DJ Delorie <dj@redhat.com> Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2023-06-12 11:33:39 -05:00
Florian Weimer	7d42120928	pthreads: Use _exit to terminate the tst-stdio1 test Previously, the exit function was used, but this causes the test to block (until the timeout) once exit is changed to lock stdio streams during flush.	2023-06-06 11:39:06 +02:00
Adhemerval Zanella	d4963a844d	linux: Fail as unsupported if personality call is filtered Container management default seccomp filter [1] only accepts personality(2) with PER_LINUX, (0x0), UNAME26 (0x20000), PER_LINUX32 (0x8), UNAME26 \| PER_LINUX32, and 0xffffffff (to query current personality) Although the documentation only state it is blocked to prevent 'enabling BSD emulation' (PER_BSD, not implemented by Linux), checking on repository log the real reason is to block ASLR disable flag (ADDR_NO_RANDOMIZE) and other poorly support emulations. So handle EPERM and fail as UNSUPPORTED if we can really check for BZ#19408. Checked on aarch64-linux-gnu. [1] https://github.com/moby/moby/blob/master/profiles/seccomp/default.json Reviewed-by: Florian Weimer <fweimer@redhat.com>	2023-06-05 12:51:48 -03:00
Joseph Myers	be9b883ddd	Remove MAP_VARIABLE from hppa bits/mman.h As suggested in <https://sourceware.org/pipermail/libc-alpha/2023-February/145890.html>, remove the MAP_VARIABLE define from the hppa bits/mman.h, for consistency with Linux 6.2 which removed the define there. Tested with build-many-glibcs.py for hppa-linux-gnu.	2023-06-05 14:35:25 +00:00
Sergey Bugaev	67f704ab69	hurd: Fix x86_64 sigreturn restoring bogus reply_port Since the area of the user's stack we use for the registers dump (and otherwise as __sigreturn2's stack) can and does overlap the sigcontext, we have to be very careful about the order of loads and stores that we do. In particular we have to load sc_reply_port before we start clobbering the sigcontext. Signed-off-by: Sergey Bugaev <bugaevc@gmail.com>	2023-06-04 19:05:51 +02:00
Paul Pluzhnikov	2cbeda847b	Fix a few more typos I missed in previous round -- BZ 25337	2023-06-02 23:46:32 +00:00
Alejandro Colomar	5013f6fc6c	Use __nonnull for the epoll_wait(2) family of syscalls Signed-off-by: Alejandro Colomar <alx@kernel.org> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-06-01 14:50:42 -03:00
Alejandro Colomar	cc5372806a	Fix invalid use of NULL in epoll_pwait2(2) test epoll_pwait2(2)'s second argument should be nonnull. We're going to add __nonnull to the prototype, so let's fix the test accordingly. We can use a dummy variable to avoid passing NULL. Reported-by: Adhemerval Zanella Netto <adhemerval.zanella@linaro.org> Signed-off-by: Alejandro Colomar <alx@kernel.org>	2023-06-01 14:50:35 -03:00
Joe Simmons-Talbott	884012db20	getipv4sourcefilter: Get rid of alloca Use a scratch_buffer rather than alloca to avoid potential stack overflows. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-06-01 14:47:12 -03:00
Joe Simmons-Talbott	d1eaab5a79	getsourcefilter: Get rid of alloca. Use a scratch_buffer rather than alloca to avoid potential stack overflows. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-06-01 14:46:09 -03:00
Frédéric Bérat	29e25f6f13	tests: fix warn unused results With fortification enabled, few function calls return result need to be checked, has they get the __wur macro enabled. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-06-01 13:01:32 -04:00
Frédéric Bérat	026a84a54d	tests: replace write by xwrite Using write without cheks leads to warn unused result when __wur is enabled. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2023-06-01 12:40:05 -04:00
H.J. Lu	a8c8889978	x86-64: Use YMM registers in memcmpeq-evex.S Since the assembly source file with -evex suffix should use YMM registers, not ZMM registers, include x86-evex256-vecs.h by default to use YMM registers in memcmpeq-evex.S Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2023-06-01 09:21:14 -07:00
Adhemerval Zanella	5f828ff824	io: Fix F_GETLK, F_SETLK, and F_SETLKW for powerpc64 Different than other 64 bit architectures, powerpc64 defines the LFS POSIX lock constants with values similar to 32 ABI, which are meant to be used with fcntl64 syscall. Since powerpc64 kABI does not have fcntl, the constants are adjusted with the FCNTL_ADJUST_CMD macro. The `4d0fe291ae` changed the logic of generic constants LFS value are equal to the default values; which is now wrong for powerpc64. Fix the value by explicit define the previous glibc constants (powerpc64 does not need to use the 32 kABI value, but it simplifies the FCNTL_ADJUST_CMD which should be kept as compatibility). Checked on powerpc64-linux-gnu and powerpc-linux-gnu.	2023-05-31 15:31:02 -03:00
Paul Pluzhnikov	65cc53fe7c	Fix misspellings in sysdeps/ -- BZ 25337	2023-05-30 23:02:29 +00:00
Adhemerval Zanella	4d0fe291ae	io: Fix record locking contants on 32 bit arch with 64 bit default time_t (BZ#30477) For architecture with default 64 bit time_t support, the kernel does not provide LFS and non-LFS values for F_GETLK, F_GETLK, and F_GETLK (the default value used for 64 bit architecture are used). This is might be considered an ABI break, but the currenct exported values is bogus anyway. The POSIX lockf is not affected since it is aliased to lockf64, which already uses the LFS values. Checked on i686-linux-gnu and the new tests on a riscv32. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2023-05-30 08:53:07 -03:00
caiyinyu	3eed5f3a1e	LoongArch: Fix inconsistency in SHMLBA macro values between glibc and kernel The LoongArch glibc was using the value of the SHMLBA macro from common code, which is __getpagesize() (16k), but this was inconsistent with the value of the SHMLBA macro in the kernel, which is SZ_64K (64k). This caused several shmat-related tests in LTP (Linux Test Project) to fail. This commit fixes the issue by ensuring that the glibc's SHMLBA macro value matches the value used in the kernel like other architectures.	2023-05-30 14:13:06 +08:00
Adhemerval Zanella	a1950a0758	riscv: Add the clone3 wrapper It follows the internal signature: extern int clone3 (struct clone_args __cl_args, size_t __size, int (__func) (void __arg), void __arg); Checked on riscv64-linux-gnu-rv64imafdc-lp64d. Reviewed-by: Palmer Dabbelt <palmer@rivosinc.com>	2023-05-29 17:39:57 -03:00
Dridi Boukelmoune	33d7c0e1cb	posix: Add error message for EAI_OVERFLOW Signed-off-by: Dridi Boukelmoune <dridi.boukelmoune@gmail.com> Reviewed-by: Arjun Shankar <arjun@redhat.com>	2023-05-29 15:30:14 +02:00
Joe Simmons-Talbott	d9055634a3	setsourcefilter: Replace alloca with a scratch_buffer. Use a scratch_buffer rather than either alloca or malloc to reduce the possibility of a stack overflow. Suggested-by: Adhemerval Zanella Netto <adhemerval.zanella@linaro.org> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2023-05-29 09:16:00 -04:00
Noah Goldstein	ed2f9dc942	x86: Use 64MB as nt-store threshold if no cacheinfo [BZ #30429 ] If `non_temporal_threshold` is below `minimum_non_temporal_threshold`, it almost certainly means we failed to read the systems cache info. In this case, rather than defaulting the minimum correct value, we should default to a value that gets at least reasonable performance. 64MB is chosen conservatively to be at the very high end. This should never cause non-temporal stores when, if we had read cache info, we wouldn't have otherwise. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2023-05-27 21:32:57 -05:00
Samuel Thibault	9ffdcf5b79	hurd: Fix setting up signal thread stack alignment x86_64 needs special alignment when calling functions, so we have to use MACHINE_THREAD_STATE_SETUP_CALL for the signal thread when forking.	2023-05-28 00:30:26 +02:00

1 2 3 4 5 ...

15748 Commits