glibc

mirror of https://sourceware.org/git/glibc.git synced 2024-12-13 14:50:17 +00:00

Author	SHA1	Message	Date
Andreas Schwab	c4f5e32aae	Fix buffer overrun in EUC-KR conversion module (bz #24973 ) The byte 0xfe as input to the EUC-KR conversion denotes a user-defined area and is not allowed. The from_euc_kr function used to skip two bytes when told to skip over the unknown designation, potentially running over the buffer end. (cherry picked from commit `ee7a3144c9`)	2021-01-05 06:33:06 +01:00
Siddhesh Poyarekar	0858f46440	Add NEWS entry for CVE-2020-29562 (BZ #26923 ) BZ #26923 now has a CVE entry, so add a NEWS entry for it. (cherry picked from commit `38a9e93cb1`)	2021-01-03 13:21:12 +01:00
Michael Colavita	1e40391de2	iconv: Fix incorrect UCS4 inner loop bounds (BZ#26923) Previously, in UCS4 conversion routines we limit the number of characters we examine to the minimum of the number of characters in the input and the number of characters in the output. This is not the correct behavior when __GCONV_IGNORE_ERRORS is set, as we do not consume an output character when we skip a code unit. Instead, track the input and output pointers and terminate the loop when either reaches its limit. This resolves assertion failures when resetting the input buffer in a step of iconv, which assumes that the input will be fully consumed given sufficient output space. (cherry picked from commit `228edd356f`)	2021-01-03 13:20:30 +01:00
Siddhesh Poyarekar	568c86274a	tests-mcheck: New variable to run tests with MALLOC_CHECK_=3 This new variable allows various subsystems in glibc to run all or some of their tests with MALLOC_CHECK_=3. This patch adds infrastructure support for this variable as well as an implementation in malloc/Makefile to allow running some of the tests with MALLOC_CHECK_=3. At present some tests in malloc/ have been excluded from the mcheck tests either because they're specifically testing MALLOC_CHECK_ or they are failing in master even without the Memory Tagging patches that prompted this work. Some tests were reviewed and found to need specific error points that MALLOC_CHECK_ defeats by terminating early but a thorough review of all tests is needed to bring them into mcheck coverage. Backported from `4f969166ce`.	2020-12-24 11:20:34 +05:30
Arjun Shankar	df31c7ca92	iconv: Accept redundant shift sequences in IBM1364 [BZ #26224 ] The IBM1364, IBM1371, IBM1388, IBM1390 and IBM1399 character sets share converter logic (iconvdata/ibm1364.c) which would reject redundant shift sequences when processing input in these character sets. This led to a hang in the iconv program (CVE-2020-27618). This commit adjusts the converter to ignore redundant shift sequences and adds test cases for iconv_prog hangs that would be triggered upon their rejection. This brings the implementation in line with other converters that also ignore redundant shift sequences (e.g. IBM930 etc., fixed in commit `692de4b396`). Reviewed-by: Carlos O'Donell <carlos@redhat.com> (cherry picked from commit `9a99c68214`)	2020-11-30 23:59:53 +01:00
Adhemerval Zanella	7df507808c	sh: Add sh4 fpu Implies folder The commit `605f38177d` (sh: Split BE/LE abilist) did not take in consideration the SH4 fpu support. Checked with a build for sh4-linux-gnu and manually checked that the implementations at sysdeps/sh/sh4/fpu/ are selected. John Paul Adrian Glaubitz also confirmed it fixes the build issues he encontered. (cherry-picked from `9ff2674ef8`)	2020-11-27 09:34:02 -03:00
Szabolcs Nagy	8dc7605665	aarch64: Fix DT_AARCH64_VARIANT_PCS handling [BZ #26798 ] The variant PCS support was ineffective because in the common case linkmap->l_mach.plt == 0 but then the symbol table flags were ignored and normal lazy binding was used instead of resolving the relocs early. (This was a misunderstanding about how GOT[1] is setup by the linker.) In practice this mainly affects SVE calls when the vector length is more than 128 bits, then the top bits of the argument registers get clobbered during lazy binding. Fixes bug 26798. (cherry picked from commit `558251bd87`)	2020-11-04 12:23:20 +00:00
Sajan Karumanchi	48cf525f4b	x86: Optimizing memcpy for AMD Zen architecture. Modifying the shareable cache '__x86_shared_cache_size', which is a factor in computing the non-temporal threshold parameter '__x86_shared_non_temporal_threshold' to optimize memcpy for AMD Zen architectures. In the existing implementation, the shareable cache is computed as 'L3 per thread, L2 per core'. Recomputing this shareable cache as 'L3 per CCX(Core-Complex)' has brought in performance gains. As per the large bench variant results, this patch also addresses the regression problem on AMD Zen architectures. Backport of commit `59803e81f9` upstream, with the fix from `cb3a749a22` ("x86: Restore processing of cache size tunables in init_cacheinfo") applied. Reviewed-by: Premachandra Mallappa <premachandra.mallappa@amd.com> Co-Authored-by: Florian Weimer <fweimer@redhat.com>	2020-10-30 13:04:09 +01:00
Patrick McGehearty	8d730cb25a	Reversing calculation of __x86_shared_non_temporal_threshold The __x86_shared_non_temporal_threshold determines when memcpy on x86 uses non_temporal stores to avoid pushing other data out of the last level cache. This patch proposes to revert the calculation change made by H.J. Lu's patch of June 2, 2017. H.J. Lu's patch selected a threshold suitable for a single thread getting maximum performance. It was tuned using the single threaded large memcpy micro benchmark on an 8 core processor. The last change changes the threshold from using 3/4 of one thread's share of the cache to using 3/4 of the entire cache of a multi-threaded system before switching to non-temporal stores. Multi-threaded systems with more than a few threads are server-class and typically have many active threads. If one thread consumes 3/4 of the available cache for all threads, it will cause other active threads to have data removed from the cache. Two examples show the range of the effect. John McCalpin's widely parallel Stream benchmark, which runs in parallel and fetches data sequentially, saw a 20% slowdown with this patch on an internal system test of 128 threads. This regression was discovered when comparing OL8 performance to OL7. An example that compares normal stores to non-temporal stores may be found at https://vgatherps.github.io/2018-09-02-nontemporal/. A simple test shows performance loss of 400 to 500% due to a failure to use nontemporal stores. These performance losses are most likely to occur when the system load is heaviest and good performance is critical. The tunable x86_non_temporal_threshold can be used to override the default for the knowledgable user who really wants maximum cache allocation to a single thread in a multi-threaded system. The manual entry for the tunable has been expanded to provide more information about its purpose. modified: sysdeps/x86/cacheinfo.c modified: manual/tunables.texi (cherry picked from commit `d3c5702747`) (Conflicts in sysdeps/x86/cacheinfo.c due to missing rep_movsb_threshold, x86_rep_stosb_threshold tunables.)	2020-10-30 13:03:00 +01:00
Wilco Dijkstra	4bc9918c99	AArch64: Use __memcpy_simd on Neoverse N2/V1 Add CPU detection of Neoverse N2 and Neoverse V1, and select __memcpy_simd as the memcpy/memmove ifunc. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> (cherry picked from commit `e11ed9d2b4`)	2020-10-14 15:57:02 +01:00
Wilco Dijkstra	4722d1fb9d	[AArch64] Improve integer memcpy Further optimize integer memcpy. Small cases now include copies up to 32 bytes. 64-128 byte copies are split into two cases to improve performance of 64-96 byte copies. Comments have been rewritten. (cherry picked from commit `7000651327`)	2020-10-14 15:57:02 +01:00
Wilco Dijkstra	bea507a3f5	AArch64: Rename IS_ARES to IS_NEOVERSE_N1 Rename IS_ARES to IS_NEOVERSE_N1 since that is a bit clearer. Reviewed-by: Carlos O'Donell <carlos@redhat.com> (cherry picked from commit `0f6278a879`)	2020-10-14 15:57:02 +01:00
Wilco Dijkstra	d0a5b76902	AArch64: Improve backwards memmove performance On some microarchitectures performance of the backwards memmove improves if the stores use STR with decreasing addresses. So change the memmove loop in memcpy_advsimd.S to use 2x STR rather than STP. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> (cherry picked from commit `bd394d131c`)	2020-10-14 15:57:02 +01:00
Wilco Dijkstra	24a30c5959	AArch64: Add optimized Q-register memcpy Add a new memcpy using 128-bit Q registers - this is faster on modern cores and reduces codesize. Similar to the generic memcpy, small cases include copies up to 32 bytes. 64-128 byte copies are split into two cases to improve performance of 64-96 byte copies. Large copies align the source rather than the destination. bench-memcpy-random is ~9% faster than memcpy_falkor on Neoverse N1, so make this memcpy the default on N1 (on Centriq it is 15% faster than memcpy_falkor). Passes GLIBC regression tests. Reviewed-by: Szabolcs Nagy <szabolcs.nagy@arm.com> (cherry picked from commit `4a733bf375`)	2020-10-14 15:57:02 +01:00
Wilco Dijkstra	88db98fa6e	AArch64: Align ENTRY to a cacheline Given almost all uses of ENTRY are for string/memory functions, align ENTRY to a cacheline to simplify things. Reviewed-by: Carlos O'Donell <carlos@redhat.com> (cherry picked from commit `34f0d01d5e`)	2020-10-14 15:57:02 +01:00
Arjun Shankar	32965a46ce	intl: Handle translation output codesets with suffixes [BZ #26383 ] Commit `91927b7c76` (Rewrite iconv option parsing [BZ #19519]) did not handle cases where the output codeset for translations (via the `gettext' family of functions) might have a caller specified encoding suffix such as TRANSLIT or IGNORE. This led to a regression where translations did not work when the codeset had a suffix. This commit fixes the above issue by parsing any suffixes passed to __dcigettext and adds two new test-cases to intl/tst-codeset.c to verify correct behaviour. The iconv-internal function __gconv_create_spec and the static iconv-internal function gconv_destroy_spec are now visible internally within glibc and used in intl/dcigettext.c. (cherry picked from commit `7d4ec75e11`)	2020-10-10 14:51:27 +02:00
Aurelien Jarno	6fdf971c9d	Add NEWS entry for CVE-2016-10228 (bug 19519) (cherry picked from commit `17a0126abf`)	2020-08-03 23:34:51 +02:00
Arjun Shankar	70d585151c	Rewrite iconv option parsing [BZ #19519 ] This commit replaces string manipulation during `iconv_open' and iconv_prog option parsing with a structured, flag based conversion specification. In doing so, it alters the internal `__gconv_open' interface and accordingly adjusts its uses. This change fixes several hangs in the iconv program and therefore includes a new test to exercise iconv_prog options that originally led to these hangs. It also includes a new regression test for option handling in the iconv function. Reviewed-by: Florian Weimer <fweimer@redhat.com> Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org> Reviewed-by: Carlos O'Donell <carlos@redhat.com> (cherry picked from commit `91927b7c76`)	2020-08-03 23:34:44 +02:00
Florian Weimer	1c8efe848b	powerpc: Fix incorrect cache line size load in memset (bug 26332) __GLRO loaded the word after the requested variable on big-endian PowerPC, where LOWORD is 4. This can cause the memset implement go wrong because the masking with the cache line size produces wrong results, particularly if the loaded value happens to be 1. The __GLRO macro is not used in any place where loading the lower 32-bit word of a 64-bit value is desired, so the +4 offset is always wrong. Fixes commit `18363b4f01` ("powerpc: Move cache line size to rtld_global_ro") and bug 26332. Reviewed-by: Carlos O'Donell <carlos@redhat.com> (cherry picked from commit `7650321ce0`)	2020-08-03 23:32:32 +02:00
H.J. Lu	7611339a9b	nptl: Zero-extend arguments to SETXID syscalls [BZ #26248 ] nptl has /* Opcodes and data types for communication with the signal handler to change user/group IDs. / struct xid_command { int syscall_no; long int id[3]; volatile int cntr; volatile int error; }; / This must be last, otherwise the current thread might not have permissions to send SIGSETXID syscall to the other threads. / result = INTERNAL_SYSCALL_NCS (cmdp->syscall_no, 3, cmdp->id[0], cmdp->id[1], cmdp->id[2]); But the second argument of setgroups syscal is a pointer: int setgroups (size_t size, const gid_t list); But on x32, pointers passed to syscall must have pointer type so that they will be zero-extended. The kernel XID arguments are unsigned and do not require sign extension. Change xid_command to struct xid_command { int syscall_no; unsigned long int id[3]; volatile int cntr; volatile int error; }; so that all arguments are zero-extended. A testcase is added for x32 and setgroups returned with EFAULT when running as root without the fix. (cherry picked from commit `0ad926f349`)	2020-07-29 23:18:33 +02:00
Arjun Shankar	21b760cc2f	Disable warnings due to deprecated libselinux symbols used by nss and nscd The SELinux API deprecated several symbols in its 3.1 release, including security_context_t, matchpathcon, avc_init, and sidput, which are used in makedb and nscd. While the usage of these should eventually be replaced by newer interfaces, this commit disables GCC warnings due to the use of the above symbols. Reviewed-by: Carlos O'Donell <carlos@redhat.com> Tested-by: Carlos O'Donell <carlos@redhat.com> (cherry picked from commit `04726be814`)	2020-07-29 20:33:35 +02:00
Aurelien Jarno	6f3459f985	Add NEWS entry for CVE-2020-6096 (bug 25620) Reviewed-by: Carlos O'Donell <carlos@redhat.com> (cherry picked from commit `17400c4bcd`)	2020-07-16 19:23:43 +02:00
Alexander Anisimov	64246fccaf	arm: CVE-2020-6096: Fix multiarch memcpy for negative length [BZ #25620 ] Unsigned branch instructions could be used for r2 to fix the wrong behavior when a negative length is passed to memcpy. This commit fixes the armv7 version. (cherry picked from commit `beea361050`)	2020-07-14 15:46:21 -04:00
Evgeny Eremin	9bbd2b6172	arm: CVE-2020-6096: fix memcpy and memmove for negative length [BZ #25620 ] Unsigned branch instructions could be used for r2 to fix the wrong behavior when a negative length is passed to memcpy and memmove. This commit fixes the generic arm implementation of memcpy amd memmove. (cherry picked from commit `79a4fa341b`)	2020-07-14 15:45:00 -04:00
H.J. Lu	4e8a33a959	NEWS: Mention BZ 25933 fix	2020-07-04 09:45:21 -07:00
Sunil K Pandey	fd15ba932d	Fix avx2 strncmp offset compare condition check [BZ #25933 ] strcmp-avx2.S: In avx2 strncmp function, strings are compared in chunks of 4 vector size(i.e. 32x4=128 byte for avx2). After first 4 vector size comparison, code must check whether it already passed the given offset. This patch implement avx2 offset check condition for strncmp function, if both string compare same for first 4 vector size. (cherry picked from commit `75870237ff`)	2020-07-04 09:35:53 -07:00
Florian Weimer	3a44844c97	nss_compat: internal_endent may clobber errno, hiding ERANGE [BZ #25976 ] During cleanup, before returning from get_r functions, the endent calls must not change errno. Otherwise, an ERANGE error from the underlying implementation can be hidden, causing unexpected lookup failures. This commit introduces an internal_endent_noerror function which saves and restore errno, and marks the original internal_end*ent function as warn_unused_result, so that it is used only in contexts were errors from it can be handled explicitly. Reviewed-by: DJ Delorie <dj@redhat.com> (cherry picked from commit `790b8dda44`)	2020-05-19 16:28:53 +02:00
Lexi Shao	c839175267	aarch64: fix strcpy and strnlen for big-endian [BZ #25824 ] This patch fixes the optimized implementation of strcpy and strnlen on a big-endian arm64 machine. The optimized method uses neon, which can process 128bit with one instruction. On a big-endian machine, the bit order should be reversed for the whole 128-bits double word. But with instuction rev64 datav.16b, datav.16b it reverses 64bits in the two halves rather than reversing 128bits. There is no such instruction as rev128 to reverse the 128bits, but we can fix this by loading the data registers accordingly. Fixes 0237b61526e7("aarch64: Optimized implementation of strcpy") and 2911cb68ed3d("aarch64: Optimized implementation of strnlen"). Signed-off-by: Lexi Shao <shaolexi@huawei.com> Reviewed-by: Szabolcs Nagy <szabolcs.nagy@arm.com> (cherry picked from commit `59b64f9cbb`)	2020-05-18 18:32:13 +01:00
Florian Weimer	1094741224	aarch64: Accept PLT calls to __getauxval within libc.so When using outline atomics (-moutline-atomics, the default for ARMv8-A starting with GCC 10), libgcc contains an ELF constructor which calls __getauxval. This code is built outside of glibc, so none of its internal PLT avoidance schemes can be applied to it. This change suppresses the elf/check-localplt failure. (cherry picked from commit `16536e98e3`) (cherry picked from commit 587a332b6fadc4d9f1035ecaa52ba32ee41cd300)	2020-05-12 19:54:36 +02:00
H.J. Lu	a98b8b221c	NEWS: Mention fixes for BZ 25810/25896/25902/25966	2020-05-09 12:43:53 -07:00
H.J. Lu	4c833bbebe	x86-64: Use RDX_LP on __x86_shared_non_temporal_threshold [BZ #25966 ] Since __x86_shared_non_temporal_threshold is defined as long int __x86_shared_non_temporal_threshold; and long int is 4 bytes for x32, use RDX_LP to compare against __x86_shared_non_temporal_threshold in assembly code. (cherry picked from commit `55c7bcc71b`)	2020-05-09 12:39:22 -07:00
Rafał Lużyński	3b9ceb3320	NEWS: Mention bug 25639 fixed in 2.31 branch	2020-05-09 17:07:24 +02:00
Rafał Lużyński	bb44fe7711	oc_FR locale: Fix spelling of April (bug 25639) Confirmed by CLDR and a native speaker: "abril" is more often used even if "abrial" is also correct. Both nominative (alt_mon) and genitive (mon) cases are updated.	2020-05-09 17:07:24 +02:00
Rafał Lużyński	f2ac792047	oc_FR locale: Fix spelling of Thursday (bug 25639) As reported by a native speaker: Thursday: "dijóus" -> "dijòus" (also confirmed by CLDR)	2020-05-09 17:07:24 +02:00
H.J. Lu	18fdba553d	Add a C wrapper for prctl [BZ #25896 ] Add a C wrapper to pass arguments in /* Control process execution. */ extern int prctl (int __option, ...) __THROW; to prctl syscall: extern int prctl (int, unsigned long int, unsigned long int, unsigned long int, unsigned long int); (cherry picked from commit `ff026950e2`)	2020-05-07 05:01:20 -07:00
H.J. Lu	7c9e054afd	powerpc: Rename argN to _argN in LOADARGS_N [BZ #25902 ] LOADARGS_N in powerpc/sysdep.h uses argN as local variables. It breaks when argN is also a function argument. Rename argN to _argN to avoid conflict. (cherry picked from commit `14f43dd34d`)	2020-05-07 05:01:20 -07:00
H.J. Lu	9c5ae39a64	Add C wrappers for process_vm_readv/process_vm_writev [BZ #25810 ] Since the the U marker can only be applied to 2 unsigned long arguments in syscalls.list files, add a C wrapper for process_vm_readv and process_vm_writev syscals which have more than 2 unsigned long arguments. (cherry picked from commit `ad9fd65d71`)	2020-05-07 05:01:20 -07:00
H.J. Lu	63c3696a4a	Mark unsigned long arguments with U in more syscalls [BZ #25810 ] Mark unsigned long arguments in mmap, read, recv, recvfrom, send, sendto, write, ioperm, sendfile64, setxattr, lsetxattr, fsetxattr, getxattr, lgetxattr, fgetxattr, listxattr, llistxattr and flistxattr with U in syscalls.list files. (cherry picked from commit `86f4f2263b`)	2020-05-07 05:01:20 -07:00
H.J. Lu	5b9d49293b	Add a syscall test for [BZ #25810 ] Add a test to pass 64-bit long arguments to syscall with undefined upper 32 bits on x32. Tested on i386, x86-64 and x32 as well as with build-many-glibcs.py. (cherry picked from commit `781dacc4f4`)	2020-05-07 05:01:20 -07:00
H.J. Lu	496b5963a7	Add SYSCALL_ULONG_ARG_[12] to pass long to syscall [BZ #25810 ] X32 has 32-bit long and pointer with 64-bit off_t. Since x32 psABI requires that pointers passed in registers must be zero-extended to 64bit, x32 can share many syscall interfaces with LP64. When a LP64 syscall with long and unsigned long int arguments is used for x32, these arguments must be properly extended to 64-bit. Otherwise if the upper 32 bits of the register have undefined value, such a syscall will be rejected by kernel. For syscalls implemented in assembly codes, 'U' is added to syscall signature key letters for unsigned long, which is zero-extended to 64-bit types. SYSCALL_ULONG_ARG_1 and SYSCALL_ULONG_ARG_2 are passed to syscall-template.S for the first and the second unsigned long int arguments if PSEUDOS_HAVE_ULONG_INDICES is defined. They are used by x32 to zero-extend 32-bit arguments to 64 bits. Tested on i386, x86-64 and x32 as well as with build-many-glibcs.py. (cherry picked from commit `2ad5d0845d`)	2020-05-07 05:01:20 -07:00
H.J. Lu	04330f8526	x32: Properly pass long to syscall [BZ #25810 ] X32 has 32-bit long and pointer with 64-bit off_t. Since x32 psABI requires that pointers passed in registers must be zero-extended to 64bit, x32 can share many syscall interfaces with LP64. When a LP64 syscall with long and unsigned long arguments is used for x32, these arguments must be properly extended to 64-bit. Otherwise if the upper 32 bits of the register have undefined value, such a syscall will be rejected by kernel. Enforce zero-extension for pointers and array system call arguments. For integer types, extend to int64_t (the full register) using a regular cast, resulting in zero or sign extension based on the signedness of the original type. For void mmap(void addr, size_t length, int prot, int flags, int fd, off_t offset); we now generate 0: 41 f7 c1 ff 0f 00 00 test $0xfff,%r9d 7: 75 1f jne 28 <__mmap64+0x28> 9: 48 63 d2 movslq %edx,%rdx c: 89 f6 mov %esi,%esi e: 4d 63 c0 movslq %r8d,%r8 11: 4c 63 d1 movslq %ecx,%r10 14: b8 09 00 00 40 mov $0x40000009,%eax 19: 0f 05 syscall That is 1. addr is unchanged. 2. length is zero-extend to 64 bits. 3. prot is sign-extend to 64 bits. 4. flags is sign-extend to 64 bits. 5. fd is sign-extend to 64 bits. 6. offset is unchanged. For int arguments, since kernel uses only the lower 32 bits and ignores the upper 32 bits in 64-bit registers, these work correctly. Tested on x86-64 and x32. There are no code changes on x86-64. (cherry picked from commit `df76ff3a44`)	2020-05-07 05:01:20 -07:00
Joseph Myers	de371d1581	Fix build with GCC 10 when long double = double. On platforms where long double has the same ABI as double, glibc defines long double functions as aliases for the corresponding double functions. The declarations of those functions in <math.h> are disabled to avoid problems with aliases having incompatible types, but GCC 10 now gives errors for incompatible types when the long double function is known to GCC as a built-in function, not just when there is an incompatible header declaration. This patch fixes those errors by using appropriate -fno-builtin-<function> options to compile the double functions. The list of CFLAGS-* settings is an appropriately adapted version of that in sysdeps/ieee754/ldbl-opt/Makefile used there for building nldbl-.c files; in particular, the options are used even if GCC does not currently have a built-in function of a given function, so that adding such a built-in function in future will not break the glibc build. Thus, various of the CFLAGS- settings are only for future-proofing and may not currently be needed (and it's possible some could be irrelevant for other reasons). Tested with build-many-glibcs.py for arm-linux-gnueabi (compilers and glibcs builds), where it fixes the build that previously failed. (cherry picked from commit `49348beafe`)	2020-05-05 14:58:16 +01:00
John David Anglin	ece4e11d55	Add new file missed in previous hppa commit. (cherry picked from commit `acdcca7294`)	2020-05-04 21:56:38 +02:00
John David Anglin	91b909315c	Fix data race in setting function descriptors during lazy binding on hppa. This addresses an issue that is present mainly on SMP machines running threaded code. In a typical indirect call or PLT import stub, the target address is loaded first. Then the global pointer is loaded into the PIC register in the delay slot of a branch to the target address. During lazy binding, the target address is a trampoline which transfers to _dl_runtime_resolve(). _dl_runtime_resolve() uses the relocation offset stored in the global pointer and the linkage map stored in the trampoline to find the relocation. Then, the function descriptor is updated. In a multi-threaded application, it is possible for the global pointer to be updated between the load of the target address and the global pointer. When this happens, the relocation offset has been replaced by the new global pointer. The function pointer has probably been updated as well but there is no way to find the address of the function descriptor and to transfer to the target. So, _dl_runtime_resolve() typically crashes. HP-UX addressed this problem by adding an extra pc-relative branch to the trampoline. The descriptor is initially setup to point to the branch. The branch then transfers to the trampoline. This allowed the trampoline code to figure out which descriptor was being used without any modification to user code. I didn't use this approach as it is more complex and changes function pointer canonicalization. The order of loading the target address and global pointer in indirect calls was not consistent with the order used in import stubs. In particular, $$dyncall and some inline versions of it loaded the global pointer first. This was inconsistent with the global pointer being updated first in dl-machine.h. Assuming the accesses are ordered, we want elf_machine_fixup_plt() to store the global pointer first and calls to load it last. Then, the global pointer will be correct when the target function is entered. However, just to make things more fun, HP added support for out-of-order execution of accesses in PA 2.0. The accesses used by calls are weakly ordered. So, it's possibly under some circumstances that a function might be entered with the wrong global pointer. However, HP uses weakly ordered accesses in 64-bit HP-UX, so I assume that loading the global pointer in the delay slot of the branch must work consistently. The basic fix for the race is a combination of modifying user code to preserve the address of the function descriptor in register %r22 and setting the least-significant bit in the relocation offset. The latter was suggested by Carlos as a way to distinguish relocation offsets from global pointer values. Conventionally, %r22 is used as the address of the function descriptor in calls to $$dyncall. So, it wasn't hard to preserve the address in %r22. I have updated gcc trunk and gcc-9 branch to not clobber %r22 in $$dyncall and inline indirect calls. I have also modified the import stubs in binutils trunk and the 2.33 branch to preserve %r22. This required making the stubs one instruction longer but we save one relocation. I also modified binutils to align the .plt section on a 8-byte boundary. This allows descriptors to be updated atomically with a floting-point store. With these changes, _dl_runtime_resolve() can fallback to an alternate mechanism to find the relocation offset when it has been clobbered. There's just one additional instruction in the fast path. I tested the fallback function, _dl_fix_reloc_arg(), by changing the branch to always use the fallback. Old code still runs as it did before. Fixes bug 23296. Reviewed-by: Carlos O'Donell <carlos@redhat.com> (cherry picked from commit `1a044511a3`)	2020-05-04 21:56:38 +02:00
Chung-Lin Tang	b999c0098a	nios2: delete sysdeps/unix/sysv/linux/nios2/kernel-features.h Commit `06436acf81` created a kernel-features.h file with '#undef __ASSUME_SYSVIPC_DEFAULT_IPC_64', which is wrong for Nios II. Deleting the header. (cherry picked from commit `38c6788818`)	2020-05-04 20:29:50 +08:00
Adhemerval Zanella	54ba2541b3	mips: Fix bracktrace result for signal frames MIPS fallback code handle a frame where its FDE can not be obtained (for instance a signal frame) by reading the kernel allocated signal frame and adding '2' to the value of 'sc_pc' [1]. The added value is used to recognize an end of an EH region on mips16 [2]. The fix adjust the obtained signal frame value and remove the libgcc added value by checking if the previous frame is a signal frame one. Checked with backtrace and tst-sigcontext-get_pc tests on mips-linux-gnu and mips64-linux-gnu. [1] libgcc/config/mips/linux-unwind.h from gcc code. [2] gcc/config/mips/mips.h from gcc code. */ (cherry picked from commit `6e05978f0c`)	2020-03-30 23:18:26 +02:00
Adhemerval Zanella	83d3eec672	stdlib: Move tst-system to tests-container Fix some issues with different shell and error messages. Checked on x86_64-linux-gnu and i686-linux-gnu. (cherry picked from commit `4eda036f5b`)	2020-03-25 10:01:28 -03:00
Adhemerval Zanella	ad9b0037cc	support/shell-container.c: Add builtin kill No options supported. Reviewed-by: DJ Delorie <dj@redhat.com> (cherry picked from commit `1c17100c43`)	2020-03-25 10:01:28 -03:00
Adhemerval Zanella	2448ba1d72	support/shell-container.c: Add builtin exit Reviewed-by: DJ Delorie <dj@redhat.com> (cherry picked from commit `5a5a3a3234`)	2020-03-25 10:01:28 -03:00
Adhemerval Zanella	5810e6d75f	support/shell-container.c: Return 127 if execve fails Reviewed-by: DJ Delorie <dj@redhat.com> (cherry picked from commit `5fce0e095b`)	2020-03-25 10:01:28 -03:00

1 2 3 4 5 ...

35492 Commits