glibc

mirror of https://sourceware.org/git/glibc.git synced 2024-11-30 08:40:07 +00:00

Author	SHA1	Message	Date
Adhemerval Zanella	a61933fe27	sparc: Remove bzero optimization The symbol is not present in current POSIX specification and compiler already generates memset call.	2022-02-23 14:18:18 -03:00
Adhemerval Zanella	c0d215f162	ia64: Remove bzero optimization The symbol is not present current POSIX specification and compiler already generates memset call. The arch specific implementation is just to avoid the __bzero symbol creation (which ia64 abi does not export).	2022-02-23 14:18:17 -03:00
Adhemerval Zanella	f883dbaf1f	alpha: Remove bzero optimization The symbols is not present in current POSIX specification and compiler already generates memmove call.	2022-02-23 14:06:49 -03:00
Adhemerval Zanella	bf92893a14	x86_64: Remove bcopy optimizations The symbols is not present in current POSIX specification and compiler already generates memmove call.	2022-02-23 14:06:49 -03:00
Adhemerval Zanella	8bad328203	i386: Remove bcopy optimizations The symbols is not present in current POSIX specification and compiler already generates memmove call.	2022-02-23 14:06:49 -03:00
Adhemerval Zanella	86a82cd57c	powerpc: Remove bcopy optimizations The symbols is not present in current POSIX specification and compiler already generates memmove call.	2022-02-23 14:06:49 -03:00
Adhemerval Zanella	80b85f92f4	ia64: Remove bcopy It just call memmove as the generic implementation.	2022-02-23 14:06:45 -03:00
John David Anglin	d2224ffbdd	hppa: Fix warnings from _dl_lookup_address This change fixes two warnings from _dl_lookup_address. The first warning comes from dropping the volatile keyword from desc in the call to _dl_read_access_allowed. We now have a full atomic barrier between loading desc[0] and the access check, so desc no longer needs to be declared as volatile. The second warning comes from the implicit declaration of _dl_fix_reloc_arg. This is fixed by including dl-runtime.h and declaring _dl_fix_reloc_arg in dl-runtime.h.	2022-02-22 18:51:35 +00:00
John David Anglin	9e7e5fda38	hppa: Revise gettext trampoline design The current getcontext return trampoline is overly complex and it unnecessarily clobbers several registers. By saving the context pointer (r26) in the context, __getcontext_ret can restore any registers not restored by setcontext. This allows getcontext to save and restore the entire register context present when getcontext is entered. We use the unused oR0 context slot for the return from __getcontext_ret. While this is not directly useful in C, it can be exploited in assembly code. Registers r20, r23, r24 and r25 are not clobbered in the call path to getcontext. This allows a small simplification of swapcontext. It also allows saving and restoring the 6-bit SAR register in the LSB of the oSAR context slot. The getcontext flag value can be stored in the MSB of the oSAR slot.	2022-02-22 17:28:46 +00:00
Joseph Myers	fdc1ae67fe	Add SOL_MPTCP, SOL_MCTP from Linux 5.16 to bits/socket.h Linux 5.16 adds constants SOL_MPTCP and SOL_MCTP to the getsockopt / setsockopt levels; add these constants to bits/socket.h. Tested for x86_64.	2022-02-21 22:49:36 +00:00
Noah Goldstein	b98d0bbf74	x86: Fix TEST_NAME to make it a string in tst-strncmp-rtm.c Previously TEST_NAME was passing a function pointer. This didn't fail because of the -Wno-error flag (to allow for overflow sizes passed to strncmp/wcsncmp) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-18 15:24:50 -08:00
Noah Goldstein	7835d611af	x86: Test wcscmp RTM in the wcsncmp overflow case [BZ #28896 ] In the overflow fallback strncmp-avx2-rtm and wcsncmp-avx2-rtm would call strcmp-avx2 and wcscmp-avx2 respectively. This would have not checks around vzeroupper and would trigger spurious aborts. This commit fixes that. test-strcmp, test-strncmp, test-wcscmp, and test-wcsncmp all pass on AVX2 machines with and without RTM. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-18 16:35:18 -06:00
John David Anglin	71b108d7eb	hppa: Fix swapcontext This change fixes the failure of stdlib/tst-setcontext2 and stdlib/tst-setcontext7 on hppa. The implementation of swapcontext in C is broken. C saves the return pointer (rp) and any non call-clobbered registers (in this case r3, r4 and r5) on the stack. However, the setcontext call in swapcontext pops the stack and subsequent calls clobber the saved registers. When the context in oucp is restored, both tests fault. Here we rewrite swapcontext in assembly code to avoid using the stack for register values that need to be used after restoration. The getcontext and setcontext routines are revised to save and restore register ret1 for normal returns. We copy the oucp pointer to ret1. This allows access to the old context after calling getcontext and setcontext.	2022-02-18 20:38:25 +00:00
Noah Goldstein	c627209832	x86: Fallback {str\|wcs}cmp RTM in the ncmp overflow case [BZ #28896 ] In the overflow fallback strncmp-avx2-rtm and wcsncmp-avx2-rtm would call strcmp-avx2 and wcscmp-avx2 respectively. This would have not checks around vzeroupper and would trigger spurious aborts. This commit fixes that. test-strcmp, test-strncmp, test-wcscmp, and test-wcsncmp all pass on AVX2 machines with and without RTM. Co-authored-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-17 15:43:05 -06:00
Adhemerval Zanella	bbe199b27a	microblaze: Use the correct select syscall (BZ #28883 ) On Microblaze only __NR_newselect is implemented, even though kernel advertise __NR_select on asm/unistd.h. Since microblaze is the only architecture that undef __ASSUME_PSELECT, the generic code change is simpler than chaging the architecture syscall number. Acked-by: Mark Hatle <mark.hatle@xilinx.com>	2022-02-16 16:26:44 -03:00
Joseph Myers	790a607e23	Update kernel version to 5.16 in tst-mman-consts.py This patch updates the kernel version in the test tst-mman-consts.py to 5.16. (There are no new MAP_* constants covered by this test in 5.16 that need any other header changes.) Tested with build-many-glibcs.py.	2022-02-16 14:19:24 +00:00
Adhemerval Zanella	894755e16e	pthread: Use 64 bit time_t stat internally for sem_open (BZ #28880 ) The __sem_check_add_mapping internal stat calls fails with EOVERFLOW if system time is larger than 32 bit. It is a missing spot from `52a5fe70a2` fix to use 64 bit stat internally. Checked on x86_64-linux-gnu and i686-linux-gnu.	2022-02-16 10:20:56 -03:00
Noah Goldstein	e108c02a5e	x86: Fix bug in strncmp-evex and strncmp-avx2 [BZ #28895 ] Logic can read before the start of `s1` / `s2` if both `s1` and `s2` are near the start of a page. To avoid having the result contimated by these comparisons the `strcmp` variants would mask off these comparisons. This was missing in the `strncmp` variants causing the bug. This commit adds the masking to `strncmp` so that out of range comparisons don't affect the result. test-strcmp, test-strncmp, test-wcscmp, and test-wcsncmp all pass as well a full xcheck on x86_64 linux. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-16 02:11:16 -06:00
H.J. Lu	a5659cf27d	x86-64: Define __memcmpeq in ld.so Define __memcmpeq in ld.so so that compiler can generate __memcmpeq call when compiling for ld.so.	2022-02-14 17:57:07 -08:00
Samuel Thibault	06dbfcced3	htl: Fix initializing the key lock The static pthread_once_t in the pt-key.h header was creating one pthread_once_t per includer. We have to use a shared common pthread_once_t instead.	2022-02-14 19:29:02 +01:00
Samuel Thibault	315c9e794a	htl: Make pthread_[gs]etspecific not check for key validity Since __pthread_key_create might be concurrently reallocating the __pthread_key_destructors array, it's not safe to access it without the mutex held. Posix explicitly says we are allowed to prefer performance over error detection.	2022-02-14 19:29:02 +01:00
H.J. Lu	0fb8800029	x86-64: Remove bzero weak alias in SS2 memset commit `3d9f171bfb` Author: H.J. Lu <hjl.tools@gmail.com> Date: Mon Feb 7 05:55:15 2022 -0800 x86-64: Optimize bzero added the optimized bzero. Remove bzero weak alias in SS2 memset to avoid undefined __bzero in memset-sse2-unaligned-erms.	2022-02-14 10:16:02 -08:00
John David Anglin	17c57d70bd	hppa: Fix typo	2022-02-14 17:41:59 +00:00
Adhemerval Zanella	fee62d6c62	linux: Use socket-constants-time64.h on tst-socket-timestamp-compat The kernel header might not define the SO_TIMESTAMP{NS}_OLD or SO_TIMESTAMP{NS}_NEW if it older than v5.1. Reviewed-by: Carlos O'Donell <carlos@redhat.com> Reviewed-by: Tulio Magno Quites Machado Filho <tuliom@linux.ibm.com>	2022-02-14 14:05:57 -03:00
H.J. Lu	f9db5433f3	x86/configure.ac: Define PI_STATIC_AND_HIDDEN/SUPPORT_STATIC_PIE Move PI_STATIC_AND_HIDDEN and SUPPORT_STATIC_PIE to sysdeps/x86/configure.ac.	2022-02-14 07:34:54 -08:00
John David Anglin	2e20cd63c9	Fix elf/tst-audit2 on hppa The test elf/tst-audit2 fails on hppa with a segmentation fault in the long branch stub used to call malloc from calloc. This occurs because the test is not a PIC executable and calloc is called from the dynamic linker before the dp register is initialized in _dl_start_user. The fix is to move the dp register initialization into elf_machine_runtime_setup. Since the address of $global$ can't be loaded directly, we continue to use the DT_PLTGOT value from the the main_map to initialize dp.	2022-02-14 15:14:49 +00:00
H.J. Lu	6229aa74fb	x86: Use CHECK_FEATURE_PRESENT on PCONFIG PCONFIG is a privileged instruction. Use CHECK_FEATURE_PRESENT, instead of CHECK_FEATURE_ACTIVE, on PCONFIG in tst-cpu-features-supports.c.	2022-02-14 05:53:03 -08:00
H.J. Lu	61a4425dd4	x86: Don't check PTWRITE in tst-cpu-features-cpuinfo.c Don't check PTWRITE against /proc/cpuinfo since kernel doesn't report PTWRITE in /proc/cpuinfo.	2022-02-14 05:53:03 -08:00
Noah Goldstein	7912236f4a	x86: Set .text section in memset-vec-unaligned-erms commit `3d9f171bfb` Author: H.J. Lu <hjl.tools@gmail.com> Date: Mon Feb 7 05:55:15 2022 -0800 x86-64: Optimize bzero Remove setting the .text section for the code. This commit adds that back.	2022-02-12 04:25:19 -06:00
Florian Weimer	098c795e85	Linux: Include <dl-auxv.h> in dl-sysdep.c only for SHARED Otherwise, <dl-auxv.h> on POWER ends up being included twice, once in dl-sysdep.c, once in dl-support.c. That leads to a linker failure due to multiple definitions of _dl_cache_line_size. Fixes commit `d96d2995c1` ("Revert "Linux: Consolidate auxiliary vector parsing").	2022-02-11 19:50:58 +01:00
Florian Weimer	d96d2995c1	Revert "Linux: Consolidate auxiliary vector parsing" This reverts commit `8c8510ab27`. The revert is not perfect because the commit included a bug fix for _dl_sysdep_start with an empty argv, introduced in commit `2d47fa6862` ("Linux: Remove DL_FIND_ARG_COMPONENTS"), and this bug fix is kept. The revert is necessary because the reverted commit introduced an early memset call on aarch64, which leads to crash due to lack of TCB initialization.	2022-02-11 17:10:59 +01:00
Adhemerval Zanella	144761540a	elf: Remove LD_USE_LOAD_BIAS It is solely for prelink with PIE executables [1]. [1] https://sourceware.org/legacy-ml/libc-hacker/2003-11/msg00127.html Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2022-02-10 09:18:15 -03:00
Adhemerval Zanella	6628c742b2	elf: Remove prelink support Prelinked binaries and libraries still work, the dynamic tags DT_GNU_PRELINKED, DT_GNU_LIBLIST, DT_GNU_CONFLICT just ignored (meaning the process is reallocated as default). The loader environment variable TRACE_PRELINKING is also removed, since it used solely on prelink. Checked on x86_64-linux-gnu, i686-linux-gnu, and aarch64-linux-gnu. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2022-02-10 09:16:12 -03:00
Florian Weimer	8c8510ab27	Linux: Consolidate auxiliary vector parsing And optimize it slightly. The large switch statement in _dl_sysdep_start can be replaced with a large array. This reduces source code and binary size. On i686-linux-gnu: Before: text data bss dec hex filename 7791 12 0 7803 1e7b elf/dl-sysdep.os After: text data bss dec hex filename 7135 12 0 7147 1beb elf/dl-sysdep.os Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-02-10 11:51:55 +01:00
Florian Weimer	f19fc997a5	Linux: Assume that NEED_DL_SYSINFO_DSO is always defined The definition itself is still needed for generic code. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-02-10 11:51:46 +01:00
Florian Weimer	2d47fa6862	Linux: Remove DL_FIND_ARG_COMPONENTS The generic definition is always used since the Native Client port has been removed. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-02-10 11:51:33 +01:00
Florian Weimer	b9c3d3382f	Linux: Remove HAVE_AUX_SECURE, HAVE_AUX_XID, HAVE_AUX_PAGESIZE They are always defined. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-02-10 11:51:22 +01:00
Florian Weimer	91c0a47ffb	elf: Merge dl-sysdep.c into the Linux version The generic version is the de-facto Linux implementation. It requires an auxiliary vector, so Hurd does not use it. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-02-10 11:50:52 +01:00
Adhemerval Zanella	9e94f57484	hppa: Fix bind-now audit (BZ #28857 ) On hppa, a function pointer returned by la_symbind is actually a function descriptor has the plabel bit set (bit 30). This must be cleared to get the actual address of the descriptor. If the descriptor has been bound, the first word of the descriptor is the physical address of theA function, otherwise, the first word of the descriptor points to a trampoline in the PLT. This patch also adds a workaround on tests because on hppa (and it seems to be the only ABI I have see it), some shared library adds a dynamic PLT relocation to am empty symbol name: $ readelf -r elf/tst-audit25mod1.so [...] Relocation section '.rela.plt' at offset 0x464 contains 6 entries: Offset Info Type Sym.Value Sym. Name + Addend 00002008 00000081 R_PARISC_IPLT 508 [...] It breaks some assumptions on the test, where a symbol with an empty name ("") is passed on la_symbind. Checked on x86_64-linux-gnu and hppa-linux-gnu.	2022-02-09 08:47:42 -03:00
H.J. Lu	3d9f171bfb	x86-64: Optimize bzero memset with zero as the value to set is by far the majority value (99%+ for Python3 and GCC). bzero can be slightly more optimized for this case by using a zero-idiom xor for broadcasting the set value to a register (vector or GPR). Co-developed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2022-02-08 15:58:56 -08:00
Dmitry V. Levin	e1d32b8364	linux: fix accuracy of get_nprocs and get_nprocs_conf [BZ #28865 ] get_nprocs() and get_nprocs_conf() use various methods to obtain an accurate number of processors. Re-introduce __get_nprocs_sched() as a source of information, and fix the order in which these methods are used to return the most accurate information. The primary source of information used in both functions remains unchanged. This also changes __get_nprocs_sched() error return value from 2 to 0, but all its users are already prepared to handle that. Old fallback order: get_nprocs: /sys/devices/system/cpu/online -> /proc/stat -> 2 get_nprocs_conf: /sys/devices/system/cpu/ -> /proc/stat -> 2 New fallback order: get_nprocs: /sys/devices/system/cpu/online -> /proc/stat -> sched_getaffinity -> 2 get_nprocs_conf: /sys/devices/system/cpu/ -> /proc/stat -> sched_getaffinity -> 2 Fixes: `342298278e` ("linux: Revert the use of sched_getaffinity on get_nproc") Closes: BZ #28865 Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-02-07 20:18:29 +00:00
Noah Goldstein	1b0c60f95b	x86: Remove SSSE3 instruction for broadcast in memset.S (SSE2 Only) commit `b62ace2740` Author: Noah Goldstein <goldstein.w.n@gmail.com> Date: Sun Feb 6 00:54:18 2022 -0600 x86: Improve vec generation in memset-vec-unaligned-erms.S Revert usage of 'pshufb' in broadcast logic as it is an SSSE3 instruction and memset.S is restricted to only SSE2 instructions.	2022-02-07 14:18:29 -06:00
Noah Goldstein	b62ace2740	x86: Improve vec generation in memset-vec-unaligned-erms.S No bug. Split vec generation into multiple steps. This allows the broadcast in AVX2 to use 'xmm' registers for the L(less_vec) case. This saves an expensive lane-cross instruction and removes the need for 'vzeroupper'. For SSE2 replace 2x 'punpck' instructions with zero-idiom 'pxor' for byte broadcast. Results for memset-avx2 small (geomean of N = 20 benchset runs). size, New Time, Old Time, New / Old 0, 4.100, 3.831, 0.934 1, 5.074, 4.399, 0.867 2, 4.433, 4.411, 0.995 4, 4.487, 4.415, 0.984 8, 4.454, 4.396, 0.987 16, 4.502, 4.443, 0.987 All relevant string/wcsmbs tests are passing. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 20:58:07 -06:00
Sunil K Pandey	d7fca835e0	x86-64: Add vector tan/tanf to libmvec microbenchmark Add vector tan/tanf and input files to libmvec microbenchmark. libmvec-tan-inputs: 90% Normal random distribution range: (-DBL_MAX, DBL_MAX) mean: 0.0 sigma: 5.0 10% uniform random distribution in range (-1000.0, 1000.0) libmvec-tanf-inputs: 90% Normal random distribution range: (-FLT_MAX, FLT_MAX) mean: 0.0f sigma: 5.0f 10% uniform random distribution in range (-1000.0f, 1000.0f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:37:13 -08:00
Sunil K Pandey	d0086fe45c	x86-64: Add vector erfc/erfcf to libmvec microbenchmark Add vector erfc/erfcf and input files to libmvec microbenchmark. libmvec-erfc-inputs: 90% Normal random distribution range: (-6.0, 6.0) mean: 0.0 sigma: 1.0 10% uniform random distribution in range (-5.9, 5.9) libmvec-erfcf-inputs: 90% Normal random distribution range: (-4.0f, 4.0f) mean: 0.0f sigma: 1.0f 10% uniform random distribution in range (-3.9f, 3.9f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:37:07 -08:00
Sunil K Pandey	bef2d0ec25	x86-64: Add vector asinh/asinhf to libmvec microbenchmark Add vector asinh/asinhf and input files to libmvec microbenchmark. libmvec-asinh-inputs: 90% Normal random distribution range: (-DBL_MAX, DBL_MAX) mean: 0.0 sigma: 2.0 10% uniform random distribution in range (-1.0e6, 1.0e6) libmvec-asinhf-inputs: 90% Normal random distribution range: (-FLT_MAX, FLT_MAX) mean: 0.0f sigma: 2.0f 10% uniform random distribution in range (-1.0e6f, 1.0e6f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:37:01 -08:00
Sunil K Pandey	b263a0155e	x86-64: Add vector tanh/tanhf to libmvec microbenchmark Add vector tanh/tanhf and input files to libmvec microbenchmark. libmvec-tanh-inputs: 90% Normal random distribution range: (-19.0, 19.0) mean: 0.0 sigma: 2.0 10% uniform random distribution in range (-16.0, 16.0) libmvec-tanhf-inputs: 90% Normal random distribution range: (-10.0f, 10.0f) mean: 0.0f sigma: 2.0f 10% uniform random distribution in range (-8.0f, 8.0f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:36:56 -08:00
Sunil K Pandey	475ed201c2	x86-64: Add vector erf/erff to libmvec microbenchmark Add vector erf/erff and input files to libmvec microbenchmark. libmvec-erf-inputs: 90% Normal random distribution range: (-6.0, 6.0) mean: 0.0 sigma: 1.0 10% uniform random distribution in range (-5.9, 5.9) libmvec-erff-inputs: 90% Normal random distribution range: (-4.0f, 4.0f) mean: 0.0f sigma: 1.0f 10% uniform random distribution in range (-3.9f, 3.9f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:36:51 -08:00
Sunil K Pandey	157bdb5f89	x86-64: Add vector acosh/acoshf to libmvec microbenchmark Add vector acosh/acoshf and input files to libmvec microbenchmark. libmvec-acosh-inputs: 90% Normal random distribution range: (1.0, DBL_MAX) mean: 1.0 sigma: 8.0 10% uniform random distribution in range (1.0, 1.0e6) libmvec-acoshf-inputs: 90% Normal random distribution range: (1.0f, FLT_MAX) mean: 1.0f sigma: 4.0f 10% uniform random distribution in range (1.0f, 1.0e6f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:36:46 -08:00
Sunil K Pandey	0050c9a45d	x86-64: Add vector atanh/atanhf to libmvec microbenchmark Add vector atanh/atanhf and input files to libmvec microbenchmark. libmvec-atanh-inputs: 90% Normal random distribution range: (-1.0, 1.0) mean: 0.0 sigma: 1.0 10% uniform random distribution in range (-1.0, 1.0) libmvec-atanhf-inputs: 90% Normal random distribution range: (-1.0f, 1.0f) mean: 0.0f sigma: 1.0f 10% uniform random distribution in range (-1.0f, 1.0f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:36:41 -08:00
Sunil K Pandey	171817d8c0	x86-64: Add vector log1p/log1pf to libmvec microbenchmark Add vector log1p/log1pf and input files to libmvec microbenchmark. libmvec-log1p-inputs: 70% Normal random distribution range: (-1.0, DBL_MAX) mean: 0.0 sigma: 50.0 30% uniform random distribution in range (-1.0, 1.0e6) libmvec-log1pf-inputs: 70% Normal random distribution range: (-1.0f, FLT_MAX) mean: 0.0f sigma: 50.0f 30% uniform random distribution in range (-1.0f, 1.0e6f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:36:36 -08:00
Sunil K Pandey	b6b2be5c2f	x86-64: Add vector log2/log2f to libmvec microbenchmark Add vector log2/log2f and input files to libmvec microbenchmark. libmvec-log2-inputs: 70% Normal random distribution range: (0.0, DBL_MAX) mean: 1.0 sigma: 50.0 30% uniform random distribution in range (0.0, 1.0e6) libmvec-log2f-inputs: 70% Normal random distribution range: (0.0f, FLT_MAX) mean: 1.0f sigma: 50.0f 30% uniform random distribution in range (0.0f, 1.0e6f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:36:31 -08:00
Sunil K Pandey	e43b757e06	x86-64: Add vector log10/log10f to libmvec microbenchmark Add vector log10/log10f and input files to libmvec microbenchmark. libmvec-log10-inputs: 70% Normal random distribution range: (0.0, DBL_MAX) mean: 1.0 sigma: 50.0 30% uniform random distribution in range (0.0, 1.0e6) libmvec-log10f-inputs: 70% Normal random distribution range: (0.0f, FLT_MAX) mean: 1.0f sigma: 50.0f 30% uniform random distribution in range (0.0f, 1.0e6f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:36:23 -08:00
Sunil K Pandey	16aec30154	x86-64: Add vector atan2/atan2f to libmvec microbenchmark Add vector atan2/atan2f and input files to libmvec microbenchmark. libmvec-atan2-inputs: arg1: 90% Normal random distribution range: (-DBL_MAX, DBL_MAX) mean: 0.0 sigma: 4.0 10% uniform random distribution in range (-1.0e6, 1.0e6) arg2: 90% Normal random distribution range: (-DBL_MAX, DBL_MAX) mean: 0.0 sigma: 4.0 10% uniform random distribution in range (-1.0e6, 1.0e6) libmvec-atan2f-inputs: arg1: 90% Normal random distribution range: (-FLT_MAX, FLT_MAX) mean: 0.0f sigma: 4.0f 10% uniform random distribution in range (-1.0e6f, 1.0e6f) arg2: 90% Normal random distribution range: (-FLT_MAX, FLT_MAX) mean: 0.0f sigma: 4.0f 10% uniform random distribution in range (-1.0e6f, 1.0e6f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:36:14 -08:00
Sunil K Pandey	fec48238b2	x86-64: Add vector cbrt/cbrtf to libmvec microbenchmark Add vector cbrt/cbrtf and input files to libmvec microbenchmark. libmvec-cbrt-inputs: 90% Normal random distribution range: (-DBL_MAX, DBL_MAX) mean: 0.0 sigma: 10.0 10% uniform random distribution in range (-1000.0, 1000.0) libmvec-cbrtf-inputs: 90% Normal random distribution range: (-FLT_MAX, FLT_MAX) mean: 0.0f sigma: 10.0f 10% uniform random distribution in range (-1000.0f, 1000.0f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:36:06 -08:00
Sunil K Pandey	6acc09c589	x86-64: Add vector sinh/sinhf to libmvec microbenchmark Add vector sinh/sinhf and input files to libmvec microbenchmark. libmvec-sinh-inputs: 90% Normal random distribution range: (-710.0, 710.0) mean: 0.0 sigma: 32.0 10% uniform random distribution in range (-500.0, 500.0) libmvec-sinhf-inputs: 90% Normal random distribution range: (-89.0f, 89.0f) mean: 0.0f sigma: 16.0f 10% uniform random distribution in range (-50.0f, 50.0f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:36:00 -08:00
Sunil K Pandey	049555aad4	x86-64: Add vector expm1/expm1f to libmvec microbenchmark Add vector expm1/expm1f and input files to libmvec microbenchmark. libmvec-expm1-inputs: 90% Normal random distribution range: (-708.0, 709.0) mean: 0.0 sigma: 16.0 10% uniform random distribution in range (-500.0, 500.0) libmvec-expm1f-inputs: 90% Normal random distribution range: (-87.0f, 88.0f) mean: 0.0f sigma: 8.0f 10% uniform random distribution in range (-50.0f, 50.0f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:35:54 -08:00
Sunil K Pandey	54cf4f31fe	x86-64: Add vector cosh/coshf to libmvec microbenchmark Add vector cosh/coshf and input files to libmvec microbenchmark. libmvec-cosh-inputs: 90% Normal random distribution range: (-710.0, 710.0) mean: 0.0 sigma: 32.0 10% uniform random distribution in range (-500.0, 500.0) libmvec-coshf-inputs: 90% Normal random distribution range: (-89.0f, 89.0f) mean: 0.0f sigma: 16.0f 10% uniform random distribution in range (-50.0f, 50.0f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:35:49 -08:00
Sunil K Pandey	abebb26108	x86-64: Add vector exp10/exp10f to libmvec microbenchmark Add vector exp10/exp10f and input files to libmvec microbenchmark. libmvec-exp10-inputs: 90% Normal random distribution range: (-307.0, 308.0) mean: 0.0 sigma: 16.0 10% uniform random distribution in range (-250.0, 250.0) libmvec-exp10f-inputs: 90% Normal random distribution range: (-37.0f, 38.0f) mean: 0.0f sigma: 8.0f 10% uniform random distribution in range (-25.0f, 25.0f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:35:43 -08:00
Sunil K Pandey	b0e4360778	x86-64: Add vector exp2/exp2f to libmvec microbenchmark Add vector exp2/exp2f and input files to libmvec microbenchmark. libmvec-exp2-inputs: 90% Normal random distribution range: (-1022.0, 1024.0) mean: 0.0 sigma: 16.0 10% uniform random distribution in range (-1000.0, 1000.0) libmvec-exp2f-inputs: 90% Normal random distribution range: (-126.0f, 128.0f) mean: 0.0f sigma: 8.0f 10% uniform random distribution in range (-100.0f, 100.0f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:35:34 -08:00
Sunil K Pandey	b0a1107042	x86-64: Add vector hypot/hypotf to libmvec microbenchmark Add vector hypot/hypotf and input files to libmvec microbenchmark. libmvec-hypot-inputs: arg1: 90% Normal random distribution range: (-DBL_MAX, DBL_MAX) mean: 0.0 sigma: 10.0 10% uniform random distribution in range (-1000.0, 1000.0) arg1: 90% Normal random distribution range: (-DBL_MAX, DBL_MAX) mean: 0.0 sigma: 10.0 10% uniform random distribution in range (-1000.0, 1000.0) libmvec-hypotf-inputs: arg1: 90% Normal random distribution range: (-FLT_MAX, FLT_MAX) mean: 0.0f sigma: 10.0f 10% uniform random distribution in range (-1000.0f, 1000.0f) arg2: 90% Normal random distribution range: (-FLT_MAX, FLT_MAX) mean: 0.0f sigma: 10.0f 10% uniform random distribution in range (-1000.0f, 1000.0f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:35:16 -08:00
Sunil K Pandey	e96f25427c	x86-64: Add vector asin/asinf to libmvec microbenchmark Add vector asin/asinf and input files to libmvec microbenchmark. libmvec-asin-inputs: 90% Normal random distribution range: (-1.0, 1.0) mean: 0.0 sigma: 1.0 10% uniform random distribution in range (-1.0, 1.0) libmvec-asinf-inputs: 90% Normal random distribution range: (-1.0f, 1.0f) mean: 0.0f sigma: 1.0f 10% uniform random distribution in range (-1.0f, 1.0f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:35:11 -08:00
Sunil K Pandey	7e05d94ea1	x86-64: Add vector atan/atanf to libmvec microbenchmark Add vector atan/atanf and input files to libmvec microbenchmark. libmvec-atan-inputs: arg1: 90% Normal random distribution range: (-DBL_MAX, DBL_MAX) mean: 0.0 sigma: 4.0 10% uniform random distribution in range (-1.0e6, 1.0e6) arg2: 90% Normal random distribution range: (-DBL_MAX, DBL_MAX) mean: 0.0 sigma: 4.0 10% uniform random distribution in range (-1.0e6, 1.0e6) libmvec-atanf-inputs: arg1: 90% Normal random distribution range: (-FLT_MAX, FLT_MAX) mean: 0.0f sigma: 4.0f 10% uniform random distribution in range (-1.0e6f, 1.0e6f) arg2: 90% Normal random distribution range: (-FLT_MAX, FLT_MAX) mean: 0.0f sigma: 4.0f 10% uniform random distribution in range (-1.0e6f, 1.0e6f) Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-02-06 12:24:32 -08:00
H.J. Lu	c328d0152d	x86_64/multiarch: Sort sysdep_routines and put one entry per line	2022-02-05 16:42:17 -08:00
H.J. Lu	1283948f23	x86: Improve L to support L(XXX_SYMBOL (YYY, ZZZ))	2022-02-05 16:42:17 -08:00
H.J. Lu	0e0199a9e0	x86-64: Fix strcmp-evex.S Change "movl %edx, %rdx" to "movl %edx, %edx" in: commit `8418eb3ff4` Author: Noah Goldstein <goldstein.w.n@gmail.com> Date: Mon Jan 10 15:35:39 2022 -0600 x86: Optimize strcmp-evex.S	2022-02-04 11:11:08 -08:00
H.J. Lu	c15efd011c	x86-64: Fix strcmp-avx2.S Change "movl %edx, %rdx" to "movl %edx, %edx" in: commit `b77b06e0e2` Author: Noah Goldstein <goldstein.w.n@gmail.com> Date: Mon Jan 10 15:35:38 2022 -0600 x86: Optimize strcmp-avx2.S	2022-02-04 11:09:10 -08:00
Sunil K Pandey	811124ce08	x86-64: Add vector acos/acosf to libmvec microbenchmark Add vector acos/acosf and input files to libmvec microbenchmark. libmvec-acos-inputs: 90% Normal random distribution range: (-1.0, 1.0) mean: 0.0 sigma: 1.0 10% uniform random distribution in range (-1.0, 1.0) libmvec-acosf-inputs: 90% Normal random distribution range: (-1.0f, 1.0f) mean: 0.0f sigma: 1.0f 10% uniform random distribution in range (-1.0f, 1.0f) Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2022-02-03 17:37:06 -08:00
Noah Goldstein	8418eb3ff4	x86: Optimize strcmp-evex.S Optimization are primarily to the loop logic and how the page cross logic interacts with the loop. The page cross logic is at times more expensive for short strings near the end of a page but not crossing the page. This is done to retest the page cross conditions with a non-faulty check and to improve the logic for entering the loop afterwards. This is only particular cases, however, and is general made up for by more than 10x improvements on the transition from the page cross -> loop case. The non-page cross cases as well are nearly universally improved. test-strcmp, test-strncmp, test-wcscmp, and test-wcsncmp all pass. Signed-off-by: Noah Goldstein <goldstein.w.n@gmail.com>	2022-02-03 16:41:41 -06:00
Noah Goldstein	b77b06e0e2	x86: Optimize strcmp-avx2.S Optimization are primarily to the loop logic and how the page cross logic interacts with the loop. The page cross logic is at times more expensive for short strings near the end of a page but not crossing the page. This is done to retest the page cross conditions with a non-faulty check and to improve the logic for entering the loop afterwards. This is only particular cases, however, and is general made up for by more than 10x improvements on the transition from the page cross -> loop case. The non-page cross cases are improved most for smaller sizes [0, 128] and go about even for (128, 4096]. The loop page cross logic is improved so some more significant speedup is seen there as well. test-strcmp, test-strncmp, test-wcscmp, and test-wcsncmp all pass. Signed-off-by: Noah Goldstein <goldstein.w.n@gmail.com>	2022-02-03 16:41:38 -06:00
Adhemerval Zanella	798d716df7	linux: Fix missing __convert_scm_timestamps (BZ #28860 ) Commit `948ce73b31` made recvmsg/recvmmsg to always call __convert_scm_timestamps for 64 bit time_t symbol, so adjust it to always build it for __TIMESIZE != 64. It fixes build for architecture with 32 bit time_t support when configured with minimum kernel of 5.1.	2022-02-03 16:59:16 -03:00
Gleb Fotengauer-Malinovskiy	97ba273b50	linux: __get_nprocs_sched: do not feed CPU_COUNT_S with garbage [BZ #28850 ] Pass the actual number of bytes returned by the kernel. Fixes: `33099d72e4` ("linux: Simplify get_nprocs") Reviewed-by: Dmitry V. Levin <ldv@altlinux.org>	2022-02-03 11:04:08 +00:00
Carlos O'Donell	e0beb0c9f1	Regenerate configure.	2022-02-03 00:10:03 -05:00
Florian Weimer	6c33b01843	Linux: Use ptrdiff_t for __rseq_offset This matches the data size initial-exec relocations use on most targets. Reviewed-by: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2022-02-02 22:37:20 +01:00
Adhemerval Zanella	6289d28d3c	posix: Replace posix_spawnattr_tc{get,set}pgrp_np with posix_spawn_file_actions_addtcsetpgrp_np The posix_spawnattr_tcsetpgrp_np works on a file descriptor (the controlling terminal), so it would make more sense to actually fit it on the file actions API. Also, POSIX_SPAWN_TCSETPGROUP is not really required since it is implicit by the presence of tcsetpgrp file action. The posix/tst-spawn6.c is also fixed when TTY can is not present. Checked on x86_64-linux-gnu and i686-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com> Tested-by: Carlos O'Donell <carlos@redhat.com>	2022-02-02 08:34:16 -03:00
Stafford Horne	3f35e7d193	or1k: Define PI_STATIC_AND_HIDDEN PI_STATIC_AND_HIDDEN means that references to static functions, data and symbols with hidden visibility do not need any run-time relocations after the final link, with the build flags used by glibc. OpenRISC follows this so enabled PI_STATIC_AND_HIDDEN by adding configure.ac and generating configure. Suggested-by: Florian Weimer <fweimer@redhat.com>	2022-02-02 20:05:12 +09:00
Samuel Thibault	355bc7f736	SET_RELHOOK: merge i386 and x86_64, and move to sysdeps/mach/hurd/x86 It is not Hurd-specific, but H.J. Lu wants it there. Also, dc.a can be used to avoid hardcoding .long vs .quad and thus use the same implementation for i386 and x86_64.	2022-02-01 20:08:25 +00:00
Ben Woodard	ce9a68c57c	elf: Fix runtime linker auditing on aarch64 (BZ #26643 ) The rtld audit support show two problems on aarch64: 1. _dl_runtime_resolve does not preserve x8, the indirect result location register, which might generate wrong result calls depending of the function signature. 2. The NEON Q registers pushed onto the stack by _dl_runtime_resolve were twice the size of D registers extracted from the stack frame by _dl_runtime_profile. While 2. might result in wrong information passed on the PLT tracing, 1. generates wrong runtime behaviour. The aarch64 rtld audit support is changed to: * Both La_aarch64_regs and La_aarch64_retval are expanded to include both x8 and the full sized NEON V registers, as defined by the ABI. * dl_runtime_profile needed to extract registers saved by _dl_runtime_resolve and put them into the new correctly sized La_aarch64_regs structure. * The LAV_CURRENT check is change to only accept new audit modules to avoid the undefined behavior of not save/restore x8. * Different than other architectures, audit modules older than LAV_CURRENT are rejected (both La_aarch64_regs and La_aarch64_retval changed their layout and there are no requirements to support multiple audit interface with the inherent aarch64 issues). * A new field is also reserved on both La_aarch64_regs and La_aarch64_retval to support variant pcs symbols. Similar to x86, a new La_aarch64_vector type to represent the NEON register is added on the La_aarch64_regs (so each type can be accessed directly). Since LAV_CURRENT was already bumped to support bind-now, there is no need to increase it again. Checked on aarch64-linux-gnu. Co-authored-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Reviewed-by: Szabolcs Nagy <szabolcs.nagy@arm.com> Reviewed-by: Carlos O'Donell <carlos@redhat.com> Tested-by: Carlos O'Donell <carlos@redhat.com>	2022-02-01 14:49:46 -03:00
Adhemerval Zanella	32612615c5	elf: Issue la_symbind for bind-now (BZ #23734 ) The audit symbind callback is not called for binaries built with -Wl,-z,now or when LD_BIND_NOW=1 is used, nor the PLT tracking callbacks (plt_enter and plt_exit) since this would change the expected program semantics (where no PLT is expected) and would have performance implications (such as for BZ#15533). LAV_CURRENT is also bumped to indicate the audit ABI change (where la_symbind flags are set by the loader to indicate no possible PLT trace). To handle powerpc64 ELFv1 function descriptor, _dl_audit_symbind requires to know whether bind-now is used so the symbol value is updated to function text segment instead of the OPD (for lazy binding this is done by PPC64_LOAD_FUNCPTR on _dl_runtime_resolve). Checked on x86_64-linux-gnu, i686-linux-gnu, aarch64-linux-gnu, powerpc64-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com> Tested-by: Carlos O'Donell <carlos@redhat.com>	2022-02-01 14:49:46 -03:00
Adhemerval Zanella	254d3d5aef	elf: Fix initial-exec TLS access on audit modules (BZ #28096 ) For audit modules and dependencies with initial-exec TLS, we can not set the initial TLS image on default loader initialization because it would already be set by the audit setup. However, subsequent thread creation would need to follow the default behaviour. This patch fixes it by setting l_auditing link_map field not only for the audit modules, but also for all its dependencies. This is used on _dl_allocate_tls_init to avoid the static TLS initialization at load time. Checked on x86_64-linux-gnu, i686-linux-gnu, and aarch64-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com> Tested-by: Carlos O'Donell <carlos@redhat.com>	2022-02-01 14:49:46 -03:00
H.J. Lu	3fb18fd80c	elf: Add <dl-r_debug.h> Add <dl-r_debug.h> to get the adddress of the r_debug structure after relocation and its offset before relocation from the PT_DYNAMIC segment to support DT_DEBUG, DT_MIPS_RLD_MAP_REL and DT_MIPS_RLD_MAP. Co-developed-by: Xi Ruoyao <xry111@mengyan1223.wang>	2022-01-31 07:05:48 -08:00
H.J. Lu	77a602ebb0	tst-socket-timestamp-compat.c: Check __TIMESIZE [BZ #28837 ] time_t size is defined by __TIMESIZE, not __WORDSIZE. Check __TIMESIZE, instead of __WORDSIZE, for time_t size. This fixes BZ #28837.	2022-01-29 06:57:07 -08:00
Adhemerval Zanella	948ce73b31	Linux: Only generate 64 bit timestamps for 64 bit time_t recvmsg/recvmmsg The timestamps created by __convert_scm_timestamps only make sense for 64 bit time_t programs, 32 bit time_t programs will ignore 64 bit time_t timestamps since SO_TIMESTAMP will be defined to old values (either by glibc or kernel headers). Worse, if the buffer is not suffice MSG_CTRUNC is set to indicate it (which breaks some programs [1]). This patch makes only 64 bit time_t recvmsg and recvmmsg to call __convert_scm_timestamps. Also, the assumption to called it is changed from __ASSUME_TIME64_SYSCALLS to __TIMESIZE != 64 since the setsockopt might be called by libraries built without __TIME_BITS=64. The MSG_CTRUNC is only set for the 64 bit symbols, it should happen only if 64 bit time_t programs run older kernels. Checked on x86_64-linux-gnu and i686-linux-gnu. [1] https://github.com/systemd/systemd/pull/20567 Reviewed-by: Florian Weimer <fweimer@redhat.com>	2022-01-28 18:18:27 -03:00
Adhemerval Zanella	8fba672472	linux: Fix ancillary 64-bit time timestamp conversion (BZ #28349 , BZ#28350) The __convert_scm_timestamps only updates the control message last pointer for SOL_SOCKET type, so if the message control buffer contains multiple ancillary message types the converted timestamp one might overwrite a valid message. The test checks if the extra ancillary space is correctly handled by recvmsg/recvmmsg, where if there is no extra space for the 64-bit time_t converted message the control buffer should be marked with MSG_TRUNC. It also check if recvmsg/recvmmsg handle correctly multiple ancillary data. Checked on x86_64-linux and on i686-linux-gnu on both 5.11 and 4.15 kernel. Co-authored-by: Fabian Vogt <fvogt@suse.de> Reviewed-by: Florian Weimer <fweimer@redhat.com>	2022-01-28 17:46:44 -03:00
Florian Weimer	af121ae3e7	Fix glibc 2.34 ABI omission (missing GLIBC_2.34 in dynamic loader) The glibc 2.34 release really should have added a GLIBC_2.34 symbol to the dynamic loader. With it, we could move functions such as dlopen or pthread_key_create that work on process-global state into the dynamic loader (once we have fixed a longstanding issue with static linking). Without the GLIBC_2.34 symbol, yet another new symbol version would be needed because old glibc will fail to load binaries due to the missing symbol version in ld.so that newly linked programs will require. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-01-27 18:52:05 +01:00
H.J. Lu	501246c5e2	x86: Use CHECK_FEATURE_PRESENT to check HLE [BZ #27398 ] HLE is disabled on blacklisted CPUs. Use CHECK_FEATURE_PRESENT, instead of CHECK_FEATURE_ACTIVE, to check HLE.	2022-01-26 21:08:05 -08:00
Adhemerval Zanella	604814121d	hurd: Add posix_spawnattr_tc{get,set}pgrp_np on libc.abilist Commit `342cc934a3` missed the update-abi for the ABI.	2022-01-26 16:22:54 -03:00
Adhemerval Zanella	342cc934a3	posix: Add terminal control setting support for posix_spawn Currently there is no proper way to set the controlling terminal through posix_spawn in race free manner [1]. This forces shell implementations to keep using fork+exec when launching background process groups, even when using posix_spawn yields better performance. This patch adds a new GNU extension so the creating process can configure the created process terminal group. This is done with a new flag, POSIX_SPAWN_TCSETPGROUP, along with two new attribute functions: posix_spawnattr_tcsetpgrp_np, and posix_spawnattr_tcgetpgrp_np. The function sets a new attribute, spawn-tcgroupfd, that references to the controlling terminal. The controlling terminal is set after the spawn-pgroup attribute, and uses the spawn-tcgroupfd along with current creating process group (so it is composable with POSIX_SPAWN_SETPGROUP). To create a process and set the controlling terminal, one can use the following sequence: posix_spawnattr_t attr; posix_spawnattr_init (&attr); posix_spawnattr_setflags (&attr, POSIX_SPAWN_TCSETPGROUP); posix_spawnattr_tcsetpgrp_np (&attr, tcfd); If the idea is also to create a new process groups: posix_spawnattr_t attr; posix_spawnattr_init (&attr); posix_spawnattr_setflags (&attr, POSIX_SPAWN_TCSETPGROUP \| POSIX_SPAWN_SETPGROUP); posix_spawnattr_tcsetpgrp_np (&attr, tcfd); posix_spawnattr_setpgroup (&attr, 0); The controlling terminal file descriptor is ignored if the new flag is not set. This interface is slight different than the one provided by QNX [2], which only provides the POSIX_SPAWN_TCSETPGROUP flag. The QNX documentation does not specify how the controlling terminal is obtained nor how it iteracts with POSIX_SPAWN_SETPGROUP. Since a glibc implementation is library based, it is more straightforward and avoid requires additional file descriptor operations to request the caller to setup the controlling terminal file descriptor (and it also allows a bit less error handling by posix_spawn). Checked on x86_64-linux-gnu and i686-linux-gnu. [1] https://github.com/ksh93/ksh/issues/79 [2] https://www.qnx.com/developers/docs/7.0.0/index.html#com.qnx.doc.neutrino.lib_ref/topic/p/posix_spawn.html Reviewed-by: Carlos O'Donell <carlos@redhat.com> Tested-by: Carlos O'Donell <carlos@redhat.com>	2022-01-25 14:07:53 -03:00
Florian Weimer	5b8e7980c5	Linux: Detect user namespace support in io/tst-getcwd-smallbuff Otherwise the test fails with certain container runtimes. Reviewed-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2022-01-24 18:14:24 +01:00
Siddhesh Poyarekar	23e0e8f5f1	getcwd: Set errno to ERANGE for size == 1 (CVE-2021-3999) No valid path returned by getcwd would fit into 1 byte, so reject the size early and return NULL with errno set to ERANGE. This change is prompted by CVE-2021-3999, which describes a single byte buffer underflow and overflow when all of the following conditions are met: - The buffer size (i.e. the second argument of getcwd) is 1 byte - The current working directory is too long - '/' is also mounted on the current working directory Sequence of events: - In sysdeps/unix/sysv/linux/getcwd.c, the syscall returns ENAMETOOLONG because the linux kernel checks for name length before it checks buffer size - The code falls back to the generic getcwd in sysdeps/posix - In the generic func, the buf[0] is set to '\0' on line 250 - this while loop on line 262 is bypassed: while (!(thisdev == rootdev && thisino == rootino)) since the rootfs (/) is bind mounted onto the directory and the flow goes on to line 449, where it puts a '/' in the byte before the buffer. - Finally on line 458, it moves 2 bytes (the underflowed byte and the '\0') to the buf[0] and buf[1], resulting in a 1 byte buffer overflow. - buf is returned on line 469 and errno is not set. This resolves BZ #28769. Reviewed-by: Andreas Schwab <schwab@linux-m68k.org> Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org> Signed-off-by: Qualys Security Advisory <qsa@qualys.com> Signed-off-by: Siddhesh Poyarekar <siddhesh@sourceware.org>	2022-01-24 11:00:17 +05:30
Samuel Thibault	8c86ba4463	htl: Fix cleaning the reply port If any RPC fails, the reply port will already be deallocated. __pthread_thread_terminate thus has to defer taking its name until the very last __thread_terminate_release which doesn't reply a message. But then we have to read from the pthread structure. This introduces __pthread_dealloc_finish() which does the recording of the thread termination, so the slot can be reused really only just before the __thread_terminate_release call. Only the real thread can set it, so let's decouple this from the pthread_state by just removing the PTHREAD_TERMINATED state and add a terminated field.	2022-01-22 02:17:19 +01:00
Florian Weimer	f44820821a	mips: Move DT_MIPS into <ldsodefs.h> ELF_MACHINE_XHASH_SETUP in that file needs it. Fixes commit `c90363403b` ("elf: Move _dl_setup_hash to its own file").	2022-01-19 20:11:55 +01:00
H.J. Lu	1e000d3d33	x86: Black list more Intel CPUs for TSX [BZ #27398 ] Disable TSX and enable RTM_ALWAYS_ABORT for Intel CPUs listed in: https://www.intel.com/content/www/us/en/support/articles/000059422/processors.html This fixes BZ #27398. Reviewed-by: Noah Goldstein <goldstein.w.n@gmail.com>	2022-01-18 14:20:09 -08:00
Samuel Thibault	f8b765bec4	htl: Fix build error in annexc We were getting ../scripts/evaluate-test.sh posix/annexc $? true false > /usr/src/glibc-upstream/build/posix/annexc.test-result In file included from ../include/pthread.h:1, from <stdin>:1: ../sysdeps/htl/include/pthread.h:7:62: error: missing binary operator before token "(" 7 \| # if defined __USE_EXTERN_INLINES && defined _LIBC && !IS_IN (libsupport) \| ^	2022-01-17 23:18:27 +00:00
Aurelien Jarno	c242fcce06	x86: use default cache size if it cannot be determined [BZ #28784 ] In some cases (e.g QEMU, non-Intel/AMD CPU) the cache information can not be retrieved and the corresponding values are set to 0. Commit `2d651eb926` ("x86: Move x86 processor cache info to cpu_features") changed the behaviour in such case by defining the __x86_shared_cache_size and __x86_data_cache_size variables to 0 instead of using the default values. This cause an issue with the i686 SSE2 optimized bzero/routine which assumes that the cache size is at least 128 bytes, and otherwise tries to zero/set the whole address space minus 128 bytes. Fix that by restoring the original code to only update __x86_shared_cache_size and __x86_data_cache_size variables if the corresponding cache sizes are not zero. Fixes bug 28784 Fixes commit `2d651eb926` Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-01-17 19:42:46 +01:00
Adhemerval Zanella	5f3a7ebc35	Linux: Add epoll_pwait2 (BZ #27359 ) It is similar to epoll_wait, with the difference the timeout has nanosecond resoluting by using struct timespec instead of int. Although Linux interface only provides 64 bit time_t support, old 32 bit interface is also provided (so keep in sync with current practice and to no force opt-in on 64 bit time_t). Checked on x86_64-linux-gnu and i686-linux-gnu. Reviewed-by: Florian Weimer <fweimer@redhat.com>	2022-01-17 14:34:54 -03:00
Adhemerval Zanella	9fe6f63638	elf: Fix 64 time_t support for installed statically binaries The usage of internal static symbol for statically linked binaries does not work correctly for objects built with -D_TIME_BITS=64, since the internal definition does not provide the expected aliases. This patch makes it to use the default stat functions instead (which uses the default 64 time_t alias and types). Checked on i686-linux-gnu. Reviewed-by: Carlos O'Donell <carlos@redhat.com> Tested-by: Carlos O'Donell <carlos@redhat.com>	2022-01-17 10:57:09 -03:00
Adhemerval Zanella	cedd498dbc	Revert "elf: Fix 64 time_t support for installed statically binaries" This reverts commit `0b8e83eb14`.	2022-01-17 10:56:58 -03:00
Samuel Thibault	41a11a5e83	hurd: optimize exec cleanup When ports are nul we do not need to request their deallocation. It is also useless to look for them in portnames.	2022-01-16 00:02:16 +01:00
Samuel Thibault	54dda2cdba	hurd: Add __rtld_execve It trivially execs with the same dtable, portarray and intarray, and only has to take care of deallocating / destroying ports (file, notably).	2022-01-15 23:42:35 +01:00
Samuel Thibault	1bd7a06a95	htl: Hide __pthread_attr's __schedparam type [BZ #23088 ] The content of the structure is only used internally, so we can make __pthread_attr_getschedparam and __pthread_attr_setschedparam convert between the public sched_param type and an internal __sched_param. This allows to avoid to spuriously expose the sched_param type. This fixes BZ #23088.	2022-01-15 21:31:08 +01:00
Samuel Thibault	c1105e34ac	htl: Clear kernel_thread field before releasing the thread structure Otherwise this is a use-after-free.	2022-01-15 21:31:08 +01:00
Samuel Thibault	67ca1c5560	hurd: Fix timer/clock_getres crash on NULL res parameter POSIX allows res to be NULL.	2022-01-15 15:37:03 +01:00
Samuel Thibault	2c040d0b90	hurd: Fix pthread_kill on exiting/ted thread We have to drop the kernel_thread port from the thread structure, to avoid pthread_kill's call to _hurd_thread_sigstate trying to reference it and fail.	2022-01-15 15:11:54 +01:00
Samuel Thibault	dfb204d87f	[hurd] Drop spurious #ifdef SHARED The whole file is already #ifdef SHARED	2022-01-15 14:23:37 +01:00
Samuel Thibault	f05faf5f22	[hurd] Call _dl_sort_maps_init in _dl_sysdep_start This follows `15a0c5730d` ("elf: Fix slow DSO sorting behavior in dynamic loader (BZ #17645)").	2022-01-15 14:21:53 +01:00
Florian Weimer	f01d482f03	s390x: Use <gcc-macros.h> in early HWCAP check This is required so that the checks still work if $(early-cflags) selects a different ISA level. Reviewed-by: Carlos O'Donell <carlos@redhat.com> Tested-by: Carlos O'Donell <carlos@redhat.com>	2022-01-14 20:17:58 +01:00
Florian Weimer	990c953bce	x86: Add x86-64-vN check to early startup This ISA level covers the glibc build itself. <dl-hwcap-check.h> cannot be used because this check (by design) happens before DL_PLATFORM_INIT and the x86 CPU flags initialization. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-01-14 20:17:49 +01:00
Florian Weimer	5501164866	powerpc64le: Use <gcc-macros.h> in early HWCAP check This is required so that the checks still work if $(early-cflags) selects a different ISA level. Reviewed-by: Carlos O'Donell <carlos@redhat.com> Tested-by: Carlos O'Donell <carlos@redhat.com>	2022-01-14 20:17:40 +01:00
Florian Weimer	5732a881aa	x86: HAVE_X86_LAHF_SAHF, HAVE_X86_MOVBE and -march=x86-64-vN (bug 28782) HAVE_X86_LAHF_SAHF is implied by x86-64-v2, and HAVE_X86_MOVBE by x86-64-v3. The individual flag does not appear in -fverbose-asm flag output even if the ISA level implies it. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-01-14 16:09:20 +01:00
Sunil K Pandey	047512374a	math: Add more inputs to atan2 accuracy tests [BZ #28765 ] This patch adds following inputs: 0x1.bcab29da0e947p-54 0x1.bc41f4d2294b8p-54 0x1.a11891ec004d4p-348 0x1.814830510be26p-348 0x1.b836ed678be29p-588 0x1.b7be6f5a03a8cp-588 0x1.a83f842ef3f73p-633 0x1.a799d8a6677ep-633 to atan2 tests and updates x86_64 double atan2 ulps. This fixes BZ #28765. Reviewed-By: Paul Zimmermann <Paul.Zimmermann@inria.fr>	2022-01-14 06:00:06 -08:00
Joseph Myers	4997a533ae	Update syscall lists for Linux 5.16 Linux 5.16 has one new syscall, futex_waitv. Update syscall-names.list and regenerate the arch-syscall.h headers with build-many-glibcs.py update-syscalls. Tested with build-many-glibcs.py.	2022-01-13 22:18:13 +00:00
Florian Weimer	a78e6a10d0	i386: Remove broken CAN_USE_REGISTER_ASM_EBP (bug 28771) The configure check for CAN_USE_REGISTER_ASM_EBP tried to compile a simple function that uses %ebp as an inline assembly operand. If compilation failed, CAN_USE_REGISTER_ASM_EBP was set 0, which eventually had these consequences: (1) %ebx was avoided as an inline assembly operand, with an assembler macro hack to avoid unnecessary register moves. (2) %ebp was avoided as an inline assembly operand, using an out-of-line syscall function for 6-argument system calls. (1) is no longer needed for any GCC version that is supported for building glibc. %ebx can be used directly as a register operand. Therefore, this commit removes the %ebx avoidance completely. This avoids the assembler macro hack, which turns out to be incompatible with the current Systemtap probe macros (which switch to .altmacro unconditionally). (2) is still needed in many build configurations. The existing configure check cannot really capture that because the simple function succeeds to compile, while the full glibc build still fails. Therefore, this commit removes the check, the CAN_USE_REGISTER_ASM_EBP macro, and uses the out-of-line syscall function for 6-argument system calls unconditionally. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-01-13 14:59:44 +01:00
Sunil K Pandey	49e2bf58d5	x86_64: Fix SSE4.2 libmvec atan2 function accuracy [BZ #28765 ] This patch fixes SSE4.2 libmvec atan2 function accuracy for following inputs to less than 4 ulps. {0x1.bcab29da0e947p-54,0x1.bc41f4d2294b8p-54} 4.19888 ulps {0x1.b836ed678be29p-588,0x1.b7be6f5a03a8cp-588} 4.09889 ulps This fixes BZ #28765. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2022-01-12 13:23:22 -08:00
Adhemerval Zanella	572e0c8554	Revert "linux: Fix ancillary 64-bit time timestamp conversion (BZ #28349 , BZ #28350 )" This reverts commit `21e0f45c7d`.	2022-01-12 10:35:06 -03:00
Adhemerval Zanella	21e0f45c7d	linux: Fix ancillary 64-bit time timestamp conversion (BZ #28349 , BZ #28350 ) The __convert_scm_timestamps() only updates the control message last pointer for SOL_SOCKET type, so if the message control buffer contains multiple ancillary message types the converted timestamp one might overwrite a valid message. The test check if the extra ancillary space is correctly handled by recvmsg/recvmmsg, where if there is no extra space for the 64-bit time_t converted message the control buffer should be marked with MSG_TRUNC. It also check if recvmsg/recvmmsg handle correctly multiple ancillary data. Checked on x86_64-linux and on i686-linux-gnu on both 5.11 and 4.15 kernel. Co-authored-by: Fabian Vogt <fvogt@suse.de>	2022-01-12 10:30:10 -03:00
Adhemerval Zanella	0b8e83eb14	elf: Fix 64 time_t support for installed statically binaries The usage of internal static symbol for statically linked binaries does not work correctly for objects built with -D_TIME_BITS=64, since the internal definition does not provide the expected aliases. This patch makes it to use the default stat functions instead (which uses the default 64 time_t alias and types). Checked on i686-linux-gnu.	2022-01-12 10:30:10 -03:00
Szabolcs Nagy	5a1be8ebdf	aarch64: Add HWCAP2_ECV from Linux 5.16 Indicates the availability of enhanced counter virtualization extension of armv8.6-a with self-synchronized virtual counter CNTVCTSS_EL0 usable in userspace.	2022-01-11 16:05:16 +00:00
Noah Goldstein	7e08db3359	x86: Fix __wcsncmp_evex in strcmp-evex.S [BZ# 28755] Fixes [BZ# 28755] for wcsncmp by redirecting length >= 2^56 to __wcscmp_evex. For x86_64 this covers the entire address range so any length larger could not possibly be used to bound `s1` or `s2`. test-strcmp, test-strncmp, test-wcscmp, and test-wcsncmp all pass. Signed-off-by: Noah Goldstein <goldstein.w.n@gmail.com>	2022-01-10 20:31:57 -06:00
Noah Goldstein	ddf0992cf5	x86: Fix __wcsncmp_avx2 in strcmp-avx2.S [BZ# 28755] Fixes [BZ# 28755] for wcsncmp by redirecting length >= 2^56 to __wcscmp_avx2. For x86_64 this covers the entire address range so any length larger could not possibly be used to bound `s1` or `s2`. test-strcmp, test-strncmp, test-wcscmp, and test-wcsncmp all pass. Signed-off-by: Noah Goldstein <goldstein.w.n@gmail.com>	2022-01-10 20:31:46 -06:00
Szabolcs Nagy	347a5b592c	math: Fix float conversion regressions with gcc-12 [BZ #28713 ] Converting double precision constants to float is now affected by the runtime dynamic rounding mode instead of being evaluated at compile time with default rounding mode (except static object initializers). This can change the computed result and cause performance regression. The known correctness issues (increased ulp errors) are already fixed, this patch fixes remaining cases of unnecessary runtime conversions. Add float M_* macros to math.h as new GNU extension API. To avoid conversions the new M_* macros are used and instead of casting double literals to float, use float literals (only required if the conversion is inexact). The patch was tested on aarch64 where the following symbols had new spurious conversion instructions that got fixed: __clog10f __gammaf_r_finite@GLIBC_2.17 __j0f_finite@GLIBC_2.17 __j1f_finite@GLIBC_2.17 __jnf_finite@GLIBC_2.17 __kernel_casinhf __lgamma_negf __log1pf __y0f_finite@GLIBC_2.17 __y1f_finite@GLIBC_2.17 cacosf cacoshf casinhf catanf catanhf clogf gammaf_positive Fixes bug 28713. Reviewed-by: Paul Zimmermann <Paul.Zimmermann@inria.fr>	2022-01-10 14:27:17 +00:00
Florian Weimer	6b0978c14a	Restore ENTRY_POINT definition on hppa, ia64 (bug 28749) ENTRY_POINT is still needed for elf/rtld.c. Fixes commit `4fb4e7e821` ("csu: Always use __executable_start in gmon-start.c").	2022-01-07 14:47:31 +01:00
Samuel Thibault	d5b0046e3d	ttydefaults.h: Fix CSTATUS to control-t 4.4BSD actually defaults CSTATUS to control-t, so our generic header should as well.	2022-01-07 00:23:05 +01:00
Wilco Dijkstra	e5fa62b8db	AArch64: Check for SVE in ifuncs [BZ #28744 ] Add a check for SVE in the A64FX ifuncs for memcpy, memset and memmove. This fixes BZ #28744.	2022-01-06 14:36:28 +00:00
Adhemerval Zanella	65ccd641ba	debug: Remove catchsegv and libSegfault (BZ #14913 ) Trapping SIGSEGV within the process is error-prone, adds security issues, and modern analysis design tends to happen out of the process (either by attaching a debugger or by post-mortem analysis). The libSegfault also has some design problems, it uses non async-signal-safe function (backtrace) on signal handler. There are multiple alternatives if users do want to use similar functionality, such as sigsegv gnulib module or libsegfault.	2022-01-06 07:59:49 -03:00
Stafford Horne	0c3c62ca7d	or1k: Build Infrastructure Here we define the minumum linux kernel version at 5.4.0, as that is the long term support version where 32-bit architectures start to support 64-bit time API's. The OpenRISC kernel had some bugs up until version 5.8 which caused issues with glibc fork/clone, they have been backported to 5.4 but not previous versions. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-01-05 06:40:06 +09:00
Stafford Horne	d147259b5c	or1k: ABI lists Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-01-05 06:40:06 +09:00
Stafford Horne	7d334b1831	or1k: Linux ABI Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-01-05 06:40:06 +09:00
Stafford Horne	1871c95f2b	or1k: Linux Syscall Interface Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-01-05 06:40:06 +09:00
Stafford Horne	9a47b9660b	or1k: math soft float support OpenRISC support hard float but I will like to submit that after glibc soft float goes upstream. The hard float support depends on adding user access to the FPCSR, which is not supported by the kernel yet. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-01-05 06:40:06 +09:00
Stafford Horne	9f3653b1fa	or1k: Atomics and Locking primitives Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-01-05 06:40:06 +09:00
Stafford Horne	96882a00ce	or1k: Thread Local Storage support OpenRISC includes 3 TLS addressing models. Local Dynamic optimizations are not done in the linker and therefore use the same code sequences as Global Dynamic. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-01-05 06:40:06 +09:00
Stafford Horne	de5c0edc80	or1k: startup and dynamic linking code Code for C runtime startup and dynamic loading including PLT layout. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-01-05 06:40:06 +09:00
Stafford Horne	6e5964311d	or1k: ABI Implementation This code deals with the OpenRISC ABI. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-01-05 06:40:05 +09:00
Stafford Horne	9dde3a24f1	linux/syscalls: Add or1k_atomic syscall for OpenRISC Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2022-01-05 06:40:05 +09:00
H.J. Lu	9288c92d00	elf: Add <dl-debug.h> Add <dl-debug.h> to setup debugging entry in PT_DYNAMIC segment to support DT_DEBUG, DT_MIPS_RLD_MAP_REL and DT_MIPS_RLD_MAP. Tested on x86-64, x32 and i686 as well as with build-many-glibcs.py.	2022-01-03 05:16:03 -08:00
Paul Eggert	634b5ebac6	Update copyright dates not handled by scripts/update-copyrights. I've updated copyright dates in glibc for 2022. This is the patch for the changes not generated by scripts/update-copyrights and subsequent build / regeneration of generated files. As well as the usual annual updates, mainly dates in --version output (minus csu/version.c which previously had to be handled manually but is now successfully updated by update-copyrights), there is a small change to the copyright notice in NEWS which should let NEWS get updated automatically next year. Please remember to include 2022 in the dates for any new files added in future (which means updating any existing uncommitted patches you have that add new files to use the new copyright dates in them).	2022-01-01 11:42:26 -08:00
Paul Eggert	581c785bf3	Update copyright dates with scripts/update-copyrights I used these shell commands: ../glibc/scripts/update-copyrights $PWD/../gnulib/build-aux/update-copyright (cd ../glibc && git commit -am"[this commit message]") and then ignored the output, which consisted lines saying "FOO: warning: copyright statement not found" for each of 7061 files FOO. I then removed trailing white space from math/tgmath.h, support/tst-support-open-dev-null-range.c, and sysdeps/x86_64/multiarch/strlen-vec.S, to work around the following obscure pre-commit check failure diagnostics from Savannah. I don't know why I run into these diagnostics whereas others evidently do not. remote: * 912-#endif remote: * 913: remote: * 914- remote: * error: lines with trailing whitespace found ... remote: *** error: sysdeps/unix/sysv/linux/statx_cp.c: trailing lines	2022-01-01 11:40:24 -08:00
Samuel Thibault	edb5ab841a	hurd: Use __trivfs_server_name instead of trivfs_server_name The latter violates namespace contraints	2022-01-01 17:51:18 +01:00
Samuel Thibault	35cf8a85ed	hurd: Bump BRK_START to 0x20000000 By nowadays uses, 256MiB is not that large for the program+libraries. Let's push the heap further to leave room for e.g. clang.	2021-12-31 18:25:49 +01:00
Samuel Thibault	8c0727af63	hurd: Avoid overzealous shared objects constraints `407765e9f2` ("hurd: Fix ELF_MACHINE_USER_ADDRESS_MASK value") switched ELF_MACHINE_USER_ADDRESS_MASK from 0xf8000000UL to 0xf0000000UL to let libraries etc. get loaded at 0x08000000. But ELF_MACHINE_USER_ADDRESS_MASK is actually only meaningful for the main program anyway, so keep it at 0xf8000000UL to prevent the program loader from putting ld.so beyond 0x08000000. And conversely, drop the use of ELF_MACHINE_USER_ADDRESS_MASK for shared objects, which don't need any constraints since the program will have already be loaded by then.	2021-12-31 18:22:46 +01:00
Adhemerval Zanella	1f17da01e6	time: Refactor timesize.h for some ABIs Commit `a4b4131355` changed default __TIMESIZE to 64, however it added sub-architecture timesize.h for powerpc, s390, and sparc. Also simplify mips by removing _MIPS_SIM usage (which would require to add sgidefs inclusion.	2021-12-31 10:58:13 -03:00
Samuel Thibault	33e8e95cbd	hurd: Make getrandom a stub inside the random translator glibc uses /dev/urandom for getrandom(), and from version 2.34 malloc initialization uses it. We have to detect when we are running the random translator itself, in which case we can't read ourself.	2021-12-31 08:54:41 +01:00
Stafford Horne	4dfa8f4870	open64: Force O_LARGEFILE on all architectures When running tests on OpenRISC which has 32-bit wordsize but 64-bit timesize it was found that O_LARGEFILE is not being set when calling open64. For 64-bit architectures the O_LARGEFILE flag is generally implied by the kernel according to force_o_largefile. However, for 32-bit architectures this is not done. For this patch we unconditionally now set the O_LARGEFILE flag for open64 class syscalls as there is no harm in doing so. Tested on the OpenRISC the build works and timezone/tst-tzset passes which was failing before. I would expect this also would fix arc. Reviewed-by: Adhemerval Zanella <adhemerval.zanella@linaro.org>	2021-12-31 07:10:19 +09:00
Sunil K Pandey	c21c7bc24e	x86-64: Add vector tan/tanf implementation to libmvec Implement vectorized tan/tanf containing SSE, AVX, AVX2 and AVX512 versions for libmvec as per vector ABI. It also contains accuracy and ABI tests for vector tan/tanf with regenerated ulps. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2021-12-30 10:19:13 -08:00
Sunil K Pandey	8881cca8fb	x86-64: Add vector erfc/erfcf implementation to libmvec Implement vectorized erfc/erfcf containing SSE, AVX, AVX2 and AVX512 versions for libmvec as per vector ABI. It also contains accuracy and ABI tests for vector erfc/erfcf with regenerated ulps. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2021-12-30 10:19:03 -08:00
Sunil K Pandey	e682d01578	x86-64: Add vector asinh/asinhf implementation to libmvec Implement vectorized asinh/asinhf containing SSE, AVX, AVX2 and AVX512 versions for libmvec as per vector ABI. It also contains accuracy and ABI tests for vector asinh/asinhf with regenerated ulps. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2021-12-29 11:38:56 -08:00
Sunil K Pandey	c0f36fc303	x86-64: Add vector tanh/tanhf implementation to libmvec Implement vectorized tanh/tanhf containing SSE, AVX, AVX2 and AVX512 versions for libmvec as per vector ABI. It also contains accuracy and ABI tests for vector tanh/tanhf with regenerated ulps. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2021-12-29 11:38:50 -08:00
Sunil K Pandey	f9ce13fdac	x86-64: Add vector erf/erff implementation to libmvec Implement vectorized erf/erff containing SSE, AVX, AVX2 and AVX512 versions for libmvec as per vector ABI. It also contains accuracy and ABI tests for vector erf/erff with regenerated ulps. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2021-12-29 11:38:44 -08:00
Sunil K Pandey	0625489ccc	x86-64: Add vector acosh/acoshf implementation to libmvec Implement vectorized acosh/acoshf containing SSE, AVX, AVX2 and AVX512 versions for libmvec as per vector ABI. It also contains accuracy and ABI tests for vector acosh/acoshf with regenerated ulps. Reviewed-by: H.J. Lu <hjl.tools@gmail.com>	2021-12-29 11:38:39 -08:00

1 2 3 4 5 ...

14832 Commits