Go to file
Szabolcs Nagy 72aa623345 Optimized generic expf and exp2f with wrappers
Based on new expf and exp2f code from
https://github.com/ARM-software/optimized-routines/

with wrapper on aarch64:
expf reciprocal-throughput: 2.3x faster
expf latency: 1.7x faster
without wrapper on aarch64:
expf reciprocal-throughput: 3.3x faster
expf latency: 1.7x faster
without wrapper on aarch64:
exp2f reciprocal-throughput: 2.8x faster
exp2f latency: 1.3x faster
libm.so size on aarch64:
.text size: -152 bytes
.rodata size: -1740 bytes
expf/exp2f worst case nearest rounding error: 0.502 ulp
worst case non-nearest rounding error: 1 ulp

Error checks are inline and errno setting is in separate tail called
functions, but the wrappers are kept in this patch to handle the
_LIB_VERSION==_SVID_ case.  (So e.g. errno is set twice for expf calls
and once for __expf_finite calls on targets where the new code is used.)

Double precision arithmetics is used which is expected to be faster on
most targets (including soft-float) than using single precision and it
is easier to get good precision result with it.

Const data is kept in a separate translation unit which complicates
maintenance a bit, but is expected to give good code for literal loads
on most targets and allows sharing data across expf, exp2f and powf.
(This data is disabled on i386, m68k and ia64 which have their own
expf, exp2f and powf code.)

Some details may need target specific tweaks:
- best convert and round to int operation in the arg reduction may be
different across targets.
- code was optimized on fma target, optimal polynomial eval may be
different without fma.
- gcc does not always generate good code for fp bit representation
access via unions or it may be inherently slow on some targets.

The libm-test-ulps will need adjustment because..
- The argument reduction ideally uses nearest rounded rint, but that is
not efficient on most targets, so the polynomial can get evaluated on a
wider interval in non-nearest rounding mode making 1 ulp errors common
in that case.
- The polynomial is evaluated such that it may have 1 ulp error on
negative tiny inputs with upward rounding.

	* math/Makefile (type-float-routines): Add math_errf and e_exp2f_data.
	* sysdeps/aarch64/fpu/math_private.h (TOINT_INTRINSICS): Define.
	(roundtoint, converttoint): Likewise.
	* sysdeps/ieee754/flt-32/e_expf.c: New implementation.
	* sysdeps/ieee754/flt-32/e_exp2f.c: New implementation.
	* sysdeps/ieee754/flt-32/e_exp2f_data.c: New file.
	* sysdeps/ieee754/flt-32/math_config.h: New file.
	* sysdeps/ieee754/flt-32/math_errf.c: New file.
	* sysdeps/ieee754/flt-32/t_exp2f.h: Remove.
	* sysdeps/i386/fpu/e_exp2f_data.c: New file.
	* sysdeps/i386/fpu/math_errf.c: New file.
	* sysdeps/ia64/fpu/e_exp2f_data.c: New file.
	* sysdeps/ia64/fpu/math_errf.c: New file.
	* sysdeps/m68k/m680x0/fpu/e_exp2f_data.c: New file.
	* sysdeps/m68k/m680x0/fpu/math_errf.c: New file.
2017-09-25 10:44:39 +01:00
argp Remove __need macros from errno.h (__need_Emath, __need_error_t). 2017-06-14 08:14:34 -04:00
assert Fix position of tests-unsupported definition in assert/Makefile. 2017-08-22 00:30:51 +00:00
benchtests Add exp2f and log2f benchmark trace 2017-09-20 10:04:12 +01:00
bits hurd: Fix bits/socket.h conformity 2017-09-24 22:21:41 +02:00
catgets Don't compile non-lib modules as lib modules [BZ #21864] 2017-08-21 05:34:54 -07:00
ChangeLog.old Move all old ChangeLogs to a top-level ChangeLog.old directory. 2017-09-01 09:31:43 -04:00
conform Fix mcontext_t sigcontext namespace (bug 21457). 2017-08-30 22:02:04 +00:00
crypt Remove non-add-on Banner files. 2017-09-21 17:49:51 +00:00
csu Remove Banner mechanism. 2017-09-22 17:43:42 +00:00
ctype Use locale_t, not __locale_t, throughout glibc 2017-06-20 20:30:06 -04:00
debug Enable unwind info in libc-start.c and backtrace.c 2017-09-19 15:07:58 +01:00
dev Rename xlocale.h to bits/types/__locale_t.h. 2017-06-20 20:28:11 -04:00
dirent dirent: Remove internal_function attribute 2017-08-31 16:02:40 +02:00
dlfcn dlfcn: Remove internal_function attribute 2017-08-31 16:12:03 +02:00
elf Fix $(error) use in Makefile 2017-09-10 17:39:39 +02:00
gmon gmon: Remove internal_function attribute 2017-08-31 16:16:07 +02:00
gnulib Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
grp Remove compat from DEFAULT_CONFIG lookup strings 2017-09-12 10:21:48 -07:00
gshadow Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
hesiod Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
hurd hurd: fix gethostname(NULL, 0) 2017-09-07 00:51:17 +02:00
iconv iconv, intl, locale, wcsmbs: Remove internal_function 2017-08-31 15:59:06 +02:00
iconvdata Add new codepage charmaps/IBM858 [BZ #21084] 2017-09-14 15:50:57 +02:00
include Make more libm functions into weak aliases. 2017-09-14 22:28:53 +00:00
inet inet: Remove internal_function attribute 2017-08-31 18:52:00 +02:00
intl iconv, intl, locale, wcsmbs: Remove internal_function 2017-08-31 15:59:06 +02:00
io Add missing libc_hidden_weak/def calls 2017-09-17 21:57:39 +02:00
libidn Remove Banner mechanism. 2017-09-22 17:43:42 +00:00
libio Finish change from “Bengali” to “Bangla” 2017-09-09 12:41:54 +02:00
locale Finish change from “Bengali” to “Bangla” 2017-09-09 12:41:54 +02:00
localedata Add new codepage charmaps/IBM858 [BZ #21084] 2017-09-14 15:50:57 +02:00
login Consolidate remaning non cancellable definitions 2017-08-23 10:47:03 -03:00
mach hurd: Remove duplicate symbol version 2017-08-28 14:19:55 +02:00
malloc Sync scratch_buffer with gnulib 2017-09-08 15:51:34 +02:00
manual Remove Banner mechanism. 2017-09-22 17:43:42 +00:00
math Optimized generic expf and exp2f with wrappers 2017-09-25 10:44:39 +01:00
mathvec Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
misc hurd: Fix build ot misc/tst-preadvwritev2-common.c 2017-09-03 17:51:15 +02:00
nis Consistently use uintN_t not u_intN_t everywhere. 2017-08-07 19:55:34 +00:00
nptl Remove non-add-on Banner files. 2017-09-21 17:49:51 +00:00
nptl_db Move all old ChangeLogs to a top-level ChangeLog.old directory. 2017-09-01 09:31:43 -04:00
nscd Remove compat from DEFAULT_CONFIG lookup strings 2017-09-12 10:21:48 -07:00
nss Remove compat from DEFAULT_CONFIG lookup strings 2017-09-12 10:21:48 -07:00
po Update translations 2017-09-11 05:50:49 +05:30
posix Use execveat syscall in fexecve (bug 22134) 2017-09-19 16:19:14 +02:00
pwd Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
resolv Remove non-add-on Banner files. 2017-09-21 17:49:51 +00:00
resource Define struct rusage in sys/wait.h when required (bug 21575). 2017-06-19 11:59:19 +00:00
rt aio: Remove internal_function function attribute 2017-08-31 15:59:06 +02:00
scripts Use MPFR 3.1.6 in build-many-glibcs.py. 2017-09-07 14:50:32 +00:00
setjmp Remove __need macros from signal.h. 2017-05-20 19:04:43 -04:00
shadow Remove __need macros from stdio.h and wchar.h. 2017-06-08 13:58:17 -04:00
signal Fix sigval namespace (bug 21944). 2017-08-16 20:33:59 +00:00
socket __opensock: Remove internal_function attribute 2017-08-17 10:18:15 +02:00
soft-fp Remove non-add-on Banner files. 2017-09-21 17:49:51 +00:00
stdio-common linux: Implement tmpfile with O_TMPFILE (BZ#21530) 2017-09-01 09:52:47 -03:00
stdlib Fix BZ# 22180. 2017-09-21 12:14:41 -07:00
streams Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
string string/stratcliff.c: Replace int with size_t [BZ #21982] 2017-08-23 08:23:02 -07:00
sunrpc sunrpc/tst-udp-nonblocking: Fix timeout value 2017-09-10 21:09:28 +02:00
support resolv_test.c: also cope with CONNREFUSED errors returned by recvfrom 2017-09-25 01:11:43 +02:00
sysdeps Optimized generic expf and exp2f with wrappers 2017-09-25 10:44:39 +01:00
sysvipc Fix test-sysvsem on some platforms 2017-01-02 18:53:50 -02:00
termios Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
time time: Remove the internal_function attribute 2017-08-31 15:59:07 +02:00
timezone zic: Use PRIdMAX to print line numbers 2017-07-25 12:34:14 +05:30
wcsmbs iconv, intl, locale, wcsmbs: Remove internal_function 2017-08-31 15:59:06 +02:00
wctype Use locale_t, not __locale_t, throughout glibc 2017-06-20 20:30:06 -04:00
.gitattributes Assume __NR_openat is always defined 2016-03-23 23:35:08 +01:00
.gitignore Add *.pyc to .gitignore 2015-05-18 15:26:26 +05:30
abi-tags Remove the bulk of the NaCl port. 2017-05-20 08:09:10 -04:00
aclocal.m4 gmon: Add test for basic mcount/gprof functionality 2017-08-15 15:49:45 +02:00
ChangeLog Optimized generic expf and exp2f with wrappers 2017-09-25 10:44:39 +01:00
config.h.in i386: Do not set internal_function 2017-08-14 15:08:48 +02:00
config.make.in gmon: Add test for basic mcount/gprof functionality 2017-08-15 15:49:45 +02:00
configure gmon: Add test for basic mcount/gprof functionality 2017-08-15 15:49:45 +02:00
configure.ac Check linker support for INSERT in linker script 2017-08-04 12:17:30 -07:00
COPYING Update to latest versions of GPL-2.0 and LGPL-2.1 2013-09-09 12:52:48 +10:00
COPYING.LIB Update to latest versions of GPL-2.0 and LGPL-2.1 2013-09-09 12:52:48 +10:00
extra-lib.mk Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
gen-locales.mk Split locale generation snippet into a separate file 2015-05-13 13:05:28 +05:30
INSTALL Update contributors and latest gcc and binutils versions 2017-08-02 18:22:58 +05:30
libc-abis
libof-iterator.mk Rename cppflags-iterator.mk to libof-iterator.mk, remove extra-modules.mk. 2017-05-09 07:06:29 -04:00
LICENSES Expand LICENSES file. 2012-12-05 21:56:15 +00:00
MAINTAINERS Add MAINTAINERS 2017-05-11 13:38:30 -04:00
Makeconfig Support mcount/gprof test with GCC defaulting to PIE 2017-08-17 04:56:01 -07:00
Makefile Suppress internal declarations for most of the testsuite. 2017-05-11 19:27:59 -04:00
Makefile.in New make target to only build benchmark binaries 2016-04-20 10:23:28 +05:30
Makerules Place $(elf-objpfx)sofini.os last [BZ #22051] 2017-08-31 06:28:46 -07:00
NEWS Optimized generic expf and exp2f with wrappers 2017-09-25 10:44:39 +01:00
o-iterator.mk
README Require Linux kernel 3.2 or later on x86 / x86_64. 2017-05-08 10:45:20 +00:00
Rules Suppress internal declarations for most of the testsuite. 2017-05-11 19:27:59 -04:00
shlib-versions Extend NSS test suite 2017-07-17 15:52:44 -04:00
test-skeleton.c Update copyright dates with scripts/update-copyrights. 2017-01-01 00:14:16 +00:00
version.h Open master for development 2017-08-02 19:23:16 +05:30

This directory contains the sources of the GNU C Library.
See the file "version.h" for what release version you have.

The GNU C Library is the standard system C library for all GNU systems,
and is an important part of what makes up a GNU system.  It provides the
system API for all programs written in C and C-compatible languages such
as C++ and Objective C; the runtime facilities of other programming
languages use the C library to access the underlying operating system.

In GNU/Linux systems, the C library works with the Linux kernel to
implement the operating system behavior seen by user applications.
In GNU/Hurd systems, it works with a microkernel and Hurd servers.

The GNU C Library implements much of the POSIX.1 functionality in the
GNU/Hurd system, using configurations i[4567]86-*-gnu.  The current
GNU/Hurd support requires out-of-tree patches that will eventually be
incorporated into an official GNU C Library release.

When working with Linux kernels, this version of the GNU C Library
requires Linux kernel version 3.2 or later.

Also note that the shared version of the libgcc_s library must be
installed for the pthread library to work correctly.

The GNU C Library supports these configurations for using Linux kernels:

	aarch64*-*-linux-gnu
	alpha*-*-linux-gnu
	arm-*-linux-gnueabi
	hppa-*-linux-gnu	Not currently functional without patches.
	i[4567]86-*-linux-gnu
	x86_64-*-linux-gnu	Can build either x86_64 or x32
	ia64-*-linux-gnu
	m68k-*-linux-gnu
	microblaze*-*-linux-gnu
	mips-*-linux-gnu
	mips64-*-linux-gnu
	powerpc-*-linux-gnu	Hardware or software floating point, BE only.
	powerpc64*-*-linux-gnu	Big-endian and little-endian.
	s390-*-linux-gnu
	s390x-*-linux-gnu
	sh[34]-*-linux-gnu
	sparc*-*-linux-gnu
	sparc64*-*-linux-gnu
	tilegx-*-linux-gnu
	tilepro-*-linux-gnu

If you are interested in doing a port, please contact the glibc
maintainers; see http://www.gnu.org/software/libc/ for more
information.

See the file INSTALL to find out how to configure, build, and install
the GNU C Library.  You might also consider reading the WWW pages for
the C library at http://www.gnu.org/software/libc/.

The GNU C Library is (almost) completely documented by the Texinfo manual
found in the `manual/' subdirectory.  The manual is still being updated
and contains some known errors and omissions; we regret that we do not
have the resources to work on the manual as much as we would like.  For
corrections to the manual, please file a bug in the `manual' component,
following the bug-reporting instructions below.  Please be sure to check
the manual in the current development sources to see if your problem has
already been corrected.

Please see http://www.gnu.org/software/libc/bugs.html for bug reporting
information.  We are now using the Bugzilla system to track all bug reports.
This web page gives detailed information on how to report bugs properly.

The GNU C Library is free software.  See the file COPYING.LIB for copying
conditions, and LICENSES for notices about a few contributions that require
these additional notices to be distributed.  License copyright years may be
listed using range notation, e.g., 1996-2015, indicating that every year in
the range, inclusive, is a copyrightable year that would otherwise be listed
individually.