glibc/sysdeps/x86_64/fpu/multiarch
Andrew Senkevich 04f496d602 Vector cosf for x86_64.
Here is implementation of vectorized cosf containing SSE, AVX,
AVX2 and AVX512 versions according to Vector ABI
<https://groups.google.com/forum/#!topic/x86-64-abi/LmppCfN1rZ4>.

    * sysdeps/x86_64/fpu/Makefile (libmvec-support): Added new files.
    * sysdeps/x86_64/fpu/Versions: New versions added.
    * sysdeps/x86_64/fpu/svml_s_cosf4_core.S: New file.
    * sysdeps/x86_64/fpu/multiarch/svml_s_cosf4_core.S: New file.
    * sysdeps/x86_64/fpu/multiarch/svml_s_cosf4_core_sse4.S: New file.
    * sysdeps/x86_64/fpu/svml_s_cosf8_core_avx.S: New file.
    * sysdeps/x86_64/fpu/svml_s_cosf8_core.S: New file.
    * sysdeps/x86_64/fpu/multiarch/svml_s_cosf8_core.S: New file.
    * sysdeps/x86_64/fpu/multiarch/svml_s_cosf8_core_avx2.S: New file.
    * sysdeps/x86_64/fpu/svml_s_cosf16_core.S: New file.
    * sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core.S: New file.
    * sysdeps/x86_64/fpu/multiarch/svml_s_cosf16_core_avx512.S: New file.
    * sysdeps/x86_64/fpu/svml_s_wrapper_impl.h: New file.
    * sysdeps/x86_64/fpu/svml_s_cosf_data.S: New file.
    * sysdeps/x86_64/fpu/svml_s_cosf_data.h: New file.
    * sysdeps/x86_64/fpu/multiarch/Makefile (libmvec-sysdep_routines): Added
    build of SSE, AVX2 and AVX512 IFUNC versions.
    * sysdeps/unix/sysv/linux/x86_64/libmvec.abilist: New versions added.
    * sysdeps/x86/fpu/bits/math-vector.h: Added SIMD declaration for cosf.
    * NEWS: Mention addition of x86_64 vector cosf.
2015-06-09 18:29:47 +03:00
..
doasin-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
dosincos-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
e_asin-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
e_asin.c Fix asin missing underflows (bug 16351). 2015-02-26 17:18:54 +00:00
e_atan2-avx.c Remove code without too much effects 2011-10-28 16:55:01 -04:00
e_atan2-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
e_atan2.c Fix atan / atan2 missing underflows (bug 15319). 2015-02-18 21:10:49 +00:00
e_exp-avx.c Add AVX optimized versions for some x86-64 math functions 2011-10-25 21:34:55 -04:00
e_exp-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
e_exp.c Fix implicit __isinf declarations in exp. 2014-03-24 22:00:32 +00:00
e_log-avx.c Add AVX optimized versions for some x86-64 math functions 2011-10-25 21:34:55 -04:00
e_log-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
e_log.c Remove various ABS macros and replace uses with fabs (or in one case abs) 2015-05-15 11:04:40 +00:00
e_pow-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
e_pow.c Fix pow overflow in non-default rounding modes (bug 16315). 2014-06-23 20:12:33 +00:00
halfulp-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
Makefile Vector cosf for x86_64. 2015-06-09 18:29:47 +03:00
mpa-avx.c New __sqr function as a faster special case of __mul 2013-02-14 10:31:09 +05:30
mpa-fma4.c New __sqr function as a faster special case of __mul 2013-02-14 10:31:09 +05:30
mpatan2-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
mpatan-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
mpexp-avx.c Add AVX optimized versions for some x86-64 math functions 2011-10-25 21:34:55 -04:00
mpexp-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
mplog-avx.c Add AVX optimized versions for some x86-64 math functions 2011-10-25 21:34:55 -04:00
mplog-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
mpsqrt-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
mptan-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
s_atan-avx.c Remove code without too much effects 2011-10-28 16:55:01 -04:00
s_atan-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
s_atan.c Use -msse2avx option for x86-64 libm functions 2012-01-28 14:48:46 -05:00
s_ceil-c.c
s_ceil.S Update copyright dates with scripts/update-copyrights. 2015-01-02 16:29:47 +00:00
s_ceilf-c.c
s_ceilf.S Update copyright dates with scripts/update-copyrights. 2015-01-02 16:29:47 +00:00
s_floor-c.c
s_floor.S Update copyright dates with scripts/update-copyrights. 2015-01-02 16:29:47 +00:00
s_floorf-c.c
s_floorf.S Update copyright dates with scripts/update-copyrights. 2015-01-02 16:29:47 +00:00
s_fma.c Update copyright dates with scripts/update-copyrights. 2015-01-02 16:29:47 +00:00
s_fmaf.c Update copyright dates with scripts/update-copyrights. 2015-01-02 16:29:47 +00:00
s_nearbyint-c.c Add optimized nearbyint{,f} for x86-64 2011-10-18 09:13:23 -04:00
s_nearbyint.S Update copyright dates with scripts/update-copyrights. 2015-01-02 16:29:47 +00:00
s_nearbyintf-c.c Add optimized nearbyint{,f} for x86-64 2011-10-18 09:13:23 -04:00
s_nearbyintf.S Update copyright dates with scripts/update-copyrights. 2015-01-02 16:29:47 +00:00
s_rint-c.c
s_rint.S Update copyright dates with scripts/update-copyrights. 2015-01-02 16:29:47 +00:00
s_rintf-c.c
s_rintf.S Update copyright dates with scripts/update-copyrights. 2015-01-02 16:29:47 +00:00
s_sin-avx.c Remove code without too much effects 2011-10-28 16:55:01 -04:00
s_sin-fma4.c Disable use of FMA instructions in branred 2012-03-21 23:58:50 +01:00
s_sin.c Use -msse2avx option for x86-64 libm functions 2012-01-28 14:48:46 -05:00
s_tan-avx.c Remove code without too much effects 2011-10-28 16:55:01 -04:00
s_tan-fma4.c Disable use of FMA instructions in branred 2012-03-21 23:58:50 +01:00
s_tan.c Use -msse2avx option for x86-64 libm functions 2012-01-28 14:48:46 -05:00
sincos32-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
slowexp-avx.c Add AVX optimized versions for some x86-64 math functions 2011-10-25 21:34:55 -04:00
slowexp-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
slowpow-fma4.c Cleanup FMA4 patch 2011-10-25 00:56:33 -04:00
svml_d_cos2_core_sse4.S Start of series of patches with x86_64 vector math functions. 2015-06-09 14:25:49 +03:00
svml_d_cos2_core.S Start of series of patches with x86_64 vector math functions. 2015-06-09 14:25:49 +03:00
svml_d_cos4_core_avx2.S Start of series of patches with x86_64 vector math functions. 2015-06-09 14:25:49 +03:00
svml_d_cos4_core.S Start of series of patches with x86_64 vector math functions. 2015-06-09 14:25:49 +03:00
svml_d_cos8_core_avx512.S Start of series of patches with x86_64 vector math functions. 2015-06-09 14:25:49 +03:00
svml_d_cos8_core.S Start of series of patches with x86_64 vector math functions. 2015-06-09 14:25:49 +03:00
svml_s_cosf4_core_sse4.S Vector cosf for x86_64. 2015-06-09 18:29:47 +03:00
svml_s_cosf4_core.S Vector cosf for x86_64. 2015-06-09 18:29:47 +03:00
svml_s_cosf8_core_avx2.S Vector cosf for x86_64. 2015-06-09 18:29:47 +03:00
svml_s_cosf8_core.S Vector cosf for x86_64. 2015-06-09 18:29:47 +03:00
svml_s_cosf16_core_avx512.S Vector cosf for x86_64. 2015-06-09 18:29:47 +03:00
svml_s_cosf16_core.S Vector cosf for x86_64. 2015-06-09 18:29:47 +03:00