glibc/sysdeps
Adhemerval Zanella 3323476641 i686: Use generic sinf implementation for SSE2 version
Performance seems to be similar (gcc 11.2.1 on a Ryzen 9 5900X),
the generic algorithm shows slight better performance for
the 'workload-huge.wrf' input set.

* s_sinf-sse2.S:
  "sinf": {
   "": {
    "duration": 3.72405e+09,
    "iterations": 2.38374e+08,
    "max": 63.973,
    "min": 11.211,
    "mean": 15.6227
   },
   "workload-random.wrf": {
    "duration": 3.76923e+09,
    "iterations": 8.4e+07,
    "reciprocal-throughput": 17.6355,
    "latency": 72.108,
    "max-throughput": 5.67037e+07,
    "min-throughput": 1.38681e+07
   },
   "workload-huge.wrf": {
    "duration": 3.76943e+09,
    "iterations": 6e+07,
    "reciprocal-throughput": 29.3493,
    "latency": 96.2985,
    "max-throughput": 3.40724e+07,
    "min-throughput": 1.03844e+07
   }
  }

* generic s_sinf.c:
  "sinf": {
   "": {
    "duration": 3.70989e+09,
    "iterations": 2.18025e+08,
    "max": 69.782,
    "min": 11.1,
    "mean": 17.0159
   },
   "workload-random.wrf": {
    "duration": 3.77213e+09,
    "iterations": 9.6e+07,
    "reciprocal-throughput": 17.5402,
    "latency": 61.0459,
    "max-throughput": 5.70119e+07,
    "min-throughput": 1.63811e+07
   },
   "workload-huge.wrf": {
    "duration": 3.81576e+09,
    "iterations": 5.6e+07,
    "reciprocal-throughput": 38.2111,
    "latency": 98.0659,
    "max-throughput": 2.61704e+07,
    "min-throughput": 1.01972e+07
   }
  }

Checked on i686-linux-gnu.

Reviewed-by: H.J. Lu <hjl.tools@gmail.com>
2022-06-01 10:47:44 -03:00
..
aarch64 Revert "[AArch64][BZ #17711] Fix extern protected data handling" 2022-05-23 13:37:05 -07:00
alpha alpha: Remove _dl_skip_args usage 2022-05-30 16:32:22 -03:00
arc arc: Remove _dl_skip_args usage 2022-05-30 16:32:29 -03:00
arm arm: Remove _dl_skip_args usage 2022-05-30 16:32:26 -03:00
csky csky: Remove _dl_skip_args usage 2022-05-30 16:32:33 -03:00
generic elf: Remove _dl_skip_args 2022-05-30 16:33:54 -03:00
gnu Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
hppa hppa: Remove _dl_skip_args usage (BZ# 29165) 2022-05-30 16:32:35 -03:00
htl htl: Fix initializing the key lock 2022-02-14 19:29:02 +01:00
hurd hurd: Fix pthread_kill on exiting/ted thread 2022-01-15 15:11:54 +01:00
i386 i686: Use generic sinf implementation for SSE2 version 2022-06-01 10:47:44 -03:00
ia64 ia64: Remove _dl_skip_args usage 2022-05-30 16:33:08 -03:00
ieee754 i686: Use generic sinf implementation for SSE2 version 2022-06-01 10:47:44 -03:00
m68k m68k: Remove _dl_skip_args usage 2022-05-30 16:33:11 -03:00
mach elf: Remove _dl_skip_args 2022-05-30 16:33:54 -03:00
microblaze microblaze: Remove _dl_skip_args usage 2022-05-30 16:33:14 -03:00
mips mips: Remove _dl_skip_args usage 2022-05-30 16:33:16 -03:00
nios2 nios2: Remove _dl_skip_args usage (BZ# 29187) 2022-05-30 16:33:20 -03:00
nptl nptl: Add backoff mechanism to spinlock loop 2022-05-09 14:38:40 -07:00
or1k elf: Replace PI_STATIC_AND_HIDDEN with opposite HIDDEN_VAR_NEEDS_DYNAMIC_RELOC 2022-04-26 09:26:22 -07:00
posix gmon: Remove unused sprofil.c functions 2022-03-23 14:29:25 -03:00
powerpc math: Add math-use-builtins-fabs (BZ#29027) 2022-05-23 17:49:18 -03:00
pthread Fix deadlock when pthread_atfork handler calls pthread_atfork or dlclose 2022-05-25 11:27:31 +02:00
riscv riscv: Remove _dl_skip_args usage 2022-05-30 16:33:22 -03:00
s390 s390: Remove _dl_skip_args usage 2022-05-30 16:33:25 -03:00
sh sh: Remove _dl_skip_args usage 2022-05-30 16:33:28 -03:00
sparc sparc: Remove _dl_skip_args usage 2022-05-30 16:33:31 -03:00
unix Add SOL_SMC from Linux 5.18 to bits/socket.h 2022-05-31 13:49:53 +00:00
wordsize-32 Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
wordsize-64 Update copyright dates with scripts/update-copyrights 2022-01-01 11:40:24 -08:00
x86 elf: Optimize _dl_new_hash in dl-new-hash.h 2022-05-23 10:38:40 -05:00
x86_64 x86_64: Optimize sincos where sin/cos is optimized (bug 29193) 2022-06-01 10:29:52 +02:00