glibc

mirror of https://sourceware.org/git/glibc.git synced 2024-11-10 15:20:10 +00:00

Author	SHA1	Message	Date
Paul Zimmermann	50a8dd367e	benchtests/README update. Improve documentation of the 'name' directive and the 'workload' mechanism. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2020-08-04 12:44:41 -04:00
H.J. Lu	e52434a2e4	benchtests: Restore the clock_gettime option commit `7621e38bf3` Author: Wilco Dijkstra <Wilco.Dijkstra@arm.com> Date: Tue Jan 29 17:43:45 2019 +0000 Add generic hp-timing support removed the clock_gettime option. Restore the clock_gettime option for some x86 CPUs on which value from RDTSC may not be incremented at a fixed rate. Reviewed-by: Carlos O'Donell <carlos@redhat.com>	2020-06-05 09:48:07 -07:00
Wilco Dijkstra	7621e38bf3	Add generic hp-timing support Add missing generic hp_timing support. It uses clock_gettime (CLOCK_MONOTONIC) which has unspecified starting time, nano-second accuracy, and should faster on architectures that implementes the symbol as vDSO. Checked on aarch64-linux-gnu, x86_64-linux-gnu, and i686-linux-gnu. I also checked the builds for all afected ABIs. * benchtests/Makefile (USE_CLOCK_GETTIME) Remove. * benchtests/README: Update description. * benchtests/bench-timing.h: Default to hp-timing. * sysdeps/generic/hp-timing.h (HP_TIMING_DIFF, HP_TIMING_ACCUM_NT, HP_TIMING_PRINT): Remove. (HP_TIMING_NOW): Add generic implementation. (hp_timing_t): Change to uint64_t.	2019-03-22 17:30:44 -03:00
H.J. Lu	7cc65773f0	x86: Support RDTSCP for benchtests RDTSCP waits until all previous instructions have executed and all previous loads are globally visible before reading the counter. RDTSC doesn't wait until all previous instructions have been executed before reading the counter. All x86 processors since 2010 support RDTSCP instruction. This patch adds RDTSCP support to benchtests. * benchtests/Makefile (CPPFLAGS-nonlib): Add -DUSE_RDTSCP if USE_RDTSCP is defined. * sysdeps/x86/hp-timing.h (HP_TIMING_NOW): Use RDTSCP if USE_RDTSCP is defined.	2018-10-24 02:19:34 -07:00
Victor Rodriguez	0422ed1e84	benchtests: Enable BENCHSET to run subset of tests This patch adds BENCHSET variable to benchtests/Makefile in order to provide the capability to run a list of subsets of benchmark tests, ie; make bench BENCHSET="bench-pthread bench-math malloc-thread" This helps users to benchmark specific glibc area ChangeLog: * benchtests/Makefile:Add BENCHSET to allow subsets of benchmarks to be run. * benchtests/README: Add documentation for: Running subsets of benchmarks. Signed-off-by: Victor Rodriguez <victor.rodriguez.bahena@intel.com> Signed-off-by: Icarus Sparry <icarus.w.sparry@intel.com> Reviewed-By: Siddhesh Poyarekar <siddhesh@sourceware.org>	2017-11-28 19:57:46 +05:30
Siddhesh Poyarekar	06b1de2378	benchtests: Use argparse to parse arguments Make the script more usable by adding proper command line options along with a way to query the options. The script is capable of doing a bunch of things right now like choosing a base for comparison, choosing to generate graphs, etc. and they should be accessible via command line switches. * benchtests/scripts/compare_strings.py: Use argparse. * benchtests/README: Document existence of compare_strings.py.	2017-09-16 11:47:32 +05:30
Wilco Dijkstra	beb52f502f	Improve math benchmark infrastructure Improve support for math function benchmarking. This patch adds a feature that allows accurate benchmarking of traces extracted from real workloads. This is done by iterating over all samples rather than repeating each sample many times (which completely ignores branch prediction and cache effects). A trace can be added to existing math function inputs via "## name: workload-<name>", followed by the trace. * benchtests/README: Describe workload feature. * benchtests/bench-skeleton.c (main): Add support for benchmarking traces from workloads.	2017-06-20 16:26:26 +01:00
Siddhesh Poyarekar	2d304f3c6f	benchtests: Support for cross-building benchmarks This patch adds full support for cross-building benchmarks. Some benchmarks like those that need locales to be generated cannot be built and are hence skipped for cross builds. Tested by cross building for aarch64 on x86_64 and then running the generated benchmark on aarch64. * benchtests/Makefile (wcsmbs-benchset): Include only for native builds and runs. (LOCALES): Likewise. (bench-build): Build timing-type here instead of the bench target. Generate locale only for native builds. * benchtests/README: Add note for cross-building.	2016-04-20 13:19:01 +05:30
Siddhesh Poyarekar	bfdda211c6	benchtests: Update README to include instructions for bench-build target	2016-04-20 10:58:20 +05:30
Siddhesh Poyarekar	15eaf6ffe3	benchtests: Add new directive for benchmark initialization hook Add a new 'init' directive that specifies the name of the function to call to do function-specific initialization. This is useful for benchmarks that need to do a one-time initialization before the functions are executed.	2014-05-26 12:37:29 +05:30
Torvald Riegel	6a5d6ea128	benchtests: Add pthread_once common-case test. We have a single thread that runs a no-op initialization once and then repeatedly runs checks of the initialization (i.e., an acquire load and conditional jump) in a tight loop. This gives us, on average, the best-case latency of pthread_once (the initialization is the exactly-once slow path, and we're not looking at initialization-related synchronization overheads in this case).	2014-04-10 21:22:28 +02:00
Siddhesh Poyarekar	df26ea5359	Implement benchmarking script in python Implemented the benchmark script in python since it is much cleaner and simpler to maintain.	2014-03-21 17:32:50 +05:30
Siddhesh Poyarekar	9298ecba15	Accept output arguments to benchmark functions This patch adds the ability to accept output arguments to functions being benchmarked, by nesting the argument type in <> in the args directive. It includes the sincos implementation as an example, where the function would have the following args directive: ## args: double:<double >:<double > This simply adds a definition for a static variable whose pointer gets passed into the function, so it's not yet possible to pass something more complicated like a pre-allocated string or array. That would be a good feature to add if a function needs it. The values in the input file will map only to the input arguments. So if I had a directive like this for a function foo: ## args: int:<int >:int:<int > and I have a value list like this: 1, 2 3, 4 5, 6 then the function calls generated would be: foo (1, &out1, 2, &out2); foo (3, &out1, 4, &out2); foo (5, &out1, 6, &out2);	2013-12-05 10:12:59 +05:30
Torvald Riegel	40fefba1b5	benchtests: Add include-sources directive. This adds the "include-sources" directive to scripts/bench.pl. This allows for including source code (vs including headers, which might get a different search path) after the inclusion of any headers.	2013-10-10 14:45:30 +03:00
Siddhesh Poyarekar	a357259bf8	Add more directives to benchmark input files This patch adds some more directives to the benchmark inputs file, moving functionality from the Makefile and making the code generation script a bit cleaner. The function argument and return types that were earlier added as variables in the makefile and passed to the script via command line arguments are now the 'args' and 'ret' directive respectively. 'args' should be a colon separated list of argument types (skipped if the function doesn't accept any arguments) and 'ret' should be the return type. Additionally, an 'includes' directive may have a comma separated list of headers to include in the source. For example, the pow input file now looks like this: 42.0, 42.0 1.0000000000000020, 1.5 I did this to unclutter the benchtests Makefile a bit and eventually eliminate dependency of the tests on the Makefile and have tests depend on their respective include files only.	2013-10-07 11:51:25 +05:30
Siddhesh Poyarekar	c1f75dc386	Begin porting string performance tests to benchtests This is the initial support for string function performance tests, along with copying tests for memcpy and memcpy-ifunc as proof of concept. The string function benchmarks perform operations at different alignments and for different sizes and compare performance between plain operations and the optimized string operations. Due to this their output is incompatible with the function benchmarks where we're interested in fastest time, throughput, etc. In future, the correctness checks in the benchmark tests can be removed. Same goes for the performance measurements in the string/test-*.	2013-06-11 15:08:13 +05:30
Siddhesh Poyarekar	fef94eab0b	Add a README for benchtests Move instructions from the Makefile here and expand on them.	2013-05-21 14:59:50 +05:30

17 Commits