AuroraMiddleware/zstd - zstd - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Yann Collet	4b6a94f0cc	clarified comments on LDM parameters	2018-02-01 17:07:27 -08:00
Yann Collet	2bfc79ab8d	removed bitstream.h dependency	2018-02-01 16:13:04 -08:00
Yann Collet	823a28a1f4	Merge pull request #1000 from facebook/progressiveFlush Progressive flush	2018-01-30 22:49:47 -08:00
Yann Collet	9f8ed23b5b	bumped version number to v1.3.4 also added a paragraph on using compression level with training mode as this is a recurrent question (see for example #1004)	2018-01-27 22:23:26 -08:00
ne-sted	50aea2f293	cli: fix align of defaults	2018-01-24 15:07:22 +02:00
Yann Collet	cb5eba8e20	add `zcat` symlink support, suggested by @wtarreau added some test also updated relevant doc + fixed a mistake in `lz4` symlink support : lz4 utility doesn't remove source files by default (like zstd, but unlike gzip). The symlink must behave the same.	2018-01-19 11:26:35 -08:00
Yann Collet	70f81d6030	zstdmt uses POOL_tryAdd() to call a new worker so that it's no longer a blocking call. This makes it possible to stream out data gradually, while waiting for a worker to become available.	2018-01-19 10:01:40 -08:00
Yann Collet	4d08ba8b77	fileio: READY_FOR_UPDATE() is now a function-like macro as suggested by @terrelln	2018-01-18 11:27:13 -08:00
Yann Collet	aa79c18e3f	fixed a few access contention passes thread sanitizer test	2018-01-17 17:18:19 -08:00
Yann Collet	394eec697b	Introduce ZSTD_getFrameProgression() Produces 3 statistics for ongoing frame compression : - ingested - consumed (effectively compressed) - produced Ingested can be larger than consumed due to buffering effect. For the time being, this patch mostly fixes the % ratio issue, since it computes consumed / produced, instead of ingested / produced. That being said, update is not "smooth", because on a slow enough setting, fileio spends most of its time waiting for a worker to complete its job. This could be improved thanks to more granular flushing i.e. start flushing before ongoing job is fully completed.	2018-01-17 16:39:02 -08:00
Yann Collet	58dd7de640	zstdmt: fixed an endless loop on allocation failure this happened on 32-bits build when requiring a too large input buffer, typically on wlog=29, creating jobs of 2 GB size. also : zstd32 now compiles with multithread support enabled by default (can be disabled with HAVE_THREAD=0)	2018-01-17 12:10:15 -08:00
Yann Collet	3e1e57db27	fix fileio progression status update The compression % is no longer correct, since it's no longer possible to make direct correlation between nb bytes read and nb bytes written due to large internal buffer inside CCtx (exacerbated with --long). The current "fix" is to no longer display the %. A more complex solution will have to count exactly how much data has been consumed and compressed internally, within CCtx buffers.	2018-01-16 17:35:00 -08:00
Yann Collet	10c213761a	cli: fix for no-MT mode when cli is compiled without MT support, invoking ZSTD_p_nonBlockingMode result in an error code. This patch only sets ZSTD_p_nonBlockingMode when ZSTD_MULTITHREAD is set, meaning there is MT support. The error code could also be intentionnally ignored (there is no side effect).	2018-01-16 17:28:11 -08:00
Yann Collet	1dba98d563	introduced parameter ZSTD_p_nonBlockingMode This new parameter makes it possible to call streaming ZSTDMT with a single thread set which is non blocking. It makes it possible for the main thread to do other tasks in parallel while the worker thread does compression. Typically, for zstd cli, it means it can do I/O stuff. Applied within fileio.c, this patch provides non-negligible gains during compression. Tested on my laptop, with enwik9 (1000000000 bytes) : time zstd -f enwik9 With traditional single-thread blocking mode : real 0m9.557s user 0m8.861s sys 0m0.538s With new single-worker non blocking mode : real 0m7.938s user 0m8.049s sys 0m0.514s => 20% faster	2018-01-16 16:15:47 -08:00
Yann Collet	58ecf13e02	zstdmt : can compress at block granularity offering perspective of more accurate progression report.	2018-01-13 13:18:57 -08:00
Yann Collet	1edf33764e	Merge pull request #974 from terrelln/dstfile [fileio] Improve safety of output file modifications	2018-01-10 19:02:48 +01:00
Yann Collet	752880ffed	Merge pull request #963 from facebook/benchfix fix: bench can accept hlog custom parameter	2018-01-06 06:57:02 +01:00
Nick Terrell	ed9611dc62	[fileio] Don't call FIO_remove() on stdout or /dev/null	2018-01-05 11:50:24 -08:00
Nick Terrell	282ad05e0a	[fileio] Use FIO_remove() everywhere for safety	2018-01-05 11:44:45 -08:00
Nick Terrell	fd63140e1c	[util] Refuse to set file stat on non-regular file	2018-01-05 11:44:25 -08:00
Pádraig Brady	e0596715dc	zstd: fix crash when not overwriting existing files This fixes the following crash: $ touch exists $ programs/zstd -r examples/ -o exists zstd: exists already exists; not overwritten Segmentation fault (core dumped) * programs/fileio.c (FIO_compressMultipleFilenames): Handle the case where we're not overwriting the destination. Reported at https://bugzilla.redhat.com/1530049	2018-01-02 15:24:09 +00:00
Yann Collet	c707c6e9f2	fix: bench can accept hlog custom parameter was ignored during initialization	2017-12-27 13:32:05 +01:00
Yann Collet	cc9e026866	Merge pull request #952 from terrelln/merge-end [fileio] Merge end loop for small optimization	2017-12-15 10:27:53 -08:00
Yann Collet	2cff66b62f	version bump to v1.3.3	2017-12-14 16:11:20 -08:00
Nick Terrell	f48d34edba	[fileio] Merge end loop for small optimization	2017-12-14 15:52:24 -08:00
Yann Collet	a0ac8c895c	Merge pull request #950 from facebook/srcSizeAdaptation fix adaptation on srcSize	2017-12-14 14:48:31 -08:00
Yann Collet	2e97a6d464	fixed minor declaration-after-statement warning	2017-12-13 18:50:05 -08:00
Yann Collet	5432ef6921	fixes adaptation on srcSize This patch restores capability for each file to receive adapted compression parameters depending on its size. The bug breaking this feature was relatively silly : setting a parameter with a value "0" is supposed to be a no-op. Unfortunately, it would pin down compression parameters as if they were manually set, preventing later automatic adaptation. Unfortunately, I'm currently short of a test case that could check this situation and trigger an error. Compression parameters selection between tableID 0,1,2,3 is largely internal, leaving no trace to outside world, not even in frame header.	2017-12-13 17:45:26 -08:00
Nick Terrell	4680e85bdf	Allow -o with multiple files	2017-12-13 17:44:34 -08:00
Yann Collet	4d0dfafa7b	Merge pull request #949 from terrelln/rrm [fileio] Refuse to remove non-regular file	2017-12-13 17:36:39 -08:00
Nick Terrell	82bc8fe0cc	[fileio] Refuse to remove non-regular file	2017-12-13 13:38:26 -08:00
Nick Terrell	b5e7f6c0f3	[fileio] Fix window size MB calculation Test command: ``` head -c 10000 /dev/zero \| ./zstd -c --zstd=wlog=12 \| ./zstd -M2048 -t ```	2017-12-13 10:57:01 -08:00
Yann Collet	31293330d0	It's still necessary to check PLATFORM_POSIX_VERSION for clock_gettime() glibc/uclibc is not enough	2017-12-04 16:31:59 -08:00
Yann Collet	0097469238	removed a few redundant #include	2017-12-04 16:02:42 -08:00
Yann Collet	e46194bbf9	fix #911 : changed detection macro for clock_gettime() The new macro might be a bit too restrictive. Systems which do not support new test will simply default to <time.h>'s `clock_t clock()`, suffering lesser benchmark accuracy. Should it matter, the detection macro will have to be upgraded.	2017-12-04 15:57:01 -08:00
Yann Collet	55faa5492d	fileio: fixed LZ4F invocation from assert()	2017-12-04 11:26:59 -08:00
Yann Collet	af2fbbcb0d	Merge pull request #939 from facebook/shorterCircleCI Faster CircleCI tests	2017-12-04 11:22:30 -08:00
Yann Collet	71f012e5bf	zstdcli: fixed minor warning when bench module not enabled one variable defined but not used	2017-12-01 17:42:46 -08:00
Yann Collet	a1b24e6262	Merge pull request #938 from terrelln/time Use util.h for timing	2017-12-01 16:40:38 -08:00
Nick Terrell	dab8cfa3c7	Combine definitions of SEC_TO_MICRO	2017-11-30 19:40:53 -08:00
Nick Terrell	9a2f6f477b	Use util.h for timing	2017-11-30 14:57:25 -08:00
Yann Collet	2f22a6ec50	Merge branch 'dev' into opt3	2017-11-28 15:03:58 -08:00
Yann Collet	0a0a212934	zstd_opt: changed cost formula There was a flaw in the formula which compared literal cost with match cost : at a given position, a non-null literal suite is going to be part of next sequence, while if position ends a previous match, to immediately start another match, next sequence will have a litlength of zero. A litlength of zero has a non-null cost. It follows that literals cost should be compared to match cost + litlength==0. Not doing so gave a structural advantage to matches, which would be selected more often. I believe that's what led to the creation of the strange heuristic which added a complex cost to matches. The heuristic was actually compensating. It was probably created through multiple trials, settling for best outcome on a given scenario (I suspect silesia.tar). The problem with this heuristic is that it's hard to understand, and unfortunately, any future change in the parser would impact the way it should be calculated and its effects. The "proper" formula makes it possible to remove this heuristic. Now, the problem is : in a head to head comparison, it's sometimes better, sometimes worse. Note that all differences are small (< 0.01 ratio). In general, the newer formula is better for smaller files (for example, calgary.tar and enwik7). I suspect that's because starting statistics are pretty poor (another area of improvement). However, for silesia.tar specifically, it's worse at level 22 (while being better at level 17, so even compression level has an impact ...). It's a pity that zstd -22 gets worse on silesia.tar. That being said, I like that the new code gets rid of strange variables, which were introducing complexity for any future evolution (faster variants being in mind). Therefore, in spite of this detrimental side effect, I tend to be in favor of it.	2017-11-28 14:07:03 -08:00
W. Felix Handte	baff9dd15e	Fix LZ4 Compression Buffer Overflow Fixes issue where, when `zstd --format=lz4` is fed an input larger than 128KB, the read overruns the input buffer. This changes Zstd to use LZ4 with chained 64KB blocks. This is technically a breaking change in that some third party LZ4 implementations may not support linked blocks. However, progress should not be allowed to be stopped by such petty concerns as backwards compatibility!	2017-11-28 12:07:26 -05:00
Yann Collet	743b23878e	install: changed variable MANDIR into MAN1DIR MANDIR still exists, and is now the parent of MAN1DIR	2017-11-27 13:47:35 -08:00
Yann Collet	2fd765498a	updated man page following patch #931 by @scottchiefbaker	2017-11-24 17:20:54 -08:00
Yann Collet	c857ee850a	minor update	2017-11-24 16:44:28 -08:00
Scott Baker	31a191b178	Include information about the benchmark output/methodology Addresses #930	2017-11-22 20:34:25 -08:00
Yann Collet	daebc7fe26	bench: slightly adjusted display format adapt accuracy depending on value. makes it possible to have higher accuracy for small value, notably small compression speed. This capability is expected to be useful while modifying optimal parser.	2017-11-18 15:54:32 -08:00
Nick Terrell	a6052af0e8	[zstd] Fix rare bug with signal handler	2017-11-17 16:38:56 -08:00

1 2 3 4 5 ...

1167 Commits