AuroraMiddleware/zstd - zstd - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Yann Collet	a8ddf1d370	disable 2-passes strategy	2018-05-22 15:06:36 -07:00
Nick Terrell	49cf880513	Approximate FSE encoding costs for selection Estimate the cost for using FSE modes `set_basic`, `set_compressed`, and `set_repeat`, and select the one with the lowest cost. * The cost of `set_basic` is computed using the cross-entropy cost function `ZSTD_crossEntropyCost()`, using the normalized default count and the count. * The cost of `set_repeat` is computed using `FSE_bitCost()`. We check the previous table to see if it is able to represent the distribution. * The cost of `set_compressed` is computed with the entropy cost function `ZSTD_entropyCost()`, together with the cost of writing the normalized count `ZSTD_NCountCost()`.	2018-05-22 14:33:22 -07:00
Yann Collet	27af35c110	Merge pull request #1143 from facebook/tableLevels Update table of compression levels	2018-05-19 14:40:37 -07:00
Yann Collet	5381369cb1	Merge branch 'dev' into tableLevels	2018-05-18 18:23:27 -07:00
Yann Collet	b0b3fb517d	updated compression levels for blocks of 256KB	2018-05-18 17:17:12 -07:00
Nick Terrell	7cbb8bbbbf	[cover] Small compression ratio improvement The cover algorithm selects one segment per epoch, and it selects the epoch size such that `epochs * segmentSize ~= dictSize`. Selecting less epochs gives the algorithm more candidates to choose from for each segment it selects, and then it will loop back to the first epoch when it hits the last one. The trade off is that now it takes longer to select each segment, since it has to look at more data before making a choice. I benchmarked on the following data sets using this command: ```sh $ZSTD -T0 -3 --train-cover=d=8,steps=256 $DIR -r -o dict && $ZSTD -3 -D dict -rc $DIR \| wc -c ``` \| Data set \| k (approx) \| Before \| After \| % difference \| \|--------------\|------------\|----------\|----------\|--------------\| \| GitHub \| ~1000 \| 738138 \| 746610 \| +1.14% \| \| hg-changelog \| ~90 \| 4295156 \| 4285336 \| -0.23% \| \| hg-commands \| ~500 \| 1095580 \| 1079814 \| -1.44% \| \| hg-manifest \| ~400 \| 16559892 \| 16504346 \| -0.34% \| There is some noise in the measurements, since small changes to `k` can have large differences, which is why I'm using `steps=256`, to try to minimize the noise. However, the GitHub data set still has some noise. If I run the GitHub data set on my Mac, which presumably lists directory entries in a different order, so the dictionary builder sees the files in a different order, or I use `steps=1024` I see these results. \| Run \| Before \| After \| % difference \| \|------------\|--------\|--------\|--------------\| \| steps=1024 \| 738138 \| 734470 \| -0.50% \| \| MacBook \| 738451 \| 737132 \| -0.18% \| Question: Should we expose this as a parameter? I don't think it is necessary. Someone might want to turn it up to exchange a much longer dictionary building time in exchange for a slightly better dictionary. I tested `2`, `4`, and `16`, and `4` got most of the benefit of `16` with a faster running time.	2018-05-18 16:15:27 -07:00
Yann Collet	5cbef6e094	Merge branch 'dev' into staticDictCost	2018-05-18 16:03:06 -07:00
Yann Collet	a95e9e80d1	adding some debug functions to observe statistics	2018-05-18 14:09:42 -07:00
fbrosson	291824f49d	__builtin_prefetch did probably not exist before gcc 3.1.	2018-05-18 18:40:11 +00:00
fbrosson	16bb8f1f9e	Drop colon in asm snippet to make old versions of gcc happy.	2018-05-18 17:05:36 +00:00
Yann Collet	af3da079d1	fixed minor conversion warning	2018-05-17 17:27:27 -07:00
Yann Collet	8572b4d09f	fixed a pretty complex bug when combining ldm + btultra	2018-05-17 16:13:53 -07:00
Yann Collet	134388ba6b	collect statistics for first block in ultra mode this patch makes btultra do 2 passes on the first block, the first one being dedicated to collecting statistics so that the 2nd pass is more accurate. It translates into a very small compression ratio gain : enwik7, level 20: blocks 4K : 2.142 -> 2.153 blocks 16K : 2.447 -> 2.457 blocks 64K : 2.716 -> 2.726 On the other hand, the cpu cost is doubled. The trade off looks bad. Though, that's ultimately a price to pay to reach better compression ratio. So it's only enabled when setting btultra.	2018-05-17 12:24:30 -07:00
Yann Collet	a243020d37	slightly improved weight calculation translating into a tiny compression ratio improvement	2018-05-17 11:19:44 -07:00
Yann Collet	63eeeaa1dd	update table levels for blocks <= 16K also : allow hlog to be slighly larger than windowlog, as it's apparently good for both speed and compression ratio.	2018-05-16 16:13:37 -07:00
Yann Collet	18fc3d3cd5	introduced bit-fractional cost evaluation this improves compression ratio by a tiny amount. It also reduces speed by a small amount. Consequently, bit-fractional evaluation is only turned on for btultra.	2018-05-16 14:53:35 -07:00
Yann Collet	9938b17d4c	Merge pull request #1135 from facebook/frameCSize decompress: changed error code when input is too large	2018-05-15 11:02:53 -07:00
Nick Terrell	30d9c84b1a	Fix failing Travis tests	2018-05-15 09:46:20 -07:00
Yann Collet	0b31304c8d	Merge branch 'dev' into staticDictCost	2018-05-14 18:09:26 -07:00
Yann Collet	2c26df0e13	opt: removed static prices after testing, it's actually always better to use dynamic prices albeit initialised from dictionary.	2018-05-14 18:04:08 -07:00
Yann Collet	f372ffc64d	Merge pull request #1127 from facebook/staticDictCost Improved optimal parser with dictionary	2018-05-14 17:45:50 -07:00
Yann Collet	d59cf02df0	decompress: changed error code when input is too large ZSTD_decompress() can decompress multiple frames sent as a single input. But the input size must be the exact sum of all compressed frames, no more. In the case of a mistake on srcSize, being larger than required, ZSTD_decompress() will try to decompress a new frame after current one, and fail. As a consequence, it will issue an error code, ERROR(prefix_unknown). While the error is technically correct (the decoder could not recognise the header of _next_ frame), it's confusing, as users will believe that the first header of the first frame is wrong, which is not the case (it's correct). It makes it more difficult to understand that the error is in the source size, which is too large. This patch changes the error code provided in such a scenario. If (at least) a first frame was successfully decoded, and then following bytes are garbage values, the decoder assumes the provided input size is wrong (too large), and issue the error code ERROR(srcSize_wrong).	2018-05-14 15:32:28 -07:00
Yann Collet	c9227ee16b	update table for 128 KB blocks	2018-05-13 17:15:07 -07:00
Yann Collet	b4250489cf	update compression levels for large inputs	2018-05-13 01:53:38 -07:00
Yann Collet	761758982e	replaced FSE_count by FSE_count_simple to reduce usage of stack memory. Also : tweaked a few comments, as suggested by @terrelln	2018-05-11 16:03:37 -07:00
Yann Collet	3193d692c2	minor patch, ensuring LIBDIR is created before installation follow-up from #1123	2018-05-11 11:31:48 -07:00
Yann Collet	99ddca43a6	fixed wrong assertion base can actually overflow	2018-05-10 19:48:09 -07:00
Yann Collet	0d7626672d	fixed c++ conversion warning	2018-05-10 18:17:21 -07:00
Yann Collet	09d0fa29ee	minor adjusting of weights	2018-05-10 18:13:48 -07:00
Yann Collet	1a26ec6e8d	opt: init statistics from dictionary instead of starting from fake "default" statistics.	2018-05-10 17:59:12 -07:00
Yann Collet	74b1c75d64	btopt : minor adjustment of update frequencies	2018-05-10 16:32:36 -07:00
Yann Collet	ac6105463a	opt: minor improvements to log traces slight improvement when using fractional-bit evaluation (opt:dictionay)	2018-05-09 15:46:11 -07:00
Yann Collet	c39061cb7b	fixed declaration-after-statement warning	2018-05-09 12:07:25 -07:00
Yann Collet	4d5bd32a00	added traces to look at symbol costs evaluation looks correct.	2018-05-09 12:00:12 -07:00
Yann Collet	c0da0f5e9e	switchable bit-approximation / fractional-bit accuracy modes also : makes it possible to select nb of fractional bits.	2018-05-09 10:48:09 -07:00
Yann Collet	ba2ad9b6b9	implemented fractional bit cost evaluation for FSE symbols. While it seems to work, the gains are negligible compared to rough maxNbBits evaluation. There are even a few losses sometimes, that still need to be explained. Furthermode, there are still cases where btlazy2 does a better job than btopt, which seems rather strange too.	2018-05-08 17:43:13 -07:00
Yann Collet	1aff63b114	opt: shift all costs by 8 bits (* 256) making it possible to represent fractional bit costs.	2018-05-08 16:19:04 -07:00
Yann Collet	6a3c34aa58	opt: estimate cost of both Hufman and FSE symbols For FSE symbols : provide an upper bound, in nb of bits, since cost function is not able to store fractional bit costs.	2018-05-08 16:11:21 -07:00
Yann Collet	338f738c24	pass entropy tables to optimal parser for proper estimation of symbol's weights when using dictionary compression. Note : using only huffman costs is not good enough, presumably because sequence symbol costs are incorrect.	2018-05-08 15:37:06 -07:00
Yann Collet	a155061328	minor code refactor for readability removed some useless operations from optimal parser (should not change performance, too small a difference)	2018-05-08 12:32:44 -07:00
Baruch Siach	9a0643b633	lib/Makefile: create include directory before headers installation Make sure that $(INCLUDEDIR) exists before copying the headers there. Otherwise, the contest of header files is copied over $(DESTDIR)$(INCLUDEDIR), making it a regular file. While at it, remove $(DESTDIR)$(INCLUDEDIR) from the list of directories to create in the install-pc target. The install-pc target does not need this directory.	2018-05-08 20:59:44 +03:00
Yann Collet	ad4524d605	fix ZSTD_compressBlock() associated with CDict reported by @let-def. It's actually a bug in ZSTD_compressBegin_usingCDict() which would pass a wrong pledgedSrcSize value (0 instead of ZSTD_CONTENTSIZE_UNKNOWN) resulting in wrong window size, resulting in downsized seqStore, resulting in segfault when writing into the seqStore later in the process. Added a test in fuzzer to cover this use case (fails before the patch).	2018-05-07 12:54:13 -07:00
Peter Seiderer	64bfdca5b9	Split library install target into pc, static, shared and include only target Signed-off-by: Peter Seiderer <ps.report@gmx.net>	2018-04-30 20:32:32 +02:00
Nick Terrell	ca77822ddf	Fix parameter adjustment with dictionary The new advanced API basically set `requestedParams = appliedParams` when using a dictionary. This halted all parameter adjustment, which can hurt compression ratio if, for example, the window log is small for the first call, but the rest of the files are large. This patch fixes the bug, and checks that the `requestedParams` don't change in the new advanced API when using a dictionary, and generally in the fuzzer.	2018-04-25 16:32:29 -07:00
Yann Collet	12f60b8c98	clarified documentation related to refPrefix()	2018-04-25 10:17:06 -07:00
Yann Collet	ace856a835	updated documentation of streaming compression api	2018-04-24 14:44:27 -07:00
taigacon	2c3ad05812	Fix the problem that enables DYNAMIC_BMI2 macro by mistake on ARM architecture with Clang (#1110 )	2018-04-23 15:41:50 -07:00
Nick Terrell	e8c9dc5cea	Fix documentation	2018-04-13 12:43:38 -07:00
Nick Terrell	c0987986e5	Only reset CDict in ZSTD_CCtx_resetParameters()	2018-04-13 11:26:40 -07:00
Nick Terrell	9f76eebd17	Add ZSTD_CCtx_resetParameters() function * Fix docs for `ZSTD_CCtx_reset()`. * Add `ZSTD_CCtx_resetParameters()`. Fixes #1094.	2018-04-12 16:54:07 -07:00
Nick Terrell	3c3f59e68f	Enforce pledgeSrcSize whenever known (#1106 ) The test fails before the patch and passes after. Fixes #1095.	2018-04-12 16:02:03 -07:00
Nick Terrell	280a236e9e	Add ZSTD_CCtx(Param)?_getParameter() function Closes #1096.	2018-04-12 11:50:12 -07:00
Yann Collet	04212178b5	doc : clarified advanced API usage sticky parameters only work with `ZSTD_compress_generic()`	2018-04-10 11:40:36 -07:00
Yann Collet	ad5ba6cdcf	updated comment on parameters that can be changed during compression	2018-04-09 17:39:07 -07:00
Yann Collet	1da629f2ad	Merge pull request #1104 from terrelln/fast-train Allow negative compression levels in training	2018-04-09 14:16:20 -07:00
Nick Terrell	569e2abccd	Allow negative compression levels in training * Set `dictCLevel` in `zstdcli.c`. * Only set to default level if the compression level `== 0`, not `<= 0`.	2018-04-09 12:12:03 -07:00
Yann Collet	4195b36dd7	Merge pull request #1100 from bket/stable_sort zstd requires a stable sort.	2018-04-05 11:39:27 -07:00
Yann Collet	f35b8ba9da	updated ZSTD_p_chainLog description	2018-04-05 11:05:11 -07:00
Björn Ketelaars	462aed6811	zstd requires a stable sort. On OpenBSD qsort() is not guaranteed to be stable, their mergesort() is. This fixes issue #1088. All the hard work has been done by @terrelln.	2018-04-05 07:59:16 +02:00
Yann Collet	55f67502f4	Merge pull request #1098 from terrelln/nd-mt Only load extra table positions for CDicts	2018-04-02 15:38:20 -07:00
Nick Terrell	295ab0dbfa	Only load extra table positions for CDicts Zstdmt uses prefixes to load the overlap between segments. Loading extra positions makes compression non-deterministic, depending on the previous job the context was used for. Since loading extra position takes extra time as well, only do it when creating a `ZSTD_CDict`. Fixes #1077.	2018-04-02 14:41:30 -07:00
Yann Collet	5b616fa269	Merge pull request #1090 from bket/openbsd Fix building zstd on OpenBSD.	2018-04-02 14:15:26 -07:00
Björn Ketelaars	9d3048346d	Fix building zstd on OpenBSD.	2018-03-31 10:46:20 +02:00
Yann Collet	8be984ec45	fixed comments as suggested by @terrelln	2018-03-30 20:09:27 -07:00
Yann Collet	e6e848bfe9	added ZSTD_getFrameHeader_advanced() makes it possible to request frame header from a magicless frame	2018-03-29 17:51:08 -06:00
Yann Collet	a6694838e1	added more code documentation for ZSTD_getFrameHeader()	2018-03-29 15:24:17 -06:00
René Rebe	21eb26d664	fixed legacy/zstd_v* with older gcc version, by guarding builtin_* like in other files	2018-03-25 20:35:15 +02:00
Yann Collet	ad15c1b724	added __has_attribute() define for non-clang compilers	2018-03-23 19:04:48 -07:00
Yann Collet	52ca7c6c56	make DYNAMIC_BMI2 support of clang conditional to __has_attribute() to support older clang versions such as 3.4	2018-03-23 18:45:42 -07:00
Yann Collet	29b021f9a0	Merge pull request #1067 from facebook/targetLength removed limit ZSTD_TARGETLENGTH_MAX	2018-03-22 10:38:33 -07:00
Nick Terrell	ad344033df	Fix broken assertion The `avgJobSize` must not be lower than 256 KB for single-pass mode. In `zstd.h` we say the minimum value for `ZSTD_p_jobSize` is 1 MB, so ensure that we always pick a size >= 1 MB. Found by libFuzzer fuzzer tests with large input limits.	2018-03-21 16:20:30 -07:00
Yann Collet	153bc1c004	removed limit ZSTD_TARGETLENGTH_MAX this makes it possible to specify extremely large negative compression levels, achieving the side effect as "no compression". It will also be possible to define larger targetlength for ultra compression mode. There is no adverse side effect due to removing this limit.	2018-03-21 15:50:05 -07:00
Yann Collet	a99c4a3621	Merge branch 'dev' into advancedDecompress	2018-03-21 06:08:28 -07:00
Yann Collet	87b0cf05bd	Merge pull request #1057 from facebook/lrmSettings LRM parameters	2018-03-21 05:59:39 -07:00
Yann Collet	d1bf609abf	Merge pull request #1059 from terrelln/mt-ldm Integrate ldm with zstdmt	2018-03-20 17:50:20 -07:00
Yann Collet	e0cb8d19c6	fixed legacy test case	2018-03-20 17:48:22 -07:00
Yann Collet	878728dc26	fixed several comments by @terrelln	2018-03-20 16:35:14 -07:00
Yann Collet	e1c52faace	Merge pull request #1060 from facebook/compressImpl merge bmi2 implementation of encodeSequence into zstd_compress.c	2018-03-20 16:19:42 -07:00
Yann Collet	6cda8c932c	added test with ZSTD_decompress_generic() + ZSTD_DCtx_refPrefix() also : clarified stage condition to accept new parameters, fixed initializers correspondingly.	2018-03-20 16:16:13 -07:00
Yann Collet	0dadb6b70d	implemented ZSTD_DCtx_refPrefix*()	2018-03-20 15:45:56 -07:00
Yann Collet	569b8ba4d9	implemented ZSTD_DCtx_refDDict()	2018-03-20 15:43:49 -07:00
Nick Terrell	a3b76a77ef	Quiet appveyor warnings	2018-03-20 15:34:40 -07:00
Yann Collet	6873fec658	changed dictMore for dictContentType which seems clearer to describe what the variable/argument is about.	2018-03-20 15:13:14 -07:00
Yann Collet	31b54b6eea	updated ZSTD_initStaticDDict() prototype can also specify dictContentType.	2018-03-20 14:52:02 -07:00
Nick Terrell	136b9e2392	Fix external sequence corner cases * Clear external sequences when we reset the `ZSTD_CCtx`. * Skip external sequences when a block is too small to compress.	2018-03-20 14:50:28 -07:00
Yann Collet	353117c5d7	implemented ZSTD_DCtx_loadDictionary*() this required updating ZSTD_createDDict_advanced() to accept a dictContentType parameter (raw, full, auto).	2018-03-20 13:40:29 -07:00
Yann Collet	451357f37f	Merge pull request #1058 from facebook/cctxParams updated CCtxParams API	2018-03-20 12:36:12 -07:00
Yann Collet	2ed5af0766	merge bmi2 implementation of encodeSequence into zstd_compress.c	2018-03-19 19:10:31 -07:00
Nick Terrell	d19f803a3b	Fix window size for 1 worker + flushing	2018-03-19 18:56:39 -07:00
Nick Terrell	24d9edbdd8	Set ldmParams to 0 when disabled	2018-03-19 18:23:54 -07:00
Nick Terrell	4b92574feb	Fix corner cases exposed by zstreamtest	2018-03-19 17:54:04 -07:00
Nick Terrell	94c77710a9	Integrate ldm with zstdmt Integrate ldm into zstdmt by running it in serial and in order in the first step of each job, in the same place as the hash gets updated. The input buffer is sized to fit the whole LDM window and 2 full buffers of slack. Input buffers cannot be reused until the LDM step is done with them. After the LDM step is finished, the jobs don't actually have access to the full window, only the overlap. Tested on a few different multi-GB files with and without sanitizers, and with different numbers of threads.	2018-03-19 16:29:03 -07:00
Nick Terrell	aa4dbd09a1	Pull job/overlap log logic into common function (#1055 ) Prepares for LDM integration by separating the job size and overlap logic into helper functions.	2018-03-19 15:56:36 -07:00
Yann Collet	c8b3d389fd	updated CCtxParams API to respect naming convention : ZSTD_CCtxParams_*()	2018-03-19 15:07:26 -07:00
Yann Collet	6f4d0778a5	make it possible to express compression parameters in any order	2018-03-19 14:41:23 -07:00
Nick Terrell	2253d01b27	Move XXH64_update() into worker threads * Computes the XXH hash in the worker threads. * Workers get a sequence number and wait until ther number shows up. On error, ensures that its sequence is finished, so future threads don't get blocked. * Sets up for ldm integration, which will go in the same spot.	2018-03-19 11:08:27 -07:00
Yann Collet	9618c0c804	make it possible to specify LDM parameters in any order	2018-03-19 11:07:04 -07:00
Yann Collet	ec0959e701	Merge branch 'dev' into mt-single	2018-03-18 01:06:31 -07:00
Nick Terrell	4af1fafeb8	Restore setting loadedDictEnd Setting `loadedDictEnd` was accidently removed from `ZSTD_loadDictionaryContent()`, which means that dictionary compression will only be able to reference the parts of the dictionary within the window. The spec allows us to reference the entire dictionary so long as even one byte is in the window. `ZSTD_enforceMaxDist()` incorrectly always allowed offsets up to `loadedDictEnd` beyond the window, even once the dictionary was out of range. When overflow protection kicked in, the check `current > loadedDictEnd + maxDist` is incorrect if `loadedDictEnd` isn't reset back to zero. `current` could be reset below the value, which would incorrectly allow references beyond the window. This bug is present in `master`, but is very hard to trigger, since it requires both dictionaries and data which triggers overflow correction.	2018-03-16 14:54:06 -07:00
Yann Collet	cbc71e40f6	moving LRM parameters out of experimental section into "normal" range, start pinned at 160.	2018-03-15 17:22:40 -07:00

1 2 3 4 5 ...

2293 Commits