AuroraMiddleware/zstd - zstd - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Nick Terrell	374f868354	Update whitespace	2017-05-10 17:48:42 -07:00
Nick Terrell	5f2c7213c7	Merge remote-tracking branch 'upstream/dev' into btopt * upstream/dev: (305 commits) added test for ZSTD_estimateCStreamSize() changed variable name, for clarity fixed ZSTD_estimateCStreamSize() shortened ZSTD_createCStream_Advanced() fixed symbols test added ZSTD_estimateDStreamSize() changed name frameParams into frameHeader regroup memory usage function declarations separated ZSTD_estimateCStreamSize() from ZSTD_estimateCCtxSize() bumped version number added ZSTD_estimateCDictSize() and ZSTD_estimateDDictSize() Updated ZSTD_freeCCtx() updated ZSTD_estimateCCtxSize() Updated ZSTD_sizeof_CCtx() merged CCtx and CStream as a single same object cli : -d and -t do not stop after a failed decompression added dev branch CircleCI badge added dev branch Appveyor badge keep dev branch status only creates a binary archive without the `programs` directory ...	2017-05-10 16:49:58 -07:00
Yann Collet	669346fe8b	fixed ZSTD_estimateCStreamSize() https://github.com/facebook/zstd/pull/689#discussion_r115637721	2017-05-10 11:08:00 -07:00
Yann Collet	6fb2f24132	shortened ZSTD_createCStream_Advanced() https://github.com/facebook/zstd/pull/689#discussion_r115637613	2017-05-10 11:06:06 -07:00
Yann Collet	fa8dadb294	separated ZSTD_estimateCStreamSize() from ZSTD_estimateCCtxSize() for clarity	2017-05-08 18:24:16 -07:00
Yann Collet	a1d6704d7f	added ZSTD_estimateCDictSize() and ZSTD_estimateDDictSize() it complements ZSTD_estimateCCtxSize() for the special case of ZSTD_initCStream_usingDict()	2017-05-08 17:51:49 -07:00
Yann Collet	7855366598	Updated ZSTD_freeCCtx() which can also contain streaming buffers now. Redirected ZSTD_freeCStream() towards it.	2017-05-08 17:15:00 -07:00
Yann Collet	fc5145955a	updated ZSTD_estimateCCtxSize() added a parameter streaming, to estimate memory allocation size when the CCtx is used for streaming (CStream). Note : this function is not able to estimate memory cost of a potential internal CDict which can only happen when starting with ZSTD_initCStream_usingDict()	2017-05-08 17:07:59 -07:00
Yann Collet	791d744279	Updated ZSTD_sizeof_CCtx() can now contain buffers if object used as CStream. ZSTD_sizeof_CStream() is now just a thin wrapper of ZSTD_sizeof_CCtx().	2017-05-08 16:17:30 -07:00
Yann Collet	0be6fd3429	merged CCtx and CStream as a single same object To be changed : ZSTD_sizeof_CCtx(), ZSTD_estimateCCtxSize()	2017-05-08 16:08:01 -07:00
Yann Collet	a51cab6e68	Merge pull request #678 from facebook/apiChange Breaking API Change around CDict	2017-04-28 10:02:45 -07:00
Yann Collet	29297c6751	Changed default level 18 (large input) Previous -18 : 4.7 MB/s, R:3.833 New -18 : 5.1 MB/s. R:3.825 It's a better fit within -17 (6.8 MB/s) and -19 (4.0 MB/s) The new level 18 also uses significantly less memory. And, it makes a good transition between level 17 (mml5) and level 19 (mml3). Up to now, there was no level with mml4. (note : minmatch setting can have a large impact on some (specific) datasets)	2017-04-27 17:44:01 -07:00
Yann Collet	1c3ab0c77f	fixed init error on Visual 2008	2017-04-27 12:57:11 -07:00
Yann Collet	77bf59ef50	added ZSTD_initCStream_usingCDict_advanced()	2017-04-27 11:43:04 -07:00
Yann Collet	f4bd857d81	created ZSTD_compress_usingCDict_advanced()	2017-04-27 11:31:55 -07:00
Yann Collet	69a54d138a	fixed compilation warning : declaration-after-statement	2017-04-27 01:11:26 -07:00
Yann Collet	31533bacce	Changed ZSTD_createCDict_advanced() It now only uses compressionParameters as argument. It produces many changes throughout user code, though hopefully they tend to be simple : just provide the cParams part from existing ZSTD_parameters. Some programs might depend on ZSTD_createCDict_advanced() to pass frame parameters. This change will force them to revisit this strategy and fix it, since frame parameters are effectively silently ignored in current version.	2017-04-27 00:29:04 -07:00
Yann Collet	768df129d2	changed ZSTD_compressBegin_usingCDict() No longer takes `pledgedSrcSize` as argument this is in line with similar functions ZSTD_compress_usingCDict() and ZSTD_initCStream_usingCDict().	2017-04-26 15:42:10 -07:00
Yann Collet	e42afbc6fa	Comply with suggested comments by @terrelln created FSE_CTABLE_SIZE() and FSE_DTABLE_SIZE()	2017-04-26 11:39:35 -07:00
Yann Collet	7271203bdb	transferred entropy scratch space from CCtx into workSpace Saved 6 KB	2017-04-20 23:21:19 -07:00
Yann Collet	a408645f50	made some room for entropy scratch space	2017-04-20 23:09:39 -07:00
Yann Collet	71aaa32c3c	transferred FSE tables from CCtx into workspace Saved 5 KB from CCtx	2017-04-20 23:03:38 -07:00
Yann Collet	71ddeb67b1	made room in workspace for FSE tables still need to be transferred from CCtx into workspace	2017-04-20 22:54:54 -07:00
Yann Collet	a34a39c183	changed size evaluation of entropy tables so that memcpy() does no longer depends on fse pointer being a static table	2017-04-20 18:26:25 -07:00
Yann Collet	7bb60b17d8	init entropy table pointers only once per workSpace resize	2017-04-20 17:38:56 -07:00
Yann Collet	e6fa70a0a1	reorganized ZSTD_resetCCtx_internal() clearer separation between variables and buffers clearer buffers category kept static buffers at the beginning, favoring cache locality (it will be easier to add FSE tables there later) This break a few assumptions that hashTable was always at the beginning. This is fixed. And remaining assumptions (namely that tables stand next to each other in memory) are now tested with assert.	2017-04-20 17:28:31 -07:00
Yann Collet	c17e020c9a	disable assert when compiling paramgrill paramgrill is a benchmark calibration function. Speed accuracy is critical, it cannot be altered by assert.	2017-04-20 12:50:02 -07:00
Yann Collet	16f9c572fc	Merge branch 'dev' into compressionFlow	2017-04-20 11:16:40 -07:00
Yann Collet	e348dad305	minor long line reformatting	2017-04-20 11:14:13 -07:00
Yann Collet	2c5514c759	fixed ZSTDMT_initCStream_advanced() Must use the new ZSTD_compressBegin_usingCDict_advanced() to enforce correct frame parameters	2017-04-18 22:52:41 -07:00
Yann Collet	a4cab80183	added ZSTD_copyCCtx_internal() which respects provided fParams.	2017-04-18 14:54:54 -07:00
Yann Collet	30fb499208	Changed ZSTD_resetCCtx_advanced() into ZSTD_resetCCtx_internal() for naming consistency : _advanced() can be invoked while _internal() are strictly static	2017-04-18 14:08:50 -07:00
Yann Collet	715b9aa113	created ZSTD_compressBegin_usingCDict_advanced()	2017-04-18 13:55:53 -07:00
Yann Collet	4f818182b8	clarified frame parameters for ZSTD_compress*_usingCDict() created ZSTD_compressBegin_usingCDict_internal(), which gives direct control to frame Parameters. ZSTD_resetCStream_internal() now points into it.	2017-04-17 18:29:06 -07:00
Yann Collet	c47c68f6ca	proper evaluation of Huffman CTable size	2017-04-17 16:14:21 -07:00
Yann Collet	88009a8ba2	removed srcSize control from CStream since it's already done from lower bufferless API level	2017-04-12 00:51:24 -07:00
Yann Collet	20d5e03893	content size is controlled at bufferless level so it's active for all entry points Also : added relevant test (wrong content size) in fuzzer	2017-04-11 18:34:02 -07:00
Yann Collet	4ee6b15dac	force contentSizeFlag=0 when using ZSTD_initCStream_usingCDict() because by definition srcSize is not known when using this prototype. added relevant test Note : this use was already working, because at a later stage (both ZSTD_compressBegin_usingCDict() and ZSTD_copyCCtx()) pledgedSrcSize=0 is translated into "unknown", no matter the frame parameter. This is not correct, but of little importance, as the medium term plan is to no longer set fParams within CDict	2017-04-11 11:59:44 -07:00
Yann Collet	ab9162ebb4	simplified call graph by calling ZSTD_compressBegin_internal() instead of ZSTD_compressBegin_advanced()	2017-04-11 10:46:20 -07:00
Yann Collet	e88034fe26	simplified ZSTD_initCStream*() flow all variants converge towards ZSTD_initCStream_stage2()	2017-04-10 22:24:02 -07:00
Yann Collet	4b987ad8ce	Introduce ZSTD_initCStream_internal() This is now the regroup point for ZSTD_initCStream*() functions ZSTD_initCStream_advanced() now properly checks for parameters validity. Also : added <assert.h> usage inside zstd_compress.c Needs ZSTD_DEBUG=1 macro to be triggered. Will be triggered by default from `tests` directory	2017-04-10 17:50:44 -07:00
Yann Collet	0181fef545	ensure cctx internal buffer is correctly sized in case of memory error	2017-04-06 01:25:26 -07:00
Yann Collet	02d37aa1c1	ensure correct size of internal buffers in case of error	2017-04-05 14:53:51 -07:00
Nick Terrell	405d2a1027	Explicitly convert scratchBuffer to unsigned*	2017-04-04 16:35:31 -07:00
Nick Terrell	16a739cab0	Switch call of FSE_count() to FSE_count_wksp()	2017-04-04 16:17:21 -07:00
Yann Collet	7cf78f1be7	Protects ZSTD_compressBegin_usingCDict() vs NULL cdict dereference Will issue an error (GENERIC) is cdict==NULL	2017-04-04 12:38:14 -07:00
Nick Terrell	26b046a7c4	Remove unnecessary dictID store	2017-04-03 21:46:28 -07:00
Nick Terrell	39a6cc5172	Make ZSTD_compress_usingCDict() respect contentSizeFlag	2017-04-03 21:09:55 -07:00
Nick Terrell	62ecad3819	Fix ZSTD_initCStream_usingCDict() to use dictionary	2017-04-03 21:05:59 -07:00
Yann Collet	30c7698970	optimize ZSTDMT_compress() memory usage does no longer allocate temporary buffers when there is enough room in dstBuffer to decompress directly there. (previous method would skip that for 1st chunk only). Also : fix ZSTD_compressBound() for small srcSize	2017-03-31 18:27:03 -07:00
Yann Collet	3f75d52527	Changed ZSTD_compressBound() required so that if Total = A+B compressBound(Total) <= compressBound(A) + compressBound(B) under condition of a minimum size for A and B Will help for ZSTDMT_compress() memory allocation	2017-03-31 17:11:38 -07:00
Yann Collet	eea7858e2b	fixed minor warnings in debug code	2017-03-30 16:47:19 -07:00
Yann Collet	34cc487d05	overlap at full windowSize for max compression level as it provides max compression ratio	2017-03-30 16:23:22 -07:00
Yann Collet	458e955c23	improved ZSTDMT_compress() Use a bit more threads by default. Uses overlap segments to boost compression ratio (like the streaming variant)	2017-03-30 15:51:58 -07:00
Yann Collet	6476c51b86	Merge pull request #637 from facebook/zstdmt Zstdmt	2017-03-30 14:18:37 -07:00
Nick Terrell	5152fb2cb2	Convert all tabs to spaces	2017-03-29 18:51:58 -07:00
Yann Collet	ca5a8bbe36	re-added patch ...	2017-03-29 17:15:27 -07:00
Yann Collet	2e2e78de47	removed unnecessary restriction on minmatchLength it's now transparently translated to nearest value when unsupported (7->6) (3->4)	2017-03-29 16:02:47 -07:00
Yann Collet	933ce4a1dd	fix : minmatch 7 conversion minmatch 7 now converted to minmatch 6 for strategies which do not support 7 Used to folded into "default", which applied minmatch 4	2017-03-29 14:35:38 -07:00
Yann Collet	2238870eb6	Merge pull request #625 from facebook/loadCDict limited CDict acceptation criteria to be the same as DDict	2017-03-24 16:06:20 -07:00
Yann Collet	16a0b10781	fixed ZSTD_loadZstdDictionary() forgot to add the dictionary content (tests were not failing, just compressing less). Also : added size protections when adding dict content since hc/bt table filling would fail if size < 8	2017-03-24 12:46:46 -07:00
Yann Collet	23776ce290	fixed ERROR_GENERIC on dstSize_tooSmall required by users which depends on this error code to size dest buffer	2017-03-23 17:59:50 -07:00
Yann Collet	bea78e8fc2	limited CDict acceptation criteria to be the same as DDict	2017-03-23 15:46:06 -07:00
Nick Terrell	eaf69b07f0	Zero pointers after freeing	2017-03-21 13:20:59 -07:00
Nick Terrell	f35ef5c8e9	Whitespace only: tabs to spaces	2017-03-09 12:51:33 -08:00
Nick Terrell	eeb31eed39	s/ZSTD_btopt2/ZSTD_btultra/g	2017-03-09 11:44:25 -08:00
Yann Collet	a41a4ed39a	Merge pull request #594 from terrelln/bugs Small fixes	2017-03-08 14:56:07 -08:00
Nick Terrell	e06c303475	Fix ZSTD_sizeof_CStream()	2017-03-08 13:45:10 -08:00
Sean Purcell	881abe44f1	Reduce point at which we reduce offsets to protect against UB	2017-03-07 16:58:08 -08:00
Sean Purcell	3437bf2feb	Add build targets to the Makefile, and update CircleCI tests	2017-03-06 15:05:02 -08:00
Nick Terrell	54c4babd8f	Always check Huffman tables for ZSTD_lazy+ The compressor always reuses the existing Huffman table if the literals size is at most 1 KiB. If the compression strategy is `ZSTD_lazy` or stronger always check to see if reusing the previous table or creating a new table is better. This doesn't yet weigh in decompression speed. I don't want to add any heuristics there until I have real data to work with to ensure that the heuristic works for at least one use case, preferably more.	2017-03-03 16:49:38 -08:00
Yann Collet	f44b55c18d	Merge pull request #584 from terrelln/huff-repeat Allow compressor to repeat Huffman tables	2017-03-02 17:20:11 -08:00
Nick Terrell	d051cd5b43	Use workspace for count and CTable	2017-03-02 16:38:07 -08:00
Sean Purcell	553f67e0c1	Remove 'generic' inline strategy Seems to avoid performance loss for compression. Same strategy tested on decompression side, did not appear to improve speed.	2017-03-02 15:18:13 -08:00
Sean Purcell	3d95925a59	Merge remote-tracking branch 'origin/dev' into m32	2017-03-02 15:17:56 -08:00
Nick Terrell	a419777eb1	Allow compressor to repeat Huffman tables * Compressor saves most recently used Huffman table and reuses it if it produces better results. * I attempted to preserve CPU usage profile. I intentionally left all of the existing heuristics in place. There is only a speed difference on the second block and later. When compressing large enough blocks (say >= 4 KiB) there is no significant difference in compression speed. Dictionary compression of one block is the same speed for blocks with literals <= 1 KiB, and after that the difference is not very significant. * In the synthetic data, with blocks 10 KB or smaller, most blocks can't use repeated tables because the previous block did not contain a symbol that the current block contains. Once blocks are about 12 KB or more, most previous blocks have valid Huffman tables for the current block, and the compression ratio and decompression speed jumped. * In silesia blocks as small as 4KB can frequently reuse the previous Huffman table (85%), but it isn't as profitable, and the previous Huffman table only gets used about 3% of the time. * Microbenchmarks show that `HUF_validateCTable()` takes ~55 ns and `HUF_estimateCompressedSize()` takes ~35 ns. They are decently well optimized, the first versions took 90 ns and 120 ns respectively. `HUF_validateCTable()` could be twice as fast, if we cast the `HUF_CElt` to a `U32` and compare to 0. However, `U32` has an alignment of 4 instead of 2, so I think that might be undefined behavior. * I've ran `zstreamtest` compiled normally, with UASAN and with MSAN for 4 hours each. The worst case for the speed difference is a bunch of small blocks in the same frame. I modified `bench.c` to compress the input in a single frame but with blocks of the given block size, set by `-B`. Benchmarks on level 1: \| Program \| Block size \| Corpus \| Ratio \| Compression MB/s \| Decompression MB/s \| \|-----------\|------------\|-----------\|-------\|------------------\|--------------------\| \| zstd.base \| 256 \| synthetic \| 2.364 \| 110.0 \| 297.0 \| \| zstd \| 256 \| synthetic \| 2.367 \| 108.9 \| 297.0 \| \| zstd.base \| 256 \| silesia \| 2.204 \| 93.8 \| 415.7 \| \| zstd \| 256 \| silesia \| 2.204 \| 93.4 \| 415.7 \| \| zstd.base \| 512 \| synthetic \| 2.594 \| 144.2 \| 420.0 \| \| zstd \| 512 \| synthetic \| 2.599 \| 141.5 \| 425.7 \| \| zstd.base \| 512 \| silesia \| 2.358 \| 118.4 \| 432.6 \| \| zstd \| 512 \| silesia \| 2.358 \| 119.8 \| 432.6 \| \| zstd.base \| 1024 \| synthetic \| 2.790 \| 192.3 \| 594.1 \| \| zstd \| 1024 \| synthetic \| 2.794 \| 192.3 \| 600.0 \| \| zstd.base \| 1024 \| silesia \| 2.524 \| 148.2 \| 464.2 \| \| zstd \| 1024 \| silesia \| 2.525 \| 148.2 \| 467.6 \| \| zstd.base \| 4096 \| synthetic \| 3.023 \| 300.0 \| 1000.0 \| \| zstd \| 4096 \| synthetic \| 3.024 \| 300.0 \| 1010.1 \| \| zstd.base \| 4096 \| silesia \| 2.779 \| 223.1 \| 623.5 \| \| zstd \| 4096 \| silesia \| 2.779 \| 223.1 \| 636.0 \| \| zstd.base \| 16384 \| synthetic \| 3.131 \| 350.0 \| 1150.1 \| \| zstd \| 16384 \| synthetic \| 3.152 \| 350.0 \| 1630.3 \| \| zstd.base \| 16384 \| silesia \| 2.871 \| 296.5 \| 883.3 \| \| zstd \| 16384 \| silesia \| 2.872 \| 294.4 \| 898.3 \|	2017-03-02 13:27:52 -08:00
Sean Purcell	d44703d145	Offsets >= 32MB in 32-bits mode	2017-03-01 16:27:56 -08:00
Yann Collet	4bcc69b761	solves warnings when compiling with global XXH_STATIC_LINKING_ONLY XXH_STATIC_LINKING_ONLY protection macro is intended to be triggered just before the include. The main idea is to keep this setting local : user module shall explicitly understand and accept the static linking restriction which becomes transparent when triggering the macro at project level. Global definition also triggers redefinition warnings for user modules which do locally define the macro. This new version compiles lib and cli without warning when the macro is set globally. That's not a scenario to be recommended, since it trades a local effect for a global one, but it was easy enough to provide from zstd side.	2017-03-01 11:33:25 -08:00
Yann Collet	dccd6b6f65	cli : fix : --rm is silent when input is stdin previously, app would produce an error message, and stop.	2017-02-27 15:57:50 -08:00
Yann Collet	14312d833e	zstdmt : fix : loading prefix from previous segments There used to be a (very small) chance that loading prefix from previous segment would be confused with a real zstd dictionary. For that to happen, the prefix needs to start with the same value as dictionary magic. That's 1 chance in 4 billions if all values have equal probability. But in fact, since some values are more common (0x00000000 for example) others are less common, and dictionary magic was selected to be one of them, so probabilities are likely even lower. Anyway, this risk is no down to zero by adding a new CCtx parameter : ZSTD_p_forceRawDict Current parameter policy : the parameter "stick" to its CCtx, so any dictionary loading after ZSTD_p_forceRawDict is set will be loaded in "raw" ("content only") mode, even if CCtx is re-used multiple times with multiple different dictionary. It's up to the user to reset this value differently if it needs so.	2017-02-23 23:42:12 -08:00
Yann Collet	831b4890ce	minor tests/Makefile refactoring and update of zstd_manual,html	2017-02-23 23:09:10 -08:00
Sean Purcell	83038d236a	Fix bug in FSE distribution normalization	2017-02-22 13:52:48 -08:00
Przemyslaw Skibinski	d8114e5802	zstd_compress.c: fix memory leaks	2017-02-21 18:59:56 +01:00
Anders Oleson	517577bf53	spelling fixes in comments i.e. occurred labeled Huffman	2017-02-20 12:08:59 -08:00
Yann Collet	2252d29a5a	Merge branch 'dev' of github.com:facebook/zstd into dev	2017-02-15 12:00:50 -08:00
Yann Collet	4596037042	updated fse version feature minor refactoring (removing FSE_abs()) also : fix a few minor issues recently introduced in examples	2017-02-15 12:00:03 -08:00
Nick Terrell	ecf90ca24b	[zstdmt] Fix MSAN failure with ZSTD_p_forceWindow Reproduction steps: ``` make zstreamtest CC=clang CFLAGS="-O3 -g -fsanitize=memory -fsanitize-memory-track-origins" ./zstreamtest -vv -t4178 -i4178 -s4531 ``` How to get to the error in gdb (may be a more efficient way): * 2 breaks at zstd_compress.c:2418 -- in ZSTD_compressContinue_internal() * 2 breaks at zstd_compress.c:2276 -- in ZSTD_compressBlock_internal() * 1 break at zstd_compress.c:1547 Why the error occurred: When `zc->forceWindow == 1`, after calling `ZSTD_loadDictionaryContent()` we have `zc->loadedDictEnd == zc->nextToUpdate == 0`. But, we've really loaded up to `iend` into the dictionary. Then in `ZSTD_compressBlock_internal()` we see that `current > zc->nextToUpdate + 384`, so we load the last 192 bytes a second time. In this case the bytes we are loading are a block of all 0s, starting in the previous block. So when we are loading the last 192 bytes, we find a `match` in the future, 183 bytes beyond `ip`. Since the block is all 0s, the match extends to the end of the block. But in `ZSTD_count()` we only check that `pIn < pInLoopLimit`, but since `pMatch > pIn`, `pMatch` eventually points past the end of the buffer, causing the MSAN failure. The fix: The line changed sets sets `zc->nextToUpdate` to the end of the dictionary. This is the behavior that existed before `ZSTD_p_forceWindow` was introduced. This fixes the exposing test case. Since the code doesn't fail without `zc->forceWindow`, it makes sense that this works. I've run the command `./zstreamtest -T2mn` 64 times without failures. CI should also verify nothing obvious broke.	2017-02-13 19:11:22 -08:00
Sean Purcell	2db7249265	Make pledgedSrcSize meaning clear for other functions - Added tests - Moved new size functions to static link only	2017-02-09 11:49:58 -08:00
Sean Purcell	0f5c95af44	Disambiguate pledgedSrcSize == 0 - Modify ZSTD CLI to only set contentSizeFlag if it _knows_ the size - Change pzstd to stop setting contentSizeFlag without accurate pledgedSrcSize	2017-02-08 15:12:46 -08:00
Yann Collet	48bed91606	Merge pull request #527 from facebook/zstdmt zstdmt refinements	2017-01-31 16:36:46 -08:00
Yann Collet	b2e1b3d670	fixed overlapLog==0 => no overlap	2017-01-30 14:54:46 -08:00
Yann Collet	3672d06d06	zstdmt : section size is set to be a minimum of overlapSize the minimum size condition size is applied transparently (no warning, no error) like previous minimum section size condition (1 KB) which still applies.	2017-01-30 13:35:45 -08:00
Yann Collet	88df1aed61	changed advanced parameter overlapLog Follows a positive logic (increasing value => increasing overlap) which is easier to use	2017-01-30 11:00:00 -08:00
Nick Terrell	b42dd27ef5	Add include guards and extern C	2017-01-27 16:00:19 -08:00
Yann Collet	f6d4a786fc	reduced zstdmt latency when using small custom section sizes with high compression levels Previous version was requiring a fairly large initial amount of input data before starting to create compression jobs. This new version starts the process much sooner.	2017-01-27 15:55:30 -08:00
Yann Collet	717c65d690	Merge pull request #519 from inikep/dev11 Dev11	2017-01-26 14:23:44 -08:00
Yann Collet	8dafb1acf5	CLI : automatically set overlap size to max (windowSize) for max compression level	2017-01-25 17:01:13 -08:00
Yann Collet	06e7697f96	added test of new parameter ZSTD_p_forceWindow	2017-01-25 16:39:03 -08:00
Yann Collet	bb0027405a	fixed zstdmt corruption issue when enabling overlapped sections see Asana board for detailed explanation on why and how to fix it	2017-01-25 16:25:38 -08:00
Yann Collet	943cff9c37	fixed zstdmt cli freeze issue with large nb of threads fileio.c was continually pushing more content without giving a chance to flush compressed one. It would block the job queue when input data was accumulated too fast (requiring to define many threads). Fixed : fileio flushes whatever it can after each input attempt.	2017-01-25 12:35:19 -08:00
Yann Collet	dc8dae596a	overlapped section, for improved compression Sections 2+ read a bit of data from previous section in order to improve compression ratio. This also costs some CPU, to reference read data. Read data is currently fixed to window>>3 size	2017-01-24 22:32:12 -08:00
Yann Collet	f14a669054	refactor job creation code shared accross ZSTDMT_{compress,flush,end}Stream(), for easier maintenance	2017-01-24 17:41:49 -08:00
Yann Collet	512cbe8c10	zstdmt cli and API allow selection of section sizes By default, section sizes are 4x window size. This new setting allow manual selection of section sizes. The larger they are, the (slightly) better the compression ratio, but also the higher the memory allocation cost, and eventually the lesser the nb of possible threads, since each section is compressed by a single thread. It also introduces a prototype to set generic parameters, ZSTDMT_setMTCtxParameter() The idea is that it's possible to add enums to extend the list of parameters that can be set this way. This is more long-term oriented than a fixed-size struct. Consider it as a test.	2017-01-24 17:08:53 -08:00
Yann Collet	3488a4a473	ZSTDMT now supports frame checksum	2017-01-24 11:48:40 -08:00
Przemyslaw Skibinski	96f152f708	improved ZSTD_compressBlock_opt_extDict_generic	2017-01-24 13:18:50 +01:00
Yann Collet	94364bf87a	refactor ZSTDMT streaming flush code now shared by both ZSTDMT_compressStream() and ZSTDMT_flushStream()	2017-01-23 11:50:44 -08:00
Yann Collet	1cbf251e43	ZSTDMT streaming : fall back to (regular) single thread mode when nbThreads==1	2017-01-23 01:43:58 -08:00
Yann Collet	84581ff8d7	ZSTDMT_compressCCtx : fallback to single-thread mode when nbChunks==1	2017-01-23 01:20:27 -08:00
Yann Collet	1a2547f654	ZSTDMT_compressStream() becomes blocking when required to ensure forward progresses In some (rare) cases, job list could be blocked by a first job still being processed, while all following ones are completed, waiting to be flushed. In such case, the current job-table implementation is unable to accept new job. As a consequence, a call to ZSTDMT_compressStream() can be useless (nothing read, nothing flushed), with the risk to trigger a busy-wait on the caller side (needlessly loop over ZSTDMT_compressStream() ). In such a case, ZSTDMT_compressStream() will block until the first job is completed and ready to flush. It ensures some forward progress by guaranteeing it will flush at least a part of the completed job. Energy-wasting busy-wait is avoided.	2017-01-22 23:49:52 -08:00
Yann Collet	c593348722	ZSTDMT_initCStream_usingDict() can outlive dict Like ZSTD_initCStream_usingDict(), ZSTDMT_initCStream_usingDict() now keep a copy of dict internally. This way, dict can be released : it does not longer have to outlive all future compression sessions.	2017-01-22 16:44:15 -08:00
Yann Collet	9d6f7637ec	protected (mutex) read to jobCompleted, as suggested by @terrelln	2017-01-21 22:14:08 -08:00
Yann Collet	0cf74fa957	optimized pool allocation by 1 slot	2017-01-21 22:06:49 -08:00
Yann Collet	6ed29a8f44	minor : tab to spaces	2017-01-21 21:56:36 -08:00
Yann Collet	317604e0ad	fixed : compilation of zstreamtest in dll mode	2017-01-20 17:18:41 -08:00
Yann Collet	d7e3cb58c5	Resolved merge conflict dev+zstdmt	2017-01-20 16:44:50 -08:00
cyan4973	2e3b659ae1	fixed minor warnings (Visual, conversion, doxygen)	2017-01-20 14:43:09 -08:00
cyan4973	5fba09fa41	updated util's time for Windows compatibility Correctly measures time on Posix systems when running with Multi-threading Todo : check Windows measurement under multi-threading	2017-01-20 12:57:31 -08:00
Yann Collet	b459aad5b4	renamed savedRep into repToConfirm	2017-01-19 17:33:37 -08:00
Yann Collet	500014af49	zstd cli can now compress using multi-threading added : command -T# added : ZSTD_resetCStream() (zstdmt_compress) added : FIO_setNbThreads() (fileio)	2017-01-19 17:04:28 -08:00
Yann Collet	19d670ba9d	Added ZSTDMT_initCStream_advanced() variant Correctly compress with custom params and dictionary Added relevant fuzzer test in zstreamtest Also : new macro ZSTDMT_SECTION_LOGSIZE_MIN, which sets a minimum size for a full job (note : a flush() command can still generate a partial job anytime)	2017-01-19 15:32:07 -08:00
Yann Collet	736788f8e8	added streaming fuzzer tests for MT API Also : fixed corner case, where nb of jobs completed becomes > jobQueueSize which is possible when many flushes are issued while there is not enough dst buffer to flush completed ones.	2017-01-19 12:15:29 -08:00
Yann Collet	32dfae6f98	fixed Multi-threaded compression MT compression generates a single frame. Multi-threading operates by breaking the frames into independent sections. But from a decoder perspective, there is no difference : it's just a suite of blocks. Problem is, decoder preserves repCodes from previous block to start decoding next block. This is also valid between sections, since they are no different than changing block. Previous version would incorrectly initialize repcodes to their default value at the beginning of each section. When using them, there was a mismatch between encoder (default values) and decoder (values from previous block). This change ensures that repcodes won't be used at the beginning of a new section. It works by setting them to 0. This only works with regular (single segment) variants : extDict variants will fail ! Fortunately, sections beyond the 1st one belong to this category. To be checked : btopt strategy. This change was only validated from fast to btlazy2 strategies.	2017-01-19 10:32:55 -08:00
Yann Collet	37226c1e9f	Simplified compressChunk job minor refactoring : compression done in a single call on first chunk Avoid a mutable hSize variable and eventual recombination to cSize at the end	2017-01-19 10:18:17 -08:00
Yann Collet	6073b3e6b8	ZSTDMT_endStream : nullify input buffer after flush There will be no more input after ZSTDMT_endStream invocation : only flush/end is allowed (to fully collect compressed result).	2017-01-18 15:32:38 -08:00
Yann Collet	3a01c46b26	ZSTDMT_initCStream() supports restart from invalid state ZSTDMT_initCStream() will correcly scrub for resources when it detects that previous compression was not properly finished.	2017-01-18 15:18:17 -08:00
Yann Collet	4885f591b3	trap compression errors, collect back resources from workers	2017-01-18 14:11:37 -08:00
Yann Collet	563ef8acf4	CCtxPool starts empty, as suggested by @terrelln Also : make zstdmt now a target from root	2017-01-18 12:12:10 -08:00
Yann Collet	a6db7a7b9b	fixed cmaketest (buffer_t){NULL,0} is not considered a constant. {NULL,0} is.	2017-01-18 11:57:34 -08:00
Yann Collet	0d6b8f65a9	ZSTDMT_free() scrubs potentially unfinished jobs to release their resources In some complex scenarios (free() without finishing compression), it is possible that some resources are still into jobs and not collected back into pools. In which case, previous version of free() would miss them. This would be equivalent to a leak. New version ensures that it even foes after such resource. It requires job consumers to properly mark resources as released, by replacing entries by NULL after releasing back to the pool. Obviously, it's not recommended to free() zstdmt context mid-term, still that's now a supported scenario. The same methodology is also used to ensure proper resource collection after an error is detected. Still to do : - detect compression errors (not just allocation ones) - properly manage resource when init() is called without finishing previous compression.	2017-01-17 17:46:33 -08:00
Yann Collet	d0a1d45582	ZSTDMT_{flush,end}Stream() now block on next job completion when nothing to flush The main issue was to avoid a caller to continually loop on {flush,end}Stream() when there was nothing ready to be flushed but still some compression work ongoing in a worker thread. The continuous loop would have resulted in wasted energy. The new version makes call to {flush,end}Stream blocking when there is nothing ready to be flushed. Of course, if all worker threads have exhausted job, it will return zero (all flush completed). Note : There are still some remaining issues to report error codes and properly collect back resources into pools when an error is triggered.	2017-01-17 16:15:18 -08:00
Yann Collet	a73c412932	completed ZSTDMT streaming compression Provides the baseline compression API : size_t ZSTDMT_initCStream(ZSTDMT_CCtx* zcs, int compressionLevel); size_t ZSTDMT_compressStream(ZSTDMT_CCtx* zcs, ZSTD_outBuffer* output, ZSTD_inBuffer* input); size_t ZSTDMT_flushStream(ZSTDMT_CCtx* zcs, ZSTD_outBuffer* output); size_t ZSTDMT_endStream(ZSTDMT_CCtx* zcs, ZSTD_outBuffer* output); Not tested yet	2017-01-17 15:31:16 -08:00
Sean Purcell	57d423c5df	Don't create dict in streaming apis if dictSize == 0	2017-01-17 14:31:35 -08:00
Gregory Szorc	7d6f478d15	Set dictionary ID in ZSTD_initCStream_usingCDict() When porting python-zstandard to use ZSTD_initCStream_usingCDict() so compression dictionaries could be reused, an automated test failed due to compressed content changing. I tracked this down to ZSTD_initCStream_usingCDict() not setting the dictID field of the ZSTD_CCtx attached to the ZSTD_CStream instance. I'm not 100% convinced this is the correct or full solution, as I'm still seeing one automated test failing with this change.	2017-01-14 17:44:54 -08:00
Yann Collet	5b726dbe4d	fix gcc-arm warning "suggest braces around empty body"	2017-01-12 17:46:46 +01:00
Yann Collet	ad9f6bd123	zstdmt : fix : resources properly collected even when early fail In previous version, main function would return early when detecting a job error. Late threads resources were therefore not collected back into pools. New version just register the error, but continue the collecting process. All buffers and context should be released back to pool before leaving main function.	2017-01-12 03:06:35 +01:00
Yann Collet	b05c4828ea	zstdmt : correctly check for cctx and buffer allocation Result from getBuffer and getCCtx could be NULL when allocation fails. Now correctly checks : job creation stop and last job reports an allocation error. releaseBuffer and releaseCCtx are now also compatible with NULL input. Identified a new potential issue : when early job fails, later jobs are not collected for resource retrieval.	2017-01-12 02:01:28 +01:00
Yann Collet	107bcbbbc2	zstdmt : changed internal naming from frame to chunk Since the result of mt compression is a single frame, changed naming, which implied the concatenation of multiple frames. minor : ensures that content size is written in header	2017-01-12 01:25:46 +01:00
Yann Collet	5eb749e734	ZSTDMT_compress() creates a single frame The new strategy involves cutting frame at block level. The result is a single frame, preserving ZSTD_getDecompressedSize() As a consequence, bench can now make a full round-trip, since the result is compatible with ZSTD_decompress(). This strategy will not make it possible to decode the frame with multiple threads since the exact cut between independent blocks is not known. MT decoding needs further discussions.	2017-01-11 18:21:25 +01:00
Yann Collet	04cbc36499	minor refactor (release CCtx 1st) and comment clarification	2017-01-11 16:08:08 +01:00
Yann Collet	085179bb78	fixed ZSTDMT_createCCtx() : checked inner objects are properly created	2017-01-11 15:58:05 +01:00
Yann Collet	8ce1cc2bec	improved ZSTD_createCCtxPool() cancellation use ZSTD_freeCCtxPool() to release the partially created pool. avoids to duplicate logic. Also : identified a new difficult corner case : when freeing the Pool, all CCtx should be previously released back to the pool. Otherwise, it means some CCtx are still in use. There is currently no clear policy on what to do in such a case. Note : it's supposed to never happen. Since pool creation/usage is static, it has no external user, which limits risks.	2017-01-11 15:44:26 +01:00
Yann Collet	47557ba2b2	fixed ZSTDMT_createCCtxPool() when inner CCtx creation fails	2017-01-11 15:35:56 +01:00
Yann Collet	f1cb55192c	fixed linux warnings	2017-01-02 01:11:55 +01:00
Yann Collet	0ec6a95ba1	minor fixes	2017-01-02 00:49:42 +01:00
Yann Collet	2ec635a162	use pthread_cond to send signals between threads	2017-01-01 17:31:33 +01:00
Yann Collet	3b9d434356	extended ZSTDMT code support for non-MT systems and WIN32 (preliminary)	2016-12-31 16:32:19 +01:00
Yann Collet	c8efc1c874	simplified Buffer Pool	2016-12-31 14:45:33 +01:00
Yann Collet	3b29dbd9e8	new zstdmt version using generic treadpool	2016-12-31 06:04:25 +01:00
Yann Collet	c6a6417458	bench correctly measures time for multi-threaded compression (posix only)	2016-12-31 03:31:26 +01:00
Yann Collet	e70912c72b	Changed : input divided into roughly equal parts. Debug : can measure time waiting for mutexes to unlock.	2016-12-29 01:24:01 +01:00
Yann Collet	6c0ed9483a	compression threads use ZSTD_compressCCtx()	2016-12-28 17:08:28 +01:00
Yann Collet	ce9e1452fd	protect buffer pool with a mutex	2016-12-28 15:31:19 +01:00
Yann Collet	3d93f2fce7	first zstdmt sketch	2016-12-27 07:19:36 +01:00
Yann Collet	aca113f4f5	fixed ZSTD_sizeof_?Dict()	2016-12-23 22:25:03 +01:00
Yann Collet	4e5eea61a8	added ZSTD_createDDict_byReference()	2016-12-21 16:44:35 +01:00
Yann Collet	1f57c2ed32	added : ZSTD_createCDict_byReference()	2016-12-21 16:20:11 +01:00
Nick Terrell	8157a4c3cc	Fix dictionary loading bug causing an MSAN failure Offset rep codes must be in the range `[1, dictSize)`. Fix dictionary loading to reject `0` as a offset rep code.	2016-12-20 10:47:52 -08:00
Yann Collet	d564faa3c6	fix : ZSTD_initCStream_srcSize() correctly set srcSize in frame header	2016-12-18 21:39:15 +01:00
Yann Collet	e795c8a5f6	Added ZSTD_initCStream_srcSize(). Added relevant test cases in zstreamtest	2016-12-13 17:00:14 +01:00
Yann Collet	c3a5c4bef8	introduced cycleLog	2016-12-12 00:47:30 +01:00
Yann Collet	c261f71f6a	minor variation of rescale fix	2016-12-12 00:25:07 +01:00
Nick Terrell	3826207a70	Simplify segfault fix Take advantage of the fact that `chainLog <= windowLog`.	2016-12-10 18:46:55 -08:00
Nick Terrell	0012332ce0	Fix compression segfault When the overflow protection kicks in, it makes sure that ip - ctx->base isn't too large. However, it didn't ensure that saved offsets are still valid. This change ensures that any valid offsets (<= windowLog) are still representable after the update. The bug would shop up on line 1056, when `offset_1 > current + 1`, which causes an underflow. This in turn, would cause a segfault on line 1063. The input must necessarily be longer than 1 GB for this issue to occur. Even then, it only occurs if one of the last 3 matches is larger than the chain size and block size.	2016-12-09 17:15:33 -08:00
Yann Collet	825dffbc43	moved zbuff source files into lib/deprecated	2016-12-05 19:28:19 -08:00
Yann Collet	a0d742b1e4	introduced HUF_buildCTable_wksp(), to reduce stack memory usage	2016-12-01 17:47:30 -08:00
Yann Collet	643d9a234b	replaced usage of FSE_buildCTable by FSE_buildCTable_wksp, using less stack space in the process	2016-12-01 16:24:04 -08:00
Yann Collet	e928f7e16d	introduced ext_wksp variants of count to reduce stack memory usage	2016-12-01 16:13:35 -08:00
Yann Collet	979cab412b	fixed some minor visual silent cast warnings. introduced FSE_count_parallel_wksp().	2016-11-30 18:10:38 -08:00
Yann Collet	5e00b848a8	FSE_compress_wksp() uses less stack space	2016-11-30 16:46:13 -08:00
Yann Collet	d79a9a00d9	Introduced FSE_compress_wksp() and FSE_buildCTable_wksp() to reduce stack memory usage	2016-11-30 15:52:20 -08:00
Yann Collet	25f46dcc0f	minor const	2016-11-29 16:59:27 -08:00
Przemyslaw Skibinski	fc4193bda5	fixed g++ warnings	2016-11-23 18:17:18 +01:00
Przemyslaw Skibinski	9ca65af810	zstd_opt.h: improved price function	2016-11-23 17:22:54 +01:00
Przemyslaw Skibinski	ad3e94512c	fixed warnings from static analyzer in zstd_opt.h	2016-11-21 20:22:12 +01:00
Przemyslaw Skibinski	3d18088b38	updated windres	2016-11-17 18:04:41 +01:00
Yann Collet	407a11f63e	fixed Visual compatibility	2016-11-03 15:52:01 -07:00
Nick Terrell	d82efd8a70	ZSTD_compress_usingDict() when dict gets loaded Specify that when `dict == NULL \|\| dictSize < 8` no dictionary gets loaded. Also add some periods.	2016-11-02 18:07:16 -07:00
Yann Collet	ee5b725823	ZSTD_initCStream() optimization : do not allocate a CDict when no dictionary used	2016-10-27 14:20:55 -07:00
Yann Collet	335ad5d4d4	added ZSTD_initDStream_usingDDict() . slightly optimized ZSTD_initDStream() when no dictionary . fixed ZSTD_sizeof_CStream() .	2016-10-25 17:47:02 -07:00
Yann Collet	9516234e67	first sketch for ZSTD_initCStream_usingCDict()	2016-10-25 16:19:52 -07:00
Yann Collet	62d9a7ddfd	Merge pull request #429 from inikep/btopt2 Btopt2	2016-10-25 14:48:43 -07:00
Przemyslaw Skibinski	5c5f01f3da	added ZSTD_btopt2 strategy	2016-10-25 12:25:07 +02:00
Nick Terrell	b2c39a22b0	Fix compiler narrowing warning	2016-10-24 14:50:13 -07:00
Nick Terrell	f698ad6deb	Merge remote-tracking branch 'upstream/dev' into fixes * upstream/dev: added doc\zstd_manual.html added contrib\gen_html zstd_compression_format.md moved to doc/ Fix small bug in ZSTD_execSequence() improved ZSTD_compressBlock_opt_extDict_generic protect ZSTD_decodeFrameHeader() from invalid usage, as suggested by @spaskob zstd_opt.h: small improvement in compression ratio improved dicitonary segment merge use implicit rules to compile zstd_decompress.c detect early impossible decompression scenario in legacy decoder v0.5 no repeat mode in legacy v0.5 fixed invalid invocation of dictionary in legacy decoder v0.5 fix edge case fix command line interpretation fixed minor corner case zstd.h: added the Introduction section fixed clang 3.5 warnings zstd.h: updated comments	2016-10-24 13:10:13 -07:00
Nick Terrell	f9c9af3c2e	Reject dictionaries with incomplete entropy tables If a dictionary specifies that a symbol has probability zero in its `matchLength`, `literalLength`, or `offset` FSE table, but the symbol appears when compressing input, the compressor fails. Ensure that dictionaries support all `matchLength`, and `literalLength` codes. They must also support all of the `offset` codes required to represent every possible offset that can appear in the first block.	2016-10-24 10:42:44 -07:00
Przemyslaw Skibinski	3ee94a7600	zstd_compression_format.md moved to doc/	2016-10-24 15:58:07 +02:00
Przemyslaw Skibinski	4732074a71	improved ZSTD_compressBlock_opt_extDict_generic	2016-10-21 11:19:00 +02:00
Przemyslaw Skibinski	d365ae3497	zstd_opt.h: small improvement in compression ratio	2016-10-20 11:49:02 +02:00
Yann Collet	197a55ee7b	fix edge case	2016-10-18 11:27:52 -07:00
Nick Terrell	fd98087047	Fix stack buffer overflow in HUF_readCTable() If `w ==0` on line 153, then `CTable[n].nbBits == tableLog + 1`. Then `nbPerRank[CTable[n].nbBits]` and `valPerRank[CTable[n].nbBits]` are stack buffer overflows.	2016-10-17 18:16:59 -07:00
Nick Terrell	bfd943ace5	Fix buffer overrun in ZSTD_loadDictEntropyStats() The table log set by `FSE_readNCount()` was not checked in `ZSTD_loadDictEntropyStats()`. This caused `FSE_buildCTable()` to stack/heap overflow in a few places. The benchmarks look good, there is no obvious compression performance regression: > ./zstds/zstd.opt.0 -i10 -b1 -e10 ~/bench/silesia.tar 1#silesia.tar : 211988480 -> 73656930 (2.878), 271.6 MB/s , 716.8 MB/s 2#silesia.tar : 211988480 -> 70162842 (3.021), 204.8 MB/s , 671.1 MB/s 3#silesia.tar : 211988480 -> 66997986 (3.164), 156.8 MB/s , 658.6 MB/s 4#silesia.tar : 211988480 -> 66002591 (3.212), 136.4 MB/s , 665.3 MB/s 5#silesia.tar : 211988480 -> 65008480 (3.261), 98.9 MB/s , 647.0 MB/s 6#silesia.tar : 211988480 -> 62979643 (3.366), 65.2 MB/s , 670.4 MB/s 7#silesia.tar : 211988480 -> 61974560 (3.421), 44.9 MB/s , 688.2 MB/s 8#silesia.tar : 211988480 -> 61028308 (3.474), 32.4 MB/s , 711.9 MB/s 9#silesia.tar : 211988480 -> 60416751 (3.509), 21.1 MB/s , 718.1 MB/s 10#silesia.tar : 211988480 -> 60174239 (3.523), 22.2 MB/s , 721.8 MB/s > ./compress_zstds/zstd.opt.1 -i10 -b1 -e10 ~/bench/silesia.tar 1#silesia.tar : 211988480 -> 73656930 (2.878), 273.8 MB/s , 722.0 MB/s 2#silesia.tar : 211988480 -> 70162842 (3.021), 203.2 MB/s , 666.6 MB/s 3#silesia.tar : 211988480 -> 66997986 (3.164), 157.4 MB/s , 666.5 MB/s 4#silesia.tar : 211988480 -> 66002591 (3.212), 132.1 MB/s , 661.9 MB/s 5#silesia.tar : 211988480 -> 65008480 (3.261), 96.8 MB/s , 641.6 MB/s 6#silesia.tar : 211988480 -> 62979643 (3.366), 63.1 MB/s , 677.0 MB/s 7#silesia.tar : 211988480 -> 61974560 (3.421), 44.3 MB/s , 678.2 MB/s 8#silesia.tar : 211988480 -> 61028308 (3.474), 33.1 MB/s , 708.9 MB/s 9#silesia.tar : 211988480 -> 60416751 (3.509), 21.5 MB/s , 710.1 MB/s 10#silesia.tar : 211988480 -> 60174239 (3.523), 21.9 MB/s , 723.9 MB/s	2016-10-17 16:55:52 -07:00
Yann Collet	2b361cf2f1	minor opt	2016-10-14 16:09:07 -07:00
Nick Terrell	3b9cdf9220	Fix ubsan failures (pass NULL to memcpy)	2016-10-12 20:54:42 -07:00
Yann Collet	cf409a7e2a	fixed : init*_advanced() followed by reset() with different pledgedSrcSiz	2016-09-26 16:41:05 +02:00
Yann Collet	97b378a6f8	Streaming : dictionary compression on multiple files / segments can correctly provide srcSize into header (when provided) using pledgedSrcSize.	2016-09-21 17:20:19 +02:00
Yann Collet	993060e0f2	cli : better adaptation to small files	2016-09-21 16:46:08 +02:00
Yann Collet	a6bdf55759	fixed memory leak	2016-09-15 17:02:06 +02:00
Yann Collet	644a8da88a	fixed minor conversion warning	2016-09-15 16:16:21 +02:00
Yann Collet	4cb212938c	introduced ZSTD_resetCStream()	2016-09-15 14:54:07 +02:00
Yann Collet	fa0c09760c	variable renaming	2016-09-15 14:11:01 +02:00
Yann Collet	d7c6589df8	support ZSTD_sizeof_*() on NULL added ZSTD_sizeof_CDict()	2016-09-15 02:57:27 +02:00
Yann Collet	64deef3bee	Fixed srcSize=1	2016-09-14 00:16:07 +02:00
Yann Collet	ac8bace6b1	support large skippable frames	2016-09-07 14:54:23 +02:00
Yann Collet	95d07d7447	introduced CHECK_E	2016-09-06 16:38:51 +02:00
Yann Collet	3e21ec5b01	introduced CHECK_F	2016-09-06 15:36:19 +02:00
Yann Collet	5c956d593c	FORCE_INLINE common definition	2016-09-06 15:05:19 +02:00
Yann Collet	edbcd9f5b2	fixed zbufftest	2016-09-06 14:30:57 +02:00
Yann Collet	b624922b14	fixed checksum	2016-09-06 11:16:57 +02:00
Yann Collet	a7737f6a60	improved compression on small files when using same parameters	2016-09-06 09:44:59 +02:00
Yann Collet	7ae67bb18a	small compression speed gains with using_CDict	2016-09-06 06:28:05 +02:00
Yann Collet	855766d73d	clarified dictionary in format description	2016-09-02 17:04:49 -07:00
Yann Collet	1563bfeabc	fixing FORCE_INLINE for older compilers (#330 )	2016-09-02 11:44:21 -07:00
David Lam	e10f7f3dcb	merge	2016-08-30 12:03:36 -07:00
Yann Collet	4ded9e591c	added boilerplate	2016-08-30 11:06:28 -07:00
Yann Collet	14200a20f0	Fixed issue #304 , reported by @borzunov	2016-08-30 06:51:00 -07:00
David Lam	da9d3b7057	Cleanup some errors in typedef comments and remove duplicated HOWTO from zbuff_decompress.c	2016-08-29 17:31:51 -07:00
Yann Collet	23b6e05d8e	ZSTD_malloc() and ZSTD_free(), to simplify customMem	2016-08-28 21:05:43 -07:00
Yann Collet	e19a9ef05d	update compression level table	2016-08-26 20:02:49 +02:00
Yann Collet	87c18b2ebd	fixed multiple minor warnings for XCode	2016-08-26 01:43:47 +02:00
Yann Collet	0d59a6f73a	removed debug strings	2016-08-25 22:42:46 +02:00
Yann Collet	5a02b69215	reinforced fix for huge files	2016-08-25 22:24:59 +02:00
Yann Collet	96bdd87de4	fixed : compression bug on very large files	2016-08-25 14:34:42 +02:00
inikep	a3a47ec4d0	Merge remote-tracking branch 'refs/remotes/Cyan4973/dev' into Other	2016-08-24 21:25:49 +02:00
Yann Collet	a2cdffe556	fixed wrong parameter issue	2016-08-24 19:42:15 +02:00
inikep	e416e30019	remove unnecessary comments	2016-08-24 17:32:09 +02:00
inikep	4e90f6c1e0	removed ZSTD_LOG_ENCODE and ZSTD_LOG_BLOCK	2016-08-24 17:24:11 +02:00
inikep	83388e109f	removed ZSTD_LOG_PARSER	2016-08-24 17:22:20 +02:00
inikep	8a36f8527c	removed stats in debug mode	2016-08-24 17:19:12 +02:00
Yann Collet	24b68a5b23	update cLevel table for 256KB	2016-08-24 14:22:26 +02:00
Yann Collet	c54692faeb	improved level 3	2016-08-24 01:45:46 +02:00
Yann Collet	70e3b31306	fixed playtests on os-x	2016-08-23 01:18:06 +02:00
Yann Collet	cb3276329a	added sizeof CStream and DStream	2016-08-23 00:31:59 +02:00
Yann Collet	d1733f7417	fixed crc bug in rare timing conditions within bench.c	2016-08-21 01:04:46 +02:00
Yann Collet	c411902230	fixed g++ conversion warning	2016-08-17 01:50:54 +02:00
Yann Collet	53e17fbd5e	updated streaming API	2016-08-17 01:39:22 +02:00
Yann Collet	104e5b072d	added : streaming decompression API	2016-08-16 15:11:28 +02:00
Yann Collet	5a0c8e2439	new streaming API (compression)	2016-08-16 15:11:27 +02:00
inikep	5f49eba512	added usage of rep[0]-1 for the optimal parser	2016-08-10 15:01:53 +02:00
inikep	98e08cbe34	fixed: tree not updated after finding very long rep matches	2016-08-10 15:00:30 +02:00
Yann Collet	280f9a8754	minor comment	2016-08-08 00:44:00 +02:00
Yann Collet	0763905f44	ZSTD_compress_usingCDict() correctly provides original size by default in frame header Fixed dictionary examples	2016-08-03 01:57:57 +02:00
Yann Collet	346efccc35	fixed doc typo	2016-08-02 14:26:00 +02:00
Yann Collet	c55eb18c11	Merge pull request #267 from inikep/dev08 fixed ZSTD_compressBlock_opt_extDict_generic	2016-07-31 22:00:16 +02:00
inikep	056df510aa	fixed ZSTD_compressBlock_opt_extDict_generic	2016-07-31 20:08:53 +02:00
Yann Collet	917fe188f1	Implemented repOffset "minus 1" on ll==0	2016-07-31 04:01:57 +02:00
Yann Collet	3b2bd1d11c	zstd_opt uses same tables as zstd_compress	2016-07-30 13:21:41 +02:00
Yann Collet	3c6b808870	minor decompression speed gains	2016-07-30 03:20:47 +02:00
Yann Collet	c0ce4f1211	slightly improved compression speed	2016-07-30 00:55:13 +02:00
Yann Collet	ed57d8530a	new seqStore	2016-07-29 21:22:17 +02:00
Yann Collet	c00d30fbe4	Merge pull request #264 from inikep/dev08 Dev08	2016-07-29 17:42:30 +02:00
inikep	6b68ba2079	zstd_opt.h: fixed checking of rep codes (2)	2016-07-29 16:45:39 +02:00
inikep	59b86fc141	zstd_opt.h: fixed checking of rep codes	2016-07-29 11:00:33 +02:00
Yann Collet	60ba31c570	zbuff uses ZSTD_compressEnd()	2016-07-28 19:55:09 +02:00
Yann Collet	16e73033ad	introduced stage zbf_end	2016-07-28 16:32:34 +02:00
Yann Collet	62470b4bab	Changed ZSTD_compressEnd()	2016-07-28 15:29:08 +02:00
Yann Collet	19c1002e46	applied ZSTD_compressContinueThenEnd()	2016-07-28 01:25:46 +02:00
Yann Collet	5b56739b63	created ZSTD_compressContinueThenEnd()	2016-07-28 01:17:22 +02:00
Yann Collet	c991cc1828	new frame end, 32-bits checksums	2016-07-28 00:55:43 +02:00
Yann Collet	d4180cad9c	minor code refactoring	2016-07-27 21:21:36 +02:00
Yann Collet	731ef16fc1	minor code style refactoring	2016-07-27 21:05:12 +02:00
Yann Collet	4110534886	ZSTD_maxCLevel() is promoted to "stable" API (#254 , by @FrancescAlted)	2016-07-27 15:09:11 +02:00
Yann Collet	c154d9d6a2	better support for large dictionaries (> 128 KB)	2016-07-27 14:37:00 +02:00
inikep	003c7a8568	optimal parser: removed ZSTD_REP_INIT	2016-07-27 11:07:13 +02:00
Eric Biggers	e4d0265ea9	Replace remaining references to "direct mode" with "single segment mode"	2016-07-26 13:22:27 -07:00
Yann Collet	38b75ddeb2	removed special case all-1 huffman distribution	2016-07-24 15:35:59 +02:00
Yann Collet	7ed5e33b89	minor comment changes	2016-07-24 14:26:11 +02:00
Yann Collet	f8e7b5363f	unified encoding types	2016-07-23 16:31:49 +02:00
Yann Collet	c2e1a68d81	changed streamNb order to 1-4-4-4	2016-07-22 17:30:52 +02:00
Yann Collet	32faf6c8e7	fixed conversion warnings	2016-07-22 14:37:09 +02:00
Yann Collet	198e6aac44	Literals header fields use little endian convention	2016-07-22 14:37:09 +02:00
Yann Collet	6fa05a2371	cBlockSize uses little-endian convention	2016-07-22 14:37:09 +02:00
Yann Collet	5894ea8d01	updated cLevels	2016-07-22 14:36:46 +02:00
Yann Collet	d5c5a77990	minor comments clarifications	2016-07-20 13:35:14 +02:00
Yann Collet	cf05b9d477	ZSTD_getBlockSizeMax()	2016-07-18 16:52:10 +02:00
Yann Collet	e557fd5e92	minor compression level corrections	2016-07-17 16:21:37 +02:00
Yann Collet	6cacd34d44	minor formatting changes	2016-07-15 17:58:13 +02:00
Yann Collet	98c8884999	added target zstd in root Makefile	2016-07-15 16:12:38 +02:00
Yann Collet	961b6a0e34	ZSTD_compressBlock() limits block size depending on windowLog parameter	2016-07-15 11:58:49 +02:00
Yann Collet	227cc39e15	improved efficiency for large messages with small dictionaries	2016-07-15 11:27:09 +02:00
Yann Collet	ea2ecdc315	fixed issue with small dictionary	2016-07-14 23:27:31 +02:00
Yann Collet	b23e1ce319	removed debugging traces	2016-07-14 17:46:38 +02:00
Yann Collet	8847238cac	simplified ZSTD_estimateCCtxSize()	2016-07-14 17:05:38 +02:00
Yann Collet	69c2cdb45c	fixed conversion warning	2016-07-14 16:52:45 +02:00
Yann Collet	5e80dd3261	fixed minor coverity warnings	2016-07-13 19:21:57 +02:00
Yann Collet	2b1a3638e6	changed macro name to ZSTDCLI_CLEVEL_DEFAULT	2016-07-13 15:16:00 +02:00
Yann Collet	3c242e79d3	updated compression levels table	2016-07-13 14:56:24 +02:00
Yann Collet	eed2081e55	fixed conversion warning	2016-07-12 15:11:40 +02:00
Yann Collet	a43a854cdb	updated paramgrill	2016-07-12 13:42:10 +02:00
Yann Collet	73d74a05b9	fixed dfast strategy	2016-07-12 13:03:48 +02:00
Yann Collet	45dc35628c	first version of doubleFast	2016-07-12 09:47:31 +02:00
Yann Collet	3ae543ce75	added ZSTD_estimateCCtxSize()	2016-07-11 03:12:17 +02:00
Yann Collet	e09d38e921	removed `mem.h` dependency from `zbuff.h` (experimental section)	2016-07-07 13:17:37 +02:00
Yann Collet	52c04fe58f	removed `mem.h` dependency from `zstd.h` (experimental section)	2016-07-07 11:53:18 +02:00
Yann Collet	d57dffbe76	ZSTD_storeSeq takes an U32 as offset type	2016-07-03 01:48:26 +02:00
Yann Collet	302ff036f6	simplified repcodes for lazy_extDict	2016-07-03 01:28:16 +02:00
Yann Collet	9634f67107	fix lazy parser	2016-07-03 01:23:58 +02:00
Yann Collet	92d75667e4	fix for fast mode	2016-07-03 01:10:53 +02:00
Yann Collet	5e734ad09b	revert fix	2016-07-02 23:55:34 +02:00
Yann Collet	0d5bf8f06f	fixed risk of segfault on very large files (multiple GB)	2016-07-02 21:39:47 +02:00
Yann Collet	2fa9904844	update specification and comments	2016-07-01 20:55:28 +02:00
Yann Collet	c093208ab8	fix : potential leak (#229 )	2016-06-30 14:07:30 +02:00
Yann Collet	6c6e1751f6	use ZSTD_getParams() to simplify code	2016-06-27 15:28:45 +02:00
Yann Collet	3d2cd7f816	Introduced ZSTD_getParams() bench now uses ZSTD_createCDict_advanced()	2016-06-27 15:12:26 +02:00
Yann Collet	d4f4e58ee1	fixed ZSTD_decompressBlock() using multiple blocks	2016-06-27 01:31:35 +02:00
Yann Collet	3755eb8fea	fixed strict-aliasing warning on gcc6	2016-06-22 13:15:53 +02:00
Yann Collet	391a128794	fix : segfault in command line during automatic overwrite protection mode	2016-06-21 17:06:25 +02:00
Yann Collet	bda68c253b	refactored ZBUFF_compressEnd() for better maintainability	2016-06-21 15:18:11 +02:00
Yann Collet	aa29226b7c	fix : ZBUFF_compressEnd() gives right amount remaining to flush, including future epilogue	2016-06-21 14:04:57 +02:00
Yann Collet	f15c1cb00c	Fixed : ZBUFF_compressEnd() called multiple times with too small dst buffer (#206 )	2016-06-21 13:11:48 +02:00
Yann Collet	22d76322ce	minor refactor	2016-06-21 08:01:51 +02:00
Yann Collet	a436a529bc	minor : fast_extDict does no longer skip first byte	2016-06-20 23:34:04 +02:00
Yann Collet	4623d11571	new correction, less extreme replacement value	2016-06-20 19:15:37 +02:00
Yann Collet	5477cc25f7	fixed corruption error related to inter-blocks rep-offset	2016-06-20 18:31:25 +02:00
Yann Collet	06d9a73b48	minor refactor, using `WILDCOPY_OVERLENGTH` macro instead of hard-coded 8	2016-06-19 14:27:21 +02:00
Yann Collet	19cab46f2f	Joined `seqStore` initialization at dispatch point	2016-06-17 12:54:52 +02:00
Yann Collet	23ba41533a	Fixed zstd_opt encoding error with repeat-offsets	2016-06-16 13:20:46 +02:00
Yann Collet	736d419289	strengthened dict loading on decompresson side	2016-06-16 01:05:04 +02:00
Yann Collet	52a0622beb	RepsCodes are saved into Dict (uncomplete : need decompression to regenerate them)	2016-06-16 01:05:04 +02:00
Yann Collet	efd0b4993a	fixed fuzzer error (inter-block repeated offsets)	2016-06-16 00:53:56 +02:00
Yann Collet	d059092897	fixed conversion warnings	2016-06-14 15:34:24 +02:00
Yann Collet	45c03c564f	fixed corruption with inter-blocks repeated offsets	2016-06-14 13:46:11 +02:00
Yann Collet	4266c0a2fd	adding inter-blocks rep-offsets	2016-06-14 01:49:25 +02:00
Yann Collet	43dfe01919	Check `repIndex` for validity	2016-06-13 21:43:06 +02:00
Yann Collet	9dd12742f3	`litBlockType_t` is an `enum`	2016-06-10 00:12:26 +02:00
Yann Collet	302fb53a76	Removed `ZSTD_*_usingPrepared?Ctx()` declaration from public space	2016-06-07 12:16:49 +02:00
Yann Collet	81e13ef7cf	first implementation of the new dictionary API (untested)	2016-06-07 00:51:51 +02:00
Yann Collet	2cc72f1fd3	fixed initialization issue in bench	2016-06-06 17:50:07 +02:00
Yann Collet	e3d529403d	fixed initialization mismatch in `ZSTD_copyCCtx()`	2016-06-06 11:07:33 +02:00
Yann Collet	142acbdea7	fixed minor visual conversion warning	2016-06-06 00:46:56 +02:00
Yann Collet	673f0d7cdc	new frame format, allowing custom window size	2016-06-06 00:26:38 +02:00
Yann Collet	d0e2cd15cb	Merged `fse_static` into `fse.h` . Now requires `FSE_STATIC_LINKING_ONLY` macro.	2016-06-05 00:58:01 +02:00
Yann Collet	130fe11394	merged `huf_static.h` into `huf.h` . Requires `HUF_STATIC_LINKING_ONLY` macro.	2016-06-05 00:42:28 +02:00
Yann Collet	dc048d18d3	minor comment (detailing an `#include` motivation)	2016-06-05 00:32:23 +02:00
Yann Collet	49bb0041af	removed `ZSTD_highbit()` from `zstd_internal.h`, as it is only used by `zstd_compress.c`	2016-06-04 20:17:38 +02:00
Yann Collet	d3b7f8d21f	Merged `zstd_static.h` into `zstd.h` . Now requires `ZSTD_STATIC_LINKING_ONLY` macro	2016-06-04 19:47:02 +02:00
Yann Collet	ac110a1f21	Removed ZBUFF internal util function from public area	2016-06-04 19:16:49 +02:00
Yann Collet	5347aee8f7	merged `zbuff_static.h` into `zbuff.h` . Now requires `ZBUFF_STATIC_LINKING_ONLY` macro	2016-06-04 19:12:48 +02:00
Yann Collet	f4f5affdf7	restore ZBUFF full-block-size, for better performance on small input	2016-06-03 23:09:28 +02:00
inikep	3640396b1a	fixed: deallocation of structures in case of error in ZBUFF_createCCtx and ZBUFF_createDCtx	2016-06-03 16:36:50 +02:00
inikep	3763c77f6b	defaultCustomNULL replaced with defaultCustomMem	2016-06-03 13:28:20 +02:00
inikep	36fac00149	removed calloc calls from lib/	2016-06-03 13:23:04 +02:00
inikep	db2f540414	added defaultCustomNULL	2016-06-03 12:56:56 +02:00
inikep	b74a468fad	Merge remote-tracking branch 'refs/remotes/Cyan4973/dev070' into dev070	2016-06-02 22:09:09 +02:00
inikep	2866951558	opaque parameter for custom memory allocation functions	2016-06-02 13:04:18 +02:00
Yann Collet	70d1301d6e	Changed `ZSTD_adjustCParams()` prototype `ZSTD_adjustCParams()` is now automatically invoked at the end of `ZSTD_getCParams()`	2016-06-01 18:45:34 +02:00
Yann Collet	202844ebd0	fixed zbufftest :	2016-06-01 00:44:36 +02:00
Yann Collet	f2a3b6e7b4	added : frame content checksum	2016-05-31 22:23:45 +02:00
Yann Collet	c46fb924df	added dictionary ID (incomplete)	2016-05-29 05:01:04 +02:00
inikep	02c244bf78	Merge remote-tracking branch 'refs/remotes/Cyan4973/dev' into dev	2016-05-24 17:15:04 +02:00
inikep	fb5df613cf	zstd_stats.h included only in debug mode	2016-05-24 15:36:37 +02:00

... 5 6 7 8 9 ...

666 Commits