AuroraMiddleware/zstd - zstd - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Yann Collet	9af909bf35	Merge pull request #1624 from facebook/smallwlog Improves compression ratio for small windowLog	2019-06-14 17:28:21 -07:00
Nick Terrell	cdb9481e38	[libzstd] Optimize ZSTD_insertBt1() for repetitive data We would only skip at most 192 bytes at a time before this diff. This was added to optimize long matches and skip the middle of the match. However, it doesn't handle the case of repetitive data. This patch keeps the optimization, but also handles repetitive data by taking the max of the two return values. ``` > for n in $(seq 9); do echo strategy=$n; dd status=none if=/dev/zero bs=1024k count=1000 \| command time -f %U ./zstd --zstd=strategy=$n >/dev/null; done strategy=1 0.27 strategy=2 0.23 strategy=3 0.27 strategy=4 0.43 strategy=5 0.56 strategy=6 0.43 strategy=7 0.34 strategy=8 0.34 strategy=9 0.35 ``` At level 19 with multithreading the compressed size of `silesia.tar` regresses 300 bytes, and `enwik8` regresses 100 bytes. In single threaded mode `enwik8` is also within 100 bytes, and I didn't test `silesia.tar`. Fixes Issue #1634.	2019-06-05 20:34:00 -07:00
Yann Collet	b3af1873a0	better title formatting for html documentation must pay attention to /** and /*! patterns.	2019-06-04 10:35:40 -07:00
Yann Collet	b5c98fbfd0	Added comments on I/O buffer sizes for streaming It seems this is still a confusing topic, as in https://github.com/klauspost/compress/issues/109 .	2019-06-04 10:26:16 -07:00
Yann Collet	80d6ccea79	removed UINT32_MAX apparently not guaranteed on all platforms, replaced by UINT_MAX.	2019-05-31 17:27:07 -07:00
Yann Collet	fce4df3ab7	fixed wrong assert in double_fast	2019-05-31 17:06:28 -07:00
Yann Collet	a968099038	minor code cleaning for new index invalidation strategy	2019-05-31 16:52:37 -07:00
Yann Collet	d605f482c7	make double_fast compatible with new index invalidation strategy	2019-05-31 16:50:04 -07:00
Yann Collet	a30febaeeb	Made fast strategy compatible with new offset validation strategy fast mode does the same thing as before : it pre-emptively invalidates any index that could lead to offset > maxDistance. It's supposed to help speed. But this logic is performed inside zstd_fast, so that other strategies can select a different behavior.	2019-05-31 16:34:55 -07:00
Yann Collet	58adb1059f	extended exact window size to greedy/lazy modes	2019-05-31 16:08:48 -07:00
Yann Collet	bc601bdc6d	first implementation of small window size for btopt noticeably improves compression ratio when window size is small (< 18). enwik7 level 19 windowLog `dev` `smallwlog` improvement 23 3.577 3.577 0.02% 22 3.536 3.538 0.06% 21 3.462 3.467 0.14% 20 3.364 3.377 0.39% 19 3.244 3.272 0.86% 18 3.110 3.166 1.80% 17 2.843 3.057 7.53% 16 2.724 2.943 8.04% 15 2.594 2.822 8.79% 14 2.456 2.686 9.36% 13 2.312 2.523 9.13% 12 2.162 2.361 9.20% 11 2.003 2.182 8.94%	2019-05-31 15:55:12 -07:00
Yann Collet	b13a9207f9	Merge pull request #1623 from facebook/fullbench fullbench minor improvements	2019-05-31 14:40:19 -07:00
Yann Collet	ed38b645db	fullbench: pass proper parameters in scenario 43	2019-05-29 15:26:06 -07:00
Yann Collet	9719fd616c	removed nextToUpdate3 from ZSTD_window it's now a local variable of ZSTD_compressBlock_opt()	2019-05-28 16:18:12 -07:00
Yann Collet	33dabc8c80	get bt matches : made it a bit clearer which parameters are input and output	2019-05-28 16:11:32 -07:00
Yann Collet	327cf6fac1	nextToUpdate3 does not need to be maintained outside of zstd_opt.c It's re-synchronized with nextToUpdate at beginning of each block. It only needs to be tracked from within zstd_opt block parser. Made the logic clear, so that no code tried to maintain this variable. An even better solution would be to make nextToUpdate3 an internal variable of ZSTD_compressBlock_opt_generic(). That would make it possible to remove it from ZSTD_matchState_t, thus restricting its visibility to only where it's actually useful. This would require deeper changes though, since the matchState is the natural structure to transport parameters into and inside the parser.	2019-05-28 15:26:52 -07:00
Yann Collet	6453f8158f	complementary code comments on variables used / impacted during maxDist check	2019-05-28 14:12:16 -07:00
Yann Collet	4baecdf72a	added comments to better understand enforceMaxDist()	2019-05-28 13:15:48 -07:00
Tyler-Tran	cb47871a0a	[dictBuilder] Be more specific than ERROR(generic) (#1616 ) * Specify errors at a finer granularity than `ERROR(generic)`. * Add tests for bad parameters in the dictionary builder.	2019-05-22 18:57:50 -07:00
Nick Terrell	5f228f8db2	[libzstd] Add a ZSTD_STATIC_ASSERT for BIT_DStream_status	2019-04-23 14:22:16 -07:00
Nick Terrell	a892e25374	[libzstd] Error if all sequence bits aren't consumed	2019-04-23 14:07:36 -07:00
Nick Terrell	0fd322f812	[legacy] Fix ZSTDv0_decodeSequence() Version <= 0.5 could read beyond the end of `dumps`, which points into the input buffer. * Check the validity of `dumps` before using it, if it is out of bounds return garbage values. There is no return code for this function. * Introduce `MEM_readLE24()` for simplicity, since I don't want to trust that there is an extra byte after `dumps`.	2019-04-19 11:34:52 -07:00
Nick Terrell	2536771134	[legacy] Fix Huffman jump table reads in v01 and v05	2019-04-18 16:20:42 -07:00
Nick Terrell	579f3d7794	[legacy] Fix bug in ZSTD_decodeSeqHeaders()	2019-04-18 13:41:10 -07:00
Nick Terrell	ac098c7f5f	[legacy] Fix a bug in ZSTDv06_findFrameSizeInfoLegacy()	2019-04-18 13:33:26 -07:00
Nick Terrell	ee130a9889	[libzstd] Check the size in readSkippableFrameSize()	2019-04-17 11:41:55 -07:00
Nick Terrell	5922f4e2ae	[legacy] Return the right error code	2019-04-17 11:34:52 -07:00
Nick Terrell	450feb0f95	[libzstd] Fix ZSTD_decompressBound() on bad skippable frames The function didn't verify that the skippable frame size is correct.	2019-04-17 11:29:42 -07:00
Nick Terrell	a17fe4c9e5	[visual] Fix unreachable code warning	2019-04-16 11:32:35 -07:00
Nick Terrell	de0499f7fa	[libzstd] Require ZSTD_MULTITHREAD to create a ZSTDMT_CCtx ZSTDMT was broken when compiled without ZSTD_MULTITHREAD defined, because `ZSTD_CCtx_setParameter(cctx, ZSTD_c_nbWorkers, nbWorkerss)` failed. It was detected by the MSVC test which runs the fuzzer with multithreading disabled. This is a very niche use case of a deprecated API, because the API is inefficient and synchronous, since `threading.h` will be synchronous. Users almost certainly don't want this, and anyone who tested their code should realize that it is broken. Therefore, I think it is safe to require `ZSTD_MULTITHREAD` to be defined to use ZSTDMT.	2019-04-15 23:04:46 -07:00
Josh Soref	a880ca239b	Spelling (#1582 ) * spelling: accidentally * spelling: across * spelling: additionally * spelling: addresses * spelling: appropriate * spelling: assumed * spelling: available * spelling: builder * spelling: capacity * spelling: compiler * spelling: compressibility * spelling: compressor * spelling: compression * spelling: contract * spelling: convenience * spelling: decompress * spelling: description * spelling: deflate * spelling: deterministically * spelling: dictionary * spelling: display * spelling: eliminate * spelling: preemptively * spelling: exclude * spelling: failure * spelling: independence * spelling: independent * spelling: intentionally * spelling: matching * spelling: maximum * spelling: meaning * spelling: mishandled * spelling: memory * spelling: occasionally * spelling: occurrence * spelling: official * spelling: offsets * spelling: original * spelling: output * spelling: overflow * spelling: overridden * spelling: parameter * spelling: performance * spelling: probability * spelling: receives * spelling: redundant * spelling: recompression * spelling: resources * spelling: sanity * spelling: segment * spelling: series * spelling: specified * spelling: specify * spelling: subtracted * spelling: successful * spelling: return * spelling: translation * spelling: update * spelling: unrelated * spelling: useless * spelling: variables * spelling: variety * spelling: verbatim * spelling: verification * spelling: visited * spelling: warming * spelling: workers * spelling: with	2019-04-12 11:18:11 -07:00
Nick Terrell	aafe97b67d	[libzstd] Switch dictUses to an enum	2019-04-10 16:50:35 -07:00
Nick Terrell	50b9c41196	[libzstd] Fix decompression dictionary bugs and clean up initialization Bugs: * `ZSTD_DCtx_refPrefix()` didn't clear the dictionary after the first use. Fix and add a test case. * `ZSTD_DCtx_reset()` always cleared the dictionary. Fix and add a test case. * After calling `ZSTD_resetDStream()` you could no longer load a dictionary, since the stage was set to `zdss_loadHeader`. Fix and add a test case. Cleanup: * Make `ZSTD_initDStream()` and `ZSTD_resetDStream()` wrap the new advanced API, and add test cases. Document the equivalent of these functions in the advanced API and document the unstable functions as deprecated.	2019-04-10 12:59:02 -07:00
Nick Terrell	824aaa695f	[libzstd] Fix ZSTD_decompressDCtx() with a dictionary * `ZSTD_decompressDCtx()` did not use the dictionary loaded by `ZSTD_DCtx_loadDictionary()`. * Add a unit test. * A stacked diff uses `ZSTD_decompressDCtx()` in the `dictionary_round_trip` and `dictionary_decompress` fuzzers.	2019-04-09 17:59:27 -07:00
Nick Terrell	48a6427d22	[libzstd] Fix ZSTD_compress2() for multithreaded compression `ZSTD_compress2()` wouldn't wait for multithreaded compression to finish. We didn't find this because ZSTDMT will block when it can compress all in one go, but it can't do that if it doesn't have enough output space, or if `ZSTD_c_rsyncable` is enabled. Since we will already sometimes block when using `ZSTD_e_end`, I've changed `ZSTD_e_end` and `ZSTD_e_flush` to guarantee maximum forward progress. This simplifies the API, and helps users avoid the easy bug that was made in `ZSTD_compress2()` * Found by the libfuzzer fuzzers. * Added a test case that catches the problem. * I will make the fuzzers sometimes allocate less than `ZSTD_compressBound()` output space.	2019-04-09 16:24:17 -07:00
Nick Terrell	e649fad7aa	[dictBuilder] Fix displayLevel for corpus warning Pass the displaylevel into the corpus warning, because it is used in fast cover and cover, so it needs to respect the local level.	2019-04-08 20:00:18 -07:00
Nick Terrell	bfcd5b81d7	[libzstd] Don't check the dictID in fuzzing mode When `FUZZING_BUILD_MODE_UNSAFE_FOR_PRODUCTION` is defined don't check the dictID. This check makes the fuzzers job harder, and it is at the very beginning.	2019-04-08 19:57:41 -07:00
Nick Terrell	947548c24f	Remove double the from README	2019-04-08 16:50:18 -07:00
Nick Terrell	641e594309	[libzstd] Remove ZSTDMT from the shared object * Remove ZSTDMT from the shared object by default. * Provide a macro `ZSTD_LEGACY_MULTITHREADED_API` to override it. * Document it in `lib/README.md`.	2019-04-07 18:47:52 -07:00
Nick Terrell	1dfe37fea9	[libzstd] Stabilize ZSTD_getDictID_*() functions	2019-04-05 18:59:30 -07:00
Nick Terrell	ce388fe4d2	[libzstd] Fix return value docs for ZSTD_compressStream2()	2019-04-05 17:44:07 -07:00
Nick Terrell	7231ea72a8	[libzstd] Reword the streaming docs for the new API	2019-04-03 19:21:05 -07:00
Nick Terrell	cf7d601bf5	Move the dictionary API and mark the legacy API * Move the dictionary API below the streaming API * Mark the legacy streaming API as redundant	2019-04-03 19:16:40 -07:00
Nick Terrell	d7d89513d6	Stabilize advance API This commit moves the candidate advanced API to the stable section. It makes some minor whitespace changes, but it doesn't change any of the wording of the documentation. I'll put up a separate PR that tweaks some of the documentation once this lands, so that it is easier to review. NOTE: Even though these functions are now in stable, they aren't stable until the next release (in under 1 month). It is possible that they change until then.	2019-04-03 18:43:20 -07:00
Nick Terrell	0827edeace	[libzstd] Bump the library version to 1.4.0 Bumps the library version to 1.4.0 in preparation to stabilize the advanced API.	2019-04-03 18:43:20 -07:00
Nick Terrell	72a3fbc0e4	Merge pull request #1562 from terrelln/2fast [libzstd] Speed up single segment zstd_fast by 5%	2019-04-03 18:08:15 -07:00
Nick Terrell	00679da22b	[libzstd] Setting ZSTD_d_maxWindowLog to 0 means default	2019-04-02 19:20:52 -07:00
Nick Terrell	95624b77e4	[libzstd] Speed up single segment zstd_fast by 5% This PR is based on top of PR #1563. The optimization is to process two input pointers per loop. It is based on ideas from [igzip] level 1, and talking to @gbtucker. \| Platform \| Silesia \| Enwik8 \| \|-------------------------\|-------------\|--------\| \| OSX clang-10 \| +5.3% \| +5.4% \| \| i9 5 GHz gcc-8 \| +6.6% \| +6.6% \| \| i9 5 GHz clang-7 \| +8.0% \| +8.0% \| \| Skylake 2.4 GHz gcc-4.8 \| +6.3% \| +7.9% \| \| Skylake 2.4 GHz clang-7 \| +6.2% \| +7.5% \| Testing on all Silesia files on my Intel i9-9900k with gcc-8 \| Silesia File \| Ratio Change \| Speed Change \| \|--------------\|--------------\|--------------\| \| silesia.tar \| +0.17% \| +6.6% \| \| dickens \| +0.25% \| +7.0% \| \| mozilla \| +0.02% \| +6.8% \| \| mr \| -0.30% \| +10.9% \| \| nci \| +1.28% \| +4.5% \| \| ooffice \| -0.35% \| +10.7% \| \| osdb \| +0.75% \| +9.8% \| \| reymont \| +0.65% \| +4.6% \| \| samba \| +0.70% \| +5.9% \| \| sao \| -0.01% \| +14.0% \| \| webster \| +0.30% \| +5.5% \| \| xml \| +0.92% \| +5.3% \| \| x-ray \| -0.00% \| +1.4% \| Same tests on Calgary. For brevity, I've only included files where compression ratio regressed or was much better. \| Calgary File \| Ratio Change \| Speed Change \| \|--------------\|--------------\|--------------\| \| calgary.tar \| +0.30% \| +7.1% \| \| geo \| -0.14% \| +25.0% \| \| obj1 \| -0.46% \| +15.2% \| \| obj2 \| -0.18% \| +6.0% \| \| pic \| +1.80% \| +9.3% \| \| trans \| -0.35% \| +5.5% \| We gain 0.1% of compression ratio on Silesia. We gain 0.3% of compression ratio on enwik8. I also tested on the GitHub and hg-commands datasets without a dictionary, and we gain a small amount of compression ratio on each, as well as speed. I tested the negative compression levels on Silesia on my Intel i9-9900k with gcc-8: \| Level \| Ratio Change \| Speed Change \| \|-------\|--------------\|--------------\| \| -1 \| +0.13% \| +6.4% \| \| -2 \| +4.6% \| -1.5% \| \| -3 \| +7.5% \| -4.8% \| \| -4 \| +8.5% \| -6.9% \| \| -5 \| +9.1% \| -9.1% \| Roughly, the negative levels now scale half as quickly. E.g. the new level 16 is roughly equivalent to the old level 8, but a bit quicker and smaller. If you don't think this is the right trade off, we can change it to multiply the step size by 2, instead of adding 1. I think this makes sense, because it gives a bit slower ratio decay. [igzip]: https://github.com/01org/isa-l/tree/master/igzip	2019-04-02 19:02:50 -07:00
Nick Terrell	56682a7709	Fix ZSTD_estimateCStreamSize_usingCCtxParams() It wasn't using the ZSTD_CCtx_params correctly. It must actualize the compression parameters by calling ZSTD_getCParamsFromCCtxParams() to get the real window log. Tested by updating the streaming memory usage example in the next commit. The CHECK() failed before this patch, and passes after. I also added a unit test to zstreamtest.c that failed before this patch, and passes after.	2019-04-01 18:02:52 -07:00
Nick Terrell	425ce5547c	Merge pull request #1563 from terrelln/dms-sep [libzstd] Split out zstd_fast dict match state function	2019-03-29 16:19:21 -06:00
Nick Terrell	f00407b640	Split out zstd_fast dict match state function	2019-03-29 10:39:16 -06:00
shakeelrao	dca73db30c	fix srcSize typo and add new UTIL func to comment	2019-03-28 17:50:34 -07:00
Nick Terrell	d0f5ba36fb	[cover] Improvements for small or homogeneous data * The algorithm would bail as soon as it found one epoch that contained no new segments. Change it so it now has to fail >= 10 times in a row (10 for fastcover, 10-100 for cover). * The algorithm uses the `maxDict` size to decide the epoch size. When this size is absurdly large, it causes tiny epochs. Lower bound the epoch size at 10x the segment size, and warn the user that their training set is too small. Fixes #1554	2019-03-22 14:14:46 -07:00
Nick Terrell	6b053b9f60	[lib] Allow ZSTD_CCtx_loadDictionary() to be called before parameters are set * After loading a dictionary only create the cdict once we've started the compression job. This allows the user to pass the dictionary before they set other settings, and is in line with the rest of the API. * Add tests that mix the 3 dictionary loading APIs. * Add extra tests for `ZSTD_CCtx_loadDictionary()`. * The first 2 tests added fail before this patch. * Run the regression test suite.	2019-03-21 16:13:53 -07:00
Nick Terrell	20f9ff7e53	Update documentation to tell how to replace the old streaming API with the new one.	2019-03-21 16:08:58 -07:00
Nick Terrell	e55da9e963	Wrap the new advanced api completely	2019-03-21 10:54:40 -07:00
shakeelrao	186ded6d91	Fix typo in legacy documentation	2019-03-19 01:44:08 -07:00
shakeelrao	5740eb6769	Remove extraneous spacing in comments	2019-03-18 21:05:35 -07:00
shakeelrao	0a3fa6f909	Add legacy mode in documentation	2019-03-18 20:33:15 -07:00
shakeelrao	20aa1b455c	Stylistic changes	2019-03-17 19:35:43 -07:00
shakeelrao	0033bb4785	Update documentation for ZSTD_frameSizeInfo	2019-03-17 17:41:27 -07:00
shakeelrao	19b75b6ecb	Test new ZSTD_findFrameCompressedSize and update documentation	2019-03-15 18:04:19 -07:00
shakeelrao	8cd423a659	Reorder declaration in ZSTD_findFrameSizeInfoLegacy	2019-03-15 16:20:34 -07:00
shakeelrao	60796e76b0	Add legacy support to decompressBound	2019-03-15 16:10:37 -07:00
Nick Terrell	f52a7d8faa	Merge pull request #1547 from shakeelrao/fix-error Fix incorrect error code in ZSTD_errorFrameSizeInfo	2019-03-15 10:57:49 -07:00
Nick Terrell	787b76904a	[libzstd] Allow compression parameters to be set with a cdict The order you set parameters in the advanced API is not supposed to matter. However, once you call `ZSTD_CCtx_refCDict()` the compression parameters cannot be changed. Remove that restriction, and document what parameters are used when using a CDict. If the CCtx is in dictionary mode, then the CDict's parameters are used. If the CCtx is not in dictionary mode, then its requested parameters are used.	2019-03-13 16:10:05 -07:00
Nick Terrell	0594e8135b	[libzstd] Free local cdict when referencing cdict We no longer care about the `cdictLocal` after calling `ZSTD_CCtx_refCDict()`, so we should free it to save some memory.	2019-03-13 14:54:31 -07:00
shakeelrao	79827a179f	Fix incorrectly assigned value in ZSTD_errorFrameSizeInfo As documented in `zstd.h`, ZSTD_decompressBound returns `ZSTD_CONTENTSIZE_ERROR` if an error occurs (not `ZSTD_CONTENTSIZE_UNKNOWN`). This is consistent with the error checking made in ZSTD_decompressBound, particularly line 545.	2019-03-13 01:23:07 -07:00
shakeelrao	9ad3f31d33	update documentation for decompressBound	2019-03-02 17:56:10 -08:00
shakeelrao	95dfd48143	update formatting	2019-03-01 23:11:15 -08:00
shakeelrao	1e08c49f75	add stylistic changes	2019-03-01 18:29:35 -08:00
shakeelrao	2bb5eec711	update missing error case to CONTENTSIZE_ERROR	2019-03-01 00:12:16 -08:00
shakeelrao	44ae395b3e	change nbBlocks to size_t for consistency	2019-03-01 00:05:59 -08:00
shakeelrao	03026c3b1d	change compressedBound to ULL	2019-03-01 00:03:50 -08:00
shakeelrao	8930c3c79b	implement API-level changes	2019-02-28 22:55:18 -08:00
shakeelrao	dce9a09772	initialize local vars in decompressBound	2019-02-28 03:01:21 -08:00
shakeelrao	515c506b4c	switch frameBound type to ULL	2019-02-28 02:10:17 -08:00
shakeelrao	d0a3f25697	change return type to ULL	2019-02-28 01:52:01 -08:00
shakeelrao	c9d674b60d	Remove autogenerated test file	2019-02-28 01:29:04 -08:00
shakeelrao	97d3d28dab	Fix decl-after-stmnt build error	2019-02-28 01:24:54 -08:00
shakeelrao	820af1e078	Provide an API function to estimate decompressed size. Introduces a new utility function `ZSTD_findFrameCompressedSize_internal` which is equivalent to `ZSTD_findFrameCompressSize`, but accepts an additional output parameter `bound` that computes an upper-bound for the compressed data in the frame. The new API function is named `ZSTD_decompressBound` to be consistent with `zstd_compressBound` (the inverse operation). Clients will now be able to compute an upper-bound for their compressed payloads instead of guessing a large size. Implements https://github.com/facebook/zstd/issues/1536.	2019-02-28 00:42:49 -08:00
Nick Terrell	be3bd70c57	Merge pull request #1532 from terrelln/cctx-params [libzstd] Rename ZSTD_CCtxParam_* to ZSTD_CCtxParams_*	2019-02-20 10:46:46 -08:00
Nick Terrell	7ad7ba3178	[libzstd] Rename ZSTD_CCtxParam_* to ZSTD_CCtxParams_*	2019-02-19 17:44:52 -08:00
Nick Terrell	9f9630f455	[Windows] Don't use a .def file	2019-02-19 16:52:38 -08:00
Nick Terrell	0c86d23467	[Windows] Move public headers to include/	2019-02-19 15:49:48 -08:00
Nick Terrell	f4abba02ba	[libzstd] Clean up parameter code * Move all ZSTDMT parameter setting code to ZSTD_CCtxParams_Parameter(). ZSTDMT now calls these functions, so we can keep all the logic in the same place. Clean up `ZSTD_CCtx_setParameter()` to only add extra checks where needed. * Clean up `ZSTDMT_initJobCCtxParams()` by copying all parameters by default, and then zeroing the ones that need to be zeroed. We've missed adding several parameters here, and it makes more sense to only have to update it if you change something in ZSTDMT. * Add `ZSTDMT_cParam_clampBounds()` to clamp a parameter into its valid range. Use this to keep backwards compatibility when setting ZSTDMT parameters, which clamp into the valid range.	2019-02-19 13:22:37 -08:00
Nick Terrell	3d7377b874	[libzstd] Handle uncompressed literals	2019-02-15 14:58:11 -08:00
Nick Terrell	f9513115e4	[libzstd] Add ZSTD_c_literalCompressionMode flag It controls the literals compression. It is either `auto`, `huffman`, or `uncompressed`. It defaults to `auto`, which is the current behavior.	2019-02-13 14:59:22 -08:00
Nick Terrell	197a5737c8	Merge pull request #1516 from terrelln/dict-doc [zdict] Improve documentation	2019-02-01 19:04:05 -05:00
Nick Terrell	21616d8a77	[zdict] Improve documentation	2019-02-01 15:19:32 -08:00
Peter (Stig) Edwards	894bbda44c	-Wformat-security not needed with -Wformat=2	2019-02-01 09:31:02 +00:00
W. Felix Handte	501eb25102	Rename FORWARD_ERROR -> FORWARD_IF_ERROR	2019-01-29 12:56:07 -05:00
W. Felix Handte	429987c9a6	Add Comment	2019-01-28 17:35:31 -05:00
W. Felix Handte	2179ce00e1	Remove CHECK_E Macro	2019-01-28 17:33:13 -05:00
W. Felix Handte	03e040a966	Replace Uses of CHECK_E with RETURN_ERROR_IF(*_isError(...	2019-01-28 17:33:01 -05:00
W. Felix Handte	7ebd897157	Remove CHECK_F Macro	2019-01-28 17:16:32 -05:00
W. Felix Handte	64bb6640f2	Replace CHECK_F Uses in zstdmt_compress.c and zstd_ddict.c	2019-01-28 17:15:57 -05:00
W. Felix Handte	cafc3b1bcb	Also Convert zstd_compress.c	2019-01-28 17:05:18 -05:00
W. Felix Handte	324e9654d3	Add grep-able String to Error Macros	2019-01-28 12:50:36 -05:00
W. Felix Handte	32fed9c7be	Switch CHECK_F Calls to FORWARD_ERROR	2019-01-28 12:45:34 -05:00
W. Felix Handte	800c87fed0	Switch Unconditional RETURN_ERROR_IF Calls to RETURN_ERROR	2019-01-28 12:45:34 -05:00
W. Felix Handte	a3538bbc6f	Add RETURN_ERROR and FORWARD_ERROR Macros	2019-01-28 12:45:26 -05:00
W. Felix Handte	c823237d7b	Convert Checks in zstd_decompress.c to RETURN_ERROR_IF	2019-01-28 12:23:14 -05:00
W. Felix Handte	ea031f4ea2	Convert Checks in zstd_decompress_block.c to RETURN_ERROR_IF	2019-01-28 11:56:39 -05:00
W. Felix Handte	54fa31f03b	Add RETURN_ERROR_IF Macro That Logs Debug Information When Check Fails	2019-01-28 11:43:33 -05:00
Yann Collet	f9e4f89252	improved comments for adjustCParams() and getCParams()	2019-01-02 12:18:40 -08:00
Yann Collet	0fb4b21d1a	updated libzstd documentation	2018-12-25 03:10:07 -08:00
Yann Collet	e980ba212f	Merge pull request #1471 from facebook/nofloat guard functions using floating point for debug mode only	2018-12-23 12:35:51 -08:00
Yann Collet	aae5bc538a	Merge pull request #1470 from facebook/U32 fix confusion between unsigned <-> U32	2018-12-23 12:35:39 -08:00
Yann Collet	c9dfb7e445	guard functions using floating point for debug mode only they are only used to print debug messages. Requested in #1386,	2018-12-22 09:09:40 -08:00
Yann Collet	ededcfca57	fix confusion between unsigned <-> U32 as suggested in #1441. generally U32 and unsigned are the same thing, except when they are not ... case : 32-bit compilation for MIPS (uint32_t == unsigned long) A vast majority of transformation consists in transforming U32 into unsigned. In rare cases, it's the other way around (typically for internal code, such as seeds). Among a few issues this patches solves : - some parameters were declared with type `unsigned` in .h, but with type `U32` in their implementation .c . - some parameters have type unsigned*, but the caller user a pointer to U32 instead. These fixes are useful. However, the bulk of changes is about %u formating, which requires unsigned type, but generally receives U32 values instead, often just for brevity (U32 is shorter than unsigned). These changes are generally minor, or even annoying. As a consequence, the amount of code changed is larger than I would expect for such a patch. Testing is also a pain : it requires manually modifying `mem.h`, in order to lie about `U32` and force it to be an `unsigned long` typically. On a 64-bit system, this will break the equivalence unsigned == U32. Unfortunately, it will also break a few static_assert(), controlling structure sizes. So it also requires modifying `debug.h` to make `static_assert()` a noop. And then reverting these changes. So it's inconvenient, and as a consequence, this property is currently not checked during CI tests. Therefore, these problems can emerge again in the future. I wonder if it is worth ensuring proper distinction of U32 != unsigned in CI tests. It's another restriction for coding, adding more frustration during merge tests, since most platforms don't need this distinction (hence contributor will not see it), and while this can matter in theory, the number of platforms impacted seems minimal. Thoughts ?	2018-12-21 18:09:41 -08:00
Yann Collet	c8d1fda982	update aarch64 test to xenial in an attempt to circumvent the `ld` bug	2018-12-21 15:08:48 -08:00
Yann Collet	8f35c7f94c	Merge pull request #1466 from facebook/noDictPresent fixed : better error message	2018-12-20 19:01:27 -08:00
Yann Collet	41b45b84a1	Merge pull request #1465 from facebook/noFilePresent fixed : detection of non-existing file	2018-12-20 17:21:04 -08:00
Yann Collet	ed2fb6bd57	fixed : better error message when dictionary missing during benchmark. Also : refactored ZSTD_fillHashTable(), just for readability (it does the same thing)	2018-12-20 17:20:07 -08:00
Yann Collet	e4ae24c229	Merge pull request #1420 from felixhandte/zstd-decompress-minimal Various Macros to Allow Building Extremely Minimal Decoder Library	2018-12-20 15:17:37 -08:00
Yann Collet	95784c654c	fixed shadowing of stat variable some standard lib declares a `stat` variable at global scope shadowing local declarations ....	2018-12-20 14:56:44 -08:00
Yann Collet	ffba142406	fixed file identity detection in 32-bit mode also : some library decided to use `index` as a global variable declared in standard header shadowing the ones used in fastcover.c :(	2018-12-20 14:30:30 -08:00
W. Felix Handte	91b7309115	Mask Off Unused Functions When ZSTD_FORCE_DECOMPRESS_SEQUENCES_LONG	2018-12-20 12:20:34 -08:00
W. Felix Handte	038aabde28	Mask Off Unused Functions When ZSTD_FORCE_DECOMPRESS_SEQUENCES_SHORT	2018-12-20 12:15:07 -08:00
Yann Collet	2898afab52	fixed OSSfuzz 11849 The problem was already masked, due to no longer accepting tiny blocks for statistics. But in case it could still happen with not-so-tiny blocks, there is a stricter control which ensures that nothing was already loaded prior to statistics collection.	2018-12-19 16:54:15 -08:00
W. Felix Handte	8e61ac8161	Use Unused Variable in ERR_getErrorString()	2018-12-19 12:36:10 -08:00
Yann Collet	8e0e495ce8	fixed: compression ratio discrepancy depending on initialization, the first byte of a new frame was invalidated or not. As a consequence, one match opportunity was available or not, resulting in slightly different compressed sizes (on average, 1 or 2 bytes once every 20 frames). It impacted ratio comparison between one-shot and streaming modes. This fix makes the first byte of a new frame always a valid match. Now compressed size is always the same. It also improves compressed size by a negligible amount.	2018-12-19 10:11:06 -08:00
Yann Collet	d0e15f8d32	Merge pull request #1458 from terrelln/estimate [libzstd] Fix estimate with negative levels	2018-12-18 15:12:21 -08:00
Yann Collet	04baecaeed	Merge pull request #1457 from facebook/btultra2.1 btultra2 and very small input	2018-12-18 14:46:55 -08:00
Nick Terrell	d7def456d8	[libzstd] Fix estimate with negative levels * Fix `ZSTD_estimateCCtxSize()` with negative levels. * Fix `ZSTD_estimateCStreamSize()` with negative levels. * Add a unit test to test for this error.	2018-12-18 14:24:49 -08:00
Yann Collet	ef984e7307	fix debug levels as reported by @terrelln. 2 is reserved for temporary usage only.	2018-12-18 13:40:07 -08:00
W. Felix Handte	0d606ee3db	Fix Incorrect assert()	2018-12-18 13:36:39 -08:00
W. Felix Handte	bd4afc389f	Add Logic to Makefile to Convert Make Vars to Defines	2018-12-18 13:36:39 -08:00
W. Felix Handte	ece2c18372	Document Macros in README	2018-12-18 13:36:39 -08:00
W. Felix Handte	c2d51637d9	Add Mutual-Exclusion Error	2018-12-18 13:36:39 -08:00
W. Felix Handte	c560e34c86	Add HUF_FORCE_DECOMPRESS_X2	2018-12-18 13:36:39 -08:00
W. Felix Handte	abd1567d3c	Move HUF_DGEN Up Out of X1 Definitions	2018-12-18 13:36:39 -08:00
W. Felix Handte	4a0572b215	Refactor Huffman Decompression Away From Ternary Tree in ZSTD_decodeLiteralsBlock	2018-12-18 13:36:39 -08:00
W. Felix Handte	432314b58a	Rename HUF_DECOMPRESS_MINIMAL -> HUF_FORCE_DECOMPRESS_X1	2018-12-18 13:36:39 -08:00
W. Felix Handte	4bbb8a48ad	Add ZSTD_FORCE_DECOMPRESS_SEQUENCES_LONG This macro forces behavior in the opposite direction.	2018-12-18 13:36:39 -08:00
W. Felix Handte	64553a0e35	Rename ZSTD_DECOMPRESS_MINIMAL -> ZSTD_FORCE_DECOMPRESS_SEQUENCES_SHORT	2018-12-18 13:36:39 -08:00
W. Felix Handte	605dd576ee	Remove Error Strings with ZSTD_STRIP_ERROR_STRINGS	2018-12-18 13:36:39 -08:00
W. Felix Handte	9d5f3963ff	Add Option to Not Request Inlining with ZSTD_NO_INLINE	2018-12-18 13:36:39 -08:00
W. Felix Handte	df28e5babd	Add ZSTD_DECOMPRESS_MINIMAL Macro, Which Reduces Branching of Decompress Variants	2018-12-18 13:36:39 -08:00
W. Felix Handte	f45c9df42e	Totally Hide/Disable X2 Variants when HUF_DECOMPRESS_MINIMAL is Defined	2018-12-18 13:36:39 -08:00
W. Felix Handte	36a84b07a8	Load Dictionaries as X1 Tables	2018-12-18 13:36:39 -08:00
W. Felix Handte	f9cb348776	Add HUF_DECOMPRESS_MINIMAL Macro, Which Avoids Using X2 Variants	2018-12-18 13:36:39 -08:00
Yann Collet	635783da12	btultra2 and very small srcSize When srcSize is small, the nb of symbols produced is likely too small to warrant dedicated probability tables. In which case, predefined distribution tables will be used instead. There is a cheap algorithm in btultra initialization : it presumes default distribution will be used if srcSize <= 1024. btultra2 now uses the same threshold to shut down probability estimation, since measured frequencies won't be used at entropy stage, and therefore relying on them to determine sequence cost is misleading, resulting in worse compression ratios. This fixes btultra2 performance issue on very small input. Note that, a proper way should be to determine which symbol is going to use predefined probaility and which symbol is going to use dynamic ones. But the current algorithm is unable to make a "per-symbol" decision. So this will require significant modifications.	2018-12-18 12:32:58 -08:00
Yann Collet	517d8c984c	Merge pull request #1449 from facebook/ovlog_def overlapLog default values	2018-12-18 09:45:53 -08:00
Yann Collet	373ff8b983	play around with rescale weights	2018-12-17 15:48:34 -08:00
Yann Collet	8be145a8c1	fixed default job size	2018-12-13 16:38:08 -08:00
Nick Terrell	75fa3f2eb7	Merge pull request #1446 from terrelln/overflow [libzstd] Fix infinite loop in decompression	2018-12-13 16:21:15 -08:00
Yann Collet	62180b27d5	zstdmt parameter getter/setter use `int`	2018-12-13 15:47:34 -08:00
Nick Terrell	aaea4ef924	[libzstd] Fix infinite loop in decompression When we switched `ZSTD_SKIPPABLEHEADERSIZE` to a macro, the places where we do: MEM_readLE32(ptr) + ZSTD_SKIPPABLEHEADERSIZE can now overflow `(unsigned)-8` to `0` and we infinite loop. We now check the frame size and reject sizes that overflow a U32. Note that this bug never made it into a release, and was only in the dev branch for a few days. Credit to OSS-Fuzz	2018-12-13 15:13:19 -08:00

1 2 3 4 5 ...

2929 Commits