AuroraMiddleware/zstd - zstd - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Nick Terrell	824aaa695f	[libzstd] Fix ZSTD_decompressDCtx() with a dictionary * `ZSTD_decompressDCtx()` did not use the dictionary loaded by `ZSTD_DCtx_loadDictionary()`. * Add a unit test. * A stacked diff uses `ZSTD_decompressDCtx()` in the `dictionary_round_trip` and `dictionary_decompress` fuzzers.	2019-04-09 17:59:27 -07:00
Nick Terrell	48a6427d22	[libzstd] Fix ZSTD_compress2() for multithreaded compression `ZSTD_compress2()` wouldn't wait for multithreaded compression to finish. We didn't find this because ZSTDMT will block when it can compress all in one go, but it can't do that if it doesn't have enough output space, or if `ZSTD_c_rsyncable` is enabled. Since we will already sometimes block when using `ZSTD_e_end`, I've changed `ZSTD_e_end` and `ZSTD_e_flush` to guarantee maximum forward progress. This simplifies the API, and helps users avoid the easy bug that was made in `ZSTD_compress2()` * Found by the libfuzzer fuzzers. * Added a test case that catches the problem. * I will make the fuzzers sometimes allocate less than `ZSTD_compressBound()` output space.	2019-04-09 16:24:17 -07:00
Nick Terrell	e649fad7aa	[dictBuilder] Fix displayLevel for corpus warning Pass the displaylevel into the corpus warning, because it is used in fast cover and cover, so it needs to respect the local level.	2019-04-08 20:00:18 -07:00
Nick Terrell	bfcd5b81d7	[libzstd] Don't check the dictID in fuzzing mode When `FUZZING_BUILD_MODE_UNSAFE_FOR_PRODUCTION` is defined don't check the dictID. This check makes the fuzzers job harder, and it is at the very beginning.	2019-04-08 19:57:41 -07:00
Nick Terrell	947548c24f	Remove double the from README	2019-04-08 16:50:18 -07:00
Nick Terrell	641e594309	[libzstd] Remove ZSTDMT from the shared object * Remove ZSTDMT from the shared object by default. * Provide a macro `ZSTD_LEGACY_MULTITHREADED_API` to override it. * Document it in `lib/README.md`.	2019-04-07 18:47:52 -07:00
Nick Terrell	1dfe37fea9	[libzstd] Stabilize ZSTD_getDictID_*() functions	2019-04-05 18:59:30 -07:00
Nick Terrell	ce388fe4d2	[libzstd] Fix return value docs for ZSTD_compressStream2()	2019-04-05 17:44:07 -07:00
Nick Terrell	7231ea72a8	[libzstd] Reword the streaming docs for the new API	2019-04-03 19:21:05 -07:00
Nick Terrell	cf7d601bf5	Move the dictionary API and mark the legacy API * Move the dictionary API below the streaming API * Mark the legacy streaming API as redundant	2019-04-03 19:16:40 -07:00
Nick Terrell	d7d89513d6	Stabilize advance API This commit moves the candidate advanced API to the stable section. It makes some minor whitespace changes, but it doesn't change any of the wording of the documentation. I'll put up a separate PR that tweaks some of the documentation once this lands, so that it is easier to review. NOTE: Even though these functions are now in stable, they aren't stable until the next release (in under 1 month). It is possible that they change until then.	2019-04-03 18:43:20 -07:00
Nick Terrell	0827edeace	[libzstd] Bump the library version to 1.4.0 Bumps the library version to 1.4.0 in preparation to stabilize the advanced API.	2019-04-03 18:43:20 -07:00
Nick Terrell	72a3fbc0e4	Merge pull request #1562 from terrelln/2fast [libzstd] Speed up single segment zstd_fast by 5%	2019-04-03 18:08:15 -07:00
Nick Terrell	00679da22b	[libzstd] Setting ZSTD_d_maxWindowLog to 0 means default	2019-04-02 19:20:52 -07:00
Nick Terrell	95624b77e4	[libzstd] Speed up single segment zstd_fast by 5% This PR is based on top of PR #1563. The optimization is to process two input pointers per loop. It is based on ideas from [igzip] level 1, and talking to @gbtucker. \| Platform \| Silesia \| Enwik8 \| \|-------------------------\|-------------\|--------\| \| OSX clang-10 \| +5.3% \| +5.4% \| \| i9 5 GHz gcc-8 \| +6.6% \| +6.6% \| \| i9 5 GHz clang-7 \| +8.0% \| +8.0% \| \| Skylake 2.4 GHz gcc-4.8 \| +6.3% \| +7.9% \| \| Skylake 2.4 GHz clang-7 \| +6.2% \| +7.5% \| Testing on all Silesia files on my Intel i9-9900k with gcc-8 \| Silesia File \| Ratio Change \| Speed Change \| \|--------------\|--------------\|--------------\| \| silesia.tar \| +0.17% \| +6.6% \| \| dickens \| +0.25% \| +7.0% \| \| mozilla \| +0.02% \| +6.8% \| \| mr \| -0.30% \| +10.9% \| \| nci \| +1.28% \| +4.5% \| \| ooffice \| -0.35% \| +10.7% \| \| osdb \| +0.75% \| +9.8% \| \| reymont \| +0.65% \| +4.6% \| \| samba \| +0.70% \| +5.9% \| \| sao \| -0.01% \| +14.0% \| \| webster \| +0.30% \| +5.5% \| \| xml \| +0.92% \| +5.3% \| \| x-ray \| -0.00% \| +1.4% \| Same tests on Calgary. For brevity, I've only included files where compression ratio regressed or was much better. \| Calgary File \| Ratio Change \| Speed Change \| \|--------------\|--------------\|--------------\| \| calgary.tar \| +0.30% \| +7.1% \| \| geo \| -0.14% \| +25.0% \| \| obj1 \| -0.46% \| +15.2% \| \| obj2 \| -0.18% \| +6.0% \| \| pic \| +1.80% \| +9.3% \| \| trans \| -0.35% \| +5.5% \| We gain 0.1% of compression ratio on Silesia. We gain 0.3% of compression ratio on enwik8. I also tested on the GitHub and hg-commands datasets without a dictionary, and we gain a small amount of compression ratio on each, as well as speed. I tested the negative compression levels on Silesia on my Intel i9-9900k with gcc-8: \| Level \| Ratio Change \| Speed Change \| \|-------\|--------------\|--------------\| \| -1 \| +0.13% \| +6.4% \| \| -2 \| +4.6% \| -1.5% \| \| -3 \| +7.5% \| -4.8% \| \| -4 \| +8.5% \| -6.9% \| \| -5 \| +9.1% \| -9.1% \| Roughly, the negative levels now scale half as quickly. E.g. the new level 16 is roughly equivalent to the old level 8, but a bit quicker and smaller. If you don't think this is the right trade off, we can change it to multiply the step size by 2, instead of adding 1. I think this makes sense, because it gives a bit slower ratio decay. [igzip]: https://github.com/01org/isa-l/tree/master/igzip	2019-04-02 19:02:50 -07:00
Nick Terrell	56682a7709	Fix ZSTD_estimateCStreamSize_usingCCtxParams() It wasn't using the ZSTD_CCtx_params correctly. It must actualize the compression parameters by calling ZSTD_getCParamsFromCCtxParams() to get the real window log. Tested by updating the streaming memory usage example in the next commit. The CHECK() failed before this patch, and passes after. I also added a unit test to zstreamtest.c that failed before this patch, and passes after.	2019-04-01 18:02:52 -07:00
Nick Terrell	425ce5547c	Merge pull request #1563 from terrelln/dms-sep [libzstd] Split out zstd_fast dict match state function	2019-03-29 16:19:21 -06:00
Nick Terrell	f00407b640	Split out zstd_fast dict match state function	2019-03-29 10:39:16 -06:00
shakeelrao	dca73db30c	fix srcSize typo and add new UTIL func to comment	2019-03-28 17:50:34 -07:00
Nick Terrell	d0f5ba36fb	[cover] Improvements for small or homogeneous data * The algorithm would bail as soon as it found one epoch that contained no new segments. Change it so it now has to fail >= 10 times in a row (10 for fastcover, 10-100 for cover). * The algorithm uses the `maxDict` size to decide the epoch size. When this size is absurdly large, it causes tiny epochs. Lower bound the epoch size at 10x the segment size, and warn the user that their training set is too small. Fixes #1554	2019-03-22 14:14:46 -07:00
Nick Terrell	6b053b9f60	[lib] Allow ZSTD_CCtx_loadDictionary() to be called before parameters are set * After loading a dictionary only create the cdict once we've started the compression job. This allows the user to pass the dictionary before they set other settings, and is in line with the rest of the API. * Add tests that mix the 3 dictionary loading APIs. * Add extra tests for `ZSTD_CCtx_loadDictionary()`. * The first 2 tests added fail before this patch. * Run the regression test suite.	2019-03-21 16:13:53 -07:00
Nick Terrell	20f9ff7e53	Update documentation to tell how to replace the old streaming API with the new one.	2019-03-21 16:08:58 -07:00
Nick Terrell	e55da9e963	Wrap the new advanced api completely	2019-03-21 10:54:40 -07:00
shakeelrao	186ded6d91	Fix typo in legacy documentation	2019-03-19 01:44:08 -07:00
shakeelrao	5740eb6769	Remove extraneous spacing in comments	2019-03-18 21:05:35 -07:00
shakeelrao	0a3fa6f909	Add legacy mode in documentation	2019-03-18 20:33:15 -07:00
shakeelrao	20aa1b455c	Stylistic changes	2019-03-17 19:35:43 -07:00
shakeelrao	0033bb4785	Update documentation for ZSTD_frameSizeInfo	2019-03-17 17:41:27 -07:00
shakeelrao	19b75b6ecb	Test new ZSTD_findFrameCompressedSize and update documentation	2019-03-15 18:04:19 -07:00
shakeelrao	8cd423a659	Reorder declaration in ZSTD_findFrameSizeInfoLegacy	2019-03-15 16:20:34 -07:00
shakeelrao	60796e76b0	Add legacy support to decompressBound	2019-03-15 16:10:37 -07:00
Nick Terrell	f52a7d8faa	Merge pull request #1547 from shakeelrao/fix-error Fix incorrect error code in ZSTD_errorFrameSizeInfo	2019-03-15 10:57:49 -07:00
Nick Terrell	787b76904a	[libzstd] Allow compression parameters to be set with a cdict The order you set parameters in the advanced API is not supposed to matter. However, once you call `ZSTD_CCtx_refCDict()` the compression parameters cannot be changed. Remove that restriction, and document what parameters are used when using a CDict. If the CCtx is in dictionary mode, then the CDict's parameters are used. If the CCtx is not in dictionary mode, then its requested parameters are used.	2019-03-13 16:10:05 -07:00
Nick Terrell	0594e8135b	[libzstd] Free local cdict when referencing cdict We no longer care about the `cdictLocal` after calling `ZSTD_CCtx_refCDict()`, so we should free it to save some memory.	2019-03-13 14:54:31 -07:00
shakeelrao	79827a179f	Fix incorrectly assigned value in ZSTD_errorFrameSizeInfo As documented in `zstd.h`, ZSTD_decompressBound returns `ZSTD_CONTENTSIZE_ERROR` if an error occurs (not `ZSTD_CONTENTSIZE_UNKNOWN`). This is consistent with the error checking made in ZSTD_decompressBound, particularly line 545.	2019-03-13 01:23:07 -07:00
shakeelrao	9ad3f31d33	update documentation for decompressBound	2019-03-02 17:56:10 -08:00
shakeelrao	95dfd48143	update formatting	2019-03-01 23:11:15 -08:00
shakeelrao	1e08c49f75	add stylistic changes	2019-03-01 18:29:35 -08:00
shakeelrao	2bb5eec711	update missing error case to CONTENTSIZE_ERROR	2019-03-01 00:12:16 -08:00
shakeelrao	44ae395b3e	change nbBlocks to size_t for consistency	2019-03-01 00:05:59 -08:00
shakeelrao	03026c3b1d	change compressedBound to ULL	2019-03-01 00:03:50 -08:00
shakeelrao	8930c3c79b	implement API-level changes	2019-02-28 22:55:18 -08:00
shakeelrao	dce9a09772	initialize local vars in decompressBound	2019-02-28 03:01:21 -08:00
shakeelrao	515c506b4c	switch frameBound type to ULL	2019-02-28 02:10:17 -08:00
shakeelrao	d0a3f25697	change return type to ULL	2019-02-28 01:52:01 -08:00
shakeelrao	c9d674b60d	Remove autogenerated test file	2019-02-28 01:29:04 -08:00
shakeelrao	97d3d28dab	Fix decl-after-stmnt build error	2019-02-28 01:24:54 -08:00
shakeelrao	820af1e078	Provide an API function to estimate decompressed size. Introduces a new utility function `ZSTD_findFrameCompressedSize_internal` which is equivalent to `ZSTD_findFrameCompressSize`, but accepts an additional output parameter `bound` that computes an upper-bound for the compressed data in the frame. The new API function is named `ZSTD_decompressBound` to be consistent with `zstd_compressBound` (the inverse operation). Clients will now be able to compute an upper-bound for their compressed payloads instead of guessing a large size. Implements https://github.com/facebook/zstd/issues/1536.	2019-02-28 00:42:49 -08:00
Nick Terrell	be3bd70c57	Merge pull request #1532 from terrelln/cctx-params [libzstd] Rename ZSTD_CCtxParam_* to ZSTD_CCtxParams_*	2019-02-20 10:46:46 -08:00
Nick Terrell	7ad7ba3178	[libzstd] Rename ZSTD_CCtxParam_* to ZSTD_CCtxParams_*	2019-02-19 17:44:52 -08:00

1 2 3 4 5 ...

2796 Commits