AuroraMiddleware/zstd - zstd - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Yann Collet	d228b6b0d0	btlazy2 : optimization for dictionary compression we want the dictionary table to be fully sorted, not just lazily filled. Dictionary loading is a bit more intensive, but it saves cpu cycles for match search during compression.	2017-12-29 19:14:18 +01:00
Yann Collet	02f64ef955	btlazy2: fixed interaction between unsortedMark and reduceTable	2017-12-29 19:08:51 +01:00
Yann Collet	64482c2c97	fixed bug in dubt the chain of unsorted candidates could grow beyond lowLimit.	2017-12-29 17:04:37 +01:00
Yann Collet	f36da5b4d9	minor speed optimization : index overflow prevention new code supposed to be easier to auto-vectorize	2017-12-29 14:40:33 +01:00
Yann Collet	5235d8d6ba	first implementation of delayed update for btlazy2 This is a pretty nice speed win. The new strategy consists in stacking new candidates as if it was a hash chain. Then, only if there is a need to actually consult the chain, they are batch-updated, before starting the match search itself. This is supposed to be beneficial when skipping positions, which happens a lot when using lazy strategy. The baseline performance for btlazy2 on my laptop is : 15#calgary.tar : 3265536 -> 955985 (3.416), 7.06 MB/s , 618.0 MB/s 15#enwik7 : 10000000 -> 3067341 (3.260), 4.65 MB/s , 521.2 MB/s 15#silesia.tar : 211984896 -> 58095131 (3.649), 6.20 MB/s , 682.4 MB/s (only level 15 remains for btlazy2, as this strategy is squeezed between lazy2 and btopt) After this patch, and keeping all parameters identical, speed is increased by a pretty good margin (+30-50%), but compression ratio suffers a bit : 15#calgary.tar : 3265536 -> 958060 (3.408), 9.12 MB/s , 621.1 MB/s 15#enwik7 : 10000000 -> 3078318 (3.249), 6.37 MB/s , 525.1 MB/s 15#silesia.tar : 211984896 -> 58444111 (3.627), 9.89 MB/s , 680.4 MB/s That's because I kept `1<<searchLog` as a maximum number of candidates to update. But for a hash chain, this represents the total number of candidates in the chain, while for the binary, it represents the maximum depth of searches. Keep in mind that a lot of candidates won't even be visited in the btree, since they are filtered out by the binary sort. As a consequence, in the new implementation, the effective depth of the binary tree is substantially shorter. To compensate, it's enough to increase `searchLog` value. Here is the result after adding just +1 to searchLog (level 15 setting in this patch): 15#calgary.tar : 3265536 -> 956311 (3.415), 8.32 MB/s , 611.4 MB/s 15#enwik7 : 10000000 -> 3067655 (3.260), 5.43 MB/s , 535.5 MB/s 15#silesia.tar : 211984896 -> 58113144 (3.648), 8.35 MB/s , 679.3 MB/s aka, almost the same compression ratio as before, but with a noticeable speed increase (+20-30%). This modification makes btlazy2 more competitive. A new round of paramgrill will be necessary to determine which levels are impacted and could adopt the new strategy.	2017-12-28 16:58:57 +01:00
Yann Collet	473362e922	Merge pull request #958 from facebook/continueCCtx fix a subtle issue in continue mode	2017-12-20 00:12:50 +01:00
Yann Collet	cafedcbbe4	ZSTD_resetCCtx_internal: fixed order of arguments params1 was swapped with params2. This used to be a non-issue when testing for strict equality, but now that some tests look for "sufficient size" `<=`, order matters.	2017-12-19 21:49:04 +01:00
Yann Collet	9096088f45	changed variable name for clarity, suggested by @terrelln	2017-12-19 21:20:46 +01:00
Yann Collet	f299fa39ac	fix a subtle issue in continue mode The deep fuzzer tests caught a subtle bug that was probably there for a long time. The impact of the bug is not a crash, or any other clear error signal, rather, it reduces performance, by cutting data into smaller blocks. Eventually, the following test would fail because it produces too many 1-byte blocks, requiring more space than buffer can provide : `./zstreamtest_asan --mt -s3514 -t1678312 -i1678314` The root scenario is as follows : - Create context, initialize it using explicit parameters or a `cdict` to pin them down, set `pledgedSrcSize=1` - The compression parameters will not be adapted, but `windowSize` and `blockSize` will be automatically set to `1`. `windowSize` and `blockSize` are dynamic values, set within `ZSTD_resetCCtx_internal()`. The automatic adaptation makes it possible to generate smaller contexts for smaller input sizes. - Complete compression - New compression with same context, using same parameters, but `pledgedSrcSize=ZSTD_CONTENTSIZE_UNKNOWN` trigger "continue mode" - Continue mode doesn't modify blockSize, because it used to depend on `windowLog` only, but in fact, it also depends on `pledgedSrcSize`. - The "old" blocksize (1) is still there, next compression will use this value to cut input into blocks, resulting in more blocks and worse performance than necessary performance. Given the scenario, and its possible variants, I'm surprised it did not show up before. But I suspect it did show up, it's just that it never triggered an error, because "worse performance" is not a trigger. The above test is a special corner case, where performance is so impacted that it reaches an error case. The fix works, but I'm not completely pleased. I think the current code relies too much on implied relations between variables. This will likely break again in the future when some related part of the code change. Unfortunately, no time to make larger changes if we want to keep the release target for zstd v1.3.3. So a longer term fix will have to be considered after the release. To do : create a reliable test case which triggers this scenario for CI tests.	2017-12-19 09:43:03 +01:00
Yann Collet	5c2f2ebfdb	zstdmt via compress_generic: reduce opportunity to free/create mtctx `zstreamtest --newapi` (and `--opaqueapi`) create and destroy way too many threads resulting in failure of tsan tests, and potentially connected to the qemu flaky tests. This is because, at each test, the nb of threads can be changed (random). The `--no-big-tests` directive reduce this choice to 1/2 threads, in order to limit memory usage, especially for qemu and 32-bits builds. Unfortunately, swapping between 1 and 2 threads is enough to constantly create/destroy new mtctx. This patch takes advantage of the following property : via compress_generic, no internal mtctx is needed for nbThreads < 2. As a consequence, when nbThreads == 2, the currently active mtctx is necessarily good. This dramatically reduces the nb of thread creations when invoking `zstreamtest --newapi --no-big-tests` (only when parent cctx itself is created, which is randomized to 1/256 tests). Expected outcome : - at a minimum : tsan tests shall now work continuously without exploding the thread counter - at best : flaky qemu tests on `zstreamtest --newapi --no-big-tests` may stop being flaky, due to less stress from constant thread creation/destruction Real world impact : minimal, I don't expect users to constantly change `nbThreads` between each invocation. If `nbThreads` remains stable, existing implementation re-uses existing mtctx. Also : `zstreamtest --newapi` but without `--no-big-tests` doesn't benefit as much, since this test can select a random `nbThreads` value between 1 and 4. The current patch only reduces opportunity to free/create mtctx (for example : 2->1->2 doesn't need a new mtctx) but doesn't completely eliminate it, since `nbThreads` can still change between 2/3/4. A more complete solution could be to only use 2 out of 4 allocated threads, thus keeping the pool at a constant size. This would require a larger change to `POOL_*` api though.	2017-12-16 12:48:13 -08:00
Yann Collet	3cbfac1cdb	updated levels 15-20 taking advantage of `btopt` improved speed to tune parameters. Levels 16-19 are stronger than previous release, making the graph more favorable. In theory, I should also update small-size tables, but I got lazy on that one ...	2017-12-14 23:29:00 -08:00
Yann Collet	8c41a9cb1e	Merge pull request #951 from facebook/lastBlock saves 3-bytes on small input with streaming API	2017-12-14 15:39:50 -08:00
Yann Collet	a0ac8c895c	Merge pull request #950 from facebook/srcSizeAdaptation fix adaptation on srcSize	2017-12-14 14:48:31 -08:00
Yann Collet	281f06e01f	saves 3-bytes on small input with streaming API zstd streaming API was adding a null-block at end of frame for small input. Reason is : on small input, a single block is enough. ZSTD_CStream would size its input buffer to expect a single block of this size, automatically triggering a flush on reaching this size. Unfortunately, that last byte was generally received before the "end" directive (at least in `fileio`). The later "end" directive would force the creation of a 3-bytes last block to indicate end of frame. The solution is to not flush automatically, which is btw the expected behavior. It happens in this case because blocksize is defined with exactly the same size as input. Just adding one-byte is enough to stop triggering the automatic flush. I initially looked at another solution, solving the problem directly in the compression context. But it felt awkward. Now, the underlying compression API `ZSTD_compressContinue()` would take the decision the close a frame on reaching its expected end (`pledgedSrcSize`). This feels awkward, a responsability over-reach, beyond the definition of this API. ZSTD_compressContinue() is clearly documented as a guaranteed flush, with ZSTD_compressEnd() generating a guaranteed end. I faced similar issue when trying to port a similar mechanism at the higher streaming layer. Having ZSTD_CStream end a frame automatically on reaching `pledgedSrcSize` can surprise the caller, since it did not explicitly requested an end of frame. The only sensible action remaining after that is to end the frame with no additional input. This adds additional logic in the ZSTD_CStream state to check this condition. Plus some potential confusion on the meaning of ZSTD_endStream() with no additional input (ending confirmation ? new 0-size frame ?) In the end, just enlarging input buffer by 1 byte feels the least intrusive change. It's also a contract remaining inside the streaming layer, so the logic is contained in this part of the code. The patch also introduces a new test checking that size of small frame is as expected, without additional 3-bytes null block.	2017-12-14 11:47:02 -08:00
Yann Collet	c005df136f	Merge pull request #947 from facebook/fix944 Fix #944	2017-12-14 10:01:52 -08:00
Yann Collet	2e97a6d464	fixed minor declaration-after-statement warning	2017-12-13 18:50:05 -08:00
Yann Collet	5432ef6921	fixes adaptation on srcSize This patch restores capability for each file to receive adapted compression parameters depending on its size. The bug breaking this feature was relatively silly : setting a parameter with a value "0" is supposed to be a no-op. Unfortunately, it would pin down compression parameters as if they were manually set, preventing later automatic adaptation. Unfortunately, I'm currently short of a test case that could check this situation and trigger an error. Compression parameters selection between tableID 0,1,2,3 is largely internal, leaving no trace to outside world, not even in frame header.	2017-12-13 17:45:26 -08:00
Yann Collet	d23eb9a098	zstreamtest : added missing CHECK_Z()	2017-12-13 15:35:49 -08:00
Nick Terrell	22727a7467	Fix cdict compressor repcodes	2017-12-13 11:31:20 -08:00
Yann Collet	e28305fcca	fix #944 : ZSTDMT with large files and dictionary now works correctly windowLog is now enforced from provided compression parameters, instead of being copied blindly from `cdict` where it could be smaller. also : - fix a minor bug in zstreamtest --mt : advanced parameters must be set before init - changed advanced parameter name to ZSTDMT_jobSize	2017-12-12 18:04:58 -08:00
Yann Collet	03832b7aa5	re-added test case messing with revert ... :(	2017-12-12 14:01:54 -08:00
Yann Collet	8a104fda05	Revert "Created a test case which reliably reproduces bug #944 " This reverts commit `5098d1fbe2`.	2017-12-12 12:51:49 -08:00
Yann Collet	5098d1fbe2	Created a test case which reliably reproduces bug #944 in zstreamtest.	2017-12-12 12:48:31 -08:00
Yann Collet	dfc697e967	comment clarification	2017-12-08 12:16:49 -05:00
Yann Collet	c029ee1f0b	ZSTD_initCStream_srcSize() considers "0" to mean "unknown" to not break existing programs relying on this behavior. Might be changed to mean "empty" in the future.	2017-12-07 17:13:10 -05:00
Yann Collet	3aa2b27a89	fix #942 : streaming interface does not compress after ZSTD_initCStream() While the final result is still, technically, a frame, the resulting frame expands initial data instead of compressing it. This is because the streaming API creates a tiny 1-byte buffer for input, because it believes input is empty (0-bytes), because in the past, 0 used to mean "unknown" instead. This patch fixes the issue. Todo : add a test which traps the issue.	2017-12-07 02:52:50 -05:00
Yann Collet	5e1f34b7e4	setParameter : no side-effect on setting a compression parameter last such side-effect was modifying cctx->loadedDictEnd on setting forceWindow. It is no a useless operation, so it's removed. No side-effect left when setting a compression parameter.	2017-12-01 21:17:09 -08:00
Yann Collet	78290874a5	fixed Visual warning on minor interface discrepancy	2017-11-29 17:01:14 -08:00
Yann Collet	d3c59edac9	removed long-range-mode tests from `zstreamtest --no-big-tests`	2017-11-29 16:42:20 -08:00
Yann Collet	998a93b784	simplified ZSTD_CCtx_setParametersUsingCCtxParams() Any ZSTD_CCtx_setParameter() shall just write the requested parameter, without further action. Any action shall be taken at parameter application only (during init). It makes it possible to just copy CCtxParams from external container to internal state, and get rid of the more complex code which was trying to compensate for missing actions.	2017-11-29 16:13:05 -08:00
Yann Collet	23767e950a	fix one UB pointer arithmetic in encoder Instead of calculating distance between 2 memory objects, which is UB, we extract the offset from object 1, and transfer it into object 2.	2017-11-17 13:24:51 -08:00
Yann Collet	15768cabb5	fixed some complex scenarios Fixed : multithreading to compress some small data with dictionary Fixed : ZSTD_initCStream_usingCDict() Improved streaming memory usage when pledgedSrcSize is known.	2017-11-16 15:18:18 -08:00
Yann Collet	05dffe43a7	Fixed Btree update ZSTD_updateTree() expected to be followed by a Bt match finder, which would update zc->nextToUpdate. With the new optimal match finder, it's not necessarily the case : a match might be found during repcode or hash3, and stops there because it reaches sufficient_len, without even entering the binary tree. Previous policy was to nonetheless update zc->nextToUpdate, but the current position would not be inserted, creating "holes" in the btree, aka positions that will no longer be searched. Now, when current position is not inserted, zc->nextToUpdate is not update, expecting ZSTD_updateTree() to fill the tree later on. Solution selected is that ZSTD_updateTree() takes care of properly setting zc->nextToUpdate, so that it no longer depends on a future function to do this job. It took time to get there, as the issue started with a memory sanitizer error. The pb would have been easier to spot with a proper `assert()`. So this patch add a few of them. Additionnally, I discovered that `make test` does not enable `assert()` during CLI tests. This patch enables them. Unfortunately, these `assert()` triggered other (unrelated) bugs during CLI tests, mostly within zstdmt. So this patch also fixes them. - Changed packed structure for gcc memory access : memory sanitizer would complain that a read "might" reach out-of-bound position on the ground that the `union` is larger than the type accessed. Now, to avoid this issue, each type is independent. - ZSTD_CCtxParams_setParameter() : @return provides the value of parameter, clamped/fixed appropriately. - ZSTDMT : changed constant name to ZSTDMT_JOBSIZE_MIN - ZSTDMT : multithreading is automatically disabled when srcSize <= ZSTDMT_JOBSIZE_MIN, since only one thread will be used in this case (saves memory and runtime). - ZSTDMT : nbThreads is automatically clamped on setting the value.	2017-11-16 12:18:56 -08:00
Yann Collet	4202b2e8a6	merged rep search into btMatchSearch but there is a tree corruption somewhere ... bug hunt ongoing	2017-11-14 20:38:52 -08:00
Yann Collet	9a11f70dc3	merged repcode search into BT match search this version has same speed as branch `opt` which is itself 5-10% slower than branch `dev` (no identified reason) It does not compress exactly the same as `opt` or `dev`, maybe because it doesn't stop search after repcodes, leading to sometimes better compression, sometimes worse (by a small margin). warning : _extDict path does not work for the time being This means that benchmark module works, but file module will fail with large files (and high compression level). Objective is to fuse _extDict path into current one, in order to have a single parser to maintain.	2017-11-13 02:23:48 -08:00
Yann Collet	100d8ad6be	lib/compress: created ZSTD_LLcode() and ZSTD_MLcode() transform length into code. Since transformation is needed in several places throughout the code, better write the logic in one place.	2017-11-08 12:43:05 -08:00
Yann Collet	ee441d5d2b	renamed zstd_compress.h into zstd_compress_internal.h to emphasize the fact that all definitions it contains must remain private, accross lib/compress modules.	2017-11-07 16:15:23 -08:00
Yann Collet	150354c5fe	minor refactor added some traces and assert related to hunting a potential ubsan error in 32-bits more (it ends up being a compiler-side issue : https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82802). Modified one pointer arithmetic expression for a more conformant way.	2017-11-01 16:57:48 -07:00
Yann Collet	428e8b3bf4	fix : ZSTD_compress_generic(,,,ZSTD_e_end) automatically sets pledgedSrcSize as per documentation, on ZSTD_setPledgedSrcSize() : > If all data is provided and consumed in a single round, > this value (pledgedSrcSize) is overriden by srcSize instead. This wasn't applied before compression level is transformed into compression parameters. As a consequence, small input missed compression parameters adaptation. It seems to work fine now : compression was compared with ZSTD_compress_advanced(), results were the same.	2017-11-01 13:15:23 -07:00
Nick Terrell	86b8134cad	[libzstd] Fix parameter selection for empty input ZSTD_compress() and friends would treat an empty input as an unknown size when selecting parameters. Thus, they would drastically overallocate the context. Tell ZSTD_getParams() that the source size is 1 when it is empty.	2017-10-25 17:24:15 -07:00
Yann Collet	1ff8a8c109	Merge pull request #891 from facebook/contentSize Content size	2017-10-17 17:24:51 -07:00
Yann Collet	13bfe885aa	edited ZSTD_initCStream_advanced() comment	2017-10-16 14:06:22 -07:00
Nick Terrell	7f961ba6cd	Don't allow default tables to repeat It isn't useful in any case to repeat default tables. Saves a few bytes on Silesia, since we don't trigger the dictionary heuristic. Before: 211988480 => 73651998 bytes After: 211988480 => 73651721 bytes	2017-10-16 11:37:56 -07:00
Yann Collet	fc8d293460	dictionary compression use correct file size estimation when determining compression parameters to compress one file only. For multiple files, it still "bets" that files are going to be small. There was also a bug recently added in ZSTD_CCtx_loadDictionary_advanced() making it incapable to use pledgedSrcSize to determine compression parameters.	2017-10-14 01:21:43 -07:00
Yann Collet	213ef3b510	fixed ZSTD_initCStream_advanced() behavior, which depends on contentSizeFlag, and a stream fuzzer test, which was incorrect (relied on 0 being unconditionnally transformed into `ZSTD_CONTENTSIZE_UNKNOWN`)	2017-10-13 19:01:58 -07:00
Yann Collet	3c1e3f8ec9	contentSizeFlag enabled by default would also fail for streaming and MT operations fixed	2017-10-13 18:32:06 -07:00
Yann Collet	fb44516641	ensure fParams.contentSizeFlag starts at 1 such default was failing for ZSTD_compressBegin/ZSTD_compressContinue fixed too	2017-10-13 17:39:13 -07:00
Yann Collet	dd18d73e7e	fileio: content size is enabled by default	2017-10-13 16:32:18 -07:00
Nick Terrell	ced6e6189c	Add DEBUGLOG() that prints FSE encoding types	2017-10-13 14:55:23 -07:00
Nick Terrell	24ac2dbd2a	Fix invalid use of dictionary offcode table Fixes #888.	2017-10-13 12:47:03 -07:00
Yann Collet	a9e5705077	minor code formatting added a trace during sequence encoding	2017-10-13 02:36:16 -07:00
Nick Terrell	a86a7097ec	Ensure dictionary Huff table can encode any symbol * Ensure that the dictionary Huffman CTable has maxSymbolValue 255. * Fix a stack buffer overflow during compression dictionary loading.	2017-10-03 13:22:13 -07:00
Yann Collet	004fd34fd9	Merge pull request #876 from facebook/srcSize CLI Fix : srcSize written in frame headers when compressing multiple files	2017-10-02 15:02:05 -07:00
Nick Terrell	86e83e926f	[libzstd] Set CLEVEL_CUSTOM correctly In `ZSTD_compressBegin_advanced()`, `ZSTD_parameters` are used to set the compression parameters, but the level didn't get set to `CLEVEL_CUSTOM`, so `ZSTD_compressBlock()` used the wrong parameters when checking the source size.	2017-10-02 13:43:30 -07:00
Yann Collet	6e930c13d1	Merge branch 'dev' into compressBound	2017-10-01 11:24:02 -07:00
Yann Collet	dc404119e5	ZSTD_adjustCParams_internal : minor optimization	2017-09-30 15:02:40 -07:00
Nick Terrell	c5d6dde502	Don't `size -= 1` in ZSTD_adjustCParams() The window size could end up too small if the source size is 2^n + 1. Credit to OSS-Fuzz	2017-09-30 14:20:06 -07:00
Yann Collet	5b10345b26	added ZSTD_COMPRESSBOUND() as a macro ZSTD_compressBound() works fine, but is only useful for dynamic allocation. For static allocation, only a macro can provide the amount during compilation time.	2017-09-29 23:17:41 -07:00
Yann Collet	8afb151c9b	cli: fixed wrong initialization in MT mode It's not good to mix old and new API ZSTD_resetCStream() doesn't just set pledgedSrcSize : it also sets the CCtx for a single thread compression. Problem is, when 2+ threads are defined in cctx->requestedParams, ZSTD_compress_generic() will want to start MT compression, since initialization is supposed to have already happened (thanks to ZSTD_resetCStream()) except that the underlying ZSTDMT_CCtx* object is not created, resulting in a segfault. This is an invalid construction (correct one is to use ZSTD_CCtx_setPledgedSrcSize()). I haven't found a nice way to mitigate this impact if someone makes the same mistake. At some point, removing the old API to keep only the new API within fileio.c will limit these risks.	2017-09-29 22:14:37 -07:00
Yann Collet	fbd5ab7027	minor fix : no longer use fake srcSize during resource creation srcSize is read and provided at each file, not at resource creation. This used to be useful with older API, because it could not re-adapt parameters between sessions. At some point, it will be better to remove the old code, and only keep the new_api. It works fine by now.	2017-09-29 19:40:27 -07:00
Yann Collet	db1668a43b	fix : srcSize written in frame header when multiple files compressed This information used to be disabled when nbFiles>1. It was badly initialized later in the code, resulting in an error.	2017-09-29 18:05:18 -07:00
Yann Collet	86b4fe5b45	adjustCParams : restored previous behavior unknowns srcSize presumed small if there is a dictionary (dictSize>0) and presumed large otherwise.	2017-09-28 18:14:28 -07:00
Yann Collet	e4ec427720	Merge branch 'dev' into shorterTests fixed conflicts	2017-09-28 12:19:28 -07:00
Yann Collet	8074261d00	zstdmt : move on when not enough memory for a new input buffer just continue operations without input forward progress, instead of an error that stops current compression session.	2017-09-28 11:46:19 -07:00
Yann Collet	2cd15dd9a4	fixed minor Visual conversion warning	2017-09-28 02:33:41 -07:00
Yann Collet	9b5b47ac93	ensure adjustCParams adjust hLog and cLog even without srcSize It would previously exit when srcSize is unknown. But in the case of custom parameters, hLog and cLog can still be too large in comparison with windowLog. Reduces maximum memory allocated during zstreamtest --newapi	2017-09-28 01:25:40 -07:00
Yann Collet	54a827fff0	Merge branch 'dev' into newFormats Fixed conflicts in zstdmt_compress.c	2017-09-27 16:39:40 -07:00
Yann Collet	c994932788	fixed ZSTD_format_e value validation	2017-09-27 12:22:22 -07:00
Yann Collet	ecf1778e23	updated ZSTD_format_e value validation also updated manual	2017-09-27 11:19:21 -07:00
Yann Collet	9f0b8dfbe9	Merge branch 'dev' into newFormats	2017-09-26 14:22:39 -07:00
Nick Terrell	c233bdbaee	Increase maximum window size * Maximum window size in 32-bit mode is 1GB, since allocations for 2GB fail on my Mac. * Maximum window size in 64-bit mode is 2GB, since that is the largest power of 2 that works with the overflow prevention. * Allow `--long=windowLog` to set the window log, along with `--zstd=wlog=#`. These options also set the window size during decompression, but don't override `--memory=#` if it is set. * Present a helpful error message when the window size is too large during decompression. * The long range matcher defaults to a hash log 7 less than the window log, which keeps it at 20 for window log 27. * Keep the default long range matcher window size and the default maximum window size at 27 for the API and CLI. * Add tests that use the maximum window size and hash size for compression and decompression.	2017-09-26 14:00:01 -07:00
Yann Collet	5d8fdd1641	Merge pull request #855 from terrelln/maxoff [libzstd] Increase MaxOff	2017-09-25 16:34:29 -07:00
Yann Collet	6ee05a02b8	added ZSTD_decompress_generic() same as ZSTD_decompressStream(), just for a similar feeling as the compression side, which uses ZSTD_compress_generic()	2017-09-25 15:41:48 -07:00
Yann Collet	62568c9a42	added capability to generate magic-less frames decoder not implemented yet	2017-09-25 14:26:26 -07:00
Nick Terrell	bbe77212ef	[libzstd] Increase MaxOff	2017-09-25 13:36:18 -07:00
Yann Collet	96f0cde31a	minor function rename ZSTD_estimateCStreamSize_advanced_usingCParams -> ZSTD_estimateCStreamSize_usingCParams _usingX is clear. _advanced feels redundant	2017-09-24 16:47:02 -07:00
Yann Collet	7c3dea42ce	added prototypes for advanced parameters for decompression API required to decode custom formats	2017-09-24 15:57:29 -07:00
Nick Terrell	d6abb28951	Prepare for ZSTD_WINDOWLOG_MAX == 31	2017-09-21 17:18:41 -07:00
Yann Collet	7d1ff3817b	fix ZSTD_sizeof_CCtx() / ZSTD_sizeof_CStream() previous result was over-estimated by counting streaming buffers twice	2017-09-18 14:47:34 -07:00
Yann Collet	335780c427	fixed too strong alignment assert in ZSTD_initStaticCCtx() 64-bits fields are only 32-bits aligned on 32-bits CPU	2017-09-13 16:35:29 -07:00
Yann Collet	f1571dad8f	Merge pull request #838 from stellamplau/ldm-mergeDev Add long distance matcher	2017-09-13 13:24:08 -07:00
Stella Lau	eb3327c10a	Merge branch 'dev' of https://github.com/facebook/zstd into ldm-mergeDev	2017-09-11 15:00:01 -07:00
Stella Lau	f902bf9676	Merge branch 'ldm-integrate' into ldm-mergeDev	2017-09-11 14:55:29 -07:00
Yann Collet	f325ee4e84	fixed pass-through warning	2017-09-11 14:37:03 -07:00
Stella Lau	0d1b54db61	Explicitly cast raw numerals when left-shifting	2017-09-11 14:28:18 -07:00
Yann Collet	0d6ecc72a3	makes it possible to compile libzstd in single-thread mode without zstdmt_compress.c (#819 )	2017-09-11 14:09:34 -07:00
Yann Collet	3128e03be6	updated license header to clarify dual-license meaning as "or"	2017-09-08 00:09:23 -07:00
Stella Lau	360428c5d9	Move ldm functions to their own file	2017-09-06 18:09:26 -07:00
Stella Lau	2b99d696de	Remove debug code	2017-09-06 15:57:26 -07:00
Stella Lau	eeff55dfa8	Merge remote-tracking branch 'upstream/dev' into ldm-mergeDev	2017-09-06 15:56:32 -07:00
Yann Collet	ad0046244f	Merge pull request #831 from terrelln/split-compress Split parsers out of zstd_compress.c	2017-09-06 10:01:27 -07:00
Stella Lau	9e4060200b	Add tests and fix pointer alignment	2017-09-06 09:14:05 -07:00
Stella Lau	c706de5395	Rename and add short ldm parameters in cli	2017-09-05 21:11:18 -07:00
Stella Lau	98b85426f1	Fix setting of nextToUpdate at end of ldm matcher	2017-09-05 20:41:37 -07:00
Nick Terrell	721726d688	Split parsers out of zstd_compress.c	2017-09-05 17:10:25 -07:00
Stella Lau	08d33fe1c9	Fix parameter handling in copyCCtx with cdict	2017-09-05 15:50:20 -07:00
Stella Lau	fd0071da29	Fix parameter handling with ZSTD_copyCCtx	2017-09-05 15:34:17 -07:00
Stella Lau	67d4a6161c	Add ldmBucketSizeLog param	2017-09-02 21:55:29 -07:00
Stella Lau	a1f04d518d	Move hashEveryLog to cctxParams and update cli	2017-09-01 15:05:47 -07:00
Stella Lau	767a0b3be1	Move ldm hashLog, bucketLog, and mml to cctxParams	2017-09-01 12:24:59 -07:00
Stella Lau	17d8e0bdcc	Merge remote-tracking branch 'upstream/longRangeMatcher' into ldm-integrate	2017-09-01 10:19:38 -07:00
Stella Lau	8081becadc	Add long distance matching as a CCtxParam	2017-09-01 09:18:58 -07:00
Yann Collet	d7ad99b2ab	Merge branch 'longRangeMatcher' into dev	2017-08-31 18:08:37 -07:00
Stella Lau	6a546efb8c	Add long distance matcher Move last literals section to ZSTD_block_internal	2017-08-31 12:53:19 -07:00
Stella Lau	90a31bfa16	Pass dictMode to ZSTDMT_initCStream; fix nits - Return error code in estimate{CCtx,CStream}Size functions	2017-08-30 16:19:07 -07:00
Stella Lau	ee65701720	Minor fixes; remove formatting only changes	2017-08-29 20:27:35 -07:00
Stella Lau	a6e20e1bd7	Add test for raw content starting with dict header	2017-08-29 18:36:18 -07:00
Stella Lau	82d636b76a	Rename applyCCtxParams()	2017-08-29 18:03:06 -07:00
Stella Lau	4e835720bf	Delay creation of ZSTDMT_CCtx	2017-08-29 17:58:32 -07:00
Stella Lau	c7a18b7c21	Localize 'dictMode' from cctx to function param	2017-08-29 15:52:24 -07:00
Stella Lau	c88fb9267f	Replace 'byReference' with enum	2017-08-29 11:55:02 -07:00
Stella Lau	b5b9275e67	Rename estimateCCtxSize_advanced() and estimateCStreamSize_advanced()	2017-08-29 10:49:29 -07:00
Stella Lau	0e56a84a1e	Fix getting cParams from CCtxParams	2017-08-28 19:25:17 -07:00
Stella Lau	024098a47d	Fix parameter retrieval from cdict	2017-08-25 17:58:28 -07:00
Stella Lau	2adde898c8	Fix typo with ZSTDMT_parameter	2017-08-25 16:13:40 -07:00
Stella Lau	18224608ff	Remove ZSTD_setCCtxParameter()	2017-08-25 13:58:41 -07:00
Stella Lau	0744592d38	Add function initializing cctxParams from clevel	2017-08-25 13:36:47 -07:00
Stella Lau	9911153723	Move jobSize and overlapLog in zstdmt to cctxParams	2017-08-25 13:14:51 -07:00
Stella Lau	eb7bbab36a	Remove ZSTD_p_refDictContent and dictContentByRef	2017-08-25 11:11:45 -07:00
Stella Lau	15fdeb9e41	Enforce nbThreads<=1 for estimateCCtxSize	2017-08-24 16:28:49 -07:00
Stella Lau	2fbf0285b2	Fix interaction with ZSTD_setCCtxParameter() and cleanup	2017-08-24 11:25:41 -07:00
Stella Lau	bf3108fb50	Ensure zstdmt uses 'job version' of cctx parameters	2017-08-23 17:03:31 -07:00
Stella Lau	1c81f725ff	Remove duplicated testing code	2017-08-23 15:47:15 -07:00
Stella Lau	64ce49426b	Fix cstream compression level	2017-08-23 12:30:47 -07:00
Stella Lau	5bc2c1e982	Add prototype support for customMem with cctxParams	2017-08-23 12:03:30 -07:00
Yann Collet	e9ce1208a1	Merge pull request #812 from facebook/longRangeFix fixed extraordinary scenario where all fields use maximum nbBits	2017-08-23 11:35:28 -07:00
Stella Lau	6f1a21c7e9	Remove formatting-only changes	2017-08-23 10:24:19 -07:00
Stella Lau	11303778d0	Add function to make cctxParams from ZSTD_parameters	2017-08-22 14:53:13 -07:00
Stella Lau	23fc0e41fa	Remove 'opaque' naming from internal functions	2017-08-22 14:24:47 -07:00
Stella Lau	8fd1636776	Remove unused functions	2017-08-22 13:33:58 -07:00
Yann Collet	6b2b6a9bd5	fixed extraordinary scenario where all fields use maximum possible nb of bits simultaneously can only happen if windowLog>=27 (level 22 --ultra)	2017-08-22 12:09:21 -07:00
Stella Lau	e50ed1fa3a	Fix undefined behavior when srcSize==1	2017-08-22 11:55:42 -07:00
Stella Lau	5b956f4753	Comment out CCtx_param versions of CDict functions	2017-08-21 14:49:16 -07:00
Stella Lau	fd8a25786e	Check parameters are valid in initCCtxParams	2017-08-21 13:23:35 -07:00
Stella Lau	1c0dbe81b1	Add documentation for CCtx_params	2017-08-21 13:18:00 -07:00
Stella Lau	939f954285	Pass ZSTD_CCtx_params as const ptr when possible	2017-08-21 12:57:18 -07:00
Stella Lau	560b34f6d2	Return error code when initializing NULL cctxParams	2017-08-21 11:52:26 -07:00
Stella Lau	25be09c6b4	Set some parameters to zero before initializing cdict	2017-08-21 11:35:46 -07:00
Stella Lau	502031ca10	Use cctxParam version of createCDict internally	2017-08-21 11:00:44 -07:00
Stella Lau	91b30dbe84	Remove test parameter	2017-08-21 10:09:06 -07:00
Stella Lau	f181f33bdf	Disable tests and refactor	2017-08-21 01:59:08 -07:00
Stella Lau	023b24e6d4	Add cctx param tests	2017-08-20 22:55:07 -07:00
Stella Lau	6cee6e07e5	Add internal createCDict function	2017-08-18 22:48:31 -07:00
Stella Lau	d775519296	Add cctxParam versions of internal functions	2017-08-18 17:37:58 -07:00
Yann Collet	32fb407c9d	updated a bunch of headers for the new license	2017-08-18 16:52:05 -07:00
Stella Lau	63b8c98531	Pass cctx parameters to MTCtx	2017-08-18 16:17:24 -07:00
Stella Lau	399ae013d4	Add function to apply cctx params	2017-08-18 13:01:55 -07:00
Stella Lau	81d89d82a6	Move nbThreads to cctx params	2017-08-18 12:08:57 -07:00
Stella Lau	2300c58a6f	Move dictContentByRef to cctx params	2017-08-18 12:03:16 -07:00
Stella Lau	b6cb2ed8cb	Move dictMode to cctxParams	2017-08-18 11:43:31 -07:00
Stella Lau	97e27affcb	Move compression level to cctx params	2017-08-18 11:20:08 -07:00
Stella Lau	c0221124d5	Add function to set opaque parameters	2017-08-17 19:30:22 -07:00
Stella Lau	4169f49171	Add initialization/allocation functions for opaque params	2017-08-17 18:45:04 -07:00
Stella Lau	ade95b8bed	Add opaque interfaces for static initialization	2017-08-17 18:13:08 -07:00
Stella Lau	699f11b4f7	Create opaque parameter structure	2017-08-17 17:33:46 -07:00
Nick Terrell	565e925eb7	[libzstd] Fix FORCE_INLINE macro	2017-08-14 21:12:05 -07:00
Nick Terrell	308047eb5d	Fix compression failure on incompressible data If the destination buffer is the minimum allowed size in `ZSTD_compressSequences()` (2^17), then if the block isn't compressible compression might fail with `dstSize_tooSmall`, when it should instead emit a raw uncompressed block. Additionally, `ZSTD_compressLiterals()` implicitly called `ZSTD_noCompressLiterals()` if Huffman compression failed. Make that explicit.	2017-08-07 11:45:24 -07:00
Yann Collet	e1222544be	Merge pull request #753 from paulcruz74/adapt-approach-3 adaptive compression v1	2017-07-27 10:00:10 -07:00
Paul Cruz	6945b3c43d	removed previous version of completion for compression	2017-07-19 11:51:50 -07:00
Yann Collet	77d67fb167	Merge pull request #766 from terrelln/real-block-split [libzstd] Pull optimal parser state out of seqStore_t	2017-07-18 08:26:24 -07:00
Yann Collet	14c83b05c7	Merge pull request #765 from terrelln/real-block-split [libzstd] Remove ZSTD_CCtx* argument of ZSTD_compressSequences()	2017-07-17 19:25:55 -07:00
Nick Terrell	7a28b9e4a3	[libzstd] Pull optimal parser state out of seqStore_t	2017-07-17 15:29:11 -07:00
Yann Collet	3381bf4b84	Merge pull request #764 from terrelln/real-block-split [libzstd] Refactor ZSTD_compressSequences()	2017-07-17 14:46:01 -07:00
Nick Terrell	e198230645	[libzstd] Remove ZSTD_CCtx* argument of ZSTD_compressSequences()	2017-07-17 12:27:24 -07:00
Nick Terrell	634f012420	[libzstd] Refactor ZSTD_compressSequences()	2017-07-17 11:36:11 -07:00
Paul Cruz	50ce4eaeb6	added error detection for pthread initialization, added compression completion measurement, fixed const values	2017-07-17 10:12:44 -07:00
Yann Collet	2bd6440be0	pinned down error code enum values Note : all error codes are changed by this new version, but it's expected to be the last change for existing codes. Codes are now grouped by category, and receive a manually attributed value. The objective is to guarantee that error code values will not change in the future when introducing new codes. Intentionnal empty spaces and ranges are defined in order to keep room for potential new codes.	2017-07-13 17:12:16 -07:00
Nick Terrell	830ef4152a	[libzstd] Increase granularity of FSECTable repeat mode	2017-07-13 12:45:39 -07:00
Yann Collet	d985319337	Merge pull request #759 from terrelln/real-block-split [libzstd] Pull CTables into sub-structure	2017-07-13 10:24:19 -07:00
Nick Terrell	de0414b736	[libzstd] Pull CTables into sub-structure	2017-07-12 19:49:19 -07:00
Yann Collet	88da8f1816	fix : propagate custom allocator to ZSTDMT though ZSTD_CCtx_setParameter() also : compile fuzzer with MT enabled	2017-07-10 14:02:33 -07:00
Yann Collet	2cb9774f5e	more precise estimation of amount to flush at end of stream (single thread mode) also : can use DEBUGLEVEL variable in /tests	2017-07-04 12:39:26 -07:00
Yann Collet	2084b041f4	fixed comments	2017-07-03 15:52:19 -07:00
Yann Collet	5a77361595	fixed wrong function name in comment	2017-07-03 15:21:24 -07:00
Yann Collet	d5c046c609	implemented shortcut for zstd_compress_generic() in MT mode added ZSTDMT_compress_advanced() API	2017-06-30 14:51:01 -07:00
Yann Collet	a3d9926c40	compression optimization opportunity switch to single-pass mode directly into output buffer when outputSize >= ZSTD_compressBound(inputSize). Speed gains observed with fullbench (~+15% on level 1)	2017-06-29 14:44:49 -07:00
Yann Collet	037466245f	refactor ZSTD_check_compressionLevel_monotonicIncrease_memoryBudget() use less macro statements the initial version was meant to work with STATIC_ASSERT but since it doesn't work and needs assert() it's possible to rewrite it using normally compiled code which is better for compiler. Downside : the error message is less precise. There is a DEBUGLOG(3,) to compensate.	2017-06-28 20:24:08 -07:00
Yann Collet	2bf428df45	Merge branch 'advancedAPI2' into refPrefix	2017-06-28 16:35:49 -07:00
Yann Collet	1ca76039af	fixed -Wdeclaration-after-statement	2017-06-28 15:40:21 -07:00
Yann Collet	813535105b	added function to control monotonic memory budget increase of ZSTD_defaultCParameters[0] It's a runtime test, based on assert(), played once, on first ZSTD_getCParams() usage, when ZSTD_DEBUG is enabled.	2017-06-28 15:34:56 -07:00
Yann Collet	adbe74a8ac	adjusted compression levels to guarantee a monotonically increasing memory budget	2017-06-28 13:22:37 -07:00
Yann Collet	33a6639039	fixed ZSTD_refPrefix with Multithread-enabled CCtx	2017-06-28 11:09:43 -07:00
Yann Collet	2e4274262d	controlled dictMode	2017-06-27 17:09:12 -07:00
Yann Collet	b7372933b8	implemented ZSTD_refPrefix()	2017-06-27 15:49:12 -07:00
Yann Collet	7d3816183f	exposed ZSTD_MAGIC_DICTIONARY in zstd.h makes it easier to explain ZSTD_dictMode	2017-06-27 13:50:34 -07:00
Yann Collet	fecc721fd9	added parameter ZSTD_p_refDictContent	2017-06-27 11:46:39 -07:00
Yann Collet	dde10b23fe	refactored ZSTD_estimateDStreamSize() now uses windowSize as argument. Also : created ZSTD_estimateDStreamSize_fromFrame()	2017-06-26 17:44:26 -07:00
Yann Collet	09ae03a570	ZSTD_estimateCDictSize_advanced() ZSTD_estimateCDictSize() now uses same arguments as ZSTD_createCDict() ZSTD_estimateCDictSize_advanced() uses same arguments as ZSTD_createCDict_advanced()	2017-06-26 16:47:32 -07:00
Yann Collet	0c9a915a28	ZSTD_estimateCStreamSize_advanced()	2017-06-26 16:02:25 -07:00
Yann Collet	31af8290d1	ZSTD_estimateCCtx_advanced() ZSTD_estimateCCtx() is now a "simple" function, taking int compressionLevel as single argument. ZSTD_estimateCCtx_advanced() takes a CParams argument, which is both more complete and more complex to generate.	2017-06-26 15:52:39 -07:00
Yann Collet	ef269c1b68	Merge pull request #725 from facebook/advancedAPI2 New Advanced API	2017-06-23 09:50:47 -07:00
Yann Collet	ecb0f46866	add controls over streaming buffers	2017-06-21 17:25:01 -07:00
Yann Collet	204b6b7ef6	fixed streaming buffered allocation with CDict compression	2017-06-21 15:13:00 -07:00
Yann Collet	7bd1a2900e	added ZSTD_dictMode_e to control dictionary loading mode	2017-06-21 11:50:33 -07:00
Yann Collet	e51d51bdf7	fixed memcpy() overlap	2017-06-20 17:44:55 -07:00
Yann Collet	466f92eaa6	removed one useless streaming compression stage, detected by @terrelln	2017-06-20 16:25:29 -07:00
Yann Collet	c3bce24ef4	fixed potential dangling pointer, detected by @terrelln	2017-06-20 16:09:11 -07:00
Yann Collet	b44ab82f7a	ensure new ZSTD_strategy starts at value 1	2017-06-20 14:11:49 -07:00
Yann Collet	c08e649e95	first implementation of bench.c with new API ZSTD_compress_generic() Doesn't speed optimize this buffer-to-buffer scenario yet. Still internally defers to streaming implementation. Also : fixed a long standing bug in ZSTDMT streaming API.	2017-06-19 18:25:35 -07:00
Nick Terrell	55f9cd4942	[libzstd] Fix UBSAN failure	2017-06-19 15:12:28 -07:00

... 2 3 4 5 6 ...

664 Commits