AuroraMiddleware/zstd - zstd - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Yann Collet	2108decb41	Fixed a nasty corruption bug recently introduce into the new dictionary mode. The bug could be reproduced with this command : ./zstreamtest -v --opaqueapi --no-big-tests -s4092 -t639 error was in function ZSTD_count_2segments() : the beginning of the 2nd segment corresponds to prefixStart and not the beginning of the current block (istart == src). This would result in comparing the wrong byte.	2018-06-01 18:54:34 -07:00
Yann Collet	d59cf02df0	decompress: changed error code when input is too large ZSTD_decompress() can decompress multiple frames sent as a single input. But the input size must be the exact sum of all compressed frames, no more. In the case of a mistake on srcSize, being larger than required, ZSTD_decompress() will try to decompress a new frame after current one, and fail. As a consequence, it will issue an error code, ERROR(prefix_unknown). While the error is technically correct (the decoder could not recognise the header of _next_ frame), it's confusing, as users will believe that the first header of the first frame is wrong, which is not the case (it's correct). It makes it more difficult to understand that the error is in the source size, which is too large. This patch changes the error code provided in such a scenario. If (at least) a first frame was successfully decoded, and then following bytes are garbage values, the decoder assumes the provided input size is wrong (too large), and issue the error code ERROR(srcSize_wrong).	2018-05-14 15:32:28 -07:00
Yann Collet	8be984ec45	fixed comments as suggested by @terrelln	2018-03-30 20:09:27 -07:00
Yann Collet	e6e848bfe9	added ZSTD_getFrameHeader_advanced() makes it possible to request frame header from a magicless frame	2018-03-29 17:51:08 -06:00
Yann Collet	e0cb8d19c6	fixed legacy test case	2018-03-20 17:48:22 -07:00
Yann Collet	6cda8c932c	added test with ZSTD_decompress_generic() + ZSTD_DCtx_refPrefix() also : clarified stage condition to accept new parameters, fixed initializers correspondingly.	2018-03-20 16:16:13 -07:00
Yann Collet	0dadb6b70d	implemented ZSTD_DCtx_refPrefix*()	2018-03-20 15:45:56 -07:00
Yann Collet	569b8ba4d9	implemented ZSTD_DCtx_refDDict()	2018-03-20 15:43:49 -07:00
Yann Collet	6873fec658	changed dictMore for dictContentType which seems clearer to describe what the variable/argument is about.	2018-03-20 15:13:14 -07:00
Yann Collet	31b54b6eea	updated ZSTD_initStaticDDict() prototype can also specify dictContentType.	2018-03-20 14:52:02 -07:00
Yann Collet	353117c5d7	implemented ZSTD_DCtx_loadDictionary*() this required updating ZSTD_createDDict_advanced() to accept a dictContentType parameter (raw, full, auto).	2018-03-20 13:40:29 -07:00
Yann Collet	fe321f9e2a	re-integrate ZSTD_decompressSequencesLong() into zstd_decompress.c removed zstd_decompress_impl.h	2018-03-09 19:48:06 -08:00
Yann Collet	89a2ebb971	incorporated ZSTD_decompressSequences() into zstd_decompress()	2018-03-09 19:35:57 -08:00
Yann Collet	cdb1f1433e	incorporated ZSTD_initFseState() inside zstd_decompress.c	2018-03-09 18:16:10 -08:00
Yann Collet	a166eae1ba	incorporate ZSTD_decodeSequenceLong() within zstd_decompress.c	2018-03-09 18:11:14 -08:00
Yann Collet	17626ba56e	restored ZSTD_decodeSequence() into zstd_decompress.c	2018-03-09 18:03:25 -08:00
Yann Collet	db147ea620	improved comments following @terrelln suggestions	2018-03-06 18:15:26 -08:00
Yann Collet	06ca9c7d7c	fixed 0-seq blocks in block-decompression mode	2018-03-06 01:50:19 -08:00
Yann Collet	9a91afe6ef	long offset mode : new default threshold for 32-bit	2018-03-05 16:41:08 -08:00
Yann Collet	7bd7a3ad43	long offset mode : new default threshold for 64-bits mode	2018-03-05 16:16:49 -08:00
Yann Collet	c0393a538f	fixed counting long distance weights	2018-03-05 15:12:10 -08:00
Yann Collet	cb789d2df8	re-inserted offset evaluation	2018-03-05 13:08:59 -08:00
Yann Collet	b91ddf0ae6	Merge branch 'dev' into longOffsetMode	2018-03-05 11:59:54 -08:00
Nick Terrell	6e128d3534	[BMI2] Add comments to the bmi2 variable in the contexts	2018-02-20 14:12:11 -08:00
Nick Terrell	4319132312	[decompress] Support BMI2	2018-02-13 17:00:15 -08:00
Yann Collet	2524cbd847	added code comment on how to generate default tables as suggested by @terrelln	2018-02-13 10:02:25 -08:00
Yann Collet	71c07966bb	added SEQSYMBOL_TABLE_SIZE() as suggested by @terrelln's comment	2018-02-12 16:52:15 -08:00
Yann Collet	04a3f85ce7	fixed gcc warning on a switch code path	2018-02-09 16:16:27 -08:00
Yann Collet	af48f0b62b	fix : offset table pointer when using default table	2018-02-09 15:15:46 -08:00
Yann Collet	426944c3e3	fixed strict aliasing issue tuned threshold	2018-02-09 13:24:11 -08:00
Yann Collet	64ee732694	decide long-offset mode based on offcode statistics threshold vaguely estimated	2018-02-09 12:33:28 -08:00
Yann Collet	6bfe50ad48	re-enabled ZSTD_decompressSequencesLong()	2018-02-09 09:14:25 -08:00
Yann Collet	1850597eaa	pre-calculated default decoding tables	2018-02-09 06:01:02 -08:00
Yann Collet	ab75df21ed	fixed mono-symbol distribution	2018-02-09 05:12:13 -08:00
Yann Collet	421a2716d8	fixed default fse distributions but would be better to pre-calculate tables, for speed	2018-02-09 04:50:58 -08:00
Yann Collet	95424409ea	addBits and baseline into FSE decoding table note : unfinished - need new default tables - need modify long mode	2018-02-09 04:25:15 -08:00
Yann Collet	0170cf9a7a	minor : modified ZSTD_preserveUnsortedMark() to be more vectorization friendly	2018-02-05 11:46:02 -08:00
Yann Collet	94efb1749d	faster decoding in 32-bits mode for long offsets (tentative) On my laptop: Before: ./zstd32 -b --zstd=wlog=27 silesia.tar enwik8 -S 3#silesia.tar : 211984896 -> 66683478 (3.179), 97.6 MB/s , 400.7 MB/s 3#enwik8 : 100000000 -> 35643153 (2.806), 76.5 MB/s , 303.2 MB/s After: ./zstd32 -b --zstd=wlog=27 silesia.tar enwik8 -S 3#silesia.tar : 211984896 -> 66683478 (3.179), 97.4 MB/s , 435.0 MB/s 3#enwik8 : 100000000 -> 35643153 (2.806), 76.2 MB/s , 338.1 MB/s Mileage vary, depending on file, and cpu type. But a generic rule is : x86 benefits less from "long-offset mode" than x64, maybe due to register pressure. On "entropy", long-mode is _never_ a win for x86. On my laptop though, it may, depending on file and compression level (enwik8 benefits more from "long-mode" than silesia).	2018-02-04 01:49:31 -08:00
Yann Collet	4f43ef731d	Merge branch 'dev' into constCDict	2018-01-18 13:36:43 -08:00
Yann Collet	f3b8f90b6d	changed initStatic?Dict() return type to const ZSTD_?Dict* ZSTD_create?Dict() is required to produce a ?Dict* return type because `free()` does not accept a `const type` argument. If it wasn't for this restriction, I would have preferred to create a `const ?Dict` object to emphasize the fact that, once created, a dictionary never changes (hence can be shared concurrently until the end of its lifetime). There is no such limitation with initStatic?Dict() : as stated in the doc, there is no corresponding free() function, since `workspace` is provided, hence allocated, externally, it can only be free() externally. Which means, ZSTD_initStatic?Dict() can return a `const ZSTD_?Dict*` pointer. Tested with `make all`, to catch initStatic's users, which, incidentally, also updated zstd.h documentation.	2018-01-17 14:08:48 -08:00
Yann Collet	2e23333094	ZSTDMT can now work in non-blocking mode with 1 thread it still fallbacks to single-thread blocking invocation when input is small (<1job) or when invoking ZSTDMT_compress(), which is blocking. Also : fixed a bug in new block-granular compression routine.	2018-01-16 15:28:43 -08:00
Yann Collet	f597f55675	improved btlazy2 : list of unsorted candidates can reach extDict It used to stop on reaching extDict, for simplification. As a consequence, there was a small loss of performance each time the round buffer would restart from beginning. It's not a large difference though, just several hundreds of bytes on silesia. This patch fixes it.	2017-12-30 15:12:59 +01:00
Yann Collet	03832b7aa5	re-added test case messing with revert ... :(	2017-12-12 14:01:54 -08:00
Yann Collet	8a104fda05	Revert "Created a test case which reliably reproduces bug #944 " This reverts commit `5098d1fbe2`.	2017-12-12 12:51:49 -08:00
Yann Collet	5098d1fbe2	Created a test case which reliably reproduces bug #944 in zstreamtest.	2017-12-12 12:48:31 -08:00
Yann Collet	23767e950a	fix one UB pointer arithmetic in encoder Instead of calculating distance between 2 memory objects, which is UB, we extract the offset from object 1, and transfer it into object 2.	2017-11-17 13:24:51 -08:00
Yann Collet	cdade555ee	fixed one UB pointer arithmetic	2017-11-17 11:40:08 -08:00
Nick Terrell	1fc4f593da	Allow skippable frames of any size	2017-11-01 13:07:26 -07:00
Yann Collet	7f6a783862	fixed a small error in decodeCorpus a compressed block must be strictly smaller than its decompressed size.	2017-10-07 15:19:52 -07:00
Yann Collet	54a827fff0	Merge branch 'dev' into newFormats Fixed conflicts in zstdmt_compress.c	2017-09-27 16:39:40 -07:00

1 2 3 4 5 ...

336 Commits