AuroraMiddleware/zstd - zstd - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Bimba Shrestha	936aa63ff1	adding oversized check on decompression	2020-04-03 13:25:32 -07:00
Bimba Shrestha	05574ec141	adding oversizeDuration to dctx and macros	2020-04-03 13:08:29 -07:00
Carl Woffenden	7c420344d2	Single-file decoder script can now (optionally) create an encoder To complement the single-file decoder a new script was added to create an amalgamated single-file of all of the Zstd source, along with examples and (simple) tests.	2020-04-03 19:07:46 +02:00
Carl Woffenden	7202184ee0	Fixes decompressor when using -Wshorten-64-to-32 (#2062 ) Spotted on iOS when building with `-Wshorten-64-to-32` (since `__builtin_expect` returns a `long`).	2020-04-03 02:55:29 -07:00
Nick Terrell	ac58c8d720	Fix copyright and license lines * All copyright lines now have -2020 instead of -present * All copyright lines include "Facebook, Inc" * All licenses are now standardized The copyright in `threading.{h,c}` is not changed because it comes from zstdmt. The copyright and license of `divsufsort.{h,c}` is not changed.	2020-03-26 17:02:06 -07:00
Nick Terrell	8d0ee37ac0	Align decompress sequences loop to 32+16 bytes The alignment is added before the loop, so this shouldn't hurt performance in any case. The only way it hurts is if there is already performance instability, and we force it to be stable but in the bad case. This consistently gets us into the good case with gcc-{7,8,9} on an Intel i9-9900K and clang-9. gcc-5 is 5% worse than its best case but has stable performance. We get consistently good behavior on my Macbook Pro compiled with both clang and gcc-8. It ends up in the 50% from DSB and 50% from MITE case, but the performance is the same as the 85% DSB case, so thats fine.	2020-03-23 19:40:31 -07:00
Nick Terrell	7627759b4e	Merge pull request #1972 from terrelln/check-cont Move ZSTD_checkContinuity() to zstd_decompress_block.c	2020-01-23 22:02:50 -08:00
Nick Terrell	fa6a772f38	Initialize dctx->bType to silence valgrind false positive	2020-01-23 17:54:48 -08:00
Nick Terrell	cb2abc3dbe	Fix performance regression on aarch64 with clang	2020-01-23 17:31:14 -08:00
Nick Terrell	6e3cd5b024	Move ZSTD_checkContinuity() to zstd_decompress_block.c	2020-01-23 12:27:39 -08:00
Nick Terrell	036b30b555	Fix super block compression and stream raw blocks in decompression (#1947 ) Super blocks must never violate the zstd block bound of input_size + ZSTD_blockHeaderSize. The individual sub-blocks may, but not the super block. If the superblock violates the block bound we are liable to violate ZSTD_compressBound(), which we must not do. Whenever the super block violates the block bound we instead emit an uncompressed block. This means we increase the latency because of the single uncompressed block. I fix this by enabling streaming an uncompressed block, so the latency of an uncompressed block is 1 byte. This doesn't reduce the latency of the buffer-less API, but I don't think we really care. * I added a test case that verifies that the decompression has 1 byte latency. * I rely on existing zstreamtest / fuzzer / libfuzzer regression tests for correctness. During development I had several correctness bugs, and they easily caught them. * The added assert that the superblock doesn't violate the block bound will help us discover any missed conditions (though I think I got them all). Credit to OSS-Fuzz.	2020-01-10 18:02:11 -08:00
Igor Sugak	03ffda7b88	fix UBSAN's invalid-null-argument error in zstd_decompress.c (#1939 )	2020-01-08 16:17:42 -08:00
Nick Terrell	718f00ff6f	Optimize decompression speed for gcc and clang (#1892 ) * Optimize `ZSTD_decodeSequence()` * Optimize Huffman decoding * Optimize `ZSTD_decompressSequences()` * Delete `ZSTD_decodeSequenceLong()`	2019-11-25 18:26:19 -08:00
Nick Terrell	659e9f05cf	Fix null pointer addition	2019-11-20 18:36:04 -08:00
Nick Terrell	e0d6daabac	Fix Appveyor failure	2019-11-19 11:12:26 -08:00
Nick Terrell	6a7f65117e	Merge pull request #1866 from legrosbuffle/dev Optimized loop bounds to allow the compiler to unroll the loop.	2019-11-18 16:16:30 -08:00
Clement Courbet	b3c9fc27b4	Optimized loop bounds to allow the compiler to unroll the loop. This has no measurable impact on large files but improves small file decompression by ~1-2% for 10kB, benchmarked with: head -c 10000 silesia.tar > /tmp/test make CC=/usr/local/bin/clang-9 BUILD_STATIC=1 && ./lzbench -ezstd -t1,5 /tmp/test	2019-11-15 08:27:05 +01:00
Sen Huang	c787b351ea	Use ZSTD Error codes, improve explanation of ZSTD_loadCEntropy() and ZSTD_loadDEntropy()	2019-11-08 13:57:26 -05:00
Sen Huang	4b141b63e0	Revert "Move decompress symbols into zstd_internal.h, remove dependency" This reverts commit a152b4c67a5266f611db4a2eac4a79003852a795.	2019-11-08 13:57:26 -05:00
Sen Huang	84404cff6e	Move decompress symbols into zstd_internal.h, remove dependency	2019-11-08 13:57:26 -05:00
Nick Terrell	60205fec02	Fix 2 bugs in dictionary loading * Silently skip dictionaries less than 8 bytes, unless using `ZSTD_dct_fullDict`. This changes the compressor, which silently skips dictionaries <= 8 bytes. * Allow repcodes that are equal to the dictionary content size, since it is in bounds.	2019-11-01 16:52:07 -07:00
Nick Terrell	9c1860861e	Fix assert in ZSTD_safecopy In the case that `op >= oend_w` it is possible that `diff < 8` because the two buffers could be adjacent. Credit to OSS-Fuzz, which found the bug. It isn't reproducible because it depends on the memory layout.	2019-10-28 17:51:17 -07:00
Nick Terrell	b1ec94e63c	Fix ZSTD_f_zstd1_magicless for small data * Fix `ZSTD_FRAMEHEADERSIZE_PREFIX` and `ZSTD_FRAMEHEADERSIZE_MIN` to take a `format` parameter, so it is impossible to get the wrong size. * Fix the places that called `ZSTD_FRAMEHEADERSIZE_PREFIX` without taking the format into account, which is now impossible by design. * Call `ZSTD_frameHeaderSize_internal()` with `dctx->format`. * The added tests catch both bugs in `ZSTD_decompressFrame()`. Fixes #1813.	2019-10-21 21:16:17 -07:00
Nick Terrell	ddab2a94e8	Pass iend into ZSTD_storeSeq() to allow ZSTD_wildcopy()	2019-09-20 00:56:20 -07:00
Nick Terrell	cdad7fa512	Widen ZSTD_wildcopy to 32 bytes	2019-09-20 00:52:15 -07:00
Nick Terrell	efd37a64ea	Optimize decompression and fix wildcopy overread * Bump `WILDCOPY_OVERLENGTH` to 16 to fix the wildcopy overread. * Optimize `ZSTD_wildcopy()` by removing unnecessary branches and unrolling the loop. * Extract `ZSTD_overlapCopy8()` into its own function. * Add `ZSTD_safecopy()` for `ZSTD_execSequenceEnd()`. It is optimized for single long sequences, since that is the important case that can end up in `ZSTD_execSequenceEnd()`. Without this optimization, decompressing a block with 1 long match goes from 5.7 GB/s to 800 MB/s. * Refactor `ZSTD_execSequenceEnd()`. * Increase the literal copy shortcut to 16. * Add a shortcut for offset >= 16. * Simplify `ZSTD_execSequence()` by pushing more cases into `ZSTD_execSequenceEnd()`. * Delete `ZSTD_execSequenceLong()` since it is exactly the same as `ZSTD_execSequence()`. clang-8 seeds +17.5% on silesia and +21.8% on enwik8. gcc-9 sees +12% on silesia and +15.5% on enwik8. TODO: More detailed measurements, and on more datasets. Crdit to OSS-Fuzz for finding the wildcopy overread.	2019-09-19 21:07:14 -07:00
mgrice	b830599582	Improvements in zstd decode performance Summary: The idea behind wildcopy is that it can be cheaper to copy more bytes (say 8) than it is to copy less (say, 3). This change takes that further by exploiting some properties: 1. it's almost always OK to copy 16 bytes instead of 8, which means fewer copy instructions, and fewer branches 2. A 16 byte chunk size means that ~90% of wildcopy invocations will have a trip count of 1, so branch prediction will be improved. Speedup on Xeon E5-2680v4 is in the range of 3-5%. Measured wildcopy length distributions on silesia.tar: level <=8 <=16 <=24 >24 1 78.05% 11.49% 3.52% 6.94% 3 82.14% 8.99% 2.44% 6.43% 6 85.81% 6.51% 2.92% 4.76% 8 83.02% 7.31% 3.64% 6.03% 10 84.13% 6.67% 3.29% 5.91% 15 77.58% 7.55% 5.21% 9.66% 16 80.07% 7.20% 3.98% 8.75% Test Plan: benchmark silesia, make check	2019-08-29 12:25:56 -07:00
Carl Woffenden	901ea61f83	Tweaks to create a single-file decoder The CHECK_F macros differ slightly (but eventually do the same thing). Older GCC needs to fallback on the old-style pragma optimisation flags.	2019-08-21 17:49:17 +02:00
Yann Collet	0b0b83e8f3	fix test 122 it's an unsupported scenario.	2019-08-03 16:51:26 +02:00
Vivek Miglani	c7be7d2efb	Fixing compressed block size checks	2019-07-17 12:53:15 -07:00
Vivek Miglani	3f108f82fb	Return error if block size exceeds maximum	2019-07-15 12:10:21 -07:00
mgrice	812e8f2a16	perf improvements for zstd decode (#1668 ) * perf improvements for zstd decode tldr: 7.5% average decode speedup on silesia corpus at compression levels 1-3 (sandy bridge) Background: while investigating zstd perf differences between clang and gcc I noticed that even though gcc is vectorizing the loop in in wildcopy, it was not being done as well as could be done by hand. The sites where wildcopy is invoked have an interesting distribution of lengths to be copied. The loop trip count is rarely above 1, yet long copies are common enough to make their performance important.The code in zstd_decompress.c to invoke wildcopy handles the latter well but the gcc autovectorizer introduces a needlessly expensive startup check for vectorization. See how GCC autovectorizes the loop here: https://godbolt.org/z/apr0x0 Here is the code after this diff has been applied: (left hand side is the good one, right is with vectorizer on) After: https://godbolt.org/z/OwO4F8 Note that autovectorization still does not do a good job on the optimized version, so it's turned off\ via attribute and flag. I found that neither attribute nor command-line flag were entirely successful in turning off vectorization, which is why there were both. silesia benchmark data - second triad of each file is with the original code: file orig compressedratio encode decode change 1#dickens 10192446-> 4268865(2.388), 198.9MB/s 709.6MB/s 2#dickens 10192446-> 3876126(2.630), 128.7MB/s 552.5MB/s 3#dickens 10192446-> 3682956(2.767), 104.6MB/s 537MB/s 1#dickens 10192446-> 4268865(2.388), 195.4MB/s 659.5MB/s 7.60% 2#dickens 10192446-> 3876126(2.630), 127MB/s 516.3MB/s 7.01% 3#dickens 10192446-> 3682956(2.767), 105MB/s 479.5MB/s 11.99% 1#mozilla 51220480-> 20117517(2.546), 285.4MB/s 734.9MB/s 2#mozilla 51220480-> 19067018(2.686), 220.8MB/s 686.3MB/s 3#mozilla 51220480-> 18508283(2.767), 152.2MB/s 669.4MB/s 1#mozilla 51220480-> 20117517(2.546), 283.4MB/s 697.9MB/s 5.30% 2#mozilla 51220480-> 19067018(2.686), 225.9MB/s 665MB/s 3.20% 3#mozilla 51220480-> 18508283(2.767), 154.5MB/s 640.6MB/s 4.50% 1#mr 9970564-> 3840242(2.596), 262.4MB/s 899.8MB/s 2#mr 9970564-> 3600976(2.769), 181.2MB/s 717.9MB/s 3#mr 9970564-> 3563987(2.798), 116.3MB/s 620MB/s 1#mr 9970564-> 3840242(2.596), 253.2MB/s 827.3MB/s 8.76% 2#mr 9970564-> 3600976(2.769), 177.4MB/s 655.4MB/s 9.54% 3#mr 9970564-> 3563987(2.798), 111.2MB/s 564.2MB/s 9.89% 1#nci 33553445-> 2849306(11.78), 575.2MB/s , 1335.8MB/s 2#nci 33553445-> 2890166(11.61), 509.3MB/s , 1238.1MB/s 3#nci 33553445-> 2857408(11.74), 431MB/s , 1210.7MB/s 1#nci 33553445-> 2849306(11.78), 565.4MB/s , 1220.2MB/s 9.47% 2#nci 33553445-> 2890166(11.61), 508.2MB/s , 1128.4MB/s 9.72% 3#nci 33553445-> 2857408(11.74), 429.1MB/s , 1097.7MB/s 10.29% 1#ooffice 6152192-> 3590954(1.713), 231.4MB/s , 662.6MB/s 2#ooffice 6152192-> 3323931(1.851), 162.8MB/s , 592.6MB/s 3#ooffice 6152192-> 3145625(1.956), 99.9MB/s , 549.6MB/s 1#ooffice 6152192-> 3590954(1.713), 224.7MB/s , 624.2MB/s 6.15% 2#ooffice 6152192-> 3323931 (1.851), 155MB/s , 564.5MB/s 4.98% 3#ooffice 6152192-> 3145625(1.956), 101.1MB/s , 521.2MB/s 5.45% 1#osdb 10085684-> 3739042(2.697), 271.9MB/s 876.4MB/s 2#osdb 10085684-> 3493875(2.887), 208.2MB/s 857MB/s 3#osdb 10085684-> 3515831(2.869), 135.3MB/s 805.4MB/s 1#osdb 10085684-> 3739042(2.697), 257.4MB/s 793.8MB/s 10.41% 2#osdb 10085684-> 3493875(2.887), 209.7MB/s 776.1MB/s 10.42% 3#osdb 10085684-> 3515831(2.869), 130.6MB/s 727.7MB/s 10.68% 1#reymont 6627202-> 2152771(3.078), 198.9MB/s 696.2MB/s 2#reymont 6627202-> 2071140(3.200), 170MB/s 595.2MB/s 3#reymont 6627202-> 1953597(3.392), 128.5MB/s 609.7MB/s 1#reymont 6627202-> 2152771(3.078), 199.6MB/s 655.2MB/s 6.26% 2#reymont 6627202-> 2071140(3.200), 168.2MB/s 554.4MB/s 7.36% 3#reymont 6627202-> 1953597(3.392), 128.7MB/s 557.4MB/s 9.38% 1#samba 21606400-> 5510994(3.921), 338.1MB/s 1066MB/s 2#samba 21606400-> 5240208(4.123), 258.7MB/s 992.3MB/s 3#samba 21606400-> 5003358(4.318), 200.2MB/s 991.1MB/s 1#samba 21606400-> 5510994(3.921), 330.8MB/s 974MB/s 9.45% 2#samba 21606400-> 5240208(4.123), 257.9MB/s 919.4MB/s 7.93% 3#samba 21606400-> 5003358(4.318), 198.5MB/s 908.9MB/s 9.04% 1#sao 7251944-> 6256401(1.159), 194.6MB/s 602.2MB/s 2#sao 7251944-> 5808761(1.248), 128.2MB/s 532.1MB/s 3#sao 7251944-> 5556318(1.305), 73MB/s 509.4MB/s 1#sao 7251944-> 6256401(1.159), 198.7MB/s 580.7MB/s 3.70% 2#sao 7251944-> 5808761(1.248), 129.1MB/s 502.7MB/s 5.85% 3#sao 7251944-> 5556318(1.305), 74.6MB/s 493.1MB/s 3.31% 1#webster 41458703-> 13692222(3.028), 222.3MB/s 752MB/s 2#webster 41458703-> 12842646(3.228), 157.6MB/s 532.2MB/s 3#webster 41458703-> 12191964(3.400), 124MB/s 468.5MB/s 1#webster 41458703-> 13692222(3.028), 219.7MB/s 697MB/s 7.89% 2#webster 41458703-> 12842646(3.228), 153.9MB/s 495.4MB/s 7.43% 3#webster 41458703-> 12191964(3.400), 124.8MB/s 444.8MB/s 5.33% 1#xml 5345280-> 696652(7.673), 485MB/s , 1333.9MB/s 2#xml 5345280-> 681492(7.843), 405.2MB/s , 1237.5MB/s 3#xml 5345280-> 639057(8.364), 328.5MB/s , 1281.3MB/s 1#xml 5345280-> 696652(7.673), 473.1MB/s , 1232.4MB/s 8.24% 2#xml 5345280-> 681492(7.843), 398.6MB/s , 1145.9MB/s 7.99% 3#xml 5345280-> 639057(8.364), 327.1MB/s , 1175MB/s 9.05% 1#x-ray 8474240-> 6772557(1.251), 521.3MB/s 762.6MB/s 2#x-ray 8474240-> 6684531(1.268), 230.5MB/s 688.5MB/s 3#x-ray 8474240-> 6166679(1.374), 68.7MB/s 478.8MB/s 1#x-ray 8474240-> 6772557(1.251), 502.8MB/s 736.7MB/s 3.52% 2#x-ray 8474240-> 6684531(1.268), 224.4MB/s 662MB/s 4.00% 3#x-ray 8474240-> 6166679(1.374), 67.3MB/s 437.8MB/s 9.37% 7.51% * makefile changed to only pass -fno-tree-vectorize to gcc * <Replace this line with a title. Use 1 line only, 67 chars or less> Don't add "no-tree-vectorize" attribute on clang (which defines __GNUC__) * fix for warning/error with subtraction of void* pointers * fix c90 conformance issue - ISO C90 forbids mixed declarations and code * Fix assert for negative diff, only when there is no overlap * fix overflow revealed in fuzzing tests * tweak for small speed increase	2019-07-11 18:31:07 -04:00
Ephraim Park	c7c1ba3a19	Fix a constraint stricter than the spec	2019-06-26 16:43:37 -07:00
Nick Terrell	5f228f8db2	[libzstd] Add a ZSTD_STATIC_ASSERT for BIT_DStream_status	2019-04-23 14:22:16 -07:00
Nick Terrell	a892e25374	[libzstd] Error if all sequence bits aren't consumed	2019-04-23 14:07:36 -07:00
Nick Terrell	ee130a9889	[libzstd] Check the size in readSkippableFrameSize()	2019-04-17 11:41:55 -07:00
Nick Terrell	450feb0f95	[libzstd] Fix ZSTD_decompressBound() on bad skippable frames The function didn't verify that the skippable frame size is correct.	2019-04-17 11:29:42 -07:00
Josh Soref	a880ca239b	Spelling (#1582 ) * spelling: accidentally * spelling: across * spelling: additionally * spelling: addresses * spelling: appropriate * spelling: assumed * spelling: available * spelling: builder * spelling: capacity * spelling: compiler * spelling: compressibility * spelling: compressor * spelling: compression * spelling: contract * spelling: convenience * spelling: decompress * spelling: description * spelling: deflate * spelling: deterministically * spelling: dictionary * spelling: display * spelling: eliminate * spelling: preemptively * spelling: exclude * spelling: failure * spelling: independence * spelling: independent * spelling: intentionally * spelling: matching * spelling: maximum * spelling: meaning * spelling: mishandled * spelling: memory * spelling: occasionally * spelling: occurrence * spelling: official * spelling: offsets * spelling: original * spelling: output * spelling: overflow * spelling: overridden * spelling: parameter * spelling: performance * spelling: probability * spelling: receives * spelling: redundant * spelling: recompression * spelling: resources * spelling: sanity * spelling: segment * spelling: series * spelling: specified * spelling: specify * spelling: subtracted * spelling: successful * spelling: return * spelling: translation * spelling: update * spelling: unrelated * spelling: useless * spelling: variables * spelling: variety * spelling: verbatim * spelling: verification * spelling: visited * spelling: warming * spelling: workers * spelling: with	2019-04-12 11:18:11 -07:00
Nick Terrell	aafe97b67d	[libzstd] Switch dictUses to an enum	2019-04-10 16:50:35 -07:00
Nick Terrell	50b9c41196	[libzstd] Fix decompression dictionary bugs and clean up initialization Bugs: * `ZSTD_DCtx_refPrefix()` didn't clear the dictionary after the first use. Fix and add a test case. * `ZSTD_DCtx_reset()` always cleared the dictionary. Fix and add a test case. * After calling `ZSTD_resetDStream()` you could no longer load a dictionary, since the stage was set to `zdss_loadHeader`. Fix and add a test case. Cleanup: * Make `ZSTD_initDStream()` and `ZSTD_resetDStream()` wrap the new advanced API, and add test cases. Document the equivalent of these functions in the advanced API and document the unstable functions as deprecated.	2019-04-10 12:59:02 -07:00
Nick Terrell	824aaa695f	[libzstd] Fix ZSTD_decompressDCtx() with a dictionary * `ZSTD_decompressDCtx()` did not use the dictionary loaded by `ZSTD_DCtx_loadDictionary()`. * Add a unit test. * A stacked diff uses `ZSTD_decompressDCtx()` in the `dictionary_round_trip` and `dictionary_decompress` fuzzers.	2019-04-09 17:59:27 -07:00
Nick Terrell	bfcd5b81d7	[libzstd] Don't check the dictID in fuzzing mode When `FUZZING_BUILD_MODE_UNSAFE_FOR_PRODUCTION` is defined don't check the dictID. This check makes the fuzzers job harder, and it is at the very beginning.	2019-04-08 19:57:41 -07:00
Nick Terrell	00679da22b	[libzstd] Setting ZSTD_d_maxWindowLog to 0 means default	2019-04-02 19:20:52 -07:00
shakeelrao	0a3fa6f909	Add legacy mode in documentation	2019-03-18 20:33:15 -07:00
shakeelrao	20aa1b455c	Stylistic changes	2019-03-17 19:35:43 -07:00
shakeelrao	60796e76b0	Add legacy support to decompressBound	2019-03-15 16:10:37 -07:00
shakeelrao	79827a179f	Fix incorrectly assigned value in ZSTD_errorFrameSizeInfo As documented in `zstd.h`, ZSTD_decompressBound returns `ZSTD_CONTENTSIZE_ERROR` if an error occurs (not `ZSTD_CONTENTSIZE_UNKNOWN`). This is consistent with the error checking made in ZSTD_decompressBound, particularly line 545.	2019-03-13 01:23:07 -07:00
shakeelrao	95dfd48143	update formatting	2019-03-01 23:11:15 -08:00
shakeelrao	1e08c49f75	add stylistic changes	2019-03-01 18:29:35 -08:00
shakeelrao	2bb5eec711	update missing error case to CONTENTSIZE_ERROR	2019-03-01 00:12:16 -08:00
shakeelrao	44ae395b3e	change nbBlocks to size_t for consistency	2019-03-01 00:05:59 -08:00
shakeelrao	03026c3b1d	change compressedBound to ULL	2019-03-01 00:03:50 -08:00
shakeelrao	8930c3c79b	implement API-level changes	2019-02-28 22:55:18 -08:00
shakeelrao	dce9a09772	initialize local vars in decompressBound	2019-02-28 03:01:21 -08:00
shakeelrao	515c506b4c	switch frameBound type to ULL	2019-02-28 02:10:17 -08:00
shakeelrao	d0a3f25697	change return type to ULL	2019-02-28 01:52:01 -08:00
shakeelrao	c9d674b60d	Remove autogenerated test file	2019-02-28 01:29:04 -08:00
shakeelrao	97d3d28dab	Fix decl-after-stmnt build error	2019-02-28 01:24:54 -08:00
shakeelrao	820af1e078	Provide an API function to estimate decompressed size. Introduces a new utility function `ZSTD_findFrameCompressedSize_internal` which is equivalent to `ZSTD_findFrameCompressSize`, but accepts an additional output parameter `bound` that computes an upper-bound for the compressed data in the frame. The new API function is named `ZSTD_decompressBound` to be consistent with `zstd_compressBound` (the inverse operation). Clients will now be able to compute an upper-bound for their compressed payloads instead of guessing a large size. Implements https://github.com/facebook/zstd/issues/1536.	2019-02-28 00:42:49 -08:00
W. Felix Handte	501eb25102	Rename FORWARD_ERROR -> FORWARD_IF_ERROR	2019-01-29 12:56:07 -05:00
W. Felix Handte	03e040a966	Replace Uses of CHECK_E with RETURN_ERROR_IF(*_isError(...	2019-01-28 17:33:01 -05:00
W. Felix Handte	64bb6640f2	Replace CHECK_F Uses in zstdmt_compress.c and zstd_ddict.c	2019-01-28 17:15:57 -05:00
W. Felix Handte	32fed9c7be	Switch CHECK_F Calls to FORWARD_ERROR	2019-01-28 12:45:34 -05:00
W. Felix Handte	800c87fed0	Switch Unconditional RETURN_ERROR_IF Calls to RETURN_ERROR	2019-01-28 12:45:34 -05:00
W. Felix Handte	c823237d7b	Convert Checks in zstd_decompress.c to RETURN_ERROR_IF	2019-01-28 12:23:14 -05:00
W. Felix Handte	ea031f4ea2	Convert Checks in zstd_decompress_block.c to RETURN_ERROR_IF	2019-01-28 11:56:39 -05:00
Yann Collet	ededcfca57	fix confusion between unsigned <-> U32 as suggested in #1441. generally U32 and unsigned are the same thing, except when they are not ... case : 32-bit compilation for MIPS (uint32_t == unsigned long) A vast majority of transformation consists in transforming U32 into unsigned. In rare cases, it's the other way around (typically for internal code, such as seeds). Among a few issues this patches solves : - some parameters were declared with type `unsigned` in .h, but with type `U32` in their implementation .c . - some parameters have type unsigned*, but the caller user a pointer to U32 instead. These fixes are useful. However, the bulk of changes is about %u formating, which requires unsigned type, but generally receives U32 values instead, often just for brevity (U32 is shorter than unsigned). These changes are generally minor, or even annoying. As a consequence, the amount of code changed is larger than I would expect for such a patch. Testing is also a pain : it requires manually modifying `mem.h`, in order to lie about `U32` and force it to be an `unsigned long` typically. On a 64-bit system, this will break the equivalence unsigned == U32. Unfortunately, it will also break a few static_assert(), controlling structure sizes. So it also requires modifying `debug.h` to make `static_assert()` a noop. And then reverting these changes. So it's inconvenient, and as a consequence, this property is currently not checked during CI tests. Therefore, these problems can emerge again in the future. I wonder if it is worth ensuring proper distinction of U32 != unsigned in CI tests. It's another restriction for coding, adding more frustration during merge tests, since most platforms don't need this distinction (hence contributor will not see it), and while this can matter in theory, the number of platforms impacted seems minimal. Thoughts ?	2018-12-21 18:09:41 -08:00
W. Felix Handte	91b7309115	Mask Off Unused Functions When ZSTD_FORCE_DECOMPRESS_SEQUENCES_LONG	2018-12-20 12:20:34 -08:00
W. Felix Handte	038aabde28	Mask Off Unused Functions When ZSTD_FORCE_DECOMPRESS_SEQUENCES_SHORT	2018-12-20 12:15:07 -08:00
W. Felix Handte	0d606ee3db	Fix Incorrect assert()	2018-12-18 13:36:39 -08:00
W. Felix Handte	c2d51637d9	Add Mutual-Exclusion Error	2018-12-18 13:36:39 -08:00
W. Felix Handte	c560e34c86	Add HUF_FORCE_DECOMPRESS_X2	2018-12-18 13:36:39 -08:00
W. Felix Handte	abd1567d3c	Move HUF_DGEN Up Out of X1 Definitions	2018-12-18 13:36:39 -08:00
W. Felix Handte	4a0572b215	Refactor Huffman Decompression Away From Ternary Tree in ZSTD_decodeLiteralsBlock	2018-12-18 13:36:39 -08:00
W. Felix Handte	432314b58a	Rename HUF_DECOMPRESS_MINIMAL -> HUF_FORCE_DECOMPRESS_X1	2018-12-18 13:36:39 -08:00
W. Felix Handte	4bbb8a48ad	Add ZSTD_FORCE_DECOMPRESS_SEQUENCES_LONG This macro forces behavior in the opposite direction.	2018-12-18 13:36:39 -08:00
W. Felix Handte	64553a0e35	Rename ZSTD_DECOMPRESS_MINIMAL -> ZSTD_FORCE_DECOMPRESS_SEQUENCES_SHORT	2018-12-18 13:36:39 -08:00
W. Felix Handte	df28e5babd	Add ZSTD_DECOMPRESS_MINIMAL Macro, Which Reduces Branching of Decompress Variants	2018-12-18 13:36:39 -08:00
W. Felix Handte	f45c9df42e	Totally Hide/Disable X2 Variants when HUF_DECOMPRESS_MINIMAL is Defined	2018-12-18 13:36:39 -08:00
W. Felix Handte	36a84b07a8	Load Dictionaries as X1 Tables	2018-12-18 13:36:39 -08:00
W. Felix Handte	f9cb348776	Add HUF_DECOMPRESS_MINIMAL Macro, Which Avoids Using X2 Variants	2018-12-18 13:36:39 -08:00
Nick Terrell	aaea4ef924	[libzstd] Fix infinite loop in decompression When we switched `ZSTD_SKIPPABLEHEADERSIZE` to a macro, the places where we do: MEM_readLE32(ptr) + ZSTD_SKIPPABLEHEADERSIZE can now overflow `(unsigned)-8` to `0` and we infinite loop. We now check the frame size and reject sizes that overflow a U32. Note that this bug never made it into a release, and was only in the dev branch for a few days. Credit to OSS-Fuzz	2018-12-13 15:13:19 -08:00
Yann Collet	d7da3fc90a	merge dedicated dParam setters	2018-12-04 17:06:48 -08:00
Yann Collet	4b5a4f02d7	write the switch()case: differently so that it please both compilers which warn for dead code after the switch and compilers which do not detect that all branches terminate.	2018-12-04 16:59:26 -08:00
Yann Collet	85b02bf142	fixed silent conversion warning	2018-12-04 15:57:16 -08:00
Yann Collet	aec945f0dc	implemented ZSTD_dParam_getBounds() and ZSTD_DCtx_setParameter()	2018-12-04 15:35:37 -08:00
Yann Collet	34e146f548	advanced decompression function replaces by normal streaming one advanced parameters compatible with ZSTD_decompressStream().	2018-12-04 10:28:36 -08:00
Yann Collet	114bd4346e	changed enum type name to ZSTD_ResetDirective for naming consistency : types should start with a capital letter (after prefix)	2018-11-20 12:00:20 -08:00
Yann Collet	5c68639186	updated ZSTD_DCtx_reset() signature and behavior is now the same as ZSTD_CCtx_reset()	2018-11-15 16:12:39 -08:00
Yann Collet	d7e10a774a	added constant ZSTD_WINDOWLOG_LIMIT_DEFAULT answering #1407. Also : removed obsolete function ZSTD_setDStreamParameter() which could only be used with one parameter (DStream_p_maxWindowSize). Now replaced by ZSTD_DCtx_setWindowSize() (which exists since a few revisions)	2018-11-13 18:12:34 -08:00
Yann Collet	2c8fde538f	added constant ZSTD_MAGIC_SKIPPABLE_MASK and updated several API comments	2018-11-13 17:36:35 -08:00
Yann Collet	b83d1e7714	removed some `static const` variables and replaced by traditional macro constants. Unfortunately, C doesn't consider `static const` to mean "constant"	2018-11-13 16:56:32 -08:00
Yann Collet	483759a3de	Improves decompression speed when using cold dictionary by triggering the prefetching decoder path (which used to be dedicated to long-range offsets only). Figures on my laptop : no content prefetch : ~300 MB/s (for reference) full content prefetch : ~325 MB/s (before this patch) new prefetch path : ~375 MB/s (after this patch) The benchmark speed is already significant, but another side-effect is that this version prefetch less data into memory, since it only prefetches what's needed, instead of the full dictionary. This is supposed to help highly active environments such as active databases, that can't be properly measured in benchmark environment (too clean). Also : fixed the largeNbDict test program which was working improperly when setting nbBlocks > nbFiles.	2018-11-08 17:00:23 -08:00
Yann Collet	20fb9e7f36	reduced assertion strength one limit case can apparently be generated during fuzzer tests	2018-11-08 12:57:34 -08:00
Yann Collet	9126da5b5c	improve long-range decoder speed on enwik9 at level 22 (which is almost a worst case scenario), speed improves by +7% on my laptop (415 -> 445 MB/s)	2018-11-08 12:47:46 -08:00
Yann Collet	8bed4012bd	fixed decompression-only benchmark	2018-11-08 12:36:39 -08:00
Yann Collet	acd75a1448	fixed a second memset() on NULL not sure why it only triggers now, this code has been around for a while. Introduced a new error code : dstBuffer_null, I couldn't express anything even remotely similar with existing error codes set.	2018-10-29 15:03:57 -07:00
Yann Collet	9c58098200	fixed memcpy() on NULL warning memcpy(NULL, src, 0) is undefined behavior.	2018-10-29 13:57:37 -07:00
Yann Collet	8d56f4baee	added a few comments for clarifications	2018-10-26 15:21:52 -07:00
Yann Collet	450356b5af	Merge branch 'dev' into decompressblock	2018-10-26 15:03:43 -07:00

1 2 3 4 5 ...

561 Commits