AuroraMiddleware/zstd - zstd - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
W. Felix Handte	e5ef935cf6	Fix Variable Capitalization	2020-02-18 13:40:58 -05:00
W. Felix Handte	73737231b9	Allow Manual Overriding of pkg-config Lib and Include Dirs When the `PCLIBDIR` or `PCINCDIR` is non-empty (either because we succeeded in removing the prefix, or because it was manually set), we don't need to perform the check. This lets us trust users who go to the trouble of setting a manual override, rather than still blindly failing the make. They'll still be prefixed with `${prefix}/` / `${exec_prefix}/` in the pkg-config file though.	2020-02-18 13:17:17 -05:00
W. Felix Handte	e668c9b528	Fix pkg-config File Generation Again Revises #1851. Fixes #1900. Replaces #1930. Thanks to @orbea, @neheb, @Polynomial-C, and particularly @eli-schwartz for pointing out the problem and suggesting solutions. Tested with ``` make -C lib clean libzstd.pc cat lib/libzstd.pc # should fail make -C lib clean libzstd.pc LIBDIR=/foo make -C lib clean libzstd.pc INCLUDEDIR=/foo make -C lib clean libzstd.pc LIBDIR=/usr/localfoo make -C lib clean libzstd.pc INCLUDEDIR=/usr/localfoo make -C lib clean libzstd.pc LIBDIR=/usr/local/lib prefix=/foo make -C lib clean libzstd.pc INCLUDEDIR=/usr/local/include prefix=/foo # should succeed make -C lib clean libzstd.pc LIBDIR=/usr/local/foo make -C lib clean libzstd.pc INCLUDEDIR=/usr/local/foo make -C lib clean libzstd.pc LIBDIR=/usr/local/ make -C lib clean libzstd.pc INCLUDEDIR=/usr/local/ make -C lib clean libzstd.pc LIBDIR=/usr/local make -C lib clean libzstd.pc INCLUDEDIR=/usr/local make -C lib clean libzstd.pc LIBDIR=/tmp/foo prefix=/tmp make -C lib clean libzstd.pc INCLUDEDIR=/tmp/foo prefix=/tmp make -C lib clean libzstd.pc LIBDIR=/tmp/foo prefix=/tmp/foo make -C lib clean libzstd.pc INCLUDEDIR=/tmp/foo prefix=/tmp/foo # should also succeed make -C lib clean libzstd.pc prefix=/foo LIBDIR=/foo/bar INCLUDEDIR=/foo/ cat lib/libzstd.pc mkdir out cd out cmake ../build/cmake make cat lib/libzstd.pc ```	2020-02-18 12:23:50 -05:00
Bimba Shrestha	80c26117a9	Line-wrapping	2020-02-03 09:38:16 -08:00
Bimba Shrestha	ee8a712af3	Using appliedParams instead of supplied params	2020-01-31 15:49:07 -08:00
Nick Terrell	e32e3e8662	Improve wildcopy performance across the board	2020-01-28 20:37:04 -08:00
Nick Terrell	7627759b4e	Merge pull request #1972 from terrelln/check-cont Move ZSTD_checkContinuity() to zstd_decompress_block.c	2020-01-23 22:02:50 -08:00
Nick Terrell	fa6a772f38	Initialize dctx->bType to silence valgrind false positive	2020-01-23 17:54:48 -08:00
Nick Terrell	cb2abc3dbe	Fix performance regression on aarch64 with clang	2020-01-23 17:31:14 -08:00
Nick Terrell	6e3cd5b024	Move ZSTD_checkContinuity() to zstd_decompress_block.c	2020-01-23 12:27:39 -08:00
Nick Terrell	a11a9271d6	Fix lowLimit underflow in overflow correction	2020-01-17 12:10:18 -08:00
Nick Terrell	036b30b555	Fix super block compression and stream raw blocks in decompression (#1947 ) Super blocks must never violate the zstd block bound of input_size + ZSTD_blockHeaderSize. The individual sub-blocks may, but not the super block. If the superblock violates the block bound we are liable to violate ZSTD_compressBound(), which we must not do. Whenever the super block violates the block bound we instead emit an uncompressed block. This means we increase the latency because of the single uncompressed block. I fix this by enabling streaming an uncompressed block, so the latency of an uncompressed block is 1 byte. This doesn't reduce the latency of the buffer-less API, but I don't think we really care. * I added a test case that verifies that the decompression has 1 byte latency. * I rely on existing zstreamtest / fuzzer / libfuzzer regression tests for correctness. During development I had several correctness bugs, and they easily caught them. * The added assert that the superblock doesn't violate the block bound will help us discover any missed conditions (though I think I got them all). Credit to OSS-Fuzz.	2020-01-10 18:02:11 -08:00
Nick Terrell	d1cc9d2797	[fuzz] Allow zero sized buffers for streaming fuzzers (#1945 ) * Allow zero sized buffers in `stream_decompress`. Ensure that we never have two zero sized buffers in a row so we guarantee forwards progress. * Make case 4 in `stream_round_trip` do a zero sized buffers call followed by a full call to guarantee forwards progress. * Fix `limitCopy()` in legacy decoders. * Fix memcpy in `zstdmt_compress.c`. Catches the bug fixed in PR #1939	2020-01-09 11:38:50 -08:00
Igor Sugak	03ffda7b88	fix UBSAN's invalid-null-argument error in zstd_decompress.c (#1939 )	2020-01-08 16:17:42 -08:00
Bimba Shrestha	b1f53b1a10	[fuzz] Dividing by targetCBlockSize instead of blockSize for nbBlocks fit (#1936 ) * Adding fail logging for superblock flow * Dividing by targetCBlockSize instead of blockSize * Adding new const and using more acurate formula for nbBlocks * Only do dstCapacity check if using superblock * Remvoing disabling logic * Updating test to make it catch more extreme case of previou bug * Also updating comment * Only taking compressEnd shortcut on non-superblock	2020-01-03 16:53:51 -08:00
Bimba Shrestha	56415efc76	Constifying, malloc check and naming nit	2019-12-17 17:16:51 -08:00
Bimba Shrestha	5225dcfc0f	Adding bool to check if enough room left for noCompress superblocks	2019-12-13 15:47:28 -08:00
Yann Collet	d73e2fb465	Merge pull request #1891 from bimbashrestha/oss [fuzz] Superblock fuzz issues	2019-12-10 13:17:00 -08:00
Bimba Shrestha	e1913dc87f	Making const, removing unnecessary indent, changing parameter order	2019-12-04 15:51:17 -08:00
Bimba Shrestha	2ec556fec2	Moving init/end functions, moving compressSuperBlock inside body()	2019-12-04 15:23:13 -08:00
Bimba Shrestha	ffb0463041	Refactor	2019-12-04 14:52:27 -08:00
Bimba Shrestha	49c6d49247	[fuzz] msan uninitialized unsigned value (#1908 ) Fixes new fuzz issue Credit to OSS-Fuzz * Initializing unsigned value * Initialilzing to 1 instead of 0 because its more conservative * Unconditionoally setting to check first and then checking zero * Moving bool to before block for c90 * Move check set before block	2019-12-04 10:02:17 -08:00
Yann Collet	5120883a9c	bumped version number so that potential issue report do not confuse `dev` with latest release	2019-12-03 17:06:42 -08:00
Bimba Shrestha	1fc9352f81	Using bss var instead of creating new bool	2019-12-02 21:39:06 -08:00
Bimba Shrestha	1f681d8592	Merge branch 'oss' of https://github.com/bimbashrestha/zstd into oss	2019-11-27 10:56:54 -08:00
Bimba Shrestha	a3a3c62b81	[fuzz] Only set HUF_repeat_valid if loaded table has all non-zero weights (#1898 ) Fixes a fuzz issue where dictionary_round_trip failed because the compressor was generating corrupt files thanks to zero weights in the table. * Only setting loaded dict huf table to valid on non-zero * Adding hasNoZeroWeights test to fse tables * Forbiding nbBits != 0 when weight == 0 * Reverting the last commit * Setting table log to 0 when weight == 0 * Small (invalid) zero weight dict test * Small (valid) zero weight dict test * Initializing repeatMode vars to check before zero check * Removing FSE changes to seperate pr * Reverting accidentally changed file * Negating bool, using unsigned, optimization nit	2019-11-26 12:24:19 -08:00
Bimba Shrestha	d4e17d0776	Negating bool, updating bool on inner branches	2019-11-26 12:17:43 -08:00
Nick Terrell	718f00ff6f	Optimize decompression speed for gcc and clang (#1892 ) * Optimize `ZSTD_decodeSequence()` * Optimize Huffman decoding * Optimize `ZSTD_decompressSequences()` * Delete `ZSTD_decodeSequenceLong()`	2019-11-25 18:26:19 -08:00
Bimba Shrestha	826b555463	Merge branch 'dev' into oss	2019-11-22 17:29:33 -08:00
Bimba Shrestha	10bce1919e	Mixed declration fix	2019-11-21 13:08:27 -08:00
Bimba Shrestha	0451accab1	Checking noCompressBlock explicitly for rep code confirmation	2019-11-21 13:06:26 -08:00
Nick Terrell	659e9f05cf	Fix null pointer addition	2019-11-20 18:36:04 -08:00
Yann Collet	2d4dcce55f	Merge pull request #1894 from felixhandte/doc-clarify-dctx-reset Easy: Update Comment on `ZSTD_initDStream()`	2019-11-19 16:18:56 -08:00
Nick Terrell	e0d6daabac	Fix Appveyor failure	2019-11-19 11:12:26 -08:00
Bimba Shrestha	8f0c2d04c8	Going back to original flow but removing else return	2019-11-19 10:03:07 -08:00
W. Felix Handte	722149cf2b	Easy: Update Comment on `ZSTD_initDStream()`	2019-11-19 01:57:15 -05:00
Nick Terrell	6a7f65117e	Merge pull request #1866 from legrosbuffle/dev Optimized loop bounds to allow the compiler to unroll the loop.	2019-11-18 16:16:30 -08:00
Nick Terrell	a839d6852c	Merge pull request #1888 from senhuang42/superblocks_fixed RLE test and re-enable RLE in main compression loop	2019-11-18 16:09:33 -08:00
Bimba Shrestha	80586f5e80	Reversing condition order and forwarding error	2019-11-18 13:53:55 -08:00
Bimba Shrestha	dade64428f	Output regular uncompressed block when compressSequences fails	2019-11-18 08:43:14 -08:00
Bimba Shrestha	2d5d961a60	Typo in comment	2019-11-15 19:00:53 -08:00
Bimba Shrestha	dba767c0bb	Leaving room for checksum	2019-11-15 18:44:51 -08:00
Vincent Torri	6b5c10b48c	shared library: rename import library with .dll.a extension mort of open source project are using this extension for the import library. The Win32 linker is supporting this extension, see https://access.redhat.com/documentation/en-US/Red_Hat_Enterprise_Linux/4/html/Using_ld_the_GNU_Linker/win32.html section "direct linking to a dll"	2019-11-15 19:46:06 +01:00
Clement Courbet	b3c9fc27b4	Optimized loop bounds to allow the compiler to unroll the loop. This has no measurable impact on large files but improves small file decompression by ~1-2% for 10kB, benchmarked with: head -c 10000 silesia.tar > /tmp/test make CC=/usr/local/bin/clang-9 BUILD_STATIC=1 && ./lzbench -ezstd -t1,5 /tmp/test	2019-11-15 08:27:05 +01:00
Sen Huang	d9646dcbb5	Fixed main compression logic changes	2019-11-14 19:39:09 -05:00
Yann Collet	4b1ac69f19	Merge pull request #1868 from senhuang42/superblocks_fixed Superblocks rebased for merge	2019-11-14 13:31:34 -08:00
Sen Huang	c26d32c91c	Change superblock #include to be last	2019-11-14 13:12:17 -05:00
Yann Collet	d67742bc5d	Merge pull request #1858 from senhuang42/dictionary_header_size Method to get dictionary header size	2019-11-14 09:44:07 -08:00
Sen Huang	c85d10d0ea	Remove mixed declarations	2019-11-08 13:57:26 -05:00
Sen Huang	d9c475f3b3	Fix static analyze error, use proper bounds for dictEnd	2019-11-08 13:57:26 -05:00
Sen Huang	d06b90692b	Move asserts to loadZstdDictionary()	2019-11-08 13:57:26 -05:00
Sen Huang	b39149e156	Expose ZSTD_reset_compressedBlockState() to shared API	2019-11-08 13:57:26 -05:00
Sen Huang	6ce335371b	Add error forwarding to loadCEntropy(), make check for dictSize >= 8 from bad merge	2019-11-08 13:57:26 -05:00
Sen Huang	4a61aaf368	Remove redundant comment	2019-11-08 13:57:26 -05:00
Sen Huang	c787b351ea	Use ZSTD Error codes, improve explanation of ZSTD_loadCEntropy() and ZSTD_loadDEntropy()	2019-11-08 13:57:26 -05:00
Sen Huang	04fb42b4f3	Integrated refactor into getDictHeaderSize, now passes tests	2019-11-08 13:57:26 -05:00
Sen Huang	0bcaf6db08	First working pass at refactor of loadZstdDictionary()	2019-11-08 13:57:26 -05:00
Sen Huang	4b141b63e0	Revert "Move decompress symbols into zstd_internal.h, remove dependency" This reverts commit a152b4c67a5266f611db4a2eac4a79003852a795.	2019-11-08 13:57:26 -05:00
Sen Huang	84404cff6e	Move decompress symbols into zstd_internal.h, remove dependency	2019-11-08 13:57:26 -05:00
Sen Huang	341e0641ed	Checks malloc() for failure, returns 0 if so	2019-11-08 13:57:26 -05:00
Sen Huang	97b7f712f3	Change to heap allocation, remove implicit type conversion	2019-11-08 13:57:25 -05:00
Sen Huang	3c36a7f13a	Add ZDICT_getHeaderSize()	2019-11-08 13:57:08 -05:00
Nick Terrell	8c474f9845	Fix parameter selection and adjustment with srcSize == 0	2019-11-07 08:58:43 -08:00
Felix Handte	5688447758	Merge pull request #1873 from felixhandte/make-overlap-log-multithread-only Fix #1861: Restrict overlapLog Parameter When Not Built With Multithreading	2019-11-06 16:56:37 -05:00
Felix Handte	ba4613602f	Merge pull request #1843 from moozzyk/issue-1637 Take ZSTD_parameters as a const pointer	2019-11-06 16:56:14 -05:00
W. Felix Handte	c13f81905a	Fix #1861 : Restrict overlapLog Parameter When Not Built With Multithreading This parameter is unused in single-threaded compression. We should make it behave like the other multithread-only parameters, for which we only accept zero when we are not built with multithreading.	2019-11-06 16:05:02 -05:00
Sen Huang	13bb7500e8	Fix frame argument to compression	2019-11-05 16:15:55 -05:00
Sen Huang	f2932fb5eb	Fix more merge conflicts	2019-11-05 15:54:05 -05:00
Sen Huang	7ce891870c	Fix merge conflicts	2019-11-05 15:51:25 -05:00
Bimba Shrestha	3fb5b106da	Replacing some literals with constants	2019-11-05 10:26:57 -08:00
Nick Terrell	60205fec02	Fix 2 bugs in dictionary loading * Silently skip dictionaries less than 8 bytes, unless using `ZSTD_dct_fullDict`. This changes the compressor, which silently skips dictionaries <= 8 bytes. * Allow repcodes that are equal to the dictionary content size, since it is in bounds.	2019-11-01 16:52:07 -07:00
Sen Huang	b9ede1c8c2	Make sure contentsize is known	2019-10-30 16:03:58 -04:00
Nick Terrell	9c1860861e	Fix assert in ZSTD_safecopy In the case that `op >= oend_w` it is possible that `diff < 8` because the two buffers could be adjacent. Credit to OSS-Fuzz, which found the bug. It isn't reproducible because it depends on the memory layout.	2019-10-28 17:51:17 -07:00
Felix Handte	01ec595b85	Merge pull request #1851 from felixhandte/pkg-config-prefix-fix In pkg-config File, Derive Lib and Include Dir from Prefix at Use-Time	2019-10-28 14:24:56 -04:00
Yann Collet	74065da4c5	updated API inline doc and manual regarding ZSTD_CDict created without a dictBuffer.	2019-10-28 11:15:41 -07:00
W. Felix Handte	74bd76c3ff	In pkg-config File, Derive Lib and Include Dir from Prefix at Use-Time Addresses #1794. Instead of deriving the lib dir and include dir at build-time, let's do it like everyone else does at pkg-config run-time. This has the disadvantage that we can no longer override LIBDIR and INCLUDEDIR in the Makefile and have that reflected in the .pc file.	2019-10-25 15:07:31 -04:00
Yann Collet	c2140e9db0	Merge pull request #1845 from facebook/zbuff improve deprecation warning macro	2019-10-25 09:59:00 -07:00
Yann Collet	a9a216a846	Merge pull request #1824 from senhuang42/new_path_for_cdict Avoid using CDict params when input is large.	2019-10-23 12:04:40 -07:00
Yann Collet	63e435dda1	improve deprecation warning macro fix #1488 although, curiously enough, I was never able to reproduce the issue (according to the bug report, it should be present while using gcc 4.8).	2019-10-23 11:59:32 -07:00
moozzyk	eda7946a36	Take ZSTD_parameters as a const pointer Fixes: #1637	2019-10-22 23:21:54 -07:00
Yann Collet	f966cd080a	added documentation on DYNAMIC_BMI2 build macro	2019-10-22 17:43:09 -07:00
Yann Collet	5d5c895b18	fix initCStream_advanced() for fast strategies Compression ratio of fast strategies (levels 1 & 2) was seriously reduced, due to accidental disabling of Literals compression. Credit to @QrczakMK, which perfectly described the issue, and implementation details, making the fix straightforward. Example : initCStream with level 1 on synthetic sample P50 : Before : 5,273,976 bytes After : 3,154,678 bytes ZSTD_compress (for comparison) : 3,154,550 Fix #1787. To follow : refactor the test which was supposed to catch this issue (and failed)	2019-10-22 15:01:38 -07:00
Yann Collet	111b0c53b0	update documentation on deprecated functions mostly : note that these functions will soon generate deprecation warnings	2019-10-22 13:51:18 -07:00
Nick Terrell	b1ec94e63c	Fix ZSTD_f_zstd1_magicless for small data * Fix `ZSTD_FRAMEHEADERSIZE_PREFIX` and `ZSTD_FRAMEHEADERSIZE_MIN` to take a `format` parameter, so it is impossible to get the wrong size. * Fix the places that called `ZSTD_FRAMEHEADERSIZE_PREFIX` without taking the format into account, which is now impossible by design. * Call `ZSTD_frameHeaderSize_internal()` with `dctx->format`. * The added tests catch both bugs in `ZSTD_decompressFrame()`. Fixes #1813.	2019-10-21 21:16:17 -07:00
Sen Huang	c2e1e54f24	((x or y) or z) == (x or y or z), remove brackets	2019-10-21 19:16:50 -04:00
Sen Huang	59c81aa31b	Line up comments :)	2019-10-21 19:12:15 -04:00
Sen Huang	dbda8c318a	Trailing comma	2019-10-21 19:10:13 -04:00
Sen Huang	0c00455ea6	Merge branch 'dev' of github.com:senhuang42/zstd into new_path_for_cdict	2019-10-21 19:06:51 -04:00
Sen Huang	5b2f4ac1a8	merge	2019-10-21 19:02:52 -04:00
Sen Huang	2ab484a5f9	Fix bad merge	2019-10-21 18:55:17 -04:00
Nick Terrell	919d1d8e93	Merge pull request #1831 from terrelln/zstdmt-bad-memset [zstdmt] Don't memset the jobDescription	2019-10-21 15:53:57 -07:00
Sen Huang	b6c3459d50	merge	2019-10-21 18:46:17 -04:00
Yann Collet	6cf04c0344	Merge pull request #1834 from facebook/winFix Windows fixes	2019-10-21 13:45:17 -07:00
Sen Huang	676f89902a	Added multiplier, renamed new enum to something more useful	2019-10-21 15:36:12 -04:00
Sen Huang	1f3a51fb52	Updated forceAttachDict param bounds	2019-10-21 15:36:12 -04:00
Sen Huang	8f69c47643	Add enum to decision process	2019-10-21 15:36:12 -04:00
Sen Huang	e4de8b098a	Added support for forcing new CDict behavior and updated enum	2019-10-21 15:36:12 -04:00
Sen Huang	9294f4826b	Changed to int from BYTE	2019-10-21 15:36:12 -04:00
Sen Huang	f0fccc8847	Changed to int from BYTE	2019-10-21 15:36:12 -04:00
Sen Huang	bb2df8c499	Trailing whitespace	2019-10-21 15:36:12 -04:00
Sen Huang	cf51501d2f	Fix test	2019-10-21 15:36:12 -04:00
Sen Huang	ea3cb6988f	Cast to BYTE to appease appveyor	2019-10-21 15:36:12 -04:00
Sen Huang	a727a85a7e	merge conflicts round 2	2019-10-21 15:36:12 -04:00
Sen Huang	053a35fd64	formatting	2019-10-21 15:35:33 -04:00
Sen Huang	3fa4daaa55	Fix error	2019-10-21 15:35:33 -04:00
Sen Huang	3328348c63	Add compressionlevel to cdict	2019-10-21 15:32:39 -04:00
Felix Handte	cf725630a6	Merge pull request #1795 from felixhandte/workspace-asan Add Poisoned Redzones to the Workspace When Compiling with ASAN	2019-10-21 12:15:17 -04:00
Sen Huang	e8aa3e486d	Updated forceAttachDict param bounds	2019-10-20 22:01:08 -04:00
Sen Huang	6d297265f9	Add enum to decision process	2019-10-20 19:02:47 -04:00
Sen Huang	1daa898c93	Added support for forcing new CDict behavior and updated enum	2019-10-20 14:03:09 -04:00
Nick Terrell	0bc39bc3a0	[zstdmt] Don't memset the jobDescription	2019-10-18 15:05:51 -07:00
Nick Terrell	243824551f	[threading] Add debug utilities	2019-10-18 15:05:34 -07:00
Yann Collet	1795133c45	refactored FIO_compressMultipleFilenames() prototype for consistency	2019-10-17 15:32:03 -07:00
Yann Collet	6446ffb277	Merge pull request #1827 from facebook/dm_Dct updated erroneous comments using ZSTD_dm_*	2019-10-17 10:30:58 -07:00
Yann Collet	19741c7d99	Merge pull request #1815 from facebook/zlibwrap make zlibWrapper strict ISO-C90 compatible	2019-10-16 16:45:15 -07:00
Yann Collet	6323966e53	updated erroneous comments using ZSTD_dm_* instead of the current ZSTD_dct_*, reported by @nigeltao (#1822)	2019-10-16 16:14:04 -07:00
Yann Collet	2d5201b0ab	removed wildcopy8() which is no longer used, noticed by @davidbolvansky	2019-10-16 14:51:33 -07:00
Sen Huang	4455f00cb8	Changed to int from BYTE	2019-10-16 15:06:02 -04:00
Sen Huang	4f7d26b0ee	Changed to int from BYTE	2019-10-16 15:05:29 -04:00
Sen Huang	cf00ea367a	Trailing whitespace	2019-10-16 10:31:27 -04:00
Sen Huang	8cb2174446	Fix test	2019-10-16 10:29:31 -04:00
Sen Huang	5e901b6f32	Cast to BYTE to appease appveyor	2019-10-15 13:58:44 -04:00
Sen Huang	5c010c9d2d	merge conflicts round 2	2019-10-15 13:10:05 -04:00
Sen Huang	a06b51879c	merge conflict	2019-10-15 12:58:50 -04:00
Sen Huang	23dac23a49	formatting	2019-10-15 12:44:48 -04:00
Sen Huang	0c8df5c928	Fix error	2019-10-15 12:28:23 -04:00
Sen Huang	a65eb39f9d	Add compressionlevel to cdict	2019-10-15 10:22:06 -04:00
Yann Collet	fb77afc626	Merge pull request #1760 from bimbashrestha/extract_sequences_api Adding api for extracting sequences from seqstore	2019-10-10 13:11:18 -07:00
W. Felix Handte	ede31da2ea	Fix CCtx Size Estimation	2019-10-10 15:02:08 -04:00
W. Felix Handte	bd6a20b8a0	Expand Default Redzone Size	2019-10-10 13:45:55 -04:00
W. Felix Handte	2c80a9f8ac	Check if CCtx in Workspace after Null Check	2019-10-10 13:40:16 -04:00
W. Felix Handte	b6987acbbf	Declare the ASAN Functions We Need, Don't Include the Header	2019-10-10 13:40:16 -04:00
W. Felix Handte	0ffae7e440	Stop Allocating Extra Space for Table Redzones	2019-10-10 13:40:16 -04:00
W. Felix Handte	a07037b784	Don't Try to Redzone the Tables	2019-10-10 13:40:16 -04:00
W. Felix Handte	0cc481ef66	Fix Workspace Size Calculation	2019-10-10 13:40:16 -04:00
W. Felix Handte	b6c0a02a17	Fix ZSTD_sizeof_matchState() Calculation	2019-10-10 13:40:16 -04:00
W. Felix Handte	8cffd6ed08	Avoid ASAN Failure in ZSTD_cwksp_free()	2019-10-10 13:40:16 -04:00
W. Felix Handte	ef0b5707c5	Refactor Freeing CCtxes / CDicts Inside Workspaces	2019-10-10 13:40:16 -04:00
W. Felix Handte	143b296cf6	Surround Workspace Allocs with Dead Zone	2019-10-10 13:40:16 -04:00
W. Felix Handte	19a0955ec9	Add `ZSTD_cwksp_alloc_size()` to Help Calculate Needed Workspace Size	2019-10-10 13:40:16 -04:00
W. Felix Handte	da88c35d41	Stop Assuming Tables are Adjacent	2019-10-10 13:40:16 -04:00
W. Felix Handte	35c30d6ca7	Poison Unused Workspace Memory	2019-10-10 13:40:16 -04:00
W. Felix Handte	edb6d884a5	Detect Whether We're Being Compiled with ASAN	2019-10-10 13:40:16 -04:00
W. Felix Handte	dc1fb684bf	Remove Unused MEM_SKIP_MSAN Macro	2019-10-10 13:40:16 -04:00
Bimba Shrestha	36528b96c4	Manually moving instead of memcpy on decoder and using genBuffer()	2019-10-03 09:26:51 -07:00
Bimba Shrestha	61ec4c2e7f	Cleaning sequence parsing logic	2019-10-03 06:42:40 -07:00
Yann Collet	cb18fffe65	enforce C90 compatibility for zlibWrapper	2019-09-24 17:50:58 -07:00
Yann Collet	ad2a2785f7	bump version number to v1.4.4 so that future reports on `dev` branch use this number instead	2019-09-24 15:15:33 -07:00
Bimba Shrestha	c04245b257	Replacing assert with memory_allocation error code throw	2019-09-23 15:42:16 -07:00
Bimba Shrestha	be0bebd24e	Adding test and null check for malloc	2019-09-23 15:08:18 -07:00
Dávid Bolvanský	1ab1a40c9c	Fixed one more place	2019-09-23 21:32:56 +02:00
Dávid Bolvanský	1f7228c040	Use clz ^ 31 instead of 31 - clz; better codegen for GCC	2019-09-23 21:23:09 +02:00
Nick Terrell	7451c6578c	Merge pull request #1804 from terrelln/wild-and-fast Optimize (de)compression and fix wildcopy overread	2019-09-21 17:04:36 -07:00
Nick Terrell	5cb7615f1f	Add UNUSED_ATTR to ZSTD_storeSeq()	2019-09-20 21:37:13 -07:00
Nick Terrell	5dc0a1d659	HINT_INLINE ZSTD_storeSeq() Clang on Mac wasn't inlining `ZSTD_storeSeq()` in level 1, which was causing a 5% performance regression. This fixes it.	2019-09-20 16:39:27 -07:00
Bimba Shrestha	f3c4fd17e3	Passing in dummy dst buffer of compressbound(srcSize)	2019-09-20 15:50:58 -07:00
Felix Handte	c047fcf7bf	Merge pull request #1806 from felixhandte/estimate-cctx-doc Update Comment on `ZSTD_estimateCCtxSize()`	2019-09-20 15:36:00 -04:00
Nick Terrell	44c65da97e	Remove literals overread in ZSTD_storeSeq() for ~neutral perf	2019-09-20 12:23:25 -07:00
W. Felix Handte	f7d9b36835	Update Comment on `ZSTD_estimateCCtxSize()`	2019-09-20 14:11:29 -04:00
Nick Terrell	fde217df04	Fix bounds check in ZSTD_storeSeq()	2019-09-20 08:25:12 -07:00
Nick Terrell	67b1f5fc72	Fix too strict assert	2019-09-20 01:23:35 -07:00
Nick Terrell	ddab2a94e8	Pass iend into ZSTD_storeSeq() to allow ZSTD_wildcopy()	2019-09-20 00:56:20 -07:00
Nick Terrell	cdad7fa512	Widen ZSTD_wildcopy to 32 bytes	2019-09-20 00:52:15 -07:00
Nick Terrell	efd37a64ea	Optimize decompression and fix wildcopy overread * Bump `WILDCOPY_OVERLENGTH` to 16 to fix the wildcopy overread. * Optimize `ZSTD_wildcopy()` by removing unnecessary branches and unrolling the loop. * Extract `ZSTD_overlapCopy8()` into its own function. * Add `ZSTD_safecopy()` for `ZSTD_execSequenceEnd()`. It is optimized for single long sequences, since that is the important case that can end up in `ZSTD_execSequenceEnd()`. Without this optimization, decompressing a block with 1 long match goes from 5.7 GB/s to 800 MB/s. * Refactor `ZSTD_execSequenceEnd()`. * Increase the literal copy shortcut to 16. * Add a shortcut for offset >= 16. * Simplify `ZSTD_execSequence()` by pushing more cases into `ZSTD_execSequenceEnd()`. * Delete `ZSTD_execSequenceLong()` since it is exactly the same as `ZSTD_execSequence()`. clang-8 seeds +17.5% on silesia and +21.8% on enwik8. gcc-9 sees +12% on silesia and +15.5% on enwik8. TODO: More detailed measurements, and on more datasets. Crdit to OSS-Fuzz for finding the wildcopy overread.	2019-09-19 21:07:14 -07:00
Bimba Shrestha	ae6d0e64ae	Addressing comments	2019-09-19 15:25:20 -07:00
Yann Collet	3cac061db5	Merge pull request #1802 from bimbashrestha/rle_block_bound_fix_pt2 Adding 4 blocks to FSE_BLOCKBOUND() in lib/common (different from las…	2019-09-18 16:32:37 -07:00
Bimba Shrestha	6e9f6813bb	adding bit container size	2019-09-18 13:49:45 -07:00
Bimba Shrestha	f9b6abb896	Adding 4 blocks to FSE_BLOCKBOUND() in lib/common (different from last week)	2019-09-18 13:29:05 -07:00
Yann Collet	bfff5b30a4	Merge pull request #1756 from mgrice/dev Improvements in zstd decode performance	2019-09-18 11:35:50 -07:00
Yann Collet	243200e5bf	minor refactor of ZSTD_fast - reduced variables lifetime - more accurate code comments	2019-09-17 14:02:57 -07:00
Bimba Shrestha	76fea3fb99	Resolving appveyor test failure implicit conversion	2019-09-16 14:02:23 -07:00
Bimba Shrestha	a874435478	Merge branch 'dev' into extract_sequences_api	2019-09-16 13:29:59 -07:00
Felix Handte	2164a130f3	Merge pull request #1780 from felixhandte/workspace-efficiency-3 Avoid Clearing Tables Even When Changing CParams	2019-09-16 14:37:05 -04:00
W. Felix Handte	72ea79cacd	Don't Include `sanitizer/msan_interface.h`, Since Not All Platforms Provide It Instead, explicitly declare the functions we use.	2019-09-16 12:08:03 -04:00
Bimba Shrestha	bff6072e3a	Bailing early when collecting sequences and documentation	2019-09-16 08:26:21 -07:00
Nick Terrell	fbeaf6989e	[libzstd] Improve advanced API docs	2019-09-15 12:41:24 -07:00
Yann Collet	09b1844d9b	Merge pull request #1784 from bimbashrestha/fse_block_bound_err Rearranging assert and allowing 4 extra for FSE_BLOCKBOUND()	2019-09-12 19:09:27 -07:00
Bimba Shrestha	fe9af338ed	Added assert to BIT_flushBits()	2019-09-12 15:35:27 -07:00
Bimba Shrestha	43da5bf27e	Rearranging assert and allowing 4 extra for FSE_BLOCKBOUND()	2019-09-12 14:43:50 -07:00
W. Felix Handte	20c69077d1	Shrink Table Valid End During Alloc Alignment / Phase Change	2019-09-11 17:14:59 -04:00
W. Felix Handte	51d90668ba	Add Assertions to Confirm that Workspace Pointers are Correctly Ordered	2019-09-11 17:14:59 -04:00
W. Felix Handte	a10c191613	`__msan_poison()` Workspace When Preparing for Re-Use	2019-09-11 17:14:45 -04:00
W. Felix Handte	7c57e2b9ca	Zero `h3size` When `h3log` is 0 This led to a nasty edgecase, where index reduction for modes that don't use the h3 table would have a degenerate table (size 4) allocated and marked clean, but which would not be re-indexed.	2019-09-11 13:14:26 -04:00
W. Felix Handte	bc020eec92	Also Shrink Clean Table Area When Reducing Indices	2019-09-11 11:40:57 -04:00
W. Felix Handte	1999b2ed9b	Update DEBUGLOG Statements	2019-09-11 11:21:00 -04:00
W. Felix Handte	13e29a56de	Shrink Clean Table Area When Copying Table Contents into Context The source matchState is potentially at a lower current index, which means that any extra table space not overwritten by the copy may now contain invalid indices. The simple solution is to unconditionally shrink the valid table area to just the area overwritten.	2019-09-11 11:18:45 -04:00
W. Felix Handte	edb3ad053e	Comments	2019-09-10 18:25:45 -04:00
W. Felix Handte	f31ef28ff8	Only Reset Indexing in `ZSTD_resetCCtx_internal()` When Necessary	2019-09-10 18:25:45 -04:00
W. Felix Handte	9968a53e91	Remove No-Longer-Used Continuation Functions	2019-09-10 18:25:45 -04:00
W. Felix Handte	1b28e80416	Remove Fast Continue Path in `ZSTD_resetCCtx_internal()`	2019-09-10 18:25:45 -04:00
W. Felix Handte	ad16eda5e4	`ZSTD_reset_matchState` Optionally Doesn't Restart Indexing	2019-09-10 18:25:45 -04:00
W. Felix Handte	5b10bb5ec3	Rename `ZSTD_compResetPolicy_e` Values and Add Comment	2019-09-10 18:25:45 -04:00
W. Felix Handte	0492b9a9ec	Accept `ZSTD_indexResetPolicy_e` Param in `ZSTD_reset_matchState()`	2019-09-10 18:25:45 -04:00
W. Felix Handte	14c5471d5e	Introduce `ZSTD_indexResetPolicy_e` Enum	2019-09-10 18:25:45 -04:00
W. Felix Handte	17b6da2e0f	Track Usable Table Space in Compression Workspace	2019-09-10 18:25:37 -04:00
Yann Collet	22bd158e0f	Merge pull request #1712 from felixhandte/workspace-efficiency-2 Allocate Internal Buffers via Workspace Abstraction	2019-09-10 15:20:29 -07:00
Bimba Shrestha	1407919d13	Addressing comments on parsing	2019-09-10 15:10:50 -07:00
Bimba Shrestha	47199480da	Cleaning up parsing per suggestion	2019-09-10 13:18:59 -07:00
W. Felix Handte	a9d373f093	Remove Empty lib/compress/zstd_cwksp.c	2019-09-10 16:03:13 -04:00
Yann Collet	5ba495b622	Merge pull request #1775 from facebook/edufix fix educational decoder	2019-09-10 12:12:08 -07:00
Yann Collet	41416f0927	Merge pull request #1773 from bimbashrestha/rle_first_block_decompression_fix Removing redundant condition in decompression, making first block rle…	2019-09-10 11:17:29 -07:00
Bimba Shrestha	e3c5825918	Fizing litLength == 0 case	2019-09-10 10:38:13 -07:00
Bimba Shrestha	9e7bb55e14	Addressing comments	2019-09-09 20:04:46 -07:00
W. Felix Handte	81208fd7c2	Forward Declare `ZSTD_cwksp_available_space` to Fix Build	2019-09-09 19:10:09 -04:00
W. Felix Handte	91bf1babd1	Inline Workspace Functions	2019-09-09 18:53:53 -04:00
W. Felix Handte	0db3ffe7ee	Forward resetCCtx Errors when Using CDict	2019-09-09 16:47:19 -04:00
W. Felix Handte	eb6f69d978	Fix sizeof_CCtx and sizeof_CDict Calculations for Statically Init'ed Objects	2019-09-09 16:45:17 -04:00
W. Felix Handte	e3703825a8	Fix workspaceTooSmall Calculation	2019-09-09 15:12:14 -04:00
W. Felix Handte	0a65a67901	Shorten `&zc->workspace` -> `ws` in `ZSTD_resetCCtx_internal()`	2019-09-09 14:59:09 -04:00
W. Felix Handte	1120e4d962	Clean Up TODOs and Comments pt. II	2019-09-09 14:04:39 -04:00
W. Felix Handte	c60e1c3be5	Nit	2019-09-09 13:34:08 -04:00
W. Felix Handte	7d7b665c90	Pull Phase Advance Logic Out into Internal Function	2019-09-09 13:34:08 -04:00
W. Felix Handte	8549ae9f1d	Hide Workspace Movement Behind Helper Function	2019-09-09 13:34:08 -04:00
W. Felix Handte	2405c03bcd	Fix DEBUGLOG Statement Levels	2019-09-09 13:34:08 -04:00
W. Felix Handte	7100d24221	Fix Rescale Continue Special Case	2019-09-09 13:34:08 -04:00
W. Felix Handte	7321e4c9f3	Remove Unused noRealloc CRP Value	2019-09-09 13:34:08 -04:00
W. Felix Handte	901bba4ca6	Re-Implement Workspace Shrinking when Oversized	2019-09-09 13:34:08 -04:00
W. Felix Handte	881bcd80ca	Cleanup from Move	2019-09-09 13:34:08 -04:00
W. Felix Handte	b511a84adc	Move Workspace Functions to Their Own File	2019-09-09 13:34:08 -04:00
W. Felix Handte	077a2d7dc9	Rename	2019-09-09 13:34:08 -04:00
W. Felix Handte	ebd162194f	Clean Up TODOs and Comments	2019-09-09 13:34:08 -04:00
W. Felix Handte	2abe0145b1	Improve Comments a Bit	2019-09-09 13:34:08 -04:00
W. Felix Handte	7a2416a863	Allocate CDict in Workspace (Rather than in Separate Allocation)	2019-09-09 13:34:08 -04:00
W. Felix Handte	65057cf009	Rewrite ZSTD_initStaticCCtx to Alloc CCtx in Workspace	2019-09-09 13:34:08 -04:00
W. Felix Handte	58b69ab15c	Only the CCtx Itself Needs to be Cleared during Static CCtx Init	2019-09-09 13:34:08 -04:00
W. Felix Handte	88c2fcd0ee	Align Alloc Pointer When Transitioning from Buffers to Aligned Allocs	2019-09-09 13:34:08 -04:00
W. Felix Handte	e936b73889	Remove Overly-Restrictive Assert	2019-09-09 13:34:08 -04:00
W. Felix Handte	75d574368b	When Loading Dict By Copy, Always Put it in the Workspace	2019-09-09 13:34:08 -04:00
W. Felix Handte	e69b67e33a	Alloc Tables Separately	2019-09-09 13:34:08 -04:00
W. Felix Handte	6177354b36	Begin Introducing Phases	2019-09-09 13:34:08 -04:00
W. Felix Handte	786f2266bb	TMP	2019-09-09 13:34:08 -04:00
W. Felix Handte	c25283cf00	Disambiguate 'workspace' and 'entropyWorkspace'	2019-09-09 13:34:08 -04:00
W. Felix Handte	ccaac852e8	Normalize Case 'workSpace' -> 'workspace'	2019-09-09 13:27:18 -04:00
Bimba Shrestha	44e122053b	Mentioning cli only in the comment as suggested	2019-09-06 14:48:41 -07:00
Yann Collet	2b0a271ed2	fix eductional decoder fix #1774 also : - fix minor compilation warnings - make sure the `test` is run during CI tests	2019-09-06 14:30:13 -07:00
Bimba Shrestha	a917cd597d	Put back omission for first rle block and updated comment as suggested	2019-09-06 13:44:25 -07:00
Bimba Shrestha	d687d603e4	Removing redundant condition in decompression, making first block rles valid to deocmpress	2019-09-06 10:46:19 -07:00
Varun S Nair	9816560649	Fixing assert and DEBUGLOG due to ZSTD_CCtx_params parameter change to const pointer	2019-09-05 15:47:17 +05:30
Varun S Nair	771645471f	Passing ZSTD_CCtx_params by const pointer	2019-09-05 15:28:30 +05:30
Bimba Shrestha	5f8b0f6890	Changing api to get sequences across all blocks	2019-08-30 09:18:44 -07:00
Yann Collet	5198347382	Merge pull request #1744 from bimbashrestha/dev Generate RLE blocks in the encoder	2019-08-29 15:19:10 -07:00
Bimba Shrestha	623b90f85d	Fixing ci-circle test complaints	2019-08-29 13:09:42 -07:00
mgrice	5d89771529	fix warning: always_inline function might not be inlinable	2019-08-29 12:32:15 -07:00
Bimba Shrestha	ece465644b	Adding api for extracting sequences from seqstore	2019-08-29 12:29:39 -07:00
mgrice	b830599582	Improvements in zstd decode performance Summary: The idea behind wildcopy is that it can be cheaper to copy more bytes (say 8) than it is to copy less (say, 3). This change takes that further by exploiting some properties: 1. it's almost always OK to copy 16 bytes instead of 8, which means fewer copy instructions, and fewer branches 2. A 16 byte chunk size means that ~90% of wildcopy invocations will have a trip count of 1, so branch prediction will be improved. Speedup on Xeon E5-2680v4 is in the range of 3-5%. Measured wildcopy length distributions on silesia.tar: level <=8 <=16 <=24 >24 1 78.05% 11.49% 3.52% 6.94% 3 82.14% 8.99% 2.44% 6.43% 6 85.81% 6.51% 2.92% 4.76% 8 83.02% 7.31% 3.64% 6.03% 10 84.13% 6.67% 3.29% 5.91% 15 77.58% 7.55% 5.21% 9.66% 16 80.07% 7.20% 3.98% 8.75% Test Plan: benchmark silesia, make check	2019-08-29 12:25:56 -07:00
Bimba Shrestha	c3e3c8bf32	Undoing the last commit (that was an accident)	2019-08-29 12:05:47 -07:00
bimbashrestha	4a1ca5e0a8	Adding method for extracting sequences.	2019-08-29 11:55:12 -07:00
bimbashrestha	e5704bbfdf	Added test for multiple blocks of zeros and fixed nit about comments	2019-08-28 08:32:34 -07:00
Nick Terrell	e9c0fc12d2	Merge pull request #1748 from terrelln/cover-deadlock [dictBuilder] Fix deadlock in *COVER error case	2019-08-27 10:17:28 -07:00
Nick Terrell	0932de54bc	[dictBuilder] Fix deadlock in *COVER error case The COVER and FASTCOVER dictionary builders can deadlock when dictionary construction errors, likely because there are too few samples, or too few distinct dmers. The deadlock only occurs when there are errors. Fixes #1746.	2019-08-26 18:19:29 -07:00
bimbashrestha	96201d9774	Added bool to cctx and fixed some comment nits	2019-08-26 15:30:41 -07:00
bimbashrestha	991cbc9024	Fixing mixed declaration compiler complaint	2019-08-26 15:00:50 -07:00
bimbashrestha	ce264ce53b	Forbiding emission of RLE when its the first block	2019-08-26 14:54:29 -07:00
bimbashrestha	33b6446ca7	Removing accidental method call	2019-08-26 14:34:43 -07:00
bimbashrestha	7b041b552e	Removing assert for rle that doesn't always hold	2019-08-26 12:26:53 -07:00
bimbashrestha	1f2bf77f2a	Using typedef U32 instead of int	2019-08-26 09:00:22 -07:00
bimbashrestha	ba46932492	Removing implicit conversion from const void* to const BYTE* and added constant for threshold	2019-08-26 08:51:34 -07:00
Carl Woffenden	c690f22e96	Merge branch 'dev' into amalgamate	2019-08-23 23:05:02 +02:00
Carl Woffenden	5144e66095	Revert "Merge remote-tracking branch 'origin/master' into dev" This reverts commit `0df29a4e5f`, reversing changes made to `69c875a0cc`.	2019-08-23 23:04:21 +02:00
Carl Woffenden	0fcaa675e0	Merge remote-tracking branch 'upstream/dev' into dev	2019-08-23 23:03:52 +02:00
Carl Woffenden	0df29a4e5f	Merge remote-tracking branch 'origin/master' into dev	2019-08-23 22:57:06 +02:00
bimbashrestha	0e3ba02cf1	Fixing more test falure errors	2019-08-22 13:54:41 -07:00
bimbashrestha	4faf3a5911	Fixing ci-circle test failure issues	2019-08-22 13:46:15 -07:00
bimbashrestha	cba5350f88	Moving RLE logic to inside ZSTD_compressBlock_internal and adding assert	2019-08-22 12:12:44 -07:00
Nick Magerko	493f95c7df	Fix merge conflicts	2019-08-22 11:51:41 -07:00
bimbashrestha	4c90d862e3	Generate RLE blocks in the encoder	2019-08-22 11:27:20 -07:00
Nick Terrell	54ad33448c	Merge pull request #1737 from terrelln/legacy-fix [legacy] Fix buffer overflow in v0.2 and v0.4 raw literals decompression	2019-08-21 10:10:24 -07:00
Carl Woffenden	901ea61f83	Tweaks to create a single-file decoder The CHECK_F macros differ slightly (but eventually do the same thing). Older GCC needs to fallback on the old-style pragma optimisation flags.	2019-08-21 17:49:17 +02:00
Yann Collet	38b6428fcd	Merge pull request #1725 from emaste/dev remove extraneous doubled ;s	2019-08-21 05:19:30 -07:00
Yann Collet	fe0877c664	Merge pull request #1721 from facebook/seq127 fixed very minor inefficiency (nbSeq==127)	2019-08-21 05:19:12 -07:00
Yann Collet	757ab66879	Merge pull request #1713 from cemeyer/fix_gcc4_build Fix the build on GCC 4.x after `812e8f2a1`	2019-08-21 05:17:42 -07:00
Nick Terrell	07f22d465d	[legacy] Fix buffer overflow in v0.2 and v0.4 raw literals decompression Extends the fix in PR#1722 to v0.2 and v0.4. These aren't built into zstd by default, and v0.5 onward are not affected. I only add the `srcSize > BLOCKSIZE` check to v0.4 because the comments say that it must hold, but the equivalent comment isn't present in v0.2. Credit to OSS-Fuzz.	2019-08-20 17:13:04 -07:00
Nick Magerko	de6a6c7364	Fix ZSTD_SRCSIZEHINT_MIN typo	2019-08-20 13:07:51 -07:00
Nick Magerko	c7a24d7a14	Define ZSTD_SRCSIZEHINT_MIN as 0	2019-08-20 13:06:15 -07:00
Nick Magerko	2d39b43906	Use int for srcSizeHint when sensible	2019-08-19 16:49:25 -07:00
Nick Magerko	09894dc2eb	Add mention of regression with poor size hints	2019-08-19 13:41:36 -07:00
Nick Magerko	fee8fbcddf	Make upper bound INT_MAX	2019-08-19 12:58:54 -07:00
Nick Magerko	edf2abf106	Fix fall-through case	2019-08-19 12:32:43 -07:00
Nick Magerko	dffbac5f89	Add --size-hint=# option	2019-08-19 11:38:49 -07:00
Ed Maste	b81d7cc6a0	remove extraneous doubled ;s	2019-08-15 21:17:06 -04:00
W. Felix Handte	a42bbb4e05	Fix Buffer Overflow in Legacy (v0.3) Raw Literals Decompression	2019-08-15 14:28:30 -04:00
Yann Collet	782bfb858a	fixed very minor inefficiency (nbSeq==127) The nbSeq "short" format (1-byte) is compatible with any value < 128. However, the code would cautiously only accept values < 127. This is not an error, because the general 2-bytes format is compatible with small values < 128. Hence the inefficiency never triggered any warning. Spotted by Intel's Smita Kumar.	2019-08-15 16:41:34 +02:00
Conrad Meyer	ff6c81d90c	Fix the build on GCC 4.x after `812e8f2a1` The ancient GCC 4.x doesn't understand the "optimize" attribute until 4.4. Fix the build on platforms with GCC 4.x < 4.4 by limiting the DONT_VECTORIZE definition to GCC 5 and greater. Noticed and patch proposed by Warner Losh <imp@FreeBSD.org>.	2019-08-08 17:25:49 -07:00
Yann Collet	01b2331ad1	bumped version number to v1.4.3	2019-08-05 17:17:16 +02:00
Yann Collet	61936ba42a	Merge pull request #1705 from josepho0918/dev Add support for IAR C/C++ Compiler for Arm	2019-08-05 15:57:28 +02:00
Yann Collet	facbe8b2c2	factored the logic selecting lowest match index as suggested by @terrelln	2019-08-05 15:18:43 +02:00
Yann Collet	0b0b83e8f3	fix test 122 it's an unsupported scenario.	2019-08-03 16:51:26 +02:00
Yann Collet	98e7c344cd	fixed strategies btopt+	2019-08-02 14:42:53 +02:00
Yann Collet	b4257b04e7	fixed strategy btlazy2	2019-08-02 14:26:26 +02:00
Yann Collet	5cf1b24aca	fixed strategies greedy, lazy & lazy2 restore dictionary compression ratio	2019-08-02 14:21:39 +02:00
Yann Collet	98692c2838	fixed compression ratio regression when dictionary-compressing medium-size inputs at levels 1-3	2019-08-01 15:58:17 +02:00
Joseph Chen	3855bc4295	Add support for IAR C/C++ Compiler for Arm	2019-07-29 15:25:58 +08:00
W. Felix Handte	8083581f9a	Bump Library Version Number to 1.4.2	2019-07-24 17:35:19 -04:00
Nick Terrell	e6edcfa795	[legacy] Fix bug in zstd-0.5 decoder The match length and literal length extra bytes could either by 2 bytes or 3 bytes in version 0.5. All earlier verions were always 3 bytes, and later version didn't have dumps. The bug, introduced by commit `0fd322f812`, was triggered when the last dump was a 2-byte dump, because we didn't separate that case from a 3-byte dump, and thought we were over-reading. I've tested this fix with every zstd version < 1.0.0 on the buggy file, and we are now always successfully decompressing with the right checksum. Fixes #1693.	2019-07-22 13:05:09 -07:00
Yann Collet	be3d2e2de8	Merge pull request #1679 from ephiepark/dev Restructure the source files	2019-07-19 15:29:07 -07:00
Vivek Miglani	c7be7d2efb	Fixing compressed block size checks	2019-07-17 12:53:15 -07:00
Ephraim Park	1dc98de279	Restructure the source files	2019-07-15 17:39:18 -07:00
Vivek Miglani	3f108f82fb	Return error if block size exceeds maximum	2019-07-15 12:10:21 -07:00
Yann Collet	8fb08b68cc	Merge pull request #1681 from facebook/level3 updated double_fast complementary insertion	2019-07-12 16:16:06 -07:00
Nick Terrell	75cfe1dc69	[ldm] Fix bug in overflow correction with large job size (#1678 ) * [ldm] Fix bug in overflow correction with large job size * [zstdmt] Respect ZSTDMT_JOBSIZE_MAX (1G in 64-bit mode) * [test] Add test that exposes the bug Sadly the test fails on our CI because it uses too much memory, so I had to comment it out.	2019-07-12 18:45:18 -04:00
Yann Collet	eaeb7f00b5	updated the _extDict variant of double fast	2019-07-12 14:17:17 -07:00
Yann Collet	e8a7f5d3ce	double-fast: changed the trade-off for a smaller positive change same number of complementary insertions, just organized differently (long at `ip-2`, short at `ip-1`).	2019-07-12 11:34:53 -07:00
mgrice	812e8f2a16	perf improvements for zstd decode (#1668 ) * perf improvements for zstd decode tldr: 7.5% average decode speedup on silesia corpus at compression levels 1-3 (sandy bridge) Background: while investigating zstd perf differences between clang and gcc I noticed that even though gcc is vectorizing the loop in in wildcopy, it was not being done as well as could be done by hand. The sites where wildcopy is invoked have an interesting distribution of lengths to be copied. The loop trip count is rarely above 1, yet long copies are common enough to make their performance important.The code in zstd_decompress.c to invoke wildcopy handles the latter well but the gcc autovectorizer introduces a needlessly expensive startup check for vectorization. See how GCC autovectorizes the loop here: https://godbolt.org/z/apr0x0 Here is the code after this diff has been applied: (left hand side is the good one, right is with vectorizer on) After: https://godbolt.org/z/OwO4F8 Note that autovectorization still does not do a good job on the optimized version, so it's turned off\ via attribute and flag. I found that neither attribute nor command-line flag were entirely successful in turning off vectorization, which is why there were both. silesia benchmark data - second triad of each file is with the original code: file orig compressedratio encode decode change 1#dickens 10192446-> 4268865(2.388), 198.9MB/s 709.6MB/s 2#dickens 10192446-> 3876126(2.630), 128.7MB/s 552.5MB/s 3#dickens 10192446-> 3682956(2.767), 104.6MB/s 537MB/s 1#dickens 10192446-> 4268865(2.388), 195.4MB/s 659.5MB/s 7.60% 2#dickens 10192446-> 3876126(2.630), 127MB/s 516.3MB/s 7.01% 3#dickens 10192446-> 3682956(2.767), 105MB/s 479.5MB/s 11.99% 1#mozilla 51220480-> 20117517(2.546), 285.4MB/s 734.9MB/s 2#mozilla 51220480-> 19067018(2.686), 220.8MB/s 686.3MB/s 3#mozilla 51220480-> 18508283(2.767), 152.2MB/s 669.4MB/s 1#mozilla 51220480-> 20117517(2.546), 283.4MB/s 697.9MB/s 5.30% 2#mozilla 51220480-> 19067018(2.686), 225.9MB/s 665MB/s 3.20% 3#mozilla 51220480-> 18508283(2.767), 154.5MB/s 640.6MB/s 4.50% 1#mr 9970564-> 3840242(2.596), 262.4MB/s 899.8MB/s 2#mr 9970564-> 3600976(2.769), 181.2MB/s 717.9MB/s 3#mr 9970564-> 3563987(2.798), 116.3MB/s 620MB/s 1#mr 9970564-> 3840242(2.596), 253.2MB/s 827.3MB/s 8.76% 2#mr 9970564-> 3600976(2.769), 177.4MB/s 655.4MB/s 9.54% 3#mr 9970564-> 3563987(2.798), 111.2MB/s 564.2MB/s 9.89% 1#nci 33553445-> 2849306(11.78), 575.2MB/s , 1335.8MB/s 2#nci 33553445-> 2890166(11.61), 509.3MB/s , 1238.1MB/s 3#nci 33553445-> 2857408(11.74), 431MB/s , 1210.7MB/s 1#nci 33553445-> 2849306(11.78), 565.4MB/s , 1220.2MB/s 9.47% 2#nci 33553445-> 2890166(11.61), 508.2MB/s , 1128.4MB/s 9.72% 3#nci 33553445-> 2857408(11.74), 429.1MB/s , 1097.7MB/s 10.29% 1#ooffice 6152192-> 3590954(1.713), 231.4MB/s , 662.6MB/s 2#ooffice 6152192-> 3323931(1.851), 162.8MB/s , 592.6MB/s 3#ooffice 6152192-> 3145625(1.956), 99.9MB/s , 549.6MB/s 1#ooffice 6152192-> 3590954(1.713), 224.7MB/s , 624.2MB/s 6.15% 2#ooffice 6152192-> 3323931 (1.851), 155MB/s , 564.5MB/s 4.98% 3#ooffice 6152192-> 3145625(1.956), 101.1MB/s , 521.2MB/s 5.45% 1#osdb 10085684-> 3739042(2.697), 271.9MB/s 876.4MB/s 2#osdb 10085684-> 3493875(2.887), 208.2MB/s 857MB/s 3#osdb 10085684-> 3515831(2.869), 135.3MB/s 805.4MB/s 1#osdb 10085684-> 3739042(2.697), 257.4MB/s 793.8MB/s 10.41% 2#osdb 10085684-> 3493875(2.887), 209.7MB/s 776.1MB/s 10.42% 3#osdb 10085684-> 3515831(2.869), 130.6MB/s 727.7MB/s 10.68% 1#reymont 6627202-> 2152771(3.078), 198.9MB/s 696.2MB/s 2#reymont 6627202-> 2071140(3.200), 170MB/s 595.2MB/s 3#reymont 6627202-> 1953597(3.392), 128.5MB/s 609.7MB/s 1#reymont 6627202-> 2152771(3.078), 199.6MB/s 655.2MB/s 6.26% 2#reymont 6627202-> 2071140(3.200), 168.2MB/s 554.4MB/s 7.36% 3#reymont 6627202-> 1953597(3.392), 128.7MB/s 557.4MB/s 9.38% 1#samba 21606400-> 5510994(3.921), 338.1MB/s 1066MB/s 2#samba 21606400-> 5240208(4.123), 258.7MB/s 992.3MB/s 3#samba 21606400-> 5003358(4.318), 200.2MB/s 991.1MB/s 1#samba 21606400-> 5510994(3.921), 330.8MB/s 974MB/s 9.45% 2#samba 21606400-> 5240208(4.123), 257.9MB/s 919.4MB/s 7.93% 3#samba 21606400-> 5003358(4.318), 198.5MB/s 908.9MB/s 9.04% 1#sao 7251944-> 6256401(1.159), 194.6MB/s 602.2MB/s 2#sao 7251944-> 5808761(1.248), 128.2MB/s 532.1MB/s 3#sao 7251944-> 5556318(1.305), 73MB/s 509.4MB/s 1#sao 7251944-> 6256401(1.159), 198.7MB/s 580.7MB/s 3.70% 2#sao 7251944-> 5808761(1.248), 129.1MB/s 502.7MB/s 5.85% 3#sao 7251944-> 5556318(1.305), 74.6MB/s 493.1MB/s 3.31% 1#webster 41458703-> 13692222(3.028), 222.3MB/s 752MB/s 2#webster 41458703-> 12842646(3.228), 157.6MB/s 532.2MB/s 3#webster 41458703-> 12191964(3.400), 124MB/s 468.5MB/s 1#webster 41458703-> 13692222(3.028), 219.7MB/s 697MB/s 7.89% 2#webster 41458703-> 12842646(3.228), 153.9MB/s 495.4MB/s 7.43% 3#webster 41458703-> 12191964(3.400), 124.8MB/s 444.8MB/s 5.33% 1#xml 5345280-> 696652(7.673), 485MB/s , 1333.9MB/s 2#xml 5345280-> 681492(7.843), 405.2MB/s , 1237.5MB/s 3#xml 5345280-> 639057(8.364), 328.5MB/s , 1281.3MB/s 1#xml 5345280-> 696652(7.673), 473.1MB/s , 1232.4MB/s 8.24% 2#xml 5345280-> 681492(7.843), 398.6MB/s , 1145.9MB/s 7.99% 3#xml 5345280-> 639057(8.364), 327.1MB/s , 1175MB/s 9.05% 1#x-ray 8474240-> 6772557(1.251), 521.3MB/s 762.6MB/s 2#x-ray 8474240-> 6684531(1.268), 230.5MB/s 688.5MB/s 3#x-ray 8474240-> 6166679(1.374), 68.7MB/s 478.8MB/s 1#x-ray 8474240-> 6772557(1.251), 502.8MB/s 736.7MB/s 3.52% 2#x-ray 8474240-> 6684531(1.268), 224.4MB/s 662MB/s 4.00% 3#x-ray 8474240-> 6166679(1.374), 67.3MB/s 437.8MB/s 9.37% 7.51% * makefile changed to only pass -fno-tree-vectorize to gcc * <Replace this line with a title. Use 1 line only, 67 chars or less> Don't add "no-tree-vectorize" attribute on clang (which defines __GNUC__) * fix for warning/error with subtraction of void* pointers * fix c90 conformance issue - ISO C90 forbids mixed declarations and code * Fix assert for negative diff, only when there is no overlap * fix overflow revealed in fuzzing tests * tweak for small speed increase	2019-07-11 18:31:07 -04:00
Yann Collet	d1327738c2	updated double_fast complementary insertion in a way which is more favorable to compression ratio, though very slightly slower (~-1%). More details in the PR.	2019-07-11 15:25:22 -07:00
Yann Collet	b01c1c679f	Merge pull request #1675 from ephiepark/dev Factor out the logic to build sequences	2019-07-10 13:32:31 -07:00
Yann Collet	b8ec4b0fd6	updated version number (to v1.4.1) also : added doc on context re-use, as suggested by @scherepanov at #1676	2019-07-09 11:43:59 -07:00
Yann Collet	096714d1b8	Merge pull request #1671 from ephiepark/dev Adding targetCBlockSize param	2019-07-03 17:47:44 -07:00
Ephraim Park	f57ac7b09e	Factor out the logic to build sequences	2019-07-03 15:42:38 -07:00
Ephraim Park	9007701670	Adding targetCBlockSize param	2019-07-03 15:41:52 -07:00
Nick Terrell	6c92ba774e	ZSTD_compressSequences_internal assert op <= oend (#1667 ) When we wrote one byte beyond the end of the buffer for RLE blocks back in 1.3.7, we would then have `op > oend`. That is a problem when we use `oend - op` for the size of the destination buffer, and allows further writes beyond the end of the buffer for the rest of the function. Lets assert that it doesn't happen.	2019-07-02 15:45:47 -07:00
Yann Collet	857e608b51	Merge pull request #1658 from facebook/memset memset() rather than reduceIndex()	2019-07-01 15:01:43 -07:00
Yann Collet	4d611ca405	Merge pull request #1664 from ephiepark/dev decodecorpus	2019-07-01 14:13:49 -07:00
Tyler-Tran	c55d2e7ba3	Adding shrinking flag for cover and fastcover (#1656 ) * Changed ERROR(GENERIC) excluding inits * editing git ignore * Edited init functions to size_t returns * moved declarations earlier * resolved issues with changes to init functions * fixed style and an error check * attempting to add tests that might trigger changes * added && die to cases expecting to fail * resolved no die on expected failed command * fixed accel to be incorrect value * Adding an automated shrinking option * Fixing build * finalizing fixes * fix? * Removing added comment in cover.h * Styling fixes * Merging with fb dev * removing megic number for default regression * Requested revisions * fixing support for fast cover * fixing casting errors * parenthesis fix * fixing some build nits * resolving travis ci syntax * might resolve all compilation issues * removed unused variable * remodeling the selectDict function * fixing bad memory access * fixing error checks * fixed erroring check in selectDict * fixing mixed declarations * modify mixed declaration * fixing nits and adding test cases * Adding requested changes + fixed bug for error checking * switched double comparison from != to < * fixed declaration typing * refactoring COVER_best_finish() and changing shrinkDict * removing the const's * modifying ZDICT_optimizeTrainFromBuffer_cover functions * fixing potential bad memcpy * fixing the error function for dict size	2019-06-27 16:26:57 -07:00
Ephraim Park	c7c1ba3a19	Fix a constraint stricter than the spec	2019-06-26 16:43:37 -07:00
Yann Collet	621adde3b2	changed naming to ZSTD_indexTooCloseToMax() Also : minor speed optimization : shortcut to ZSTD_reset_matchState() rather than the full reset process. It still needs to be completed with ZSTD_continueCCtx() for proper initialization. Also : changed position of LDM hash tables in the context, so that the "regular" hash tables can be at a predictable position, hence allowing the shortcut to ZSTD_reset_matchState() without complex conditions.	2019-06-24 14:39:29 -07:00
Yann Collet	45c9fbd6d9	prefer memset() rather than reduceIndex() when close to index range limit by disabling continue mode when index is close to limit.	2019-06-21 16:19:21 -07:00
Yann Collet	944e2e9e12	benchfn : added macro macro CONTROL() like assert() but cannot be disabled. proper separation of user contract errors (CONTROL()) and invariant verification (assert()).	2019-06-21 15:58:55 -07:00
Nick Terrell	674534a700	[zstd] Fix data corruption in niche use case * Extract the overflow correction into a helper function. * Load the dictionary `ZSTD_CHUNKSIZE_MAX = 512 MB` bytes at a time and overflow correct between each chunk. Data corruption could happen when all these conditions are true: * You are using multithreading mode * Your overlap size is >= 512 MB (implies window size >= 512 MB) * You are using a strategy >= ZSTD_btlazy * You are compressing more than 4 GB The problem is that when loading a large dictionary we don't do overflow correction. We can only load 512 MB at a time, and may need to do overflow correction before each chunk.	2019-06-21 15:47:31 -07:00
Nick Terrell	4156060ca4	[zstdmt] Update assert to use ZSTD_WINDOWLOG_MAX	2019-06-21 15:39:33 -07:00
Nick Terrell	95e2b430ea	[opt] Add asserts for corruption in ZSTD_updateTree()	2019-06-21 15:22:29 -07:00
Yann Collet	9af909bf35	Merge pull request #1624 from facebook/smallwlog Improves compression ratio for small windowLog	2019-06-14 17:28:21 -07:00
Nick Terrell	cdb9481e38	[libzstd] Optimize ZSTD_insertBt1() for repetitive data We would only skip at most 192 bytes at a time before this diff. This was added to optimize long matches and skip the middle of the match. However, it doesn't handle the case of repetitive data. This patch keeps the optimization, but also handles repetitive data by taking the max of the two return values. ``` > for n in $(seq 9); do echo strategy=$n; dd status=none if=/dev/zero bs=1024k count=1000 \| command time -f %U ./zstd --zstd=strategy=$n >/dev/null; done strategy=1 0.27 strategy=2 0.23 strategy=3 0.27 strategy=4 0.43 strategy=5 0.56 strategy=6 0.43 strategy=7 0.34 strategy=8 0.34 strategy=9 0.35 ``` At level 19 with multithreading the compressed size of `silesia.tar` regresses 300 bytes, and `enwik8` regresses 100 bytes. In single threaded mode `enwik8` is also within 100 bytes, and I didn't test `silesia.tar`. Fixes Issue #1634.	2019-06-05 20:34:00 -07:00
Yann Collet	b3af1873a0	better title formatting for html documentation must pay attention to /** and /*! patterns.	2019-06-04 10:35:40 -07:00
Yann Collet	b5c98fbfd0	Added comments on I/O buffer sizes for streaming It seems this is still a confusing topic, as in https://github.com/klauspost/compress/issues/109 .	2019-06-04 10:26:16 -07:00
Yann Collet	80d6ccea79	removed UINT32_MAX apparently not guaranteed on all platforms, replaced by UINT_MAX.	2019-05-31 17:27:07 -07:00
Yann Collet	fce4df3ab7	fixed wrong assert in double_fast	2019-05-31 17:06:28 -07:00
Yann Collet	a968099038	minor code cleaning for new index invalidation strategy	2019-05-31 16:52:37 -07:00
Yann Collet	d605f482c7	make double_fast compatible with new index invalidation strategy	2019-05-31 16:50:04 -07:00
Yann Collet	a30febaeeb	Made fast strategy compatible with new offset validation strategy fast mode does the same thing as before : it pre-emptively invalidates any index that could lead to offset > maxDistance. It's supposed to help speed. But this logic is performed inside zstd_fast, so that other strategies can select a different behavior.	2019-05-31 16:34:55 -07:00
Yann Collet	58adb1059f	extended exact window size to greedy/lazy modes	2019-05-31 16:08:48 -07:00
Yann Collet	bc601bdc6d	first implementation of small window size for btopt noticeably improves compression ratio when window size is small (< 18). enwik7 level 19 windowLog `dev` `smallwlog` improvement 23 3.577 3.577 0.02% 22 3.536 3.538 0.06% 21 3.462 3.467 0.14% 20 3.364 3.377 0.39% 19 3.244 3.272 0.86% 18 3.110 3.166 1.80% 17 2.843 3.057 7.53% 16 2.724 2.943 8.04% 15 2.594 2.822 8.79% 14 2.456 2.686 9.36% 13 2.312 2.523 9.13% 12 2.162 2.361 9.20% 11 2.003 2.182 8.94%	2019-05-31 15:55:12 -07:00
Yann Collet	b13a9207f9	Merge pull request #1623 from facebook/fullbench fullbench minor improvements	2019-05-31 14:40:19 -07:00
Yann Collet	ed38b645db	fullbench: pass proper parameters in scenario 43	2019-05-29 15:26:06 -07:00
Yann Collet	9719fd616c	removed nextToUpdate3 from ZSTD_window it's now a local variable of ZSTD_compressBlock_opt()	2019-05-28 16:18:12 -07:00
Yann Collet	33dabc8c80	get bt matches : made it a bit clearer which parameters are input and output	2019-05-28 16:11:32 -07:00
Yann Collet	327cf6fac1	nextToUpdate3 does not need to be maintained outside of zstd_opt.c It's re-synchronized with nextToUpdate at beginning of each block. It only needs to be tracked from within zstd_opt block parser. Made the logic clear, so that no code tried to maintain this variable. An even better solution would be to make nextToUpdate3 an internal variable of ZSTD_compressBlock_opt_generic(). That would make it possible to remove it from ZSTD_matchState_t, thus restricting its visibility to only where it's actually useful. This would require deeper changes though, since the matchState is the natural structure to transport parameters into and inside the parser.	2019-05-28 15:26:52 -07:00
Yann Collet	6453f8158f	complementary code comments on variables used / impacted during maxDist check	2019-05-28 14:12:16 -07:00
Yann Collet	4baecdf72a	added comments to better understand enforceMaxDist()	2019-05-28 13:15:48 -07:00
Tyler-Tran	cb47871a0a	[dictBuilder] Be more specific than ERROR(generic) (#1616 ) * Specify errors at a finer granularity than `ERROR(generic)`. * Add tests for bad parameters in the dictionary builder.	2019-05-22 18:57:50 -07:00
Nick Terrell	5f228f8db2	[libzstd] Add a ZSTD_STATIC_ASSERT for BIT_DStream_status	2019-04-23 14:22:16 -07:00
Nick Terrell	a892e25374	[libzstd] Error if all sequence bits aren't consumed	2019-04-23 14:07:36 -07:00
Nick Terrell	0fd322f812	[legacy] Fix ZSTDv0_decodeSequence() Version <= 0.5 could read beyond the end of `dumps`, which points into the input buffer. * Check the validity of `dumps` before using it, if it is out of bounds return garbage values. There is no return code for this function. * Introduce `MEM_readLE24()` for simplicity, since I don't want to trust that there is an extra byte after `dumps`.	2019-04-19 11:34:52 -07:00
Nick Terrell	2536771134	[legacy] Fix Huffman jump table reads in v01 and v05	2019-04-18 16:20:42 -07:00
Nick Terrell	579f3d7794	[legacy] Fix bug in ZSTD_decodeSeqHeaders()	2019-04-18 13:41:10 -07:00
Nick Terrell	ac098c7f5f	[legacy] Fix a bug in ZSTDv06_findFrameSizeInfoLegacy()	2019-04-18 13:33:26 -07:00
Nick Terrell	ee130a9889	[libzstd] Check the size in readSkippableFrameSize()	2019-04-17 11:41:55 -07:00
Nick Terrell	5922f4e2ae	[legacy] Return the right error code	2019-04-17 11:34:52 -07:00
Nick Terrell	450feb0f95	[libzstd] Fix ZSTD_decompressBound() on bad skippable frames The function didn't verify that the skippable frame size is correct.	2019-04-17 11:29:42 -07:00
Nick Terrell	a17fe4c9e5	[visual] Fix unreachable code warning	2019-04-16 11:32:35 -07:00
Nick Terrell	de0499f7fa	[libzstd] Require ZSTD_MULTITHREAD to create a ZSTDMT_CCtx ZSTDMT was broken when compiled without ZSTD_MULTITHREAD defined, because `ZSTD_CCtx_setParameter(cctx, ZSTD_c_nbWorkers, nbWorkerss)` failed. It was detected by the MSVC test which runs the fuzzer with multithreading disabled. This is a very niche use case of a deprecated API, because the API is inefficient and synchronous, since `threading.h` will be synchronous. Users almost certainly don't want this, and anyone who tested their code should realize that it is broken. Therefore, I think it is safe to require `ZSTD_MULTITHREAD` to be defined to use ZSTDMT.	2019-04-15 23:04:46 -07:00
Josh Soref	a880ca239b	Spelling (#1582 ) * spelling: accidentally * spelling: across * spelling: additionally * spelling: addresses * spelling: appropriate * spelling: assumed * spelling: available * spelling: builder * spelling: capacity * spelling: compiler * spelling: compressibility * spelling: compressor * spelling: compression * spelling: contract * spelling: convenience * spelling: decompress * spelling: description * spelling: deflate * spelling: deterministically * spelling: dictionary * spelling: display * spelling: eliminate * spelling: preemptively * spelling: exclude * spelling: failure * spelling: independence * spelling: independent * spelling: intentionally * spelling: matching * spelling: maximum * spelling: meaning * spelling: mishandled * spelling: memory * spelling: occasionally * spelling: occurrence * spelling: official * spelling: offsets * spelling: original * spelling: output * spelling: overflow * spelling: overridden * spelling: parameter * spelling: performance * spelling: probability * spelling: receives * spelling: redundant * spelling: recompression * spelling: resources * spelling: sanity * spelling: segment * spelling: series * spelling: specified * spelling: specify * spelling: subtracted * spelling: successful * spelling: return * spelling: translation * spelling: update * spelling: unrelated * spelling: useless * spelling: variables * spelling: variety * spelling: verbatim * spelling: verification * spelling: visited * spelling: warming * spelling: workers * spelling: with	2019-04-12 11:18:11 -07:00
Nick Terrell	aafe97b67d	[libzstd] Switch dictUses to an enum	2019-04-10 16:50:35 -07:00
Nick Terrell	50b9c41196	[libzstd] Fix decompression dictionary bugs and clean up initialization Bugs: * `ZSTD_DCtx_refPrefix()` didn't clear the dictionary after the first use. Fix and add a test case. * `ZSTD_DCtx_reset()` always cleared the dictionary. Fix and add a test case. * After calling `ZSTD_resetDStream()` you could no longer load a dictionary, since the stage was set to `zdss_loadHeader`. Fix and add a test case. Cleanup: * Make `ZSTD_initDStream()` and `ZSTD_resetDStream()` wrap the new advanced API, and add test cases. Document the equivalent of these functions in the advanced API and document the unstable functions as deprecated.	2019-04-10 12:59:02 -07:00
Nick Terrell	824aaa695f	[libzstd] Fix ZSTD_decompressDCtx() with a dictionary * `ZSTD_decompressDCtx()` did not use the dictionary loaded by `ZSTD_DCtx_loadDictionary()`. * Add a unit test. * A stacked diff uses `ZSTD_decompressDCtx()` in the `dictionary_round_trip` and `dictionary_decompress` fuzzers.	2019-04-09 17:59:27 -07:00
Nick Terrell	48a6427d22	[libzstd] Fix ZSTD_compress2() for multithreaded compression `ZSTD_compress2()` wouldn't wait for multithreaded compression to finish. We didn't find this because ZSTDMT will block when it can compress all in one go, but it can't do that if it doesn't have enough output space, or if `ZSTD_c_rsyncable` is enabled. Since we will already sometimes block when using `ZSTD_e_end`, I've changed `ZSTD_e_end` and `ZSTD_e_flush` to guarantee maximum forward progress. This simplifies the API, and helps users avoid the easy bug that was made in `ZSTD_compress2()` * Found by the libfuzzer fuzzers. * Added a test case that catches the problem. * I will make the fuzzers sometimes allocate less than `ZSTD_compressBound()` output space.	2019-04-09 16:24:17 -07:00
Nick Terrell	e649fad7aa	[dictBuilder] Fix displayLevel for corpus warning Pass the displaylevel into the corpus warning, because it is used in fast cover and cover, so it needs to respect the local level.	2019-04-08 20:00:18 -07:00
Nick Terrell	bfcd5b81d7	[libzstd] Don't check the dictID in fuzzing mode When `FUZZING_BUILD_MODE_UNSAFE_FOR_PRODUCTION` is defined don't check the dictID. This check makes the fuzzers job harder, and it is at the very beginning.	2019-04-08 19:57:41 -07:00
Nick Terrell	947548c24f	Remove double the from README	2019-04-08 16:50:18 -07:00
Nick Terrell	641e594309	[libzstd] Remove ZSTDMT from the shared object * Remove ZSTDMT from the shared object by default. * Provide a macro `ZSTD_LEGACY_MULTITHREADED_API` to override it. * Document it in `lib/README.md`.	2019-04-07 18:47:52 -07:00
Nick Terrell	1dfe37fea9	[libzstd] Stabilize ZSTD_getDictID_*() functions	2019-04-05 18:59:30 -07:00
Nick Terrell	ce388fe4d2	[libzstd] Fix return value docs for ZSTD_compressStream2()	2019-04-05 17:44:07 -07:00
Nick Terrell	7231ea72a8	[libzstd] Reword the streaming docs for the new API	2019-04-03 19:21:05 -07:00
Nick Terrell	cf7d601bf5	Move the dictionary API and mark the legacy API * Move the dictionary API below the streaming API * Mark the legacy streaming API as redundant	2019-04-03 19:16:40 -07:00
Nick Terrell	d7d89513d6	Stabilize advance API This commit moves the candidate advanced API to the stable section. It makes some minor whitespace changes, but it doesn't change any of the wording of the documentation. I'll put up a separate PR that tweaks some of the documentation once this lands, so that it is easier to review. NOTE: Even though these functions are now in stable, they aren't stable until the next release (in under 1 month). It is possible that they change until then.	2019-04-03 18:43:20 -07:00
Nick Terrell	0827edeace	[libzstd] Bump the library version to 1.4.0 Bumps the library version to 1.4.0 in preparation to stabilize the advanced API.	2019-04-03 18:43:20 -07:00
Nick Terrell	72a3fbc0e4	Merge pull request #1562 from terrelln/2fast [libzstd] Speed up single segment zstd_fast by 5%	2019-04-03 18:08:15 -07:00
Nick Terrell	00679da22b	[libzstd] Setting ZSTD_d_maxWindowLog to 0 means default	2019-04-02 19:20:52 -07:00
Nick Terrell	95624b77e4	[libzstd] Speed up single segment zstd_fast by 5% This PR is based on top of PR #1563. The optimization is to process two input pointers per loop. It is based on ideas from [igzip] level 1, and talking to @gbtucker. \| Platform \| Silesia \| Enwik8 \| \|-------------------------\|-------------\|--------\| \| OSX clang-10 \| +5.3% \| +5.4% \| \| i9 5 GHz gcc-8 \| +6.6% \| +6.6% \| \| i9 5 GHz clang-7 \| +8.0% \| +8.0% \| \| Skylake 2.4 GHz gcc-4.8 \| +6.3% \| +7.9% \| \| Skylake 2.4 GHz clang-7 \| +6.2% \| +7.5% \| Testing on all Silesia files on my Intel i9-9900k with gcc-8 \| Silesia File \| Ratio Change \| Speed Change \| \|--------------\|--------------\|--------------\| \| silesia.tar \| +0.17% \| +6.6% \| \| dickens \| +0.25% \| +7.0% \| \| mozilla \| +0.02% \| +6.8% \| \| mr \| -0.30% \| +10.9% \| \| nci \| +1.28% \| +4.5% \| \| ooffice \| -0.35% \| +10.7% \| \| osdb \| +0.75% \| +9.8% \| \| reymont \| +0.65% \| +4.6% \| \| samba \| +0.70% \| +5.9% \| \| sao \| -0.01% \| +14.0% \| \| webster \| +0.30% \| +5.5% \| \| xml \| +0.92% \| +5.3% \| \| x-ray \| -0.00% \| +1.4% \| Same tests on Calgary. For brevity, I've only included files where compression ratio regressed or was much better. \| Calgary File \| Ratio Change \| Speed Change \| \|--------------\|--------------\|--------------\| \| calgary.tar \| +0.30% \| +7.1% \| \| geo \| -0.14% \| +25.0% \| \| obj1 \| -0.46% \| +15.2% \| \| obj2 \| -0.18% \| +6.0% \| \| pic \| +1.80% \| +9.3% \| \| trans \| -0.35% \| +5.5% \| We gain 0.1% of compression ratio on Silesia. We gain 0.3% of compression ratio on enwik8. I also tested on the GitHub and hg-commands datasets without a dictionary, and we gain a small amount of compression ratio on each, as well as speed. I tested the negative compression levels on Silesia on my Intel i9-9900k with gcc-8: \| Level \| Ratio Change \| Speed Change \| \|-------\|--------------\|--------------\| \| -1 \| +0.13% \| +6.4% \| \| -2 \| +4.6% \| -1.5% \| \| -3 \| +7.5% \| -4.8% \| \| -4 \| +8.5% \| -6.9% \| \| -5 \| +9.1% \| -9.1% \| Roughly, the negative levels now scale half as quickly. E.g. the new level 16 is roughly equivalent to the old level 8, but a bit quicker and smaller. If you don't think this is the right trade off, we can change it to multiply the step size by 2, instead of adding 1. I think this makes sense, because it gives a bit slower ratio decay. [igzip]: https://github.com/01org/isa-l/tree/master/igzip	2019-04-02 19:02:50 -07:00
Nick Terrell	56682a7709	Fix ZSTD_estimateCStreamSize_usingCCtxParams() It wasn't using the ZSTD_CCtx_params correctly. It must actualize the compression parameters by calling ZSTD_getCParamsFromCCtxParams() to get the real window log. Tested by updating the streaming memory usage example in the next commit. The CHECK() failed before this patch, and passes after. I also added a unit test to zstreamtest.c that failed before this patch, and passes after.	2019-04-01 18:02:52 -07:00
Nick Terrell	425ce5547c	Merge pull request #1563 from terrelln/dms-sep [libzstd] Split out zstd_fast dict match state function	2019-03-29 16:19:21 -06:00
Nick Terrell	f00407b640	Split out zstd_fast dict match state function	2019-03-29 10:39:16 -06:00
shakeelrao	dca73db30c	fix srcSize typo and add new UTIL func to comment	2019-03-28 17:50:34 -07:00
Nick Terrell	d0f5ba36fb	[cover] Improvements for small or homogeneous data * The algorithm would bail as soon as it found one epoch that contained no new segments. Change it so it now has to fail >= 10 times in a row (10 for fastcover, 10-100 for cover). * The algorithm uses the `maxDict` size to decide the epoch size. When this size is absurdly large, it causes tiny epochs. Lower bound the epoch size at 10x the segment size, and warn the user that their training set is too small. Fixes #1554	2019-03-22 14:14:46 -07:00
Nick Terrell	6b053b9f60	[lib] Allow ZSTD_CCtx_loadDictionary() to be called before parameters are set * After loading a dictionary only create the cdict once we've started the compression job. This allows the user to pass the dictionary before they set other settings, and is in line with the rest of the API. * Add tests that mix the 3 dictionary loading APIs. * Add extra tests for `ZSTD_CCtx_loadDictionary()`. * The first 2 tests added fail before this patch. * Run the regression test suite.	2019-03-21 16:13:53 -07:00
Nick Terrell	20f9ff7e53	Update documentation to tell how to replace the old streaming API with the new one.	2019-03-21 16:08:58 -07:00
Nick Terrell	e55da9e963	Wrap the new advanced api completely	2019-03-21 10:54:40 -07:00
shakeelrao	186ded6d91	Fix typo in legacy documentation	2019-03-19 01:44:08 -07:00
shakeelrao	5740eb6769	Remove extraneous spacing in comments	2019-03-18 21:05:35 -07:00
shakeelrao	0a3fa6f909	Add legacy mode in documentation	2019-03-18 20:33:15 -07:00
shakeelrao	20aa1b455c	Stylistic changes	2019-03-17 19:35:43 -07:00
shakeelrao	0033bb4785	Update documentation for ZSTD_frameSizeInfo	2019-03-17 17:41:27 -07:00
shakeelrao	19b75b6ecb	Test new ZSTD_findFrameCompressedSize and update documentation	2019-03-15 18:04:19 -07:00
shakeelrao	8cd423a659	Reorder declaration in ZSTD_findFrameSizeInfoLegacy	2019-03-15 16:20:34 -07:00
shakeelrao	60796e76b0	Add legacy support to decompressBound	2019-03-15 16:10:37 -07:00
Nick Terrell	f52a7d8faa	Merge pull request #1547 from shakeelrao/fix-error Fix incorrect error code in ZSTD_errorFrameSizeInfo	2019-03-15 10:57:49 -07:00
Nick Terrell	787b76904a	[libzstd] Allow compression parameters to be set with a cdict The order you set parameters in the advanced API is not supposed to matter. However, once you call `ZSTD_CCtx_refCDict()` the compression parameters cannot be changed. Remove that restriction, and document what parameters are used when using a CDict. If the CCtx is in dictionary mode, then the CDict's parameters are used. If the CCtx is not in dictionary mode, then its requested parameters are used.	2019-03-13 16:10:05 -07:00
Nick Terrell	0594e8135b	[libzstd] Free local cdict when referencing cdict We no longer care about the `cdictLocal` after calling `ZSTD_CCtx_refCDict()`, so we should free it to save some memory.	2019-03-13 14:54:31 -07:00
shakeelrao	79827a179f	Fix incorrectly assigned value in ZSTD_errorFrameSizeInfo As documented in `zstd.h`, ZSTD_decompressBound returns `ZSTD_CONTENTSIZE_ERROR` if an error occurs (not `ZSTD_CONTENTSIZE_UNKNOWN`). This is consistent with the error checking made in ZSTD_decompressBound, particularly line 545.	2019-03-13 01:23:07 -07:00
shakeelrao	9ad3f31d33	update documentation for decompressBound	2019-03-02 17:56:10 -08:00
shakeelrao	95dfd48143	update formatting	2019-03-01 23:11:15 -08:00
shakeelrao	1e08c49f75	add stylistic changes	2019-03-01 18:29:35 -08:00
shakeelrao	2bb5eec711	update missing error case to CONTENTSIZE_ERROR	2019-03-01 00:12:16 -08:00
shakeelrao	44ae395b3e	change nbBlocks to size_t for consistency	2019-03-01 00:05:59 -08:00
shakeelrao	03026c3b1d	change compressedBound to ULL	2019-03-01 00:03:50 -08:00
shakeelrao	8930c3c79b	implement API-level changes	2019-02-28 22:55:18 -08:00
shakeelrao	dce9a09772	initialize local vars in decompressBound	2019-02-28 03:01:21 -08:00
shakeelrao	515c506b4c	switch frameBound type to ULL	2019-02-28 02:10:17 -08:00
shakeelrao	d0a3f25697	change return type to ULL	2019-02-28 01:52:01 -08:00
shakeelrao	c9d674b60d	Remove autogenerated test file	2019-02-28 01:29:04 -08:00
shakeelrao	97d3d28dab	Fix decl-after-stmnt build error	2019-02-28 01:24:54 -08:00

... 6 7 8 9 10 ...

3499 Commits