AuroraMiddleware/zstd - zstd - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Nick Terrell	39a6cc5172	Make ZSTD_compress_usingCDict() respect contentSizeFlag	2017-04-03 21:09:55 -07:00
Nick Terrell	62ecad3819	Fix ZSTD_initCStream_usingCDict() to use dictionary	2017-04-03 21:05:59 -07:00
Yann Collet	30c7698970	optimize ZSTDMT_compress() memory usage does no longer allocate temporary buffers when there is enough room in dstBuffer to decompress directly there. (previous method would skip that for 1st chunk only). Also : fix ZSTD_compressBound() for small srcSize	2017-03-31 18:27:03 -07:00
Yann Collet	3f75d52527	Changed ZSTD_compressBound() required so that if Total = A+B compressBound(Total) <= compressBound(A) + compressBound(B) under condition of a minimum size for A and B Will help for ZSTDMT_compress() memory allocation	2017-03-31 17:11:38 -07:00
Yann Collet	7b70a1969e	Merge branch 'dev' into zstdmt	2017-03-31 16:22:33 -07:00
Yann Collet	53203e7c38	Merge pull request #640 from facebook/memAccess Changed memory strategy to __packed for gcc	2017-03-31 15:49:12 -07:00
Yann Collet	eea7858e2b	fixed minor warnings in debug code	2017-03-30 16:47:19 -07:00
Yann Collet	34cc487d05	overlap at full windowSize for max compression level as it provides max compression ratio	2017-03-30 16:23:22 -07:00
Yann Collet	458e955c23	improved ZSTDMT_compress() Use a bit more threads by default. Uses overlap segments to boost compression ratio (like the streaming variant)	2017-03-30 15:51:58 -07:00
Yann Collet	6476c51b86	Merge pull request #637 from facebook/zstdmt Zstdmt	2017-03-30 14:18:37 -07:00
Yann Collet	274f59919d	Changed memory strategy to __packed for gcc Method 1 __packed is always as good or better than memcpy(). But it's not portable, as it depends on compiler extension. For gcc, __pakced directive works fine. Furthermore, gcc has serious performance issues with memcpy() on ARM 32 bits. See #620	2017-03-30 12:52:14 -07:00
Nick Terrell	5152fb2cb2	Convert all tabs to spaces	2017-03-29 18:51:58 -07:00
Yann Collet	ca5a8bbe36	re-added patch ...	2017-03-29 17:15:27 -07:00
Yann Collet	2e2e78de47	removed unnecessary restriction on minmatchLength it's now transparently translated to nearest value when unsupported (7->6) (3->4)	2017-03-29 16:02:47 -07:00
Yann Collet	26769d88bc	Merge branch 'dev' of github.com:facebook/zstd into dev	2017-03-29 15:21:30 -07:00
Yann Collet	933ce4a1dd	fix : minmatch 7 conversion minmatch 7 now converted to minmatch 6 for strategies which do not support 7 Used to folded into "default", which applied minmatch 4	2017-03-29 14:35:38 -07:00
Sean Purcell	4708394bdd	Remove extra 'F' from skippable magic mask	2017-03-29 11:46:57 -07:00
Yann Collet	4cf0093571	restored bonus rule	2017-03-26 14:51:00 -07:00
Yann Collet	69017bf253	Merge branch 'dev' into LegacyDictBuilder	2017-03-26 14:39:13 -07:00
Yann Collet	582760818f	minor refactor add const changed if for easier to add new conditions	2017-03-26 03:04:56 -07:00
Yann Collet	858f72eeb8	fixed dictBuilder issue dictionary loading would fail during entropy analysis	2017-03-26 02:50:00 -07:00
Yann Collet	ecee9f2ef8	fixed conversion warnings	2017-03-26 00:59:14 -07:00
Yann Collet	0246d5c531	Merge pull request #630 from facebook/advancedCliCommands changed advanced commands --maxdict= and --dictID=	2017-03-26 00:13:35 -07:00
Yann Collet	4c41d37fcc	changed test for new syntax --dictID= and --maxdict=	2017-03-24 18:36:56 -07:00
Yann Collet	d41f707e88	minor improvement : remove duplicates with 1 char prefix difference	2017-03-24 17:56:45 -07:00
Yann Collet	b364caf455	Merge pull request #628 from facebook/dictBuilder_limits Ensure all limits derived from same constants	2017-03-24 17:54:42 -07:00
Yann Collet	2238870eb6	Merge pull request #625 from facebook/loadCDict limited CDict acceptation criteria to be the same as DDict	2017-03-24 16:06:20 -07:00
Yann Collet	96aa3019b2	changed advanced commands --maxdict= and --dictID= now works with the `=` variant, which is the recommended one. Old variant `--dictID #` still works, for compatibility with existing scripts. Long term objective is to remove the old variant..	2017-03-24 16:04:29 -07:00
Yann Collet	9da3b215ec	Ensure all limits derived from same constants Now uses ZDICT_DICTSIZE_MIN and ZDICT_CONTENTSIZE_MIN from zdict.h. Also : reduced values to 256 and 128 respectively	2017-03-24 15:02:09 -07:00
Yann Collet	ebe9963cf6	Merge pull request #626 from facebook/stricterDictBuilder dictBuilder fails to create dictionary on certain input	2017-03-24 14:27:28 -07:00
Yann Collet	16a0b10781	fixed ZSTD_loadZstdDictionary() forgot to add the dictionary content (tests were not failing, just compressing less). Also : added size protections when adding dict content since hc/bt table filling would fail if size < 8	2017-03-24 12:46:46 -07:00
Yann Collet	23776ce290	fixed ERROR_GENERIC on dstSize_tooSmall required by users which depends on this error code to size dest buffer	2017-03-23 17:59:50 -07:00
Yann Collet	f332ece468	dictBuilder fails to create dictionary on certain input Properly expressed with an error code (see zstd_errors.h) and a cli return code != 0	2017-03-23 16:24:02 -07:00
Yann Collet	bea78e8fc2	limited CDict acceptation criteria to be the same as DDict	2017-03-23 15:46:06 -07:00
Sean Purcell	042ba122ae	Change g_displayLevel to int and fix DISPLAYUPDATE flush	2017-03-23 11:21:59 -07:00
Nick Terrell	eaf69b07f0	Zero pointers after freeing	2017-03-21 13:20:59 -07:00
Yann Collet	f3dfcdccd1	bump version number	2017-03-21 12:18:28 -07:00
Przemyslaw Skibinski	8086d623ca	updated build of Windows packages	2017-03-18 11:19:09 +01:00
Yann Collet	7e35b352c6	Merge pull request #602 from iburinoc/doc Add functions missing from manual, and fix parameter alignment	2017-03-14 14:08:41 -07:00
Sean Purcell	dec2b96536	Add functions missing from manual, and fix parameter alignment	2017-03-14 11:24:09 -07:00
Sean Purcell	9830aeeea6	Fix legacy support=0 case and accidental double include of version headers	2017-03-13 17:19:37 -07:00
Sean Purcell	120df494e9	Update builds to not support legacy v01-v03	2017-03-13 14:44:08 -07:00
Sean Purcell	334cb34edb	ZSTD_LEGACY_SUPPORT defines lowest supported version	2017-03-13 14:32:30 -07:00
Sean Purcell	784082f49c	Change gotoDict type to uPtrDiff	2017-03-10 10:34:45 -08:00
Sean Purcell	8fe5c6862c	Fix undefined behaviour in decompressor	2017-03-10 10:17:42 -08:00
Nick Terrell	e65aab8e0f	Remove 'mem.h' dependency from ZSTD_WINDOWLOG_MAX	2017-03-08 15:40:13 -08:00
Yann Collet	a41a4ed39a	Merge pull request #594 from terrelln/bugs Small fixes	2017-03-08 14:56:07 -08:00
Nick Terrell	81512e9ebe	Avoid '#define inline /* ... */' Take definition of `FORCE_INLINE` from `zstd_internal.h`.	2017-03-08 14:00:21 -08:00
Nick Terrell	e06c303475	Fix ZSTD_sizeof_CStream()	2017-03-08 13:45:10 -08:00
Sean Purcell	881abe44f1	Reduce point at which we reduce offsets to protect against UB	2017-03-07 16:58:08 -08:00
Sean Purcell	3437bf2feb	Add build targets to the Makefile, and update CircleCI tests	2017-03-06 15:05:02 -08:00
Yann Collet	8b1d004031	added -Wformat-security flag, as recommended by @pixelb	2017-03-05 21:17:32 -08:00
Yann Collet	1f2c95c5f3	minor code refactor in HUF module	2017-03-05 21:07:20 -08:00
Yann Collet	5d801278dc	Merge pull request #586 from terrelln/repeat-heuristic Always check Huffman tables for ZSTD_lazy+	2017-03-03 19:38:56 -08:00
Nick Terrell	54c4babd8f	Always check Huffman tables for ZSTD_lazy+ The compressor always reuses the existing Huffman table if the literals size is at most 1 KiB. If the compression strategy is `ZSTD_lazy` or stronger always check to see if reusing the previous table or creating a new table is better. This doesn't yet weigh in decompression speed. I don't want to add any heuristics there until I have real data to work with to ensure that the heuristic works for at least one use case, preferably more.	2017-03-03 16:49:38 -08:00
Yann Collet	1af570bd05	Merge pull request #585 from terrelln/cover-leak Fix COVER_optimizeTrainFromBuffer() resource leaks	2017-03-02 20:46:35 -08:00
Yann Collet	f44b55c18d	Merge pull request #584 from terrelln/huff-repeat Allow compressor to repeat Huffman tables	2017-03-02 17:20:11 -08:00
Yann Collet	fe5d27062e	disable prefetch-decode for 32-bits target This decoder variant is detrimental to x86 architecture likely due to register pressure. Note that the variant is disabled for all 32-bits targets. It's unclear if it would help for different architectures, such as ARM, MIPS or PowerPC.	2017-03-02 17:09:21 -08:00
Nick Terrell	d051cd5b43	Use workspace for count and CTable	2017-03-02 16:38:07 -08:00
Nick Terrell	976e325b2e	Fix COVER_optimizeTrainFromBuffer() resource leaks Thanks to @nemequ for reporting the resource leaks.	2017-03-02 15:54:39 -08:00
Sean Purcell	553f67e0c1	Remove 'generic' inline strategy Seems to avoid performance loss for compression. Same strategy tested on decompression side, did not appear to improve speed.	2017-03-02 15:18:13 -08:00
Sean Purcell	3d95925a59	Merge remote-tracking branch 'origin/dev' into m32	2017-03-02 15:17:56 -08:00
Nick Terrell	a419777eb1	Allow compressor to repeat Huffman tables * Compressor saves most recently used Huffman table and reuses it if it produces better results. * I attempted to preserve CPU usage profile. I intentionally left all of the existing heuristics in place. There is only a speed difference on the second block and later. When compressing large enough blocks (say >= 4 KiB) there is no significant difference in compression speed. Dictionary compression of one block is the same speed for blocks with literals <= 1 KiB, and after that the difference is not very significant. * In the synthetic data, with blocks 10 KB or smaller, most blocks can't use repeated tables because the previous block did not contain a symbol that the current block contains. Once blocks are about 12 KB or more, most previous blocks have valid Huffman tables for the current block, and the compression ratio and decompression speed jumped. * In silesia blocks as small as 4KB can frequently reuse the previous Huffman table (85%), but it isn't as profitable, and the previous Huffman table only gets used about 3% of the time. * Microbenchmarks show that `HUF_validateCTable()` takes ~55 ns and `HUF_estimateCompressedSize()` takes ~35 ns. They are decently well optimized, the first versions took 90 ns and 120 ns respectively. `HUF_validateCTable()` could be twice as fast, if we cast the `HUF_CElt` to a `U32` and compare to 0. However, `U32` has an alignment of 4 instead of 2, so I think that might be undefined behavior. * I've ran `zstreamtest` compiled normally, with UASAN and with MSAN for 4 hours each. The worst case for the speed difference is a bunch of small blocks in the same frame. I modified `bench.c` to compress the input in a single frame but with blocks of the given block size, set by `-B`. Benchmarks on level 1: \| Program \| Block size \| Corpus \| Ratio \| Compression MB/s \| Decompression MB/s \| \|-----------\|------------\|-----------\|-------\|------------------\|--------------------\| \| zstd.base \| 256 \| synthetic \| 2.364 \| 110.0 \| 297.0 \| \| zstd \| 256 \| synthetic \| 2.367 \| 108.9 \| 297.0 \| \| zstd.base \| 256 \| silesia \| 2.204 \| 93.8 \| 415.7 \| \| zstd \| 256 \| silesia \| 2.204 \| 93.4 \| 415.7 \| \| zstd.base \| 512 \| synthetic \| 2.594 \| 144.2 \| 420.0 \| \| zstd \| 512 \| synthetic \| 2.599 \| 141.5 \| 425.7 \| \| zstd.base \| 512 \| silesia \| 2.358 \| 118.4 \| 432.6 \| \| zstd \| 512 \| silesia \| 2.358 \| 119.8 \| 432.6 \| \| zstd.base \| 1024 \| synthetic \| 2.790 \| 192.3 \| 594.1 \| \| zstd \| 1024 \| synthetic \| 2.794 \| 192.3 \| 600.0 \| \| zstd.base \| 1024 \| silesia \| 2.524 \| 148.2 \| 464.2 \| \| zstd \| 1024 \| silesia \| 2.525 \| 148.2 \| 467.6 \| \| zstd.base \| 4096 \| synthetic \| 3.023 \| 300.0 \| 1000.0 \| \| zstd \| 4096 \| synthetic \| 3.024 \| 300.0 \| 1010.1 \| \| zstd.base \| 4096 \| silesia \| 2.779 \| 223.1 \| 623.5 \| \| zstd \| 4096 \| silesia \| 2.779 \| 223.1 \| 636.0 \| \| zstd.base \| 16384 \| synthetic \| 3.131 \| 350.0 \| 1150.1 \| \| zstd \| 16384 \| synthetic \| 3.152 \| 350.0 \| 1630.3 \| \| zstd.base \| 16384 \| silesia \| 2.871 \| 296.5 \| 883.3 \| \| zstd \| 16384 \| silesia \| 2.872 \| 294.4 \| 898.3 \|	2017-03-02 13:27:52 -08:00
Yann Collet	fdb0fd34b3	Merge pull request #583 from terrelln/set-dictid Set dictID to 0 for content only dictionaries	2017-03-02 13:15:31 -08:00
Nick Terrell	3475b9b431	Set dictID to 0 for content only dictionaries	2017-03-02 12:33:02 -08:00
Sean Purcell	d44703d145	Offsets >= 32MB in 32-bits mode	2017-03-01 16:27:56 -08:00
Yann Collet	76f0494089	xxhash can be included twice in any order Previously, followed by : would fail to include the static definitions, because the second include was simply skipped by guard macro. Now it works as intended : the missing static part is included during the second include.	2017-03-01 13:29:29 -08:00
Yann Collet	4bcc69b761	solves warnings when compiling with global XXH_STATIC_LINKING_ONLY XXH_STATIC_LINKING_ONLY protection macro is intended to be triggered just before the include. The main idea is to keep this setting local : user module shall explicitly understand and accept the static linking restriction which becomes transparent when triggering the macro at project level. Global definition also triggers redefinition warnings for user modules which do locally define the macro. This new version compiles lib and cli without warning when the macro is set globally. That's not a scenario to be recommended, since it trades a local effect for a global one, but it was easy enough to provide from zstd side.	2017-03-01 11:33:25 -08:00
Yann Collet	31432cc57d	Merge pull request #579 from iburinoc/multiframe Check to ensure ddict isn't null before dereference	2017-03-01 11:02:04 -08:00
Sean Purcell	a81d4fee58	Check to ensure ddict isn't null before dereference	2017-02-28 15:28:29 -08:00
Yann Collet	22d79762ef	fixed multi frames	2017-02-28 02:12:42 -08:00
Yann Collet	a33ae64204	fixed decoding skippable frames	2017-02-28 01:15:28 -08:00
Yann Collet	d1760113ec	Improved speed of ZSTD_decompressStream() When ZSTD_decompressStream() detects that there is enough space in dst to complete decompression in a single pass, delegates to ZSTD_decompress(), for an extra ~5% speed boost	2017-02-28 00:14:28 -08:00
Yann Collet	a81c2e7e44	Merge pull request #573 from facebook/ddict Improved DDict memory usage	2017-02-27 20:54:42 -08:00
Yann Collet	dccd6b6f65	cli : fix : --rm is silent when input is stdin previously, app would produce an error message, and stop.	2017-02-27 15:57:50 -08:00
Yann Collet	0b9b894b2d	reduced ZSTD_DDict memory usage saved 128 KB	2017-02-27 00:27:30 -08:00
Yann Collet	bd7fa21deb	added ZSTD_refDDict() Now DDict does no longer depends on DCtx duplication	2017-02-26 14:43:07 -08:00
Yann Collet	d73eebc00f	loadEntropy works on new ZSTD_entropy_t type	2017-02-26 10:16:42 -08:00
Yann Collet	8629f0e41f	created entropy structure type	2017-02-25 18:33:31 -08:00
Yann Collet	8dff956dbf	Added DDict unit test in fuzzer also : slightly modified loadEntropy : know src must points at start of dictionary	2017-02-25 10:11:15 -08:00
Yann Collet	14312d833e	zstdmt : fix : loading prefix from previous segments There used to be a (very small) chance that loading prefix from previous segment would be confused with a real zstd dictionary. For that to happen, the prefix needs to start with the same value as dictionary magic. That's 1 chance in 4 billions if all values have equal probability. But in fact, since some values are more common (0x00000000 for example) others are less common, and dictionary magic was selected to be one of them, so probabilities are likely even lower. Anyway, this risk is no down to zero by adding a new CCtx parameter : ZSTD_p_forceRawDict Current parameter policy : the parameter "stick" to its CCtx, so any dictionary loading after ZSTD_p_forceRawDict is set will be loaded in "raw" ("content only") mode, even if CCtx is re-used multiple times with multiple different dictionary. It's up to the user to reset this value differently if it needs so.	2017-02-23 23:42:12 -08:00
Yann Collet	831b4890ce	minor tests/Makefile refactoring and update of zstd_manual,html	2017-02-23 23:09:10 -08:00
Yann Collet	cce8d8ba2b	Merge pull request #560 from iburinoc/findcompressedsize Change name to to findFrameCompressedSize and add skippable support	2017-02-23 13:39:23 -08:00
Sean Purcell	83038d236a	Fix bug in FSE distribution normalization	2017-02-22 13:52:48 -08:00
Sean Purcell	64417cd2ff	Describe ambiguity around skippable frames	2017-02-22 13:29:01 -08:00
Sean Purcell	9757cc811b	Update comment	2017-02-22 12:28:21 -08:00
Sean Purcell	9050e1925e	Change name to to findFrameCompressedSize and add skippable support	2017-02-22 12:12:34 -08:00
Przemyslaw Skibinski	d8114e5802	zstd_compress.c: fix memory leaks	2017-02-21 18:59:56 +01:00
Anders Oleson	517577bf53	spelling fixes in comments i.e. occurred labeled Huffman	2017-02-20 12:08:59 -08:00
Sean Purcell	6b010dec80	execSequence copies up to 2*WILDCOPY_OVERLENGTH extra	2017-02-16 12:05:40 -08:00
Sean Purcell	887eaa9e21	Fix wildcopy overwriting data still in window	2017-02-15 16:43:45 -08:00
Yann Collet	2252d29a5a	Merge branch 'dev' of github.com:facebook/zstd into dev	2017-02-15 12:00:50 -08:00
Yann Collet	4596037042	updated fse version feature minor refactoring (removing FSE_abs()) also : fix a few minor issues recently introduced in examples	2017-02-15 12:00:03 -08:00
Yann Collet	44f82d781f	Merge pull request #545 from terrelln/force-window [zstdmt] Fix MSAN failure with ZSTD_p_forceWindow	2017-02-15 10:20:15 -08:00
Yann Collet	f0b9a8dddb	Merge pull request #547 from inikep/dev11 Avoid fseek()'s 2GiB barrier with MacOS and *BSD	2017-02-14 12:29:00 -08:00
Yann Collet	9696bfc2ad	Merge pull request #544 from ds77/avoid-empty Portable way to avoid empty unit warning in threading.c	2017-02-14 00:54:55 -08:00
Przemyslaw Skibinski	b876b96ce1	Merge remote-tracking branch 'refs/remotes/facebook/dev' into dev11	2017-02-14 09:26:03 +01:00
Nick Terrell	ecf90ca24b	[zstdmt] Fix MSAN failure with ZSTD_p_forceWindow Reproduction steps: ``` make zstreamtest CC=clang CFLAGS="-O3 -g -fsanitize=memory -fsanitize-memory-track-origins" ./zstreamtest -vv -t4178 -i4178 -s4531 ``` How to get to the error in gdb (may be a more efficient way): * 2 breaks at zstd_compress.c:2418 -- in ZSTD_compressContinue_internal() * 2 breaks at zstd_compress.c:2276 -- in ZSTD_compressBlock_internal() * 1 break at zstd_compress.c:1547 Why the error occurred: When `zc->forceWindow == 1`, after calling `ZSTD_loadDictionaryContent()` we have `zc->loadedDictEnd == zc->nextToUpdate == 0`. But, we've really loaded up to `iend` into the dictionary. Then in `ZSTD_compressBlock_internal()` we see that `current > zc->nextToUpdate + 384`, so we load the last 192 bytes a second time. In this case the bytes we are loading are a block of all 0s, starting in the previous block. So when we are loading the last 192 bytes, we find a `match` in the future, 183 bytes beyond `ip`. Since the block is all 0s, the match extends to the end of the block. But in `ZSTD_count()` we only check that `pIn < pInLoopLimit`, but since `pMatch > pIn`, `pMatch` eventually points past the end of the buffer, causing the MSAN failure. The fix: The line changed sets sets `zc->nextToUpdate` to the end of the dictionary. This is the behavior that existed before `ZSTD_p_forceWindow` was introduced. This fixes the exposing test case. Since the code doesn't fail without `zc->forceWindow`, it makes sense that this works. I've run the command `./zstreamtest -T2mn` 64 times without failures. CI should also verify nothing obvious broke.	2017-02-13 19:11:22 -08:00
Yann Collet	58af614ef2	push version and NEWS to v1.1.4	2017-02-13 18:32:44 -08:00
ds77	08e6a88a97	avoid empty translation unit warning without #pragma	2017-02-14 00:46:47 +01:00

1 2 3 4 5 ...

1441 Commits