AuroraMiddleware/zstd - zstd - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
W. Felix Handte	37f17ee237	Mark Repeated Offset Table as Needing Check	2018-08-24 14:33:34 -07:00
Nick Terrell	e34e917655	Fix compiler warning	2018-08-23 17:48:06 -07:00
Nick Terrell	924944e471	[zstd] Reuse the ZSTD_CCtx more often with small data.	2018-08-23 17:48:06 -07:00
Yann Collet	2e45badff4	refactored bench.c for clarity and safety, especially at interface level	2018-08-23 14:21:18 -07:00
Yann Collet	c71c4f23d7	fix "unused parameter" in single-thread mode within newly added ZSD_toFlushNow()	2018-08-20 11:40:10 -07:00
Yann Collet	105677c6db	created ZSTDMT_toFlushNow() tells in a non-blocking way if there is something ready to flush right now. only works with multi-threading for the time being. Useful to know if flush speed will be limited by lack of production.	2018-08-17 18:11:54 -07:00
Yann Collet	1515f0bb0d	fixed more issues detected by recent version of scan-build test run on Linux	2018-08-16 15:20:25 -07:00
Yann Collet	3692c31598	Merge branch 'dev' into scanbuild	2018-08-15 13:50:49 -07:00
Yann Collet	6e66bbf5dd	fixed several minor issues detected by scan-build only notable one : writeNCount() resists better vs invalid distributions (though it should never happen within zstd anyway)	2018-08-14 16:55:35 -07:00
Yann Collet	3e4617ef54	frameProgression reports nbActiveWorkers and output flushed	2018-08-14 11:49:25 -07:00
Yann Collet	e7a49c6683	introduced command --adapt	2018-08-11 20:48:06 -07:00
Yann Collet	2dd76037be	zstd cli can increase level when input is too slow	2018-08-09 15:51:30 -07:00
Yann Collet	79a35ac20d	minor code comments improvements	2018-08-09 15:16:31 -07:00
W. Felix Handte	2ca7c69167	Fix CDict Attachment to Handle CDicts with Non-Zero Starts CDicts were previously guaranteed to be generated with `lowLimit=dictLimit=0`. This is no longer true, and so the old length and index calculations are no longer valid. This diff fixes them to handle non-zero start indices in CDicts.	2018-08-07 18:14:14 -07:00
Yann Collet	5808027abf	Merge branch 'dev' into fix1241	2018-08-03 16:08:33 -07:00
Nick Terrell	dc5a67cb7b	Disallow tableLog == srcLog	2018-08-02 11:12:17 -07:00
cyan4973	aade1e5904	Merge branch 'dev' into fix1241	2018-07-30 16:30:35 +02:00
Nick Terrell	9889bca530	[FSE] Fix division by zero When the primary normalization method fails, and `(1 << tableLog) == (maxSymbolValue + 1)`, and every symbol gets assigned normalized weight 1 or -1 in the first loop, then the next division can raise `SIGFPE`.	2018-07-27 17:30:03 -07:00
Yann Collet	6e490a2f09	Merge pull request #1237 from terrelln/init-cstream-adv Set requestedParams in ZSTD_initCStream*()	2018-07-18 16:33:30 +02:00
cyan4973	9597b438e9	fix #1241 Ensure that first input position is valid for a match even during first usage of context by starting reference at 1 (avoiding the problematic 0).	2018-07-17 18:52:57 +02:00
cyan4973	53e1f0504e	zstdmt debug traces compatibles with mingw since mingw does not have `sys/times.h`, remove this path when detecting mingw compilation.	2018-07-17 14:39:44 +02:00
Nick Terrell	6d222c437c	Set requestedParams in ZSTD_initCStream() The correct parameters are used once, but once `ZSTD_resetCStream()` is called the default parameters (level 3) are used. Fix this by setting `requestedParams` in the `ZSTD_initCStream()` functions. The added tests both fail before this patch and pass after.	2018-07-12 18:35:55 -07:00
Yann Collet	4489daec09	slightly adjusted default-distribution threshold depending on strategy. fast favors faster compression and decompression speeds.	2018-06-26 20:10:45 -07:00
Nick Terrell	b426bcc097	[zstdmt] Fix jobsize bugs (#1205 ) [zstdmt] Fix jobsize bugs * `ZSTDMT_serialState_reset()` should use `targetSectionSize`, not `jobSize` when sizing the seqstore. Add an assert that checks that we sized the seqstore using the right job size. * `ZSTDMT_compressionJob()` should check if `rawSeqStore.seq == NULL`. * `ZSTDMT_initCStream_internal()` should not adjust `mtctx->params.jobSize` (clamping to MIN/MAX is okay).	2018-06-25 15:21:08 -07:00
Yann Collet	3b53bfe4f3	Merge pull request #1200 from felixhandte/zstd-attach-dict-pref Add CCtx Param Controlling Dict Attachment Behavior	2018-06-25 12:42:31 -07:00
Yann Collet	3934e010a2	Merge pull request #1197 from facebook/poolResize Thread Pool resize	2018-06-22 14:20:07 -07:00
Yann Collet	fbd5dfc1b1	changed POOL_resize() return type to int return is now just en error code. This guarantee that `ctx` remains valid after POOL_resize(). Gets rid of internal POOL_free() operation.	2018-06-22 12:14:59 -07:00
Yann Collet	1d5648ca10	Merge pull request #1196 from felixhandte/zstd-btopt-in-place-dict ZSTD_btopt: Support Searching the Dictionary Context In-Place	2018-06-22 11:53:23 -07:00
Yann Collet	f6242d30b7	Merge pull request #1202 from facebook/barelyCompressible Increase threshold detection of poorly compressible data	2018-06-22 11:52:52 -07:00
Yann Collet	698fd00afb	huf: increase threshold detection of poorly compressible data	2018-06-21 18:32:38 -07:00
W. Felix Handte	01bb1c1016	Add CCtx Param Controlling Dict Attachment Behavior	2018-06-21 17:29:25 -04:00
W. Felix Handte	3e91dc4d6a	Add Repcode Bounds Check	2018-06-21 15:54:41 -04:00
W. Felix Handte	5bd3d4b7d2	Add Debug Log Statement	2018-06-21 15:54:07 -04:00
W. Felix Handte	3caba150c6	Fix `dmsBtLow` Test	2018-06-21 15:53:40 -04:00
W. Felix Handte	5da9bbc38e	Conceivably Dedup ZSTD_noDict and ZSTD_dictMatchState _insertBt1 Impls By reverting to the bool extDict flag, we call ZSTD_insertBt1 with the same const args in both non-extDict dictModes.	2018-06-21 11:20:01 -04:00
W. Felix Handte	5d81f71e83	Consistency in Guarding DMS-Only Variable Initializations	2018-06-20 16:54:53 -04:00
W. Felix Handte	9c14eafe3d	Also Use `matchLow` for HC3 Match	2018-06-20 15:51:14 -04:00
W. Felix Handte	0a6cf7cd1d	Minor Changes	2018-06-20 15:27:23 -04:00
W. Felix Handte	ae1f3898a2	Remove Dead(!) HC3 DMS Lookup	2018-06-20 15:27:12 -04:00
Yann Collet	93702a7a62	Merge pull request #1198 from facebook/msdebug made Visual Studio compatible with DEBUGLEVEL >= 2	2018-06-20 12:26:31 -07:00
cyan4973	ae0b7ffa0a	made Visual Studio compatible with DEBUGLEVEL >= 2	2018-06-20 09:45:02 -07:00
Yann Collet	166901dc72	reduced POOL_resize() restriction It's not necessary to ensure that no job is ongoing. The pool is only expanded, existing threads are preserved. In case of error, the only option is to return NULL and terminate the thread pool anyway.	2018-06-19 18:07:18 -07:00
Yann Collet	066fbbfe1c	make zstdmt resize its context when nbThreads change. Technically, it only expands. But when instructed to use less threads, the thread pool will limit nb of concurrent threads.	2018-06-19 17:28:56 -07:00
Yann Collet	6768cf53fd	Merge pull request #1190 from terrelln/ldm-adjust Adjust advanced parameters to source size	2018-06-19 14:40:56 -07:00
W. Felix Handte	03c39c540b	Fix Incorrect Param	2018-06-19 15:36:33 -04:00
W. Felix Handte	de639502aa	Update Dict Attachment Cut-Offs	2018-06-19 15:36:13 -04:00
W. Felix Handte	f0a13bcd68	Make Sure Position 0 Gets Into the Tree	2018-06-19 15:10:06 -04:00
W. Felix Handte	87fe4788a3	Fix Compression Ratio Regression #1	2018-06-19 13:01:21 -04:00
W. Felix Handte	4bb79f9c55	Misc Changes	2018-06-19 13:01:21 -04:00
W. Felix Handte	2091f34e9e	Find Proper Matches	2018-06-19 13:01:21 -04:00
W. Felix Handte	64348a15f1	Misc Fixes	2018-06-19 13:01:21 -04:00
W. Felix Handte	ade8586ce6	Find `mls == 3` Matches	2018-06-19 13:01:21 -04:00
W. Felix Handte	ce743312e2	Fix Typo	2018-06-19 13:01:21 -04:00
W. Felix Handte	a075864756	Switch `!= ZSTD_extDict` to `== ZSTD_noDict`	2018-06-19 13:01:21 -04:00
W. Felix Handte	1e03377bde	Implement RepCode Check	2018-06-19 13:01:21 -04:00
W. Felix Handte	ccbf067973	Add _dictMatchState Functions	2018-06-19 13:01:21 -04:00
W. Felix Handte	d5d8240967	Convert `extDict` Flag to `dictMode` Enum	2018-06-19 13:01:21 -04:00
W. Felix Handte	93c3184d44	Attach Dicts when Using ZSTD_btopt and ZSTD_btultra	2018-06-19 13:01:21 -04:00
Nick Terrell	3841dbac84	Adjust advanced parameters to source size In the new advanced API, adjust the parameters even if they are explicitly set. This mainly applies to the `windowLog`, and accordingly the `hashLog` and `chainLog`, when the source size is known.	2018-06-18 15:49:31 -07:00
Yann Collet	e30f13bde0	Merge pull request #1185 from felixhandte/zstd-btlazy-in-place-dict ZSTD_btlazy2: Support Searching the Dictionary Context In-Place	2018-06-18 13:29:44 -07:00
Yann Collet	9698d2fb72	Merge pull request #1189 from facebook/hist histogram module	2018-06-14 20:39:52 -04:00
Yann Collet	6901c94cd6	avoid duplicate code comments when a function is decribed in hist.h, do not describe it again in hist.c to avoid future doc synchronization issues.	2018-06-14 19:47:05 -04:00
Yann Collet	a71513bec6	Merge pull request #1184 from facebook/debug Grouped debug functions into debug.h	2018-06-14 16:21:53 -04:00
W. Felix Handte	0c654d22c8	Force Inline BtFindBestMatch	2018-06-14 14:54:39 -04:00
Yann Collet	2d76defbfe	grouped all histogram functions into hist.c renamed functions with HIST_* prefix	2018-06-13 19:49:31 -04:00
W. Felix Handte	0551de4b5a	Search Dict for Matches	2018-06-13 16:06:28 -04:00
W. Felix Handte	ace9cfa950	Attach Dicts when Using ZSTD_btlazy2	2018-06-13 16:06:28 -04:00
Yann Collet	fa41bcc2c2	grouped debug functions into debug.h There were 2 competing set of debug functions within zstd_internal.h and bitstream.h. They were mostly duplicate, and required care to avoid messing with each other. There is now a single implementation, shared by both. Significant change : The macro variable ZSTD_DEBUG does no longer exist, it has been replaced by DEBUGLEVEL, which required modifying several source files.	2018-06-13 15:43:09 -04:00
W. Felix Handte	d53200a846	Fix Cast Warning	2018-06-13 14:58:36 -04:00
W. Felix Handte	b82063b266	Extend Dictionary Matches Backwards	2018-06-13 14:58:36 -04:00
W. Felix Handte	d53a04211c	Update Dictionary Attachment Cutoff Values Again	2018-06-13 14:58:36 -04:00
W. Felix Handte	2162aa9f18	Do Not Inline DMS Search Function	2018-06-13 14:58:36 -04:00
W. Felix Handte	338bede9b5	Also Implement Depth Repcode Checks	2018-06-13 14:58:36 -04:00
W. Felix Handte	555ab9f8cf	Apply Match Continuation Bug Fix	2018-06-13 14:58:36 -04:00
W. Felix Handte	c87dd2121d	Update Dictionary Attachment Cutoff Values	2018-06-13 14:58:36 -04:00
W. Felix Handte	6204b6d592	Check Dict Match State in ZSTD_HcFindBestMatch_generic	2018-06-13 14:58:36 -04:00
W. Felix Handte	211a61b69b	Focus on Non-BT Impls for the Moment	2018-06-13 14:58:36 -04:00
W. Felix Handte	2e93736a77	Remove Pre-Existing Repcode Check	2018-06-13 14:58:36 -04:00
W. Felix Handte	3b82a23a35	Second Repcode Check	2018-06-13 14:58:36 -04:00
W. Felix Handte	a2a24bebec	First Repcode Check	2018-06-13 14:58:36 -04:00
W. Felix Handte	f74c2cd673	Disallow Too-Long Repcodes When Using an Attached Dict	2018-06-13 14:58:36 -04:00
W. Felix Handte	c14db94450	Rename `base` -> `prefixLowest`	2018-06-13 14:58:36 -04:00
W. Felix Handte	5d90708a0a	Go Back to Separate Intermediate Functions for Different Dict Modes	2018-06-13 14:58:36 -04:00
W. Felix Handte	f84fc63a43	Further Templatize Intermediate Functions on dictMode	2018-06-13 14:58:36 -04:00
W. Felix Handte	529d3a5acd	Convert Existing U32 extDict Vars to ZSTD_dictMode Enums	2018-06-13 14:58:36 -04:00
W. Felix Handte	33e2240fac	Attach Dict When Using ZSTD_lazy Strategies	2018-06-13 14:58:36 -04:00
W. Felix Handte	90cfc799e5	Add _dictMatchState Stubs for ZSTD_lazy Functions	2018-06-13 14:58:36 -04:00
W. Felix Handte	a85ecb32bd	Add dictMode Param to ZSTD_compressBlock_lazy_generic	2018-06-13 14:58:36 -04:00
Yann Collet	b2632bcf6c	Merge pull request #1174 from duc0/document_default_level Expose ZSTD_CLEVEL_DEFAULT and update documentation	2018-06-12 12:09:01 -07:00
Duc Ngo	e34c000e44	Expose ZSTD_CLEVEL_DEFAULT and update documentation	2018-06-08 11:33:44 -07:00
Yann Collet	3050733042	Merge branch 'dev' into negLevels	2018-06-07 15:51:35 -07:00
Yann Collet	c2c47e24e0	support targetlen==0 with strategy==ZSTD_fast to mean "normal compression", targetlen >= 1 now means "disable huffman compression of literals"	2018-06-07 15:49:01 -07:00
Yann Collet	a57b4df85f	removed literalCompression directive in this version, literal compression is always disabled for ZSTD_fast strategy. Performance parity between ZSTD_compress_advanced() and ZSTD_compress_generic()	2018-06-07 15:24:12 -07:00
Yann Collet	8537bfd85c	fuzzer: make negative compression level fail result of ZSTD_compress_advanced() is different from ZSTD_compress_generic() when using negative compression levels because the disabling of huffman compression is not passed in parameters.	2018-06-07 15:12:13 -07:00
Yann Collet	e3c42c739b	clean ZSTD_compress() initialization The (pretty old) code inside ZSTD_compress() was making some pretty bold assumptions on what's inside a CCtx and how to init it. This is pretty fragile by design. CCtx content evolve. Knowledge of how to handle that should be concentrate in one place. A side effect of this strategy is that ZSTD_compress() wouldn't check for BMI2 capability, and is therefore missing out some potential speed opportunity. This patch makes ZSTD_compress() use the same initialization and release functions as the normal creator / destructor ones. Measured on my laptop, with a custom version of bench manually modified to use ZSTD_compress() (instead of the advanced API) : This patch : 1#silesia.tar : 211984896 -> 73651053 (2.878), 312.2 MB/s , 723.8 MB/s 2#silesia.tar : 211984896 -> 70163650 (3.021), 226.2 MB/s , 649.8 MB/s 3#silesia.tar : 211984896 -> 66996749 (3.164), 169.4 MB/s , 636.7 MB/s 4#silesia.tar : 211984896 -> 65998319 (3.212), 136.7 MB/s , 619.2 MB/s dev branch : 1#silesia.tar : 211984896 -> 73651053 (2.878), 291.7 MB/s , 727.5 MB/s 2#silesia.tar : 211984896 -> 70163650 (3.021), 216.2 MB/s , 655.7 MB/s 3#silesia.tar : 211984896 -> 66996749 (3.164), 162.2 MB/s , 633.1 MB/s 4#silesia.tar : 211984896 -> 65998319 (3.212), 130.6 MB/s , 618.6 MB/s	2018-06-07 14:05:25 -07:00
Yann Collet	f1ea383f45	context can be sized down even with constant parameters when parameters are "equivalent", the context is re-used in continue mode, hence needed workspace size is not recalculated. This incidentally also evades the size-down check and action. This patch intercepts the "continue mode" so that the size-down check and action is actually triggered.	2018-06-06 15:04:12 -07:00
Yann Collet	e5e17d009f	changed member name to workSpaceOversizedDuration	2018-06-06 15:00:27 -07:00
Yann Collet	f7392f3dc9	added test case	2018-06-05 14:53:28 -07:00
Yann Collet	3d523c741b	added workSpaceTooLarge and workSpaceWasteful also : slightly increased speed of test fuzzer.16	2018-06-05 11:42:48 -07:00
Yann Collet	357c648c3f	changed a few variable names to unify naming convention	2018-06-04 17:10:50 -07:00
Yann Collet	2108decb41	Fixed a nasty corruption bug recently introduce into the new dictionary mode. The bug could be reproduced with this command : ./zstreamtest -v --opaqueapi --no-big-tests -s4092 -t639 error was in function ZSTD_count_2segments() : the beginning of the 2nd segment corresponds to prefixStart and not the beginning of the current block (istart == src). This would result in comparing the wrong byte.	2018-06-01 18:54:34 -07:00
Yann Collet	7c33b48221	Merge pull request #1151 from felixhandte/zstd-dfast-in-place-dict-goto ZSTD_dfast: Support Searching the Dictionary Context In-Place (Alternate `goto` Implementation)	2018-05-31 17:37:09 -07:00
W. Felix Handte	48deab92de	Allow Different Dict Attachment Cut-Offs for Different Strategies	2018-05-31 17:37:44 -04:00
W. Felix Handte	f86796639e	Remove Incorrect and Extraneous Repcode Bounds Check	2018-05-31 17:02:29 -04:00
Yann Collet	809f2f9322	minor update of literal cost function just assert() there is no negative cost evaluation for literals	2018-05-29 15:34:50 -07:00
Yann Collet	463a0fe38b	simplified optimal parser removed "cached" structure. prices are now saved in the optimal table. Primarily done for simplification. Might improve speed by a little. But actually, and surprisingly, also improves ratio in some circumstances.	2018-05-29 14:07:25 -07:00
Yann Collet	bb6eaf6495	Merge pull request #1153 from facebook/dynThreshold changed dynamic fse threshold for offset	2018-05-26 08:43:45 -07:00
Yann Collet	e916c365a1	fixed minor visual warning	2018-05-25 20:43:09 -07:00
Yann Collet	a7fdceeccd	changed dynamic fse threshold for offset recent experienced showed that default distribution table for offset can get it wrong pretty quickly with the nb of symbols, while it remains a reasonable choice much longer for lengths symbols. Changed the formula, so that dynamic threshold is now 32 symbols for offsets. It remains at 64 symbols for lengths. Detection based on defaultNormLog	2018-05-25 17:41:16 -07:00
Yann Collet	4b3a36d5d8	Merge branch 'dev' into lowCompression	2018-05-25 15:45:03 -07:00
Yann Collet	5f177f1c53	btultra accepts blocks with poorer compression ratio zstd rejects blocks which do not compress by at least a certain amount. In which case, such block is simply emitted uncompressed (even if a little bit of compression could be achieved). This is better for decompression speed, hence for energy. The logic is controlled by ZSTD_minGain(). The rule is applied uniformly, at all compression levels. This change makes btultra accepts blocks with poor compression ratios. We presume that users of btultra mode prefers compression ratio over some decompress speed gains. The threshold for minimum gain is lowered for btultra from s>>6 (~1.5% minimum gain) to s>>7 (~0.8% minimum gain). This is a prudent change. Not sure if it's large enough.	2018-05-25 15:19:52 -07:00
Yann Collet	e2c0e3d437	slightly nudge choices towards less sequences also slightly improve some strange detrimental corner cases.	2018-05-25 14:52:21 -07:00
W. Felix Handte	5b292b5685	Check Long + 1 Matches in Both Prefix and Dict in Bothe Short Match Paths	2018-05-25 13:13:57 -04:00
W. Felix Handte	88b733b380	Interleave Prefix and Dict Searches	2018-05-25 13:13:57 -04:00
W. Felix Handte	1850025156	Refactor ZSTD_dfast to Use `goto`s	2018-05-25 13:13:57 -04:00
W. Felix Handte	43606f9c83	... When I Said "HashTable", I Meant "ChainTable"	2018-05-25 13:13:28 -04:00
W. Felix Handte	ec7efe88f5	Fix Off-By-One Error	2018-05-25 13:13:28 -04:00
W. Felix Handte	2bfe43267e	Disallow Too-Long Repcodes When Using an Attached Dict	2018-05-25 13:13:28 -04:00
W. Felix Handte	b97ad3f457	Port Changes Made to ZSTD_fast to ZSTD_dfast	2018-05-25 13:13:28 -04:00
W. Felix Handte	2313cca1b7	Implement Second Repcode Check	2018-05-25 13:13:28 -04:00
W. Felix Handte	0998f10813	Implement First Repcode Check	2018-05-25 13:13:28 -04:00
W. Felix Handte	50c5b2bb90	Find Dict Hash Table Matches	2018-05-25 13:13:28 -04:00
W. Felix Handte	7a25f7ef5b	Existing Repcode Check Only Applies to noDict Case	2018-05-25 13:13:28 -04:00
W. Felix Handte	8b241da4df	Properly Initialize Repcode Values	2018-05-25 13:13:28 -04:00
W. Felix Handte	7097a03749	Add Necessary Dict Variables	2018-05-25 13:13:28 -04:00
W. Felix Handte	aacbbf4f9a	Rename 'lowest' to 'localLowest' to Prepare to Introduce Dict Indices	2018-05-25 13:13:28 -04:00
W. Felix Handte	c10d1b4011	Skeleton for In-Place Impl for ZSTD_dfast	2018-05-25 13:13:28 -04:00
Yann Collet	f6ad59ab5c	Merge branch 'dev' into staticDictCost	2018-05-24 16:21:02 -07:00
Yann Collet	b5ef32fea7	Merge branch 'dev' into fracFse	2018-05-24 14:09:49 -07:00
Yann Collet	776128d16f	fix corner case when requiring cost of an FSE symbol ensure that, when frequency[symbol]==0, result is (tableLog + 1) bits with both upper-bit and fractional-bit estimates. Also : enable BIT_DEBUG in /tests	2018-05-24 13:59:11 -07:00
Yann Collet	08c5be5db3	Merge pull request #1117 from felixhandte/zstd-fast-in-place-dict ZSTD_fast: Support Searching the Dictionary Context In-Place	2018-05-23 19:32:25 -07:00
Nick Terrell	06b70179da	Work around bug in zstd decoder (#1147 ) Work around bug in zstd decoder Pull request #1144 exercised a new path in the zstd decoder that proved to be buggy. Avoid the extremely rare bug by emitting an uncompressed block.	2018-05-23 18:02:30 -07:00
W. Felix Handte	d9c7e67125	Assert that Dict and Current Window are Adjacent in Index Space	2018-05-23 17:53:03 -04:00
W. Felix Handte	298d24fa57	Make loadedDictEnd an Index, not the Dict Len	2018-05-23 17:53:03 -04:00
W. Felix Handte	7ef85e0618	Fixes in re Comments	2018-05-23 17:53:03 -04:00
W. Felix Handte	582b7f85ed	Don't Attach Empty Dict Contents In weird corner cases, they produce unexpected results...	2018-05-23 17:53:03 -04:00
W. Felix Handte	9c92223468	Avoid Undefined Behavior in Match Ptr Calculation	2018-05-23 17:53:03 -04:00
W. Felix Handte	a44ab3b475	Remove Out-of-Date Comment	2018-05-23 17:53:03 -04:00
W. Felix Handte	95bdf20a87	Moar Renames	2018-05-23 17:53:03 -04:00
W. Felix Handte	7e0402e738	Also Attach Dict When Source Size is Unknown	2018-05-23 17:53:03 -04:00
W. Felix Handte	3ba70cc759	Clear the Dictionary When Sliding the Window	2018-05-23 17:53:03 -04:00
W. Felix Handte	b05ae9b608	Refine ip Initialization to Avoid ARM Weirdness	2018-05-23 17:53:03 -04:00
W. Felix Handte	1a7b34ef28	Use New Index Invariant to Simplify Conditionals	2018-05-23 17:53:03 -04:00
W. Felix Handte	2d598e6fed	Force Working Context Indices Greater than Dict Indices	2018-05-23 17:53:03 -04:00
W. Felix Handte	d005e5daf4	Whitespace Fix	2018-05-23 17:53:03 -04:00
W. Felix Handte	154eb09419	Switch to Original Match Calc for noDict Repcode Check	2018-05-23 17:53:03 -04:00
W. Felix Handte	191fc74a51	Rename 'hasDict' to 'dictMode'	2018-05-23 17:53:03 -04:00
W. Felix Handte	ae4fcf7816	Respond to PR Comments; Formatting/Style/Lint Fixes	2018-05-23 17:53:03 -04:00
W. Felix Handte	ca26cecc7a	Rename and Reformat	2018-05-23 17:53:03 -04:00
W. Felix Handte	66bc1ca641	Change Cut-Off to 8 KB	2018-05-23 17:53:03 -04:00
W. Felix Handte	c31ee3c7f8	Fix Rep Code Initialization	2018-05-23 17:53:03 -04:00
W. Felix Handte	b67196f30d	Coalesce hasDictMatchState and extDict Checks into One Enum and Rename Stuff	2018-05-23 17:53:03 -04:00
W. Felix Handte	265c2869d1	Split Wrapper Functions to Cause Inlining	2018-05-23 17:53:03 -04:00
W. Felix Handte	6929964d65	Add bounds check in repcode tests	2018-05-23 17:53:03 -04:00
W. Felix Handte	70a537d1d7	Initial Repcode Check Support for Ext Dict Ctx	2018-05-23 17:53:03 -04:00
W. Felix Handte	8d24ff0353	Preliminary Support in ZSTD_compressBlock_fast_generic() for Ext Dict Ctx	2018-05-23 17:53:03 -04:00
W. Felix Handte	d18a405779	Refer to the Dictionary Match State In-Place (Sometimes)	2018-05-23 17:53:03 -04:00
Nick Terrell	e3959d5eba	Fixes	2018-05-22 16:06:33 -07:00
Yann Collet	7a8b3496b4	Merge branch 'dev' into staticDictCost	2018-05-22 15:10:05 -07:00
Yann Collet	a8ddf1d370	disable 2-passes strategy	2018-05-22 15:06:36 -07:00
Nick Terrell	49cf880513	Approximate FSE encoding costs for selection Estimate the cost for using FSE modes `set_basic`, `set_compressed`, and `set_repeat`, and select the one with the lowest cost. * The cost of `set_basic` is computed using the cross-entropy cost function `ZSTD_crossEntropyCost()`, using the normalized default count and the count. * The cost of `set_repeat` is computed using `FSE_bitCost()`. We check the previous table to see if it is able to represent the distribution. * The cost of `set_compressed` is computed with the entropy cost function `ZSTD_entropyCost()`, together with the cost of writing the normalized count `ZSTD_NCountCost()`.	2018-05-22 14:33:22 -07:00
Yann Collet	5381369cb1	Merge branch 'dev' into tableLevels	2018-05-18 18:23:27 -07:00
Yann Collet	b0b3fb517d	updated compression levels for blocks of 256KB	2018-05-18 17:17:12 -07:00
Yann Collet	5cbef6e094	Merge branch 'dev' into staticDictCost	2018-05-18 16:03:06 -07:00
Yann Collet	a95e9e80d1	adding some debug functions to observe statistics	2018-05-18 14:09:42 -07:00
Yann Collet	af3da079d1	fixed minor conversion warning	2018-05-17 17:27:27 -07:00
Yann Collet	8572b4d09f	fixed a pretty complex bug when combining ldm + btultra	2018-05-17 16:13:53 -07:00
Yann Collet	134388ba6b	collect statistics for first block in ultra mode this patch makes btultra do 2 passes on the first block, the first one being dedicated to collecting statistics so that the 2nd pass is more accurate. It translates into a very small compression ratio gain : enwik7, level 20: blocks 4K : 2.142 -> 2.153 blocks 16K : 2.447 -> 2.457 blocks 64K : 2.716 -> 2.726 On the other hand, the cpu cost is doubled. The trade off looks bad. Though, that's ultimately a price to pay to reach better compression ratio. So it's only enabled when setting btultra.	2018-05-17 12:24:30 -07:00
Yann Collet	a243020d37	slightly improved weight calculation translating into a tiny compression ratio improvement	2018-05-17 11:19:44 -07:00
Yann Collet	63eeeaa1dd	update table levels for blocks <= 16K also : allow hlog to be slighly larger than windowlog, as it's apparently good for both speed and compression ratio.	2018-05-16 16:13:37 -07:00
Yann Collet	18fc3d3cd5	introduced bit-fractional cost evaluation this improves compression ratio by a tiny amount. It also reduces speed by a small amount. Consequently, bit-fractional evaluation is only turned on for btultra.	2018-05-16 14:53:35 -07:00
Nick Terrell	30d9c84b1a	Fix failing Travis tests	2018-05-15 09:46:20 -07:00
Yann Collet	0b31304c8d	Merge branch 'dev' into staticDictCost	2018-05-14 18:09:26 -07:00
Yann Collet	2c26df0e13	opt: removed static prices after testing, it's actually always better to use dynamic prices albeit initialised from dictionary.	2018-05-14 18:04:08 -07:00
Yann Collet	f372ffc64d	Merge pull request #1127 from facebook/staticDictCost Improved optimal parser with dictionary	2018-05-14 17:45:50 -07:00
Yann Collet	c9227ee16b	update table for 128 KB blocks	2018-05-13 17:15:07 -07:00
Yann Collet	b4250489cf	update compression levels for large inputs	2018-05-13 01:53:38 -07:00
Yann Collet	761758982e	replaced FSE_count by FSE_count_simple to reduce usage of stack memory. Also : tweaked a few comments, as suggested by @terrelln	2018-05-11 16:03:37 -07:00
Yann Collet	99ddca43a6	fixed wrong assertion base can actually overflow	2018-05-10 19:48:09 -07:00
Yann Collet	09d0fa29ee	minor adjusting of weights	2018-05-10 18:13:48 -07:00
Yann Collet	1a26ec6e8d	opt: init statistics from dictionary instead of starting from fake "default" statistics.	2018-05-10 17:59:12 -07:00
Yann Collet	74b1c75d64	btopt : minor adjustment of update frequencies	2018-05-10 16:32:36 -07:00
Yann Collet	ac6105463a	opt: minor improvements to log traces slight improvement when using fractional-bit evaluation (opt:dictionay)	2018-05-09 15:46:11 -07:00
Yann Collet	c39061cb7b	fixed declaration-after-statement warning	2018-05-09 12:07:25 -07:00
Yann Collet	4d5bd32a00	added traces to look at symbol costs evaluation looks correct.	2018-05-09 12:00:12 -07:00
Yann Collet	c0da0f5e9e	switchable bit-approximation / fractional-bit accuracy modes also : makes it possible to select nb of fractional bits.	2018-05-09 10:48:09 -07:00
Yann Collet	ba2ad9b6b9	implemented fractional bit cost evaluation for FSE symbols. While it seems to work, the gains are negligible compared to rough maxNbBits evaluation. There are even a few losses sometimes, that still need to be explained. Furthermode, there are still cases where btlazy2 does a better job than btopt, which seems rather strange too.	2018-05-08 17:43:13 -07:00
Yann Collet	1aff63b114	opt: shift all costs by 8 bits (* 256) making it possible to represent fractional bit costs.	2018-05-08 16:19:04 -07:00
Yann Collet	6a3c34aa58	opt: estimate cost of both Hufman and FSE symbols For FSE symbols : provide an upper bound, in nb of bits, since cost function is not able to store fractional bit costs.	2018-05-08 16:11:21 -07:00
Yann Collet	338f738c24	pass entropy tables to optimal parser for proper estimation of symbol's weights when using dictionary compression. Note : using only huffman costs is not good enough, presumably because sequence symbol costs are incorrect.	2018-05-08 15:37:06 -07:00
Yann Collet	a155061328	minor code refactor for readability removed some useless operations from optimal parser (should not change performance, too small a difference)	2018-05-08 12:32:44 -07:00
Yann Collet	ad4524d605	fix ZSTD_compressBlock() associated with CDict reported by @let-def. It's actually a bug in ZSTD_compressBegin_usingCDict() which would pass a wrong pledgedSrcSize value (0 instead of ZSTD_CONTENTSIZE_UNKNOWN) resulting in wrong window size, resulting in downsized seqStore, resulting in segfault when writing into the seqStore later in the process. Added a test in fuzzer to cover this use case (fails before the patch).	2018-05-07 12:54:13 -07:00
Nick Terrell	ca77822ddf	Fix parameter adjustment with dictionary The new advanced API basically set `requestedParams = appliedParams` when using a dictionary. This halted all parameter adjustment, which can hurt compression ratio if, for example, the window log is small for the first call, but the rest of the files are large. This patch fixes the bug, and checks that the `requestedParams` don't change in the new advanced API when using a dictionary, and generally in the fuzzer.	2018-04-25 16:32:29 -07:00
Nick Terrell	c0987986e5	Only reset CDict in ZSTD_CCtx_resetParameters()	2018-04-13 11:26:40 -07:00
Nick Terrell	9f76eebd17	Add ZSTD_CCtx_resetParameters() function * Fix docs for `ZSTD_CCtx_reset()`. * Add `ZSTD_CCtx_resetParameters()`. Fixes #1094.	2018-04-12 16:54:07 -07:00
Nick Terrell	3c3f59e68f	Enforce pledgeSrcSize whenever known (#1106 ) The test fails before the patch and passes after. Fixes #1095.	2018-04-12 16:02:03 -07:00
Nick Terrell	280a236e9e	Add ZSTD_CCtx(Param)?_getParameter() function Closes #1096.	2018-04-12 11:50:12 -07:00
Nick Terrell	295ab0dbfa	Only load extra table positions for CDicts Zstdmt uses prefixes to load the overlap between segments. Loading extra positions makes compression non-deterministic, depending on the previous job the context was used for. Since loading extra position takes extra time as well, only do it when creating a `ZSTD_CDict`. Fixes #1077.	2018-04-02 14:41:30 -07:00
Yann Collet	29b021f9a0	Merge pull request #1067 from facebook/targetLength removed limit ZSTD_TARGETLENGTH_MAX	2018-03-22 10:38:33 -07:00
Nick Terrell	ad344033df	Fix broken assertion The `avgJobSize` must not be lower than 256 KB for single-pass mode. In `zstd.h` we say the minimum value for `ZSTD_p_jobSize` is 1 MB, so ensure that we always pick a size >= 1 MB. Found by libFuzzer fuzzer tests with large input limits.	2018-03-21 16:20:30 -07:00

... 2 3 4 5 6 ...

1189 Commits