AuroraMiddleware/zstd - zstd - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
W. Felix Handte	5390fee4f7	Rename and Move DD_BLOG Constant to ZSTD_LAZY_DDSS_BUCKET_LOG	2020-09-10 18:51:52 -04:00
W. Felix Handte	f1b428fdac	Rename enableDedicatedDictSearch to dedicatedDictSearch in MatchState This makes it clear that not only is the feature allowed here, we're actually using it, as opposed to the CCtxParam field, in which it's enabled, but we may or may not be using it.	2020-09-10 18:51:52 -04:00
W. Felix Handte	34b545acb0	Add a ZSTD_dedicatedDictSearch ZSTD_dictMode_e to Allow Const Propagation Speed +1.5%.	2020-09-10 18:51:52 -04:00
Bimba Shrestha	31e581bf65	adding enableDedicatedDictSearch to matchState_t	2020-09-10 18:51:52 -04:00
Bimba Shrestha	f10d4e313c	adding ZSTD_dedicatedDictSearch_defaultCParameters variable	2020-09-10 18:51:52 -04:00
Bimba Shrestha	c497cb6716	Add ZSTD_c_enableDedicatedDictSearch Param	2020-09-10 18:51:52 -04:00
Nick Terrell	a90779397a	[lib] Reduce zstd stack usage by 1KB	2020-09-09 14:35:39 -07:00
Nick Terrell	f91ed5c766	[lib] s/current/curr because it collides with Linux Kernel macro	2020-09-09 14:35:39 -07:00
Nick Terrell	c465f24457	ZSTD_ prefix mem{cpy,move,set},malloc,calloc,free	2020-08-26 12:26:03 -07:00
Niadb	216a63dcf7	Add files via upload	2020-07-28 02:52:52 -06:00
Nick Terrell	08981d2638	[lib] Allow compression dictionaries with missing symbols Allow compression to use dictionaries with missing symbols in their entropy tables. We set the FSE repeat mode to check when there are missing symbols, and set the FSE repeat mode to valid when all symbols are present. Note that when not all symbols are present, the heuristics which favor dictionary tables for lower compression levels won't activate. Tested by manually creating a dictionary with missing symbols of every type, and validing that the compressor rejects it before this change, and accepts it after this change. Also, I ran the `dictionary_loader` fuzzer for >1 hour of CPU time without running into cases where compression succeeds, but decompression fails. Fixes #2174.	2020-06-12 17:57:19 -07:00
Nick Terrell	4e0515916d	[lib] Fix repcode validation in no dict mode	2020-05-12 11:57:15 -07:00
W. Felix Handte	6028827fee	Rewrite Include Paths to be Relative Addresses #1998.	2020-05-04 15:20:26 -04:00
W. Felix Handte	c7da66c9cf	Purge C++-Style Comments (`// ...`), Make Compilation Succeed Under C90	2020-05-04 10:59:15 -04:00
W. Felix Handte	5e5f262612	Add (Possibly Empty) Info Strings to All Variadic Error Handling Macro Invocations	2020-05-04 10:58:55 -04:00
Nick Terrell	e103d7b4a6	Fix superblock mode (#2100 ) Fixes: Enable RLE blocks for superblock mode Fix the limitation that the literals block must shrink. Instead, when we're within 200 bytes of the next header byte size, we will just use the next one up. That way we should (almost?) always have space for the table. Remove the limitation that the first sub-block MUST have compressed literals and be compressed. Now one sub-block MUST be compressed (otherwise we fall back to raw block which is okay, since that is streamable). If no block has compressed literals that is okay, we will fix up the next Huffman table. Handle the case where the last sub-block is uncompressed (maybe it is very small). Before it would skip superblock in this case, now we allow the last sub-block to be uncompressed. To do this we need to regenerate the correct repcodes. Respect disableLiteralsCompression in superblock mode Fix superblock mode to handle a block consisting of only compressed literals Fix a off by 1 error in superblock mode that disabled it whenever there were last literals Fix superblock mode with long literals/matches (> 0xFFFF) Allow superblock mode to repeat Huffman tables Respect ZSTD_minGain(). Tests: Simple check for the condition in #2096. When the simple_round_trip fuzzer enables superblock mode, it checks that the compressed size isn't expanded too much. Remaining limitations: O(targetCBlockSize^2) because we recompute statistics every sequence Unable to split literals of length > targetCBlockSize into multiple sequences Refuses to generate sub-blocks that don't shrink the compressed data, so we could end up with large sub-blocks. We should emit those sections as uncompressed blocks instead. ... Fixes #2096	2020-05-01 16:11:47 -07:00
Bimba Shrestha	5b0a452cac	Adding --long support for --patch-from (#1959 ) * adding long support for patch-from * adding refPrefix to dictionary_decompress * adding refPrefix to dictionary_loader * conversion nit * triggering log mode on chainLog < fileLog and removing old threshold * adding refPrefix to dictionary_round_trip * adding docs * adding enableldm + forceWindow test for dict * separate patch-from logic into FIO_adjustParamsForPatchFromMode * moving memLimit adjustment to outside ifdefs (need for decomp) * removing refPrefix gate on dictionary_round_trip * rebase on top of dev refPrefix change * making sure refPrefx + ldm is < 1% of srcSize * combining notes for patch-from * moving memlimit logic inside fileio.c * adding display for optimal parser and long mode trigger * conversion nit * fuzzer found heap-overflow fix * another conversion nit * moving FIO_adjustMemLimitForPatchFromMode outside ifndef * making params immutable * moving memLimit update before createDictBuffer call * making maxSrcSize unsigned long long * making dictSize and maxSrcSize params unsigned long long * error on files larger than 4gb * extend refPrefix test to include round trip * conversion to size_t * making sure ldm is at least 10x better * removing break * including zstd_compress_internal and removing redundant macros * exposing ZSTD_cycleLog() * using cycleLog instead of chainLog * add some more docs about user optimizations * formatting	2020-04-17 15:58:53 -05:00
Nick Terrell	ac58c8d720	Fix copyright and license lines * All copyright lines now have -2020 instead of -present * All copyright lines include "Facebook, Inc" * All licenses are now standardized The copyright in `threading.{h,c}` is not changed because it comes from zstdmt. The copyright and license of `divsufsort.{h,c}` is not changed.	2020-03-26 17:02:06 -07:00
Bimba Shrestha	a89c45bdbd	Typo	2020-03-10 15:19:48 -05:00
Bimba Shrestha	dba3abc95a	Missed returns	2020-03-05 12:20:59 -08:00
Bimba Shrestha	a75e5f2ffc	bitscan add undef check	2020-03-05 11:52:15 -08:00
Nick Terrell	a11a9271d6	Fix lowLimit underflow in overflow correction	2020-01-17 12:10:18 -08:00
Nick Terrell	659e9f05cf	Fix null pointer addition	2019-11-20 18:36:04 -08:00
Yann Collet	4b1ac69f19	Merge pull request #1868 from senhuang42/superblocks_fixed Superblocks rebased for merge	2019-11-14 13:31:34 -08:00
Yann Collet	d67742bc5d	Merge pull request #1858 from senhuang42/dictionary_header_size Method to get dictionary header size	2019-11-14 09:44:07 -08:00
Sen Huang	b39149e156	Expose ZSTD_reset_compressedBlockState() to shared API	2019-11-08 13:57:26 -05:00
Sen Huang	6ce335371b	Add error forwarding to loadCEntropy(), make check for dictSize >= 8 from bad merge	2019-11-08 13:57:26 -05:00
Sen Huang	c787b351ea	Use ZSTD Error codes, improve explanation of ZSTD_loadCEntropy() and ZSTD_loadDEntropy()	2019-11-08 13:57:26 -05:00
Sen Huang	0bcaf6db08	First working pass at refactor of loadZstdDictionary()	2019-11-08 13:57:26 -05:00
Nick Terrell	8c474f9845	Fix parameter selection and adjustment with srcSize == 0	2019-11-07 08:58:43 -08:00
Sen Huang	7ce891870c	Fix merge conflicts	2019-11-05 15:51:25 -05:00
Yann Collet	fb77afc626	Merge pull request #1760 from bimbashrestha/extract_sequences_api Adding api for extracting sequences from seqstore	2019-10-10 13:11:18 -07:00
Nick Terrell	5cb7615f1f	Add UNUSED_ATTR to ZSTD_storeSeq()	2019-09-20 21:37:13 -07:00
Nick Terrell	5dc0a1d659	HINT_INLINE ZSTD_storeSeq() Clang on Mac wasn't inlining `ZSTD_storeSeq()` in level 1, which was causing a 5% performance regression. This fixes it.	2019-09-20 16:39:27 -07:00
Nick Terrell	44c65da97e	Remove literals overread in ZSTD_storeSeq() for ~neutral perf	2019-09-20 12:23:25 -07:00
Nick Terrell	fde217df04	Fix bounds check in ZSTD_storeSeq()	2019-09-20 08:25:12 -07:00
Nick Terrell	67b1f5fc72	Fix too strict assert	2019-09-20 01:23:35 -07:00
Nick Terrell	ddab2a94e8	Pass iend into ZSTD_storeSeq() to allow ZSTD_wildcopy()	2019-09-20 00:56:20 -07:00
Nick Terrell	efd37a64ea	Optimize decompression and fix wildcopy overread * Bump `WILDCOPY_OVERLENGTH` to 16 to fix the wildcopy overread. * Optimize `ZSTD_wildcopy()` by removing unnecessary branches and unrolling the loop. * Extract `ZSTD_overlapCopy8()` into its own function. * Add `ZSTD_safecopy()` for `ZSTD_execSequenceEnd()`. It is optimized for single long sequences, since that is the important case that can end up in `ZSTD_execSequenceEnd()`. Without this optimization, decompressing a block with 1 long match goes from 5.7 GB/s to 800 MB/s. * Refactor `ZSTD_execSequenceEnd()`. * Increase the literal copy shortcut to 16. * Add a shortcut for offset >= 16. * Simplify `ZSTD_execSequence()` by pushing more cases into `ZSTD_execSequenceEnd()`. * Delete `ZSTD_execSequenceLong()` since it is exactly the same as `ZSTD_execSequence()`. clang-8 seeds +17.5% on silesia and +21.8% on enwik8. gcc-9 sees +12% on silesia and +15.5% on enwik8. TODO: More detailed measurements, and on more datasets. Crdit to OSS-Fuzz for finding the wildcopy overread.	2019-09-19 21:07:14 -07:00
Yann Collet	bfff5b30a4	Merge pull request #1756 from mgrice/dev Improvements in zstd decode performance	2019-09-18 11:35:50 -07:00
Yann Collet	243200e5bf	minor refactor of ZSTD_fast - reduced variables lifetime - more accurate code comments	2019-09-17 14:02:57 -07:00
Bimba Shrestha	a874435478	Merge branch 'dev' into extract_sequences_api	2019-09-16 13:29:59 -07:00
Bimba Shrestha	9e7bb55e14	Addressing comments	2019-09-09 20:04:46 -07:00
W. Felix Handte	b511a84adc	Move Workspace Functions to Their Own File	2019-09-09 13:34:08 -04:00
W. Felix Handte	077a2d7dc9	Rename	2019-09-09 13:34:08 -04:00
W. Felix Handte	ebd162194f	Clean Up TODOs and Comments	2019-09-09 13:34:08 -04:00
W. Felix Handte	2abe0145b1	Improve Comments a Bit	2019-09-09 13:34:08 -04:00
W. Felix Handte	75d574368b	When Loading Dict By Copy, Always Put it in the Workspace	2019-09-09 13:34:08 -04:00
W. Felix Handte	e69b67e33a	Alloc Tables Separately	2019-09-09 13:34:08 -04:00
W. Felix Handte	6177354b36	Begin Introducing Phases	2019-09-09 13:34:08 -04:00

1 2 3 4

160 Commits