SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2024-10-19 11:30:15 +00:00

Author	SHA1	Message	Date
dan sinclair	eda2cfbe12	Cleanup includes. (#1795 ) This Cl cleans up the include paths to be relative to the top level directory. Various include-what-you-use fixes have been added.	2018-08-03 15:06:09 -04:00
Alan Baker	755e5c9420	Transform to combine consecutive access chains * Combines OpAccessChain, OpInBoundsAccessChain, OpPtrAccessChain and OpInBoundsPtrAccessChain * New folding rule to fold add with 0 for integers * Converts to a bitcast if the result type does not match the operand type V	2018-07-31 13:42:47 -04:00
Alan Baker	b49f76fd62	Handle undef literal value in vector shuffle Fixes #1731 * Updated folding rules related to vector shuffle to account for the undef literal value: * FoldVectorShuffleFeedingShuffle * FoldVectorShuffleFeedingExtract * FoldVectorShuffleWithConstants * These rules would commit memory violations due to treating the undef literal value as an accessible composite component	2018-07-20 11:32:43 -04:00
Steven Perron	95b4d47e34	Fix infinite loop while folding OpVectorShuffle (#1722 ) When folding an OpVectorShuffle where the first operand is defined by an OpVectorShuffle, is unused, and is equal to the second, we end up with an infinite loop. This is because we think we change the instruction, but it does not actually change. So we keep trying to folding the same instruction. This commit fixes up that specific issue. When the operand is unused, we replace it with Null.	2018-07-13 12:43:00 -04:00
Steven Perron	63c1d8fb15	Fix size error when folding vector shuffle. (#1721 ) When folding a vector shuffle that feeds another vector shuffle causes the size of the first operand to change, when other indices have to be adjusted reletive to the new size.	2018-07-13 11:20:02 -04:00
dan sinclair	c7da51a085	Cleanup extraneous namespace qualifies in source/opt. (#1716 ) This CL follows up on the opt namespacing CLs by removing the unnecessary opt:: and opt::analysis:: namespace prefixes.	2018-07-12 15:14:43 -04:00
dan sinclair	4cc6cd184a	Pass the IRContext into the folding rules. (#1709 ) This CL updates the folding rules to receive the IRContext as a paramter instead of retrieving off of the Instruction. Issue #1703	2018-07-12 09:12:23 -04:00
Steven Perron	e63551deac	Add folding rule to merge a vector shuffle feeding another one.	2018-07-11 14:44:46 -04:00
dan sinclair	e6b953361d	Move the ir namespace to opt. (#1680 ) This CL moves the files in opt/ to consistenly be under the opt:: namespace. This frees up the ir:: namespace so it can be used to make a shared ir represenation.	2018-07-09 11:32:29 -04:00
dan sinclair	3dad1cda11	Change libspirv to spvtools namespace (#1678 ) This CL changes all of the libspirv namespace code to spvtools to match the rest of the code base.	2018-07-07 09:38:00 -04:00
dan sinclair	76e0bde196	Move utils/ to spvtools::utils Currently the utils/ folder uses both spvutils:: and spvtools::utils. This CL changes the namespace to consistenly be spvtools::utils to match the rest of the codebase.	2018-07-06 16:47:46 -04:00
Steven Perron	a45d4cac61	Move folding routines into a class The folding routines are currently global functions. They also rely on data in an std::map that holds the folding rules for each opcode. This causes that map to not have a clear owner, and therefore never gets deleted. There has been a request to delete this map. To implement this, we will create a InstructionFolder class that owns the maps. The IRContext will own the InstructionFolder instance. Then the global functions will become public memeber functions of the InstructionFolder. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1659.	2018-07-05 17:52:43 -04:00
Steven Perron	2eb9bfb5b6	Remove stores of undef. When storing an undef, any value is valid, including the one already in that memory location. So we can avoid the store.	2018-06-29 09:49:19 -04:00
Steven Perron	fe2fbee294	Delete the insert-extract-elim pass. Replaces anything that creates an insert-extract-elim pass and create a simplifiation pass instead. Then delete the implementation of the pass. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1570.	2018-06-01 10:13:39 -04:00
Steven Perron	745dd00af9	Fold FMix feeding Extract, and use the simplification pass. We add a new rule to the folding rules to fold an FMix feeding an extract when the alpha value for the element being extracted is either 0 or 1. In those case, we can simple extract from one of the operands to the FMix. With that change the simplification pass completely subsumes the insert-extract elimination pass. So we remove the insert-extract elimination passes and replce them with calls to the simplification pass. In a follow up PR, we should delete the insert-extract elimination pass. Contributes to https://github.com/KhronosGroup/SPIRV-Tools/issues/1570.	2018-05-25 14:42:59 -04:00
Steven Perron	70bb3c1cc2	Fold divide and multiply by same value. We want to fold code like (x*y)/x and other permutations of this. Fixes #1531.	2018-05-02 10:18:37 -04:00
Steven Perron	e1bcd2b2d8	Fold OpVectorTimesScalar and OpPhi better. If one of the operands to an OpVectorTimesScalar instruction is zero, then the result will be the 0 vector. Currently we do not fold the insturction unless both operands are constants. This change fixes that. We also allow folding of OpPhi instructions where the incoming values are either an OpUndef or the OpPhi instruction itself. As with other cases, this can be simplified to the OpUndef.	2018-04-26 12:41:16 -04:00
Steven Perron	53bc1623ec	Fold OpDot Adding three rules to fold OpDot (implemented as two). - When an OpDot has two constants, then fold to the resulting const. - When one of the inputs is the 0 vector, then fold to zero. - When one of the inputs is a single 1 with 0s, then rewrite to an OpCompositeExtract of the appropriate element. This will help find even more folding opportunities. Contributes to #709.	2018-04-10 13:09:37 -04:00
Eleni Maria Stea	045cc8f75b	Fixes compile errors generated with -Wpedantic This patch fixes the compile errors generated when the options SPIRV_WARN_EVERYTHING and SPIRV_WERROR (that force -Wpedantic) are set to cmake.	2018-03-22 09:40:11 -04:00
Greg Fischer	077249b67f	Fix InsertFeedingExtract rule when extract remains.	2018-03-12 22:06:23 -04:00
Alan Baker	bc9cfee6fa	Fixes #1385 . Grab correct input to calculate indices. * Added tests to catch the bug	2018-03-07 16:07:40 -05:00
Alan Baker	5f50e6209c	Fixes #1376 . Don't handle half folding gracefully. * Added early returns to folding rules to prevent half attempts * Added some tests	2018-03-06 14:00:02 -05:00
Alan Baker	52bceb3569	Handles more cases of redundant selects * Handles OpConstantNull and vector types * vector selects (except against a null) are converted to vector shuffles * Added tests	2018-03-02 14:28:08 -05:00
Alan Baker	ce5941a642	Fixes #1357 . Support null constants better in folding * getFloatConstantKind() now handles OpConstantNull * PerformOperation() now handles OpConstantNull for vectors * Fixed some instances where we would attempt to merge a division by 0 * added tests	2018-02-28 23:12:27 -05:00
Alan Baker	9457cabbce	Fixes #1354 . Do not merge integer division. * Removes merging of div with a div or mul for integers * Updated tests	2018-02-28 13:33:21 -05:00
Steven Perron	588f4fcc95	Add more folding rules for vector shuffle. Adds rule to fold OpVectorShuffle with constant inputs. Adds rules to fold OpCompositeExtrac being fed by an OpVectorShuffle.	2018-02-27 21:20:22 -05:00
Alan Baker	802cf053c7	Merge arithmetic with non-trivial constant operands Adding basis of arithmetic merging * Refactored constant collection in ConstantManager * New rules: * consecutive negates * negate of arithmetic op with a constant * consecutive muls * reciprocal of div * Removed IRContext::CanFoldFloatingPoint * replaced by Instruction::IsFloatingPointFoldingAllowed * Fixed some bad tests * added some header comments Added PerformIntegerOperation * minor fixes to constants and tests * fixed IntMultiplyBy1 to work with 64 bit ints * added tests for integer mul merging Adding test for vector integer multiply merging Adding support for merging integer add and sub through negate * Added tests Adding rules to merge mult with preceding divide * Has a couple tests, but needs more * Added more comments Fixed bug in integer division folding * Will no longer merge through integer division if there would be a remainder in the division * Added a bunch more tests Adding rules to merge divide and multiply through divide * Improved comments * Added tests Adding rules to handle mul or div of a negation * Added tests Changes for review * Early exit if no constants are involved in more functions * fixed some comments * removed unused declaration * clarified some logic Adding new rules for add and subtract * Fold adds of adds, subtracts or negates * Fold subtracts of adds, subtracts or negates * Added tests	2018-02-27 13:02:13 -05:00
Arseny Kapoulkine	309be423cc	Add folding for redundant add/sub/mul/div/mix operations This change implements instruction folding for arithmetic operations that are redundant, specifically: x + 0 = 0 + x = x x - 0 = x 0 - x = -x x * 0 = 0 * x = 0 x * 1 = 1 * x = x 0 / x = 0 x / 1 = x mix(a, b, 0) = a mix(a, b, 1) = b Cache ExtInst import id in feature manager This allows us to avoid string lookups during optimization; for now we just cache GLSL std450 import id but I can imagine caching more sets as they become utilized by the optimizer. Add tests for add/sub/mul/div/mix folding The tests cover scalar float/double cases, and some vector cases. Since most of the code for floating point folding is shared, the tests for vector folding are not as exhaustive as scalar. To test sub->negate folding I had to implement a custom fixture.	2018-02-20 18:29:27 -05:00
Steven Perron	9d95a91a9f	Fix folding insert feeding extract I mixed up two cases when folding an OpCompositeExtract that is feed by and OpCompositeInsert. The specific cases are demonstracted in the new test. I mixed up the conditions for the cases, and treated one like the other. Fixes #1323.	2018-02-20 11:22:51 -05:00
Arseny Kapoulkine	32a8e04c7d	Add folding of redundant OpSelect insns We can fold OpSelect into one of the operands in two cases: - condition is constant - both results are the same Even if the original shader doesn't have either of these, if-conversion pass sometimes ends up generating instructions like %7127 = OpSelect %int %3220 %7058 %7058 And this optimization cleans them up.	2018-02-15 10:03:22 -05:00
Steven Perron	1d7b1423f9	Add folding of OpCompositeExtract and OpConstantComposite constant instructions. Create files for constant folding rules. Add the rules for OpConstantComposite and OpCompositeExtract.	2018-02-09 17:52:33 -05:00
Steven Perron	06cdb96984	Make use of the instruction folder. Implementation of the simplification pass. - Create pass that calls the instruction folder on each instruction and propagate instructions that fold to a copy. This will do copy propagation as well. - Did not use the propagator engine because I want to modify the instruction as we go along. - Change folding to not allocate new instructions, but make changes in place. This change had a big impact on compile time. - Add simplification pass to the legalization passes in place of insert-extract elimination. - Added test cases for new folding rules. - Added tests for the simplification pass - Added a method to the CFG to apply a function to the basic blocks in reverse post order. Contributes to #1164.	2018-02-07 23:01:47 -05:00
Steven Perron	bc1ec9418b	Add general folding infrastructure. Create the folding engine that will 1) attempt to fold an instruction. 2) iterates on the folding so small folding rules can be easily combined. 3) insert new instructions when needed. I've added the minimum number of rules needed to test the features above.	2018-02-02 12:24:11 -05:00

33 Commits