SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2024-11-25 21:10:04 +00:00

Author	SHA1	Message	Date
Alastair Donaldson	52e9cc9301	spirv-fuzz: Improve debugging facilities (#3074 ) Adds an option to run the validator on the SPIR-V binary after each fuzzer pass has been applied, to help identify when the fuzzer has made the module invalid. Also adds a helper method to allow dumping of the sequence of transformations that have been applied to a JSON file.	2019-11-27 18:05:56 +00:00
Steven Perron	54385458ca	Handle unreachable block when computing register pressure (#3070 ) Fixes #3053	2019-11-27 09:45:17 -05:00
greg-lunarg	868ca3954c	Improve RegisterSizePasses (#3059 )	2019-11-27 09:41:50 -05:00
David Neto	a62012cede	Add test with explicit example of stripping reflection info (#3064 ) * Add test with explicit example of stripping reflection info * Improve the comment on StripReflectEnd2EndExample	2019-11-26 16:20:45 -05:00
Sarah	8312c523ee	Permit the debug instructions in WebGPU SPIR-V (#3063 ) Add tests	2019-11-26 14:04:57 -05:00
Ryan Harrison	45dde9ad6d	Add missing dealloc (#3061 ) Fixes #3060	2019-11-20 10:38:35 -05:00
Ryan Harrison	2ee9aaa288	Initialize binary for use as guard later (#3058 ) Fixes #3057	2019-11-19 16:25:06 -05:00
Steven Perron	0391d0823e	Handle OpPhi with no in operands in value numbering (#3056 ) Fixes #3043	2019-11-19 09:45:39 -05:00
Ryan Harrison	57b4cb40b2	Convert stderr and stdout in status to strings on assignment (#3049 ) This avoids Python2 vs Python3 issues related to how we decode bytes later on in the tests. Switching over to using unittest instead of nosetest	2019-11-18 16:35:20 -05:00
alan-baker	ab3cdcaef5	Fix operand access of composite in upgrade memory model (#3021 ) Fixes #2992 * Accessing aggregate subtype used the wrong operand * Added a test	2019-11-12 13:41:38 -05:00
alan-baker	1a18d491f2	Validate array stride does not cause overlap (#3028 ) Fixes #3027 * Disallow array stride 0 * Check array stride against element size * Fix up tests * Add new tests	2019-11-12 13:36:53 -05:00
Ehsan	12e54dae16	Update Offset to ConstOffset bitmask if operand is constant. (#3024 ) Update Offset to ConstOffset bitmask if operand is constant. Fixes #3005	2019-11-11 22:35:14 -05:00
Alastair Donaldson	041f0a0249	spirv-fuzz: simplify transformation for replacing an id with a synonym (#3020 ) Prior to this change, TransformationReplaceIdWithSynonym was designed to be able to replace an id with some synonymous data descriptor, possibly necessitating extracting from a composite into a fresh id in order to get at the synonymous data. This change simplifies things so that TransformationReplaceIdWithSynonym just allows one id to be replaced by another id. It is the responsibility of the associated fuzzer pass - FuzzerPassApplyIdSynonyms - to perform the extraction operations, using e.g. TransformationCompositeExtract.	2019-11-07 16:19:06 +00:00
alan-baker	528c00c016	Re-enable OpReadClockKHR validation (#3013 ) Re-enable OpReadClockKHR validation Fixes #2952 * Refactor some common scope validation * Perform correct validation for scope in OpReadClockKHR * Scope must be Subgroup or Device * new tests	2019-11-07 09:51:38 -05:00
Alastair Donaldson	dc59b4b075	spirv-fuzz: vector shuffle transformation (#3015 ) Inroduces a new transformation that adds a vector shuffle instruction to the module, with associated facts about how the result vector of the shuffle relates to the input vectors. A fuzzer pass to add such transformations is not yet in place.	2019-11-06 17:11:54 +00:00
Alastair Donaldson	3724cfbea8	spirv-fuzz: better computation of data synonym facts (#3010 ) When a data synonym fact about two composites is added, data synonym facts between all sub-components of the composites are also added. Furthermore, when data synonym facts been all sub-components of two composites are known, a data synonym fact relating the two composites is added. Identification of this case is done in a lazy manner, when questions about data synonym facts are asked. The change introduces helper methods to get the size of an array type and the number of elements of a struct type, and fixes TransformationCompositeExtract to invalidate analyses appropriately.	2019-11-05 16:45:14 +00:00
Alastair Donaldson	fb6bac889e	spirv-fuzz: make equivalence classes deterministic (#3011 ) An equivalence relation is computed by traversing the tree of values rooted at the class's representative. Children were represented by unordered sets, meaning that the order of values in an equivalence class could be nondeterministic. This change makes things deterministic by representing children using a vector. The path compression optimization employed in the implementation of the underlying union-find data structure has the potential to change the order in which elements appear in an equivalence class by changing the structure of the tree, so the guarantee of determinism is limited to being a deterministic function of the manner in which the equivalence relation is updated and inspected.	2019-11-05 15:34:05 +00:00
Alastair Donaldson	f1e5cd73f6	spirv-fuzz: improvements to representation of data synonym facts (#3006 ) This change fixes a bug in EquivalenceRelation, changes the interface of EquivalenceRelation to avoid exposing (potentially nondeterministic) unordered sets, and changes the interface of FactManager to allow querying data synonyms directly. These interface changes have required a lot of corresponding changes to client code and tests.	2019-11-01 17:50:01 +00:00
Ryan Harrison	5f6fb2f346	Reset pointers before iterating in fuzzer to avoid double free (#3003 ) Fixes #3002	2019-11-01 11:39:05 -04:00
David Neto	618ee50942	Fix some clang-tidy issues in graphics_robust_access_pass (#2998 ) One remains: the fact that the image-texel-pointer modification is mostly dead code. But that's intentional for now.	2019-10-30 14:00:34 -04:00
greg-lunarg	5ea7099374	Add two new simplifications. (#2984 ) Implements the following simplifications: (a - b) + b => a (a * b) + (a * c) => a * (b + c) Also adds logic to simplification to handle rules that create new operations that might need simplification, such as the second rule above. Only perform the second simplification if the multiplies have the add as their only use. Otherwise this is a deoptimization of size and performance.	2019-10-28 08:19:38 -07:00
Alastair Donaldson	fac166162f	spirv-fuzz: Transformation to extract from a composite object (#2991 ) At present, TransformationReplaceIdWithSynonym both extracts elements from composite objects and replaces uses of ids with synonyms. This new TransformationCompositeExtract class will allow that transformation to be broken into smaller transformations.	2019-10-28 09:33:08 +00:00
Alastair Donaldson	ec12de9131	spirv-fuzz: rename class, and fix bug related to dominance (#2990 ) Class TransformationConstructComposite has been renamed to TransformationCompositeConstruct, to correspond to the name of the SPIR-V instruction (as is done with e.g. TransformationCopyObject). Running tests revealed an issue related to checking dominance in TransformationReplaceIdWithSynonym, which is also fixed here.	2019-10-27 18:11:07 +00:00
Alastair Donaldson	0dbd4e358a	spirv-fuzz: Rework management of data synonyms (#2989 ) This change uses the recently-added equivalence relation class to re-work the way synonyms between data values are managed by the fact manager. The tests for 'transformation_replace_id_with_synonym' have been temporarily removed. This is because those tests are going to be split into a number of test classes in an upcoming PR, once some other refactorings have been applied, and it would be burdensome to temporarily refactor all the tests to be in a working state for this intermediate change.	2019-10-25 17:37:55 +01:00
Alastair Donaldson	b34fa73193	spirv-fuzz: add class to represent equivalence relation (#2988 ) Adds a templated class for representing an equivalence relation on a value data type. This will be used by spirv-fuzz for representing sets of distinct pieces of data in a shader that are known to have equal values.	2019-10-25 12:46:52 +01:00
Alastair Donaldson	570582d8d6	spirv-fuzz: fuzzer pass to adjust memory access operands (#2968 ) A new pass that gives spirv-fuzz the ability to adjust the memory operand masks associated with memory access instructions (such as OpLoad and OpCopy Memory). Fixes #2940.	2019-10-22 18:05:35 +01:00
greg-lunarg	02910ffdff	Instrument: Add missing def-use analysis. (#2985 )	2019-10-22 07:24:54 -07:00
Alastair Donaldson	8357b878d1	spirv-fuzz: add missing functionality for matrix composites (#2974 ) Support for matrix composites had been omitted in a previous PR; this change adds the support that was missing. Fixes #2971.	2019-10-22 14:23:13 +01:00
Steven Perron	6a9be627c7	Keep NOPs when comparing with original binary (#2931 ) We have a check that ensures that the optimizer did not change the binary when it says that it did not. However, when the binary is converted back to a binary, we made a decision to remove OpNop instructions. This means that any spv file that contains a NOP originally will fail this check. To get around this, we convert the module to a second binary that keeps the OpNop instructions. That binary is compared against the original. Fixes https://crbug.com/1010191	2019-10-18 09:53:29 -04:00
alan-baker	2a3cbe7c3f	Check that derivatives operate on 32-bit values (#2983 ) * Add a check that derivative functions only operate on scalar or vector 32-bit floating point values * Added tests to disallow half derivatives	2019-10-18 09:02:25 -04:00
Jakub Kuderski	e3da3143b2	Disallow use of OpCompositeExtract/OpCompositeInsert with no indices (#2980 )	2019-10-17 13:53:34 -04:00
Ryan Harrison	2ca4fcfdc2	Add fuzzer for spirv-dis call path (#2977 ) Fixes #2970	2019-10-17 12:30:47 -04:00
Jakub Kuderski	e99b918221	Support constant-folding UConvert and SConvert (#2960 )	2019-10-16 16:29:55 -04:00
Ryan Harrison	8e89778531	Add fuzzer for spirv-as call path (#2976 ) Fixes #2969	2019-10-16 15:25:03 -04:00
Alastair Donaldson	00170cc5e6	spirv-fuzz: Refactor 'copy object' and 'construct composite' transformations (#2966 ) Rework these transformations to identify instructions via (base, opcode, skip-count) triples, rather than (base, offset) pairs.	2019-10-15 20:00:17 +01:00
David Neto	964dc52df5	Update SPIR-V binary header test for SPIR-V 1.5 (#2967 )	2019-10-15 18:29:10 +01:00
Alastair Donaldson	1b6fd37fa6	spirv-fuzz: Refactor 'split blocks' to identify instructions differently (#2961 ) This change refactors the 'split blocks' transformation so that an instruction is identified via a base, opcode, and number of those opcodes to be skipped when searching from the base, as opposed to the previous design which used a base and offset.	2019-10-14 17:00:46 +01:00
alan-baker	2276e59788	Validate that selections are structured (#2962 ) * Validate that selections are structured WIP * new checks that switch and conditional branch are proceeded by a selection merge where necessary * Don't consider unreachable blocks * Add some tests * Changed how labels are marked as seen * Moved check to more appropriate place * Labels are now marked as seen when there are encountered in a terminator instead of when the block is checked * more tests * more tests * Method comment * new test for a bad case	2019-10-11 17:01:30 -04:00
Alastair Donaldson	3eda1b9ff1	spirv-fuzz: Rework id descriptors (#2959 ) A refactoring that separates the identification of an instruction from the identification of a use in an instruction, to enable the former to be used independently of the latter.	2019-10-11 10:13:06 +01:00
Alastair Donaldson	eba98c4eb7	spirv-fuzz: Add fuzzer pass to add NoContraction decorations (#2950 ) A new pass that allows the fuzzer to add NoContraction decorations to arithmetic instructions. Fixes #2936.	2019-10-11 09:15:47 +01:00
Alastair Donaldson	91232f7f75	spirv-fuzz: Add fuzzer pass to change function controls (#2951 ) A new pass that allows the fuzzer to change the 'function control' operand of OpFunction instructions. Fixes #2939.	2019-10-11 07:10:47 +01:00
Paul Thomson	feb1549213	reduce: add large tests and fix (#2947 ) * Add larger reducer tests. * Fix conditional_branch_to_simple_conditional_branch_opportunity pass.	2019-10-10 17:12:42 +01:00
Alastair Donaldson	253806adc4	spirv-fuzz: Add fuzzer pass to change loop controls (#2949 ) A new pass that allows the fuzzer to change the 'loop control' operand (and associated literal operands) of OpLoopMerge instructions. Fixes #2938. Fixes #2943.	2019-10-10 13:34:38 +01:00
alan-baker	c1d42038f7	Disable scope validation for OpReadClockKHR (#2953 ) See #2952 Disabled until specification is clarified	2019-10-09 15:02:07 -04:00
Steven Perron	32f76efa6c	Link cfg and dominator analysis in the context (#2946 ) Fixes #2889	2019-10-08 10:16:18 -04:00
Alastair Donaldson	5910bb8e94	spirv-fuzz: add transformation and pass to construct composites (#2941 ) Adds a fuzzer pass and transformation to create a composite (array, matrix, struct or vector) from available constituent components, and inform the fact manager that each component of the new composite is synonymous with the id that was used to construct it. This allows the "replace id with synonym" pass to then replace uses of said ids with uses of elements extracted from the composite. Fixes #2858.	2019-10-08 14:04:10 +01:00
Paul Thomson	2f6a87f610	reduce: improve remove unref instr pass (#2945 ) * Remove Impl struct in Reducer; we can re-add it later (in a cleaner fashion) if we need to. * Add cleanup passes in Reducer; needed so that removal of constants can be disabled during the main passes, and then enabled during cleanup passes, otherwise some main passes can perform worse due to lack of available constants. * Delete passes: remove op name, remove relaxed precision. And delete associated tests. * Add more tests for remove unreferenced instructions. * Always return and write the output file, even if there was a reduction failure. * Only exit with 0 if the reduction completed or we hit the reduction step limit.	2019-10-08 13:02:34 +01:00
Alastair Donaldson	81d227f36b	spirv-fuzz: add disabled test to document known issue (#2942 ) Issue #2919 identifies a problem in spirv-fuzz's ability to determine when it is safe to add a new control flow edge without breaking dominance rules. This change adds a (currently disabled) test to expose the issue, and a comment to document that the current solution is incomplete.	2019-10-08 11:26:08 +01:00
Alastair Donaldson	26dba32c43	spirv-fuzz: Add fuzzer pass to change selection controls (#2944 ) A new pass that allows the fuzzer to change the 'selection control' operand of OpSelectionControl instructions. Fixes #2937.	2019-10-08 11:25:34 +01:00
Jeremy Hayes	3c7ff8d4f0	Enable OpTypeCooperativeMatrix specialization (#2927 )	2019-10-07 09:52:48 -04:00
Steven Perron	c18c9ff6bc	Handle OpKill better (#2933 ) We want to handle OpKill better. The wrap opkill causes lots of extra code to be generated, even when they are not needed to avoid the main problem: OpKill cannot be found directly in a continue construct. This change will be more selective on which functions the OpKill will be wrapped and inlining will avoid inlining. Fixes #2912	2019-10-04 13:05:32 -04:00
greg-lunarg	ad3d23f478	Generate null pointer by converting uint64 zero to pointer. (#2935 ) Fixes #2929.	2019-10-04 12:26:38 -04:00
Aaron Hagan	bc37fd585a	Add SPV_KHR_shader_clock validation (#2879 )	2019-10-03 13:35:35 -04:00
alan-baker	9d7428b052	Validate physical storage buffer restrictions (#2930 ) * Physical storage buffer cannot be used with OpConstantNull, OpPtrEqual, OpPtrNotEqual or OpPtrDiff * new tests * see also #2929	2019-10-02 21:12:57 -04:00
Steven Perron	9eb1c9a4c4	Add continue construct analysis to struct cfg analysis (#2922 ) * Add continue construct analysis to struct cfg analysis Add the ability to identify which blocks are in the continue construct for a loop, and to get functions that are called from those blocks, directly or indirectly. Part of https://github.com/KhronosGroup/SPIRV-Tools/issues/2912.	2019-10-01 10:27:09 -04:00
Steven Perron	85c67b5e08	Record trailing line dbg instructions (#2926 ) There is nothing in the spir-v spec that says the last instructions in a module cannot be OpLine or OpNoLine. However, the code that parses the module will simply drop these instructions. We add code that will preserve these instructions. Strip-debug-info is updated to remove these instructions. Fixes https://crbug.com/1000689.	2019-09-27 16:03:45 -04:00
Ryan Harrison	4075b921f9	Add removing references to debug instructions when removing them (#2923 ) Fixes #2921	2019-09-27 13:23:06 -05:00
alan-baker	10951a7c9a	Refactor the InstructionPass (#2924 ) * move checks to more appropriate locations * remove some duplicated checks * New function to check valid storage classes * updated tests	2019-09-27 00:06:36 -04:00
Alastair Donaldson	84b1976061	spirv-fuzz: do not allow a dead break to target an unreachable block (#2917 ) Because dominance information becomes a bit unreliable when blocks are unreachable, this change makes it so that the 'dead break' transformation will not introduce a break to an unreachable block. Fixes #2907.	2019-09-26 10:57:05 +01:00
alan-baker	510ca9d616	Only allow previously declared forward refs in structs (#2920 ) Fixes https://crbug.com/1008130 * Restore a missing check that the only valid forward references in structs are previously declared forward pointers	2019-09-25 18:11:22 -04:00
Steven Perron	2a11f365bc	Handle id overflow in wrap-opkill (#2916 ) New code in wrap-opkill does not handle id overflow correctly. We fix that up. Fixes https://crbug.com/1007144	2019-09-25 17:42:58 -04:00
Alastair Donaldson	70097c7761	spirv-fuzz: do not replace struct indices with synonyms (#2915 ) This change introduces a robust check for whether an index in an access chain is indexing into a struct, in which case the index needs to be an OpConstant and cannot be replaced with a synonym. Fixes #2906.	2019-09-25 16:52:35 +01:00
Alastair Donaldson	c1e03834e3	spirv-fuzz: Fixes to preconditions for adding dead break/continue edges (#2904 ) Issues #2898 and #2900 identify some cases where adding a dead continue would lead to an invalid module, and these turned out to be due to the lack of sensible dominance information when a continue target is unreachable. This change requires that the header of a loop dominates the loop's continue target if a dead continue is to be added. Furthermore, issue #2905 identified a shortcoming in the algorithm being used to identify when it is OK, from a dominance point of view, to add a new break/continue edge to a control flow graph. This change replaces that algorithm with a simpler and more obviously correct algorithm (that incidentally does not require the new edge to be a break/continue edge in particular). Fixes #2898. Fixes #2900. Fixes #2905.	2019-09-25 16:51:41 +01:00
Alastair Donaldson	7bc114ba2f	spirv-fuzz: do not replace a pointer argument to a function call with a synonym (#2901 ) Before this change, spirv-fuzz would replace a pointer argument to a function call with a synonym, which is problematic when the synonym is not a memory object declaration, since function call arguments are required to be memory object declarations. This change adds a check to ensure that such a replacement is not made. Fixes #2896.	2019-09-25 12:17:29 +01:00
Alastair Donaldson	290f6a820d	spirv-fuzz: do not replace boolean constant argument to OpPhi instruction (#2903 ) Before this change, spirv-fuzz would replace a constant boolean argument to an OpPhi with the result of a binary operation, inserting the instruction to compute the binary operation right before the OpPhi, leading to an invalid module. This change conservatively disallows replacing OpPhi arguments. Issue #2902 notes that there is scope for being less conservative. Fixes #2897.	2019-09-25 12:16:25 +01:00
alan-baker	527a689307	Remove validate_datarules.cpp (#2911 ) * Checks moved into individual opcode validation * removes duplicated checks * Add check that forward pointer points to struct	2019-09-24 17:55:12 -04:00
Steven Perron	55ea57a785	Handle extract with no indexes (#2910 ) * Handle extract with no indexes It is possible that OpCompositeExtract instructions will not have any indexes. This is not handled well by scalar replacement and instruction folding. Fixes https://crbug.com/1006435 * Fix typo.	2019-09-24 16:19:31 -04:00
Steven Perron	6f26d9ad81	Handle id overflow in convert local access chains (#2908 ) Fixes https://crbug.com/1004453	2019-09-24 14:04:54 -04:00
Alastair Donaldson	958f7e72a7	Employ the "swarm testing" idea in spirv-fuzz (#2890 ) This change to spirv-fuzz uses ideas from "Swarm Testing" (Groce et al. 2012), so that a random subset of fuzzer passes are enabled. These passes are then applied repeatedly in a randomized fashion, with the aggression with which they are applied being randomly chosen per pass. There is plenty of scope for refining the probabilities introduce in this change; this is just meant to be a reasonable first effort.	2019-09-23 16:29:19 +01:00
Steven Perron	6b07212659	Use OpReturn* in wrap-opkill (#2886 ) * Use OpReturn* in wrap-opkill The warp-opkill pass is generating incorrect code. It is placing an OpUnreachable at the end of a basic block, when the block can be reached. We can't reach the end of the block, but we can reach the end. Instead we will add a return instruction. Fixes #2875.	2019-09-20 10:32:27 -04:00
Alastair Donaldson	4653127262	Fix to CMakeLists for spirv-fuzz tests (#2888 ) A previous change that disabled long-running tests by default failed to enable short-running tests when long-running tests are enabled. This change fixes that problem.	2019-09-20 15:23:25 +01:00
Alastair Donaldson	7275a71654	Allow validation during spirv-fuzz replay (#2873 ) To aid in debugging issues in spirv-fuzz, this change adds an option whereby the SPIR-V module is validated after each transformation is applied during replay. This can assist in finding a transformation that erroneously makes the module invalid, so that said transformation can be debugged.	2019-09-20 10:54:09 +01:00
Alastair Donaldson	4eee71e78f	Disable long-running fuzzer tests by default (#2887 ) spirv-fuzz has useful tests that run the fuzzer and shrinker, to give the whole tool a good shake up, effectively "fuzzing the fuzzer". The problems that this detects are sensitive to the source of randomness that is used, which can change from test platform to test platform. It is thus not a good idea to run these tests by default during continuous integration - they may end up failing due to environtal factors, making it look like an unrelated change has broken the fuzzer when really the fuzzer has revealed an already-existing bug in itself. This change makes the tests disabled by default; they can enabled during dedicated testing of the fuzzer.	2019-09-20 09:43:26 +01:00
Steven Perron	61edde52a0	Revert "Use OpReturn* in wrap-opkill" This reverts commit `87f0fa432f`.	2019-09-19 22:39:56 -04:00
Steven Perron	87f0fa432f	Use OpReturn* in wrap-opkill The warp-opkill pass is generating incorrect code. It is placing an OpUnreachable at the end of a basic block, when the block can be reached. We can't reach the end of the block, but we can reach the end. Instead we will add a return instruction. Fixes #2875.	2019-09-19 22:34:57 -04:00
Steven Perron	248c80b049	Handle OpConstantNull in copy-prop-arrays. (#2870 ) Many of the places in copy propagate arrays assumes that integer constant will be defined by an OpConstant instruction. That is not always true. We fix these spots by allowing for an OpConstantNull.	2019-09-19 10:24:00 -04:00
Alastair Donaldson	e59b60de07	Fix detection of blocks bypassed by new edge (#2874 ) Fixes an issue where the blocks that would be bypassed by a new break or continue control flow edge were not properly detected. Fixes #2871.	2019-09-18 20:50:08 +01:00
Alastair Donaldson	0a07cd1c9a	Add fuzzer pass to replace ids with synonyms (#2857 ) If the fuzzer's fact manager knows that ids A and B are synonymous, it can replace a use of A with a use of B, so long as various conditions hold (e.g. the definition of B must dominate the use of A, and it is not legal to replace a use of an OpConstant in a struct's access chain with a synonym that is not an OpConstant). This change adds a fuzzer pass to sprinke such synonym replacements through the module.	2019-09-18 20:47:08 +01:00
alan-baker	bbb29870b5	Relaxed bitcast with pointers (#2878 ) * When input or result is a pointer type also allow 32-bit integer vectors for the other type * Relaxation only applies to SPIR-V 1.5 or in the presence of SPV_KHR_physical_storage_buffer * new tests	2019-09-18 11:55:39 -04:00
Raun Krisch	99793fa67d	Adding valilidation checks for OpEntryPoint duplicate names and execution mode (#2862 )	2019-09-16 19:13:30 -04:00
alan-baker	9325619353	Extra resource interface validation (#2864 ) * Vulkan specific checks * storage buffer variables must be structs or arrays of structs * storage buffer struct must be Block decorated * uniform struct must be Block or BufferBlock decorated * new tests	2019-09-16 10:46:31 -04:00
alan-baker	1e146e8a34	Split capability tests (#2866 )	2019-09-13 16:48:42 -04:00
alan-baker	5a48c0da15	SPIRV-Tools support for SPIR-V 1.5 (#2865 ) * Ensure same enum values have consistent extension lists * val: fix checking of capabilities The operand for an OpCapability should only be checked for the extension or core version. The InstructionPass registers a capability, and all its implied sub-capabilities before actually checking the operand to an OpCapability. * Add basic support for SPIR-V 1.5 - Adds SPV_ENV_UNIVERSAL_1_5 - Command line tools default to spv1.5 environment - SPIR-V 1.5 incorporates several extensions. Now the disassembler prefers outputing the non-EXT or non-KHR names. This requires updates to many tests, to make strings match again. - Command line tests: Expect SPIR-V 1.5 by default * Test validation of SPIR-V 1.5 incorporated extensions Starting with 1.5, incorporated features no longer require the associated OpExtension instruction.	2019-09-13 14:59:02 -04:00
Steven Perron	c7a39bc40f	Don't inline function containing OpKill (#2842 ) If an OpKill instruction is inlined into a continue construct, then the spir-v is no longer valid. To avoid this issue, we do inline into an OpKill at all. This method was chosen because it is difficult to keep track of whether or not you are in a continue construct while changing the function that is being inlined into. This will work well with wrap OpKill because every will still be inlined except for the OpKill instruction itself. Fixes #2554 Fixes #2433 This reverts commit `aa9e8f5380`.	2019-09-11 13:26:55 -04:00
Steven Perron	4f9256db35	Handle id overflow in wrap op kill. (#2851 ) Fixes https://crbug.com/997729	2019-09-11 13:26:42 -04:00
David Neto	9f188e3374	Assembler: Can't set an ID in instruction without result ID (#2852 ) Fix tests that violated this rule. Fixes #2257	2019-09-11 13:15:25 -04:00
Alastair Donaldson	7ee8f443ea	Fix add-dead-break and add-dead-continue passes to respect dominance (#2838 ) The implementation of these passes had overlooked the fact that adding a new edge to a control flow graph can change dominance information. Adding a dead break/continue risks causing uses to no longer be dominated by their definitions. This change introduces various tests to expose such scenarios, and augments the preconditions for these transformations with checks to guard against the situation.	2019-09-10 14:48:27 +01:00
Steven Perron	35c9518c4e	Handle id overflow in the ssa rewriter. (#2845 ) * Handle id overflow in the ssa rewriter. Remove LocalSSAElim pass at the same time. It does the same thing as the SSARewrite pass. Then even share almost all of the same code. Fixes crbug.com/997246	2019-09-10 09:38:23 -04:00
Steven Perron	7f7236f1eb	Handle id overflow in the constant manager. (#2844 ) Fixes crbug.com/997246	2019-09-09 15:12:26 -04:00
alan-baker	a464ac1a27	Add generic builtin validation of target (#2843 ) * Validate the target's opcode is acceptable * Update tests * New tests * move early exit for builtins a bit later in the pass	2019-09-09 14:53:30 -04:00
Steven Perron	76261e2a7d	Replace CubeFaceCoord and CubeFaceIndexAMD (#2840 ) Part of #2814.	2019-09-06 17:11:37 -04:00
Steven Perron	b218ad1994	Fold Min, Max, and Clamp instructions. (#2836 ) Fixes #2830.	2019-09-05 13:30:03 -04:00
Steven Perron	a41520eaa4	Replace uses of SPV_AMD_shader_trinary_minmax extension (#2835 ) Part of #2814	2019-09-05 09:29:04 -04:00
Ryan Harrison	19b256616d	For WebGPU<->Vulkan optimization, set correct execution environment (#2834 ) Fixes #2833	2019-09-04 13:08:58 -04:00
greg-lunarg	d11725b1d4	Add --relax-float-ops and --convert-relaxed-to-half (#2808 ) The first pass applies the RelaxedPrecision decoration to all executable instructions with float32 based type results. The second pass converts all executable instructions with RelaxedPrecision result to the equivalent float16 type, inserting converts where necessary.	2019-09-03 13:22:13 -04:00
Steven Perron	b54d950298	Fold Fmix should accept vector operands. (#2826 ) Fixes #2819	2019-09-03 09:17:18 -04:00
Steven Perron	d67130caca	Replace SwizzleInvocationsAMD extended instruction. (#2823 ) Part of #2814	2019-08-30 14:07:24 -04:00
Steven Perron	ad71c057c7	Replace SwizzleInvocationsMaskedAMD extended instruction. (#2822 ) Part of #2814	2019-08-30 10:48:42 -04:00
Steven Perron	35d98be3bc	Amd ext to khr (#2811 ) Add the first steps to removing the AMD extension VK_AMD_shader_ballot. Splitting up to make the PRs smaller. Adding utilities to add capabilities and change the version of the module. Replaces the instructions: OpGroupIAddNonUniformAMD = 5000 OpGroupFAddNonUniformAMD = 5001 OpGroupFMinNonUniformAMD = 5002 OpGroupUMinNonUniformAMD = 5003 OpGroupSMinNonUniformAMD = 5004 OpGroupFMaxNonUniformAMD = 5005 OpGroupUMaxNonUniformAMD = 5006 OpGroupSMaxNonUniformAMD = 5007 and extentend instructions WriteInvocationAMD = 3 MbcntAMD = 4 Part of #2814	2019-08-29 12:48:17 -04:00
Steven Perron	73422a0a5e	Check feature mgr in context consistency check (#2818 ) We add a check that the feature manager is correcter after each pass. This resulted in a couple failing tests cases. Those are fixed. Part of #2814	2019-08-28 11:49:16 -04:00
Steven Perron	15fc19d091	Refactor instruction folders (#2815 ) * Refactor instruction folders We want to refactor the instruction folder to allow different sets of rules to be added to the instruction folder. We might want different sets of rules in different circumstances. We also need a way to add rules for extended instructions. Changes are made to the FoldingRules class and ConstFoldingRules class to enable that. We added tests to check that we can fold extended instructions using the new framework. At the same time, I noticed that there were two tests that did not tests what they were suppose to. They could not be easily salvaged. #2813 was opened to track adding the new tests.	2019-08-26 18:54:11 -04:00
Alastair Donaldson	8336d1925f	Extend reducer to remove relaxed precision decorations (#2797 ) Adds a reduction pass that removes OpDecorate and OpMemberDecorate instructions that annotate instructions and members with RelaxedPrecision. As well as being useful in its own right, removing such references allows other passes to remove further instructions.	2019-08-22 23:33:09 +01:00
Steven Perron	b00ef0d26e	Handle Id overflow in private-to-local (#2807 ) We need to handle id overflow in the private to local pass. Fixes https://crbug.com/962295	2019-08-22 09:14:48 -04:00
Steven Perron	aef8f92b2b	Even more id overflow in sroa (#2806 ) Now we need to handle id overflow when we overflow while replacing uses of the variable. While looking at this code, I noticed an error in the way we handle access chains that cannot be replaced because of overflow. Name it will make some change, and then give up by returning SuccessWithoutChange. But it was changed. This is fixed up by returning Failure if we notice the error at the time of rewriting the users. This is for both id overflow or out-of-bounds accesses. Code is added to "CheckUses" to remove variables that have out-of-bounds accesses from the candidate list, so we don't even try to rewrite its uses. Fixes https://crbug.com/995032	2019-08-21 13:12:42 -04:00
Steven Perron	c5d1dab99e	Add name for variables in desc sroa (#2805 ) Fixes #2802.	2019-08-21 10:55:02 -04:00
Steven Perron	bc62722b80	Handle overflow in wrap-opkill (#2801 ) Fixes https://crbug/994203	2019-08-18 19:00:18 -04:00
Steven Perron	9cd07272a6	More handle overflow in sroa (#2800 ) If we run out of ids when creating a new variable, sroa does not recognize the error, and continues doing work. This leads to segmentation faults. Fixes https://crbug/969655	2019-08-16 13:15:17 -04:00
greg-lunarg	06407250a1	Instrument: Add support for Buffer Device Address extension (#2792 )	2019-08-16 09:18:34 -04:00
Toomas Remmelg	7b4e5bd5ec	Update remquo validation to match the OpenCL Extended Instruction Set Specification (#2791 )	2019-08-15 09:38:37 -04:00
alan-baker	bbd80462f5	Fix validation of constant matrices (#2794 ) Fixes #2793 * Don't special case matrix validation compared to other composites * just check the constituents are constants or undefs * later checking validates the column type * new test	2019-08-14 11:26:41 -04:00
Steven Perron	60043edfa1	Replace OpKill With function call. (#2790 ) We are no able to inline OpKill instructions into a continue construct. See #2433. However, we have to be able to inline to correctly do legalization. This commit creates a pass that will wrap OpKill instructions into a function of its own. That way we are able to inline the rest of the code. The follow up to this will be to not inline any function that contains an OpKill. Fixes #2726	2019-08-14 09:27:12 -04:00
Steven Perron	f701237f2d	Remove useless semi-colons (#2789 ) Later versions of clang seem to pick up more useless semi-colons. I've removed them.	2019-08-12 08:52:39 -04:00
greg-lunarg	95386f9e45	Instrument: Fix version 2 output record write for tess eval shaders. (#2782 ) Fix output record write for tess eval shaders. Also change command line for bindless instrumentation to use use output record version 2.	2019-08-09 08:22:41 -04:00
Steven Perron	4b64beb1ae	Add descriptor array scalar replacement (#2742 ) Creates a pass that will replace a descriptor array with individual variables. See #2740 for details. Fixes #2740.	2019-08-08 10:53:19 -04:00
greg-lunarg	29af42df12	Add SPV_EXT_physical_storage_buffer to opt whitelists (#2779 ) This also fixes ADCE to not remove possibly needed OpTypeForwardPointer. The bug, its fix and the corresponding test have a circular dependency with the extension, so they are packaged together.	2019-08-08 09:45:59 -04:00
Steven Perron	b029d3697e	Handle RelaxedPrecision in SROA (#2788 ) If a member of a struct has a relaxed precision, sroa will not split the struct. This means we do not get all cases. This commit handles these cases. The other part is that the decoration needs to be passed on to the new variables. Fixes #2786	2019-08-07 12:17:26 -04:00
Alastair Donaldson	698b56a8f0	Add 'copy object' transformation (#2766 ) This transformation can introduce an instruction that uses OpCopyObject to make a copy of some other result id. This change introduces the transformation, but does not yet introduce a fuzzer pass to actually apply it.	2019-08-05 18:00:13 +01:00
Ryan Harrison	5ada98d0bb	Update WebGPU validation rules of OpAtomic*s (#2777 ) Fixes #2723	2019-07-31 17:15:47 -04:00
alan-baker	3726b500b1	Treat access chain indexes as signed in SROA (#2776 ) Fixes #2768 * In scalar replacement, interpret access chain indexes as signed counts * Use Constant::GetSignExtendedValue and Constant::GetZeroExtendedValue where appropriate * new tests	2019-07-31 15:39:33 -04:00
David Neto	31590104ec	Add pass to inject code for robust-buffer-access semantics (#2771 ) spirv-opt: Add --graphics-robust-access Clamps access chain indices so they are always in bounds. Assumes: - Logical addressing mode - No runtime-array-descriptor-indexing - No variable pointers Adds stub code for clamping coordinate and samples for OpImageTexelPointer. Adds SinglePassRunAndFail optimizer test fixture. Android.mk: add source/opt/graphics_robust_access_pass.cpp Adds Constant::GetSignExtendedValue, Constant::GetZeroExtendedValue	2019-07-30 19:52:46 -04:00
Ryan Harrison	4a28259cc8	Update OpMemoryBarriers rules for WebGPU (#2775 ) Part of #2724	2019-07-30 14:50:55 -04:00
David Neto	7621034aae	Add opt test fixture method SinglePassRunAndFail (#2770 ) Checks for failure status code and matches against the expected error message.	2019-07-30 10:38:46 -04:00
Diego Novillo	49797609b7	Protect against out-of-bounds references when folding OpCompositeExtract (#2774 ) This fixes #2608. The original test case had an out-of-bounds reference that ended up folding into OpCompositeExtract that was indexing right outside the constant composite. The returned constant would then cause a segfault during constant propagation.	2019-07-29 13:27:40 -07:00
alan-baker	7fd2365b06	Don't move debug or decorations when folding (#2772 ) Fixes #2764 * Don't replace all uses when simplifying instructions, instead only update non-debug, non-decoration uses * added a test * Add a new version of RAUW that takes a predicate to decide whether to replace the use or not * used in simplification pass	2019-07-29 16:20:43 -04:00
Ryan Harrison	7bafeda284	Update OpControlBarriers rules for WebGPU (#2769 ) * Update OpControlBarriers rules for WebGPU Part of #2724	2019-07-29 12:53:27 -04:00
Diego Novillo	9559cdbdf0	Fix #2609 - Handle out-of-bounds scalar replacements. (#2767 ) * Fix #2609 - Handle out-of-bounds scalar replacements. When SROA tries to do a replacement for an OpAccessChain that is exactly one element out of bounds, the code was trying to access its internal array of replacements and segfaulting. This protects the code from doing this, and it additionally fixes the way SROA works by not returning failure when it refuses to do a replacement. Instead of failing the optimization pass, SROA will now simply refuse to do the replacement and keep going. Additionally, this patch fixes the SROA logic to now return a proper status so we can correctly state that the pass made no changes to the IR if it only found invalid references.	2019-07-26 12:33:40 -04:00
Alastair Donaldson	f54b8653dd	Limit fuzzer tests so that they take less time to run (#2763 ) The recently added fuzzer_replayer and fuzzer_shrinker tests were rather heavyweight and were leading to CI timeouts. This change reduces the runtime of those tests by having them do fewer iterations.	2019-07-25 13:09:49 -04:00
Steven Perron	bb0e2f65bb	Fix check for unreachable blocks in merge-return (#2762 ) Merge return expects unreachable merge block to look a certain way, and unreachable continue blocks to look a certain way. What if an unreachable block is both a merge and a continue? The continue is suppose to take precedent, but merge-return implements it with the merge taking precedent. This change flips that around. Fixes #2746	2019-07-25 09:34:18 -04:00
Alastair Donaldson	1a89ac8b28	Transformation and fuzzer pass to add dead continues (#2758 ) Similar to the existing 'add dead breaks' pass, this adds a pass to add dead continues to blocks in loops where such a transformation is viable. Various functionality common to this new pass and 'add dead breaks' has been factored into 'fuzzer_util', and some small improvements to 'add dead breaks' that were identified while reviewing that code again have been applied. Fixes #2719.	2019-07-25 13:50:33 +01:00
Ryan Harrison	65f49dfc39	Remove unneeded future imports (#2739 ) Also, adds explicitly setting python executable in the NDK build script, rewrites some Python2-isms to 3isms, and formats some code. Fixes #2738	2019-07-24 15:29:38 -04:00
Steven Perron	c7fcb8c3b9	Process OpDecorateId in ADCE (#2761 ) * Process OpDecorateId in ADCE When there is an OpDecorateId instruction that is live, the ids that is references must be kept live. This change adds them to the worklist. I've also updated a validator check to allow OpDecorateId to be able to apply to decoration groups. Fixes #1759. * Remove dead code.	2019-07-24 14:43:49 -04:00
Steven Perron	fb83b6fbb5	Record correct dominators in merge return (#2760 ) In merge return, we need to know the original dominator for a block in order to traverse code from the original dominator to the new dominator and add appropriate Phi nodes. The current code gets this wrong because the dominator tree is build as needed. The first time we get the immediate dominator for a function we just built the dominator tree and it takes into account that a block has been split. The second time it does not. This inconsistency needs to be fixed. We do that by recording the original dominator for all blocks at the start of the pass. If we were to record just the basic block, that could change if the block is split. We want to traverse the code in the body of the original dominator, whatever block it ends up in. To make this easy to track, we not save the terminator instruction to represent the original dominator. Fixes #2745	2019-07-24 13:56:54 -04:00
Steven Perron	c9190a54da	SSA rewriter: Don't use trivial phis (#2757 ) When a phi candidate is marked as trivial, we are suppose to update all of its uses to the reference the value that it is being folded to. However, the code updates the uses misses `defs_at_block_`. So at a later time, the id for the trivial phi can reemerge. Fixes #2744	2019-07-23 17:59:30 -04:00
alan-baker	aea4e6b1b9	Fix block depth rule priority (#2755 ) Fixes #2743 * Continue depth calculation should take precedence over merge calculation	2019-07-23 13:57:44 -04:00
alan-baker	a94ddc267c	Case validation with repeated labels (#2689 ) Fixes #2686 * Update validation to handle the default case being mentioned multiple times * new tests	2019-07-23 11:23:32 -04:00
greg-lunarg	3855447d93	Bindless Instrument: Make init check depend solely on input_init_enabled (#2753 ) * Bindless Instrument: Make init check depend solely on input_init_enabled Previously was dependent on presense of descriptor_indexing extension in SPIR-V, but this missed some cases. Tests updated to refect this new policy. * Fix format.	2019-07-22 13:51:39 -04:00
Kévin Petit	11516c0b9a	Validate storage class OpenCL environment rules for atomics (#2750 ) This change refactors all storage class validation for atomics to reflect the similar refactoring in the specification. It is currently not possible to write a test for the check rejecting Generic in an OpenCL 1.2 environment as the required GenericPointer capability isn't allowed there. I've decided to keep the check nonetheless to guard against the capability becoming available without the rules for atomics being updated. The ID changes in existing tests aren't ideal but introducing names drags in a substantial refactoring of this file. Contributes to #2595. Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-07-22 08:38:42 -04:00
Jason Macnak	bac82f49aa	Allow LOD ops in compute shaders with derivative group execution modes (#2752 ) Also update existing derivative check to be based on the execution mode instead of just the extension being present. More info about extension: - https://github.com/KhronosGroup/SPIRV-Registry/blob/master/extensions/NV/SPV_NV_compute_shader_derivatives.asciidoc	2019-07-22 08:37:44 -04:00
Steven Perron	aa9e8f5380	Revert "Do not inline OpKill Instructions (#2713 )" (#2749 ) This reverts commit `fe7cc9c612`.	2019-07-17 14:59:05 -04:00
Jeff Bolz	58e2ec25ba	For Vulkan, disallow structures containing opaque types (#2546 )	2019-07-16 16:16:19 -04:00
Steven Perron	230c9e4371	Fix bug in merge return (#2734 ) * Fix bug in merge return The merge return pass seems to assume that the only new edges in the cfg are from return block to merge blocks. However, it is possible that a merge block branches to a merge block when it did not before. This change add a new variable to track all of the new edges. It also renames some other variables and cleans us the code to make it a bit easier to read. Fixes #2702.	2019-07-16 09:11:22 -04:00
Jason Macnak	1fedf72e50	Allow ray tracing shaders in inst bindle check pass. (#2733 ) Adds the ray tracing stages (ray gen, intersection, any hit, closest hit, miss, and callable) to the allowed stages in pass instrumentation and add debug records for these stages to output the global launch id. More information for ray tracing shaders: - https://github.com/KhronosGroup/GLSL/blob/master/extensions/nv/GLSL_NV_ray_tracing.txt	2019-07-15 16:24:42 -04:00
Ryan Harrison	032adc4d7e	Correctly implement WebGPU related flag exclusions (#2737 ) Fixes #2736	2019-07-12 14:14:46 -04:00
greg-lunarg	92c41ff1e7	Remove Common Uniform Elimination Pass (#2731 ) Remove Common Uniform Elimination Pass Fixes #2520.	2019-07-12 11:02:10 -04:00
Ryan Harrison	55adf4cf70	Update execution scope rules for WebGPU (#2730 ) Fixes #2722	2019-07-11 14:37:36 -04:00
alan-baker	1a2de48a12	Extra small storage validation (#2732 ) Fixes #2729 * Check acceptable uses of small type generators	2019-07-11 13:05:14 -04:00
Jeff Bolz	327963765b	Add validation for SPV_EXT_demote_to_helper_invocation (#2707 )	2019-07-11 10:33:22 -04:00
Steven Perron	5ce8cf781f	Change the order branches are simplified in dead branch elim (#2728 ) Dead branch elimination needs to know about the constructs that a block is contained it when determining what to do with its merge instruction. We currently fold branches in block as we see them, which is parent constructs before their children. This causes the struct cfg analysis to crash because it tries to get the parent construct for a block after the parent has been folded. This can be fixed by folding the branch of the children before the parents. Fixes #2667.	2019-07-10 14:59:44 -04:00
Thomas Roughton	cd153db8ed	Add —preserve-bindings and —preserve-spec-constants (#2693 ) Add optimizer options to for preservation of spec constants and variable with binding decorations. They are to be preserved even if they are unused.	2019-07-10 14:12:19 -04:00
Steven Perron	86e45efe15	Handle decorations better in some optimizations (#2716 ) There are a couple spots where we are not looking at decorations when we should. 1. Value numbering is suppose to assign a different value number to ids if they have different decorations. However that is not being done for OpCopyObject and OpPhi. 1. Instruction simplification is propagating OpCopyObject instruction without checking for decorations. It should only do that if no decorations are being lost. Add a new function to the decoration manager to check if the decorations of one id are a subset of the decorations of another. Fixes #2715.	2019-07-10 11:37:16 -04:00
Ryan Harrison	3a252a267b	Update memory scope rules for WebGPU (#2725 ) Fixes #2721	2019-07-10 10:34:50 -04:00
alan-baker	0c4feb643b	Remove extra semis (#2717 ) * Remove extra semi-colons * Update re2 dep	2019-07-08 15:07:36 -04:00
alan-baker	456cc598af	Validate usage of 8- and 16-bit types with only storage capabilities (#2704 ) Fixes #2669 * Check capabilities when validating variables * validate load and store types * Constant check * Don't checks pointers for stores, constants and loads * Validate composite instructions * Validate conversions for 8- and 16-bit limited types * Unified tests and expanded them * Disallow OpCopyMemory * new tests and update old tests	2019-07-08 14:10:13 -04:00
Alastair Donaldson	b8ab80843f	Shrinker for spirv-fuzz (#2708 ) Adds to spirv-fuzz the option to shrink a sequence of transformations that lead to an interesting binary to be generated, to find a smaller sub-sequence of transformations that still lead to an interesting (but hopefully simpler) binary being generated. The notion of what counts as "interesting" comes from a user-provided script, the "interestingness function", similar to the way the spirv-reduce tool works. The shrinking process will give up after a maximum number of steps, which can be configured on the command line. Tests for the combination of fuzzing and shrinking are included, using a variety of interestingness functions.	2019-07-07 08:55:30 +01:00
Steven Perron	37e8f79946	Perform merge return with single return in loop. (#2714 ) Inlining does not inline functions that have a single return that is in a loop. This is because the return cannot be replaced by a branch outside of the loop easily. Merge return knows how to rewrite the function so the return is replaced by a branch. Fixes #2038.	2019-07-04 14:14:49 -04:00
Steven Perron	fe7cc9c612	Do not inline OpKill Instructions (#2713 ) It is illegal to inline an OpKill instruction into a continue construct because the continue header will no longer dominate the backedge. This commit adds a check for this, and does not inline. If we still want to be able to inline a function that contains an OpKill, we can add a new pass that will wrap OpKill instructions into its own function with just the single instruction. I do not believe that this is a common case right now, so I will not do that yet. Fixes #2433.	2019-07-04 12:08:23 -04:00
Jason Macnak	e6e3e2ccc6	Update type for loaded builtin GlobalInvocationID in pass instrumentation (#2705 ) When working on descriptor indexing validation for compute shaders, the gl_GlobalInvocationID builtin was being loaded as uint which would cause compute shaders instrumented by the bindless check pass to have: %83 = OpLoad %uint %gl_GlobalInvocationID %84 = OpCompositeExtract %uint %83 0 %85 = OpCompositeExtract %uint %83 1 %86 = OpCompositeExtract %uint %83 2 which results in validation failures: error: line 127: Reached non-composite type while indexes still remain to be traversed. %84 = OpCompositeExtract %uint %83 0 for trying to extract a uint from a uint.	2019-06-28 09:46:16 -04:00
Alastair Donaldson	efde682369	Disallow movement of unreachable blocks. (#2700 ) Fixes #2695. Allowing unreachable blocks to be moved can lead to an unreachable block A getting placed after an unreachable successor B, which is a problem if B uses ids that A generates.	2019-06-26 15:32:25 +01:00
Alastair Donaldson	dfcb5a1e10	Refactor fuzzer transformations (#2694 ) Introduced abstract class for transformations, and refactored all transformations to inherit from this abstract class.	2019-06-25 20:49:46 +01:00
Józef Kucia	888aeef8a9	Fix Component decoration validation for arrays (#2697 )	2019-06-25 13:28:16 -04:00
Józef Kucia	7c294608ca	Basic validation for Component decorations (#2679 ) * Add basic validation for Component decoration * Add validator tests for Component decoration	2019-06-20 18:16:12 -04:00
alan-baker	2b84d25f10	Fix store to uniform Vulkan check (#2688 ) * Wrong operands were used for pointer and array types * added tests to catch the wierd number corner	2019-06-20 14:22:41 -04:00
Alastair Donaldson	51b0d5ce50	Represent uniform facts via descriptor set and binding. (#2681 ) * Represent uniform facts via descriptor set and binding. Previously uniform facts were expressed with resepect to the id of a uniform variable. Describing them with respect to a descriptor set and binding is more convenient from the point of view of expressing facts about a shader without requiring analysis of its SPIR-V. * Fix equality testing for uniform buffer element descriptors. The equality test now checks that the lengths of the index vectors match. Added a test that exposes the previous omission.	2019-06-19 20:45:14 +01:00
Alastair Donaldson	001e823b65	Add fuzzer pass to obfuscate constants. (#2671 ) Adds a new transformation that can replace a constant with a uniform known to have the same value, and adds a fuzzer pass that (a) replaces a boolean with a comparison of literals (e.g. replacing "true" with "42 > 24"), and then (b) obfuscates the literals appearing in this comparison by replacing them with identically-valued uniforms, if available. The fuzzer_replayer test file has also been updated to allow initial facts to be provided, and to do error checking of the status results returned by the fuzzer and replayer components.	2019-06-18 18:41:08 +01:00
alan-baker	2090d7a2d2	Handle volatile memory semantics in upgrade (#2674 ) * If an atomic is decorated with volatile add the volatile bit to its memory semantics	2019-06-17 16:01:37 -04:00
alan-baker	3d5fb7b908	Validate Volatile memory semantics bit (#2672 ) * Can only be used with Vulkan memory model * Can only be used with atomics * Bit setting must match for compare exchange opcodes * Updated memory semantics checks to allow constant instructions generally with CooperativeMatrixNV	2019-06-17 13:35:40 -04:00
alan-baker	400dbde0ba	Disallow stores to UBOs (#2651 ) Fixes #2638 * Adds a check that errors out if there is a store to a UBO in the Vulkan environment * tests * Function to trace pointers	2019-06-17 13:13:07 -04:00
alan-baker	59983a6010	Validate variable initializer type (#2668 ) Fixes #249 * The pointed to type of Result Type must match the initializer type * Had to update some opt tests to be valid	2019-06-15 00:34:18 -04:00
Alastair Donaldson	42830e5a68	Add replayer tool for spirv-fuzz. (#2664 ) The replayer takes an existing sequence of transformations and applies them to a module. Replaying a sequence of transformations that were obtained via fuzzing should lead to an identical module to the module that was fuzzed. Tests have been added to check for this.	2019-06-13 14:08:33 +01:00
alan-baker	b4bf7bcf0a	Add validation for Subgroup builtins (#2637 ) Fixes #2611 * Validates builtins in the Vulkan environment: * NumSubgroups * SubgroupId * SubgroupEqMask * SubgroupGeMask * SubgroupGtMask * SubgroupLeMask * SubgroupLtMask * SubgroupLocalInvocationId * SubgroupSize	2019-06-13 08:47:05 -04:00
Alastair Donaldson	9c0830133b	Add constant == uniform facts. (#2660 ) Adds a new (and first) kind of fact to the fact manager, which is that a specific uniform value is guaranteed to be equal to a specific constant. The point of this is that such information (if known to be true by some external source) can be used by spirv-fuzz to transform the module in interesting ways that a static compiler cannot reverse via compile-time analysis. This change introduces protobuf messages for the fact, and adds capabilities to the fact manager to store this kind of fact and provide information about it.	2019-06-11 15:56:08 +01:00
Alastair Donaldson	a8ae579f7a	Add transformation to replace a boolean constant with a numeric comparison (#2659 ) The transformation can, for example, replace "true" with "12.0 > 6.0", if constants for those floating-point values are available. This introduces a new 'id use descriptor' structure, which provides a way to describe a particular use of an id, and which will be heavily used in future transformations. Describing an id use is trivial if the use occurs in an instruction that itself generates an id, but is less straightforward if the id of interest is used by an instruction such as OpStore that does not have a result id. The 'id use descriptor' structure caters for such cases.	2019-06-06 22:22:35 +01:00
Daniel Koch	0755d6ce82	Add builtin validation for SPV_NV_shader_sm_builtins (#2656 ) Also add a Builtin test generator variant that takes capabilities and extensions. Tests - verify that the SMCountNV, SMIDNV, WarpsPerSMNV, and WarpIDNV Builtins are accepted as Inputs in Vertex, Fragment, TessControl, TessEval, Geometry, and Compute. - verify that the SMCountNV, SMIDNV, WarpsPerSMNV, and WarpIDNV Builtins are accepted as Inputs in MeshNV and TaskNV shaders. - verify that the SMCountNV, SMIDNV, WarpsPerSMNV, and WarpIDNV Builtins are accepted as Inputs in the 6 ray tracing stages - verify that the SMCountNV, SMIDNV, WarpsPerSMNV, and WarpIDNV Builtins are NOT accepted as Outputs. - verify that the SMCountNV, SMIDNV, WarpsPerSMNV, and WarpIDNV Builtins are NOT accepted as non-scalar integers (f32, uvec3) - verify that the SMCountNV, SMIDNV, WarpsPerSMNV, and WarpIDNV Builtins are NOT accepted as non-32-bit integers (u64)	2019-06-06 14:53:48 -04:00
greg-lunarg	43fb2403a6	Instrument: Fix code for version 2 output format. (#2655 ) Correct record size. Also bring version 2 tests up to version 1 equivalence.	2019-06-06 11:35:34 -04:00
Alastair Donaldson	08cc49ec59	Fix bug in 'split blocks', and add tests for fuzzer. (#2658 ) There turned out to be a bug in the 'split blocks' transformation due to blocks being split while they were being iterated over. This change fixes that issue, and adds tests that were able to expose the issue by running the fuzzer on some example shaders.	2019-06-05 21:54:47 +01:00
David Neto	d01a3c3b4b	Optimizer: Handle array type with OpSpecConstantOp length (#2652 ) When it's an OpConstant or OpSpecConstant, then the literal values are compared. If the OpSpecConstant also has a SpecId decoration, then that's also compared. Otherwise, it's an OpSpecConstantOp and we only compare the ID of the OpSpecConstantOp instruction itself. Fixes #2649	2019-06-05 16:35:50 -04:00
Alastair Donaldson	4a00a80c40	Add fuzzer pass to add dead breaks. (#2654 ) This pass randomly add breaks to the merge blocks of selection and loop constructs, such that the breaking edges will not be dynamically reachable.	2019-06-05 08:02:16 +01:00
Alastair Donaldson	620197bd65	Add fuzzer pass that adds useful constructs to a module (#2647 ) This new pass adds some basic ingredients to a module on which future passes are likely to depend, such as boolean constants and some specfic integer and floating-point values. This is not a fuzzer pass in the true sense in that it does not employ randomization, but it makes sense to define it as a fuzzer pass since it is the first of a number of transformations passes that the fuzzer will run on a module.	2019-06-04 14:55:00 +01:00
Jeff Bolz	2c0111e6eb	Add validation for SPV_EXT_fragment_shader_interlock (#2650 )	2019-06-03 10:55:07 -04:00
Ryan Harrison	699e167d78	Remove asserts from GetUnderlyingType (#2646 ) Fixes #2463	2019-05-31 08:57:41 -07:00
Kévin Petit	f99d7ad5c0	Validate OpenCL rules for ImageRead and OpImageSampleExplicitLod (#2643 ) Fixes #2594. Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-05-31 10:05:34 -04:00
Alastair Donaldson	209ff0ce90	Add spirv-fuzz pass to permute blocks. (#2642 ) The blocks within each function in the module will be permuted in a randomized manner that respects dominance.	2019-05-31 09:59:06 +01:00
Pierre Moreau	e7866de4b1	Linker: Better type comparison for OpTypeArray and OpTypeForwardPointer (#2580 ) * Types: Avoid comparing IDs for in Type::IsSameImpl When linking, we end up with duplicate types for imported and exported types, that needs to be removed. The current code would reject valid import/export pairs of symbols due to IDs mismatch, even if the types or constants behind those ID were the same. Enabled remaining type_match_test Fixes #2442	2019-05-29 16:12:02 -04:00
Ryan Harrison	0125b28ed4	Add compact ids to WebGPU <-> Vulkan transformations (#2639 ) Fixes #2634	2019-05-29 12:58:37 -07:00
greg-lunarg	3d62cb8148	Instrument: Add version 2 of record formats (#2630 ) New version has additional word in stage-specific section. Also some changes in content for tesselation and compute shaders. Either version can be invoked at pass creation. This is done to ease integration and updating of validation layers. Version 1 is deprecated and eventually will go away. Also sneaking in fix to version 1 compute shaders.	2019-05-29 15:08:21 -04:00
Alastair Donaldson	1b71e45338	Add "split block" transformation. (#2633 ) With this pass, the fuzzer can split blocks in the input module. This is mainly useful in order to give other (future) transformations more opportunities to apply.	2019-05-29 16:42:46 +01:00
Ryan Harrison	f051812343	Add WebGPU specific fuzzer for validation (#2628 ) Fixes #2627	2019-05-28 11:51:52 -07:00
Ryan Harrison	5a06fa4661	Add fuzzer for Vulkan->WebGPU spirv-opt passes (#2626 ) Fixes #2622	2019-05-28 10:11:43 -07:00
Ryan Harrison	78b2b18661	Add fuzzer for WebGPU->Vulkan spirv-opt passes (#2625 ) Fixes #2623	2019-05-28 07:18:03 -07:00
Steven Perron	6c7db9c630	Handle nested breaks from switches. (#2624 ) * Handle nested breaks from switches. There was a recent decision made to allow branches to the merge node of a switch even if the switch is not the first enclosing construct. They can be generated by glslang from break statements in switches. Dead branch elimination seems to be the only optimization that will break because of this change, so I will update that optimizations. The change made are: - Track switches in structured cfg analysis. - In Dead branch elimination: - Look for nested breaks that will require a switch instruction. - Rewrite, but don't delete, switchs that are required even if it could be replaced by an unconditional branch. - When looking for the first break, consider the merge of a switch as well. See #2612. * Fix variable names and comments. * Add tests for the struct cfg analysis and switches. * Fix typos in comments.	2019-05-27 16:28:14 -04:00
dan sinclair	42abaa099a	Remove MarkV and Stats code. (#2576 ) * Remove MarkV and Stats code. This Cl removes the MarkV and Stats code from SPIRV-Tools. This code was unused and currently un-maintained.	2019-05-24 15:43:59 -04:00
Jonathon Anderson	3b5ab540ca	linker: Add tests for various type comparisons (#2613 ) This adds a number of tests that check that all types will match to identically written clones during linking, including nearly every Type and some combinations (e.g. Functions of Arrays of Floats). Intent is for use with https://github.com/KhronosGroup/SPIRV-Tools/pull/2580, however that PR focuses on issues with TypeArray whereas these tests are (more) comprehensive and test more subtle (and possibly incorrect) cases. A number of these tests fail, many are fixed by the aforementioned PR. Some additional tests involving TypeForwardPointer are currently disabled as they cause assertion failures.	2019-05-24 15:40:28 -04:00
Sahil Parmar	b8fe7211c4	Allow arrays of out per-primitive builtins for mesh shaders (#2617 ) - PrimitiveID, Layer, ViewportIndex * Add validation tests for mesh builtins	2019-05-23 15:08:59 -04:00
Kévin Petit	07a1019717	Validate OpenCL environment rules for OpImageWrite (#2619 ) Fixes #2593. Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-05-23 08:35:14 -04:00
Toomas Remmelg	13f61bf859	Update vloadn and vstoren validation to match the OpenCL Extended Instruction Set Specification (#2599 )	2019-05-22 08:09:50 -04:00
Steven Perron	d9c00e1d2d	Add folding rules for OpQuantizeToF16 (#2614 ) Adding the folding rules for OpQuantizeToF16, and fixed some matching tests to check identify new lines.	2019-05-21 23:15:01 -07:00
alan-baker	713da30b63	Disallow merge targeting block with OpLoopMerge (#2610 ) Fixes #2588 * Add a check that the merge block of OpLoopMerge may not be the block that contains the OpLoopMerge * add a test	2019-05-21 23:02:53 -07:00
alan-baker	60aaafbc70	Allows breaks selection breaks to switches (#2605 ) Fixes #2604 * Allow selection constructs to branch to the nearest selection merge whose header is terminated by an OpSwitch * Cleanup break and continue checks generally * add tests	2019-05-21 22:49:37 -07:00
Steven Perron	0982f0212e	Using the instruction folder to fold OpSpecConstantOp (#2598 ) In order to try to reduce code duplication and to be able to fold more cases, we want to use the instruction folder when folding an OpSpecConstantOp with constant operands. A couple other changes are need to make this work. First GetDefiningInstruction\| in the constant manager is able to handle \|type_id\| being logically equivalent to another type, so we updated the interface, and removed the assert. Some tests were also updated because we not generate better code because constants are not duplicated as much as before. No need for new tests. The functionality of the instruction folder is already tested. There are tests check that the instruction folder is being used correctly for OpCompositeExtract and OpVectorShuffle in the existing test cases. Fixes #2585.	2019-05-21 12:45:00 -04:00
Kévin Petit	9f035269d6	Validate OpenCL environment rules for OpTypeImage (#2606 ) It is currently not possible to use an Image Format that is not Unknown without requiring a capability forbidden by the OpenCL environment. As such the validation of Image Format currently leans on capability validation entirely. Fixes #2592. Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-05-21 09:17:50 -04:00
Kévin Petit	47741f0504	Validate OpenCL memory and addressing model environment rules (#2589 ) Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-05-17 08:25:20 -04:00
alan-baker	ff4feb44b4	Validate construct exits (#2459 ) Validate structured exits from constructs * Add checks that exits from a construct are valid * Add Construct::IsStructuredExit() * uses specific rules for each type of construct * Added a test and check for #2213 * Adding tests for bad loop and continue exits * Fix identification of continue block that prevented some selections from having any blocks	2019-05-16 14:59:30 -07:00
greg-lunarg	9dfd4b8358	Bindless Validation: Instrument descriptor-based loads and stores (#2583 ) Essentially, support UBOs and SSBOs, scalar and array (sized and unsized).	2019-05-15 19:43:23 -04:00
alan-baker	7e7745fce8	Validate loop merge (#2579 ) Fixes #2559 * Validate OpLoopMerge including loop controls * add tests * fix some bad tests	2019-05-15 19:38:41 -04:00
alan-baker	fc7b5d8c6a	Mem model spv 1.4 (#2565 ) * Update memory model support for SPIR-V 1.4 Fixes #2552 * Upgrade memory model now supports two memory access operands for OpCopyMemory* * in all cases the pass will first generate two operands by either adding them or copying * updates accounts for multiple operands * tests	2019-05-15 19:06:37 -04:00
Steven Perron	84503583c6	Handle id overflow in sroa better. (#2582 ) There is a case where sroa is not handling id overflow gracefully. It is handled and an error message is output when the ids overflow. Fixes https://crbug.com/961030.	2019-05-15 09:29:28 -04:00
Steven Perron	e935dac9ef	Make pointers to isomorphic type interchangeable with option. (#2570 ) * Make pointers to logically matching types interchangeable with option. DXC will be generating code where the function parameters will be a more generic type that the actual parameter. They should be logically matching and the decorations of the actual parameter must be a superset of the decorations of the formal parameter. We want to accept this code with an options so that spirv-opt can then inline and fix the type mismatch. We will accept this under a new options `--before-hlsl-legalization`. The new option will also imply `relax-logical-pointer` so that HLSL frontends will need to use just the one more generic option. Moved the \|LogicallyMatches\| to the validation state to make it available in more places. Also added a parameter to have it check the decorations. I did not do a separate function for the decorations because checking the decorations involves making sure the types logically match anyway. Fixes #2535	2019-05-13 13:48:17 -04:00
alan-baker	2947e88f79	Update instrumentation passes to handle 1.4 interfaces (#2573 ) Fixes #2556 Added variables get added to entry point interfaces Add to input buffer too	2019-05-10 11:08:28 -04:00
alan-baker	87c4ef8a9c	Do not fold floating point if float controls used (#2569 ) Fixes #2558 * Mark floating point instructions as non-foldable if any SPV_KHR_float_controls capabilities are present * tests	2019-05-10 11:03:22 -04:00
alan-baker	45fb696668	Use last version (#2578 ) * Use grammar last version Fixes #2560 * Parse last version and use it in checks * Update grammar header generation * Fix NonWritable tests * Fix check and add specific tests	2019-05-10 11:02:01 -04:00
Ryan Harrison	f6d9a17843	Add pass to fix some invalid unreachable blocks for WebGPU (#2563 ) Attempts to split up unreachable blocks that are used both as a merge-block and a continue-target. Fixes #2429	2019-05-09 12:56:10 -04:00
David Neto	f2803c4a7f	VK_KHR_uniform_buffer_standard_layout validation (#2562 ) Add a command-line option to enable validating SPIR-V for implementations that support VK_KHR_uniform_buffer_standard_layout.	2019-05-08 18:01:10 -04:00
alan-baker	cc3e93c4e6	Add tests for folding 1.4 selects (#2568 ) Fixes #2554 * Folding rules already handle 1.4 selects so I simply added some tests	2019-05-08 14:06:04 -04:00
alan-baker	ea5e1b62e1	Update priv-to-local for SPIR-V 1.4 (#2567 ) Fixes #2555 * Fix a bug in validation where interfaces were considered non-unique between different entry points targeting the same function * added a test * Update private to local pass to remove localized private variables from entry point interfaces * added tests	2019-05-08 12:38:49 -04:00
David Neto	d0a1f5a05a	spvtest::Validate::CompileFailure: Don't leak the diagnostic (#2564 )	2019-05-07 22:01:06 -04:00
alan-baker	b74d92a8c3	ADCE support for SPIR-V 1.4 entry points (#2561 ) Fixes #2551 * Add support for 1.4 entry point interface lists * only input and output variables are automatically live * can clean up interfaces after DCE * added tests * allow opt tests to specify a target environment	2019-05-07 14:52:22 -04:00
David Neto	63f57d95d6	Support SPIR-V 1.4 (#2550 ) * SPIR-V 1.4 headers, add SPV_ENV_UNIVERSAL_1_4 * Support --target-env spv1.4 in help for command line tools * Support asm/dis of UniformId decoration * Validate UniformId decoration * Fix version check on instructions and operands Also register decorations used with OpDecorateId * Extension lists can differ between enums that match Example: SubgroupMaskEq vs SubgroupMaskEqKHR * Validate scope value for Uniform decoration, for SPIR-V 1.4 * More unioning of exts * Preserve grammar order within an enum value * 1.4: Validate OpSelect over composites * Tools default to 1.4 * Add asm/dis test for OpCopyLogical * 1.4: asm/dis tests for PtrEqual, PtrNotEqual, PtrDiff * Basic asm/Dis test for OpCopyMemory * Test asm/dis OpCopyMemory with 2-memory access Add asm/dis tests for OpCopyMemorySized Requires grammar update to add second optional memory access operand to OpCopyMemory and OpCopyMemorySized * Validate one or two memory accesses on OpCopyMemory* * Check av/vis on CopyMemory source and target memory access This is a proposed rule. See https://gitlab.khronos.org/spirv/SPIR-V/issues/413 * Validate operation for OpSpecConstantOp * Validate NonWritable decoration Also permit NonWritable on members of UBO and SSBO. * SPIR-V 1.4: NonWrtiable can decorate Function and Private vars * Update optimizer CLI tests for SPIR-V 1.4 * Testing tools: Give expected SPIR-V version in message * SPIR-V 1.4 validation for entry point interfaces * Allow only unique interfaces * Allow all global variables * Check that all statically used global variables are listed * new tests * Add validation fixture CompileFailure * Add 1.4 validation for pointer comparisons * New tests * Validate with image operands SignExtend, ZeroExtend Since we don't actually know the image texel format, we can't fully validate. We need more context. But we can make sure we allow the new image operands in known-good cases. * Validate OpCopyLogical * Recursively checks subtypes * new tests * Add SPIR-V 1.4 tests for NoSignedWrap, NoUnsignedWrap * Allow scalar conditions in 1.4 with OpSelect * Allows scalar conditions with vector operands * new tests * Validate uniform id scope as an execution scope * Validate the values of memory and execution scopes are valid scope values * new test * Remove SPIR-V 1.4 Vulkan 1.0 environment * SPIR-V 1.4 requires Vulkan 1.1 * FIX: include string for spvLog * FIX: validate nonwritable * FIX: test case suite for member decorate string * FIX: test case for hlsl functionality1 * Validation test fixture: ease debugging * Use binary version for SPIR-V 1.4 specific features * Switch checks based on the SPIR-V version from the target environment to instead use the version from the binary * Moved header parsing into the ValidationState_t constructor (where version based features are set) * Added new versions of tests that assemble a 1.3 binary and validate a 1.4 environment * Fix test for update to SPIR-V 1.4 headers * Fix formatting * Ext inst lookup: Add Vulkan 1.1 env with SPIR-V 1.4 * Update spirv-val help * Operand version checks should use module version Use the module version instead of the target environment version. * Fix comment about two-access form of OpCopyMemory	2019-05-07 12:27:18 -04:00
Steven Perron	106c98d0fa	Validate sign of int types. (#2549 ) Fixes https://crbug.com/959011.	2019-05-06 13:05:31 -04:00
Steven Perron	6d04da22c6	Fix up type mismatches. (#2545 ) Add functionality to fix-storage-class so that it can fix up mismatched data types for pointers as well. Fixes bugs in when fixing up storage class. Move GenerateCopy to the Pass class to be reused. The spirv-opt change for #2535.	2019-05-02 09:31:46 -04:00
Ryan Harrison	c8b09744c6	Add validation specific to OpExecutionModeId (#2536 ) Fixes #1565	2019-05-01 13:29:39 -04:00
Steven Perron	32af42616a	Change implementation of post order CFG traversal (#2543 ) * Change implementation of post order CFG traversal It seems like the recursion is going very deep, and causing some problem is particular situations. I've reimplemented the CFG post order traversal to not use recursion. Fixes #2539.	2019-04-29 17:09:20 -04:00
Ryan Harrison	b68af7ca8e	Add support for Private & Output to initializer decompose flag (#2537 ) Fixes #2388	2019-04-25 16:24:32 -04:00
Ryan Harrison	736376dbf9	Remove Acquire, Release, and Relaxed from allowed Mem Sem bits for WebGPU (#2526 ) Fixes #2524	2019-04-23 13:27:40 -04:00
alan-baker	07c4dd4b9e	Reduce runtime of array layout checks (#2534 ) Fixes #2533 * Stop checking layouts once the offset gets back to a 16 byte alignment	2019-04-23 10:33:00 -04:00
alan-baker	ac878fcbdd	Remove unreachable block validation (#2525 ) * Remove the check that blocks terminated by OpUnreachable are not statically reachable in the CFG * Updated tests	2019-04-17 18:21:19 -04:00
Ryan Harrison	21712068fe	Validate that SPIR-V binary is encoded as little endian for WebGPU (#2523 ) Fixes #2522	2019-04-17 12:44:54 -04:00
Ryan Harrison	3aad3e9228	Change validation of memory semantics for OpAtomics* in WebGPU (#2519 ) Recent change to the spec restricted the valid values for Memory Semantics in OpAtomics* in the WebGPU env. Implementing enforcing these changes. Fixes #2499	2019-04-16 14:49:07 -04:00
Ryan Harrison	048dcd38ce	Implement WebGPU->Vulkan initializer conversion for 'Function' variables (#2513 ) WebGPU requires certain variables to be initialized, whereas there are known issues with using initializers in Vulkan. This PR is the first of three implementing a pass to decompose initialized variables into a variable declaration followed by a store. This has been broken up into multiple PRs, because there 3 distinct cases that need to be handled, which require separate implementations. This first PR implements the basic infrastructure that is needed, and handling of Function storage class variables. Private and Output will be handled in future PRs. This is part of resolving #2388	2019-04-16 14:31:36 -04:00
Paul Thomson	3335c61147	reduce: Add two branch reduction passes (#2507 ) * Fix #2320. `conditional_branch_to_simple_conditional_branch` reduction pass changes conditional branches so both targets point to the same block id (creating a "simple" conditional branch). * Fix #2501. `simple_conditional_branch_to_branch` reduction pass changes "simple" conditional branches to branches. * Fix #2503. `conditional_branch_to_simple_conditional_branch` proper handling of back-edges.	2019-04-15 19:54:36 +01:00
Ryan Harrison	102e430a88	Add pass to legalize OpVectorShuffle for WebGPU (#2509 ) In WebGPU, the component operand 0xFFFFFFFF is forbidden, but in Vulkan it is used to indicate a value is undefined. When converting to WebGPU, 0xFFFFFFFF needs to converted to a legal value, though the specific one does not matter, since it was used to indicate an undefined entry in the original code. Choosing to use 0, since the operands are required to be on [0, N-1], so 0 is guaranteed to always be valid. Fixes #2349	2019-04-12 12:14:23 -04:00
alan-baker	98b3f26c2f	Gate formatless checks on Vulkan env (#2486 ) Fixes #2470 * Only require the WithoutFormat capabilities for Unknown image reads and writes in the Vulkan environment update tests and add new vulkan specific tests	2019-04-11 16:39:50 -04:00
Steven Perron	9047de51cb	Accept OpBitCast in fix storage class. (#2505 ) Fixes http://crbug.com/950889.	2019-04-09 14:10:35 -04:00
Paul Thomson	d90aae9a5a	reduce: miscellaneous fixes (#2494 ) * Fix .gitignore * Add missing reduction pass: RemoveBlockReductionOpportunityFinder * Add DumpShader functions in test_reduce for debugging * Add DumpShader functions in spirv-reduce for debugging * Fix include style * Don't use "using namespace"	2019-04-08 19:37:17 +01:00
Ryan Harrison	0cb2d4079e	Add WebGPU->Vulkan and Vulkan->WebGPU flags in spirv-opt (#2496 ) Renames the existing flag '--webgpu-mode' to '--vulkan-to-webgpu' for the Vulkan->WebGPU operation, and adds a new flag '--webgpu-to-vulkan' for the WebGPU->Vulkan operation. Currently '--webgpu-to-vulkan' doesn't have any passes associated with it yet, but further patches will implement them. Fixes #2495	2019-04-05 15:12:26 -04:00
Steven Perron	3a0bc9e724	Add fix storage class code. (#2434 ) This pass tries to fix validation error due to a mismatch of storage classes in instructions. There is no guarantee that all such error will be fixed, and it is possible that in fixing these errors, it could lead to other errors. Fixes #2430.	2019-04-05 13:12:08 -04:00
alan-baker	236bdc0065	Change prioritization of unreachable merge and continue (#2460 ) Fixes #2452 Swaps priority of handling unreachable merge and continues so that the back-edge is retained in the case a block is both a loop continue and loop merge	2019-04-03 12:50:08 -04:00
Steven Perron	12e4a7b649	Handle variable pointer in some optimizations (#2490 ) * Check var pointer capability in ADCE. * Check var ptr capability for common uniform. * Check var ptr capability in access chain convert. Since we want this pass to run even if there are variable pointer on storage buffers, we had to remove asserts that assumed there were no variable pointers. The functions with the asserts will now work, it becomes the responsibility of the callers to deal with the output as appropriate. * Single block elimination and variable pointers. It seems like the code in local single block elimination is able to handle cases with variable pointers already. This is because the function `HasOnlySupportedRefs` ensures that variables that feed a variable pointer are not candidates. * Single store elimination and variable pointers. It seems like the code in local single stroe elimination is able to handle cases with variable pointers already. This is because the function `FindSingleStoreAndCheckUses` ensures that variables that feed a variable pointer are not candidates. * SSA rewriter and variable pointers. It seems like the code in the two passes that call the SSA rewriter are able to handle cases with variable pointers already. This is because the function `HasOnlySupportedRefs` ensures that variables that feed a variable pointer are not candidates. Fixes #2458.	2019-04-03 12:47:51 -04:00
Ryan Harrison	01964e325f	Add pass to generate needed initializers for WebGPU (#2481 ) Fixes #2387	2019-04-03 11:44:09 -04:00
alan-baker	4bd106b089	Handle dead infinite loops in DCE (#2471 ) Fixes #2456 * When eliminating a structured construct that has an unreachable merge, replace that unreachable terminator with an appropriate return * New tests	2019-04-03 10:30:12 -04:00
alan-baker	8129cf2f99	Remove merge assert in block calculation (#2489 ) Fixes #2488 * Validator doesn't identify back-edge of the loop, so the merge is never set * Construct::blocks() has safe uses of `merge` so the assert can be removed * Added a test	2019-04-02 14:37:05 -04:00
Paul Thomson	e2ddb9371e	reduce: add remove_selection_reduction_opportunity (#2485 ) Fix #2484	2019-04-02 16:50:15 +01:00
alan-baker	c9874e5090	Fix merge return in the face of breaks (#2466 ) Fixes #2453 * Enable addition of OpPhi instructions when the loop has multiple predecessors of the merge due to a break * This can result in some values no longer dominating their uses * Track return blocks in structured flow to produce OpPhis that have multiple undef and non-undef arguments * New tests to catch the bug * When a block is predicated, mark the new body as a return if the old block as already a return	2019-04-02 10:05:28 -04:00
alan-baker	0300a464a4	Maintain inst to block mapping in merge return (#2469 ) Fixes #2455 Properly maintains instruction to block mapping for newly created phi instructions in merge return	2019-04-01 13:14:10 -04:00
alan-baker	320a7de5c9	Validate that OpUnreacahble is not statically reachable (#2473 ) * Adds a validator check that ensures no block reachable from the entry block is terminated by OpUnreachable * Updated tests * Added new tests	2019-03-29 10:49:37 -04:00
Paul Thomson	fcb8453104	reduce: fix loop to selection pass for loops with combined header/continue block (#2480 ) * Fix #2478. The fix is to just not try to simplify such loops. * Also added `BasicBlock::MergeBlockId()` and `BasicBlock::ContinueBlockId()`. * Some minor changes to `structured_loop_to_selection_reduction_opportunity.cpp`. * Added test.	2019-03-29 11:29:24 +00:00
alan-baker	2ff54e34ed	Handle function decls in Structured CFG analysis (#2474 ) Fixes #2451 * Structured cfg analysis now handles functions with no basic blocks * Added a test	2019-03-26 14:39:16 -04:00
Paul Thomson	fb0753640a	reduce: fix loop to selection dominance query (#2477 ) Fix #2457	2019-03-26 16:37:08 +00:00
Paul Thomson	7d1b176c1d	Improve reducer algorithm and other changes (#2472 ) Fix #2475. Fix #2476. * Improve reducer algorithm: shrink granularity, remove an early return, no lazy initialization, notify pass if binary is interesting, add comments. * Add fail-on-validation-error option to fail a reduction if an invalid state is reached; useful for tests. * Set fail-on-validation-error in tests. * Improve some documentation comments. * Add Reducer::AddDefaultReductionPasses so tests (and other library consumers) can add the default reduction passes. * Add CLIMessageConsumer in test_reduce so we can see messages for tricky tests. * Remove test RemoveUnreferencedInstructionReductionPassTest_ApplyReduction because it was indirectly testing the reduction algorithm, not the RemoveUnreferencedInstruction pass. * Tweak tests where needed.	2019-03-26 13:22:31 +00:00
Ryan Harrison	ffbecae56a	Check OpSampledImage is only passed into valid instructions (#2467 ) Fixes #1528	2019-03-25 15:44:57 -04:00
Paul Thomson	2d52cbee49	Add some val options to reduce (#2401 ) Fix #2396 * Check that initial state is valid. Add kInitialStateInvalid. * Fix RemoveOpnameAndRemoveUnreferenced test; turns out the original shader is invalid, but we never notice because we don't check this and the reduced shader is valid; fix original shader. Assert reduction status is kComplete. * Always check return value from `Reducer::Run`. * Change Reducer::Run to not immediately copy the input binary.	2019-03-21 14:28:06 +00:00

... 3 4 5 6 7 ...

1691 Commits