SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2024-10-21 04:20:05 +00:00

Author	SHA1	Message	Date
alan-baker	510ca9d616	Only allow previously declared forward refs in structs (#2920 ) Fixes https://crbug.com/1008130 * Restore a missing check that the only valid forward references in structs are previously declared forward pointers	2019-09-25 18:11:22 -04:00
Steven Perron	2a11f365bc	Handle id overflow in wrap-opkill (#2916 ) New code in wrap-opkill does not handle id overflow correctly. We fix that up. Fixes https://crbug.com/1007144	2019-09-25 17:42:58 -04:00
Alastair Donaldson	70097c7761	spirv-fuzz: do not replace struct indices with synonyms (#2915 ) This change introduces a robust check for whether an index in an access chain is indexing into a struct, in which case the index needs to be an OpConstant and cannot be replaced with a synonym. Fixes #2906.	2019-09-25 16:52:35 +01:00
Alastair Donaldson	c1e03834e3	spirv-fuzz: Fixes to preconditions for adding dead break/continue edges (#2904 ) Issues #2898 and #2900 identify some cases where adding a dead continue would lead to an invalid module, and these turned out to be due to the lack of sensible dominance information when a continue target is unreachable. This change requires that the header of a loop dominates the loop's continue target if a dead continue is to be added. Furthermore, issue #2905 identified a shortcoming in the algorithm being used to identify when it is OK, from a dominance point of view, to add a new break/continue edge to a control flow graph. This change replaces that algorithm with a simpler and more obviously correct algorithm (that incidentally does not require the new edge to be a break/continue edge in particular). Fixes #2898. Fixes #2900. Fixes #2905.	2019-09-25 16:51:41 +01:00
Alastair Donaldson	7bc114ba2f	spirv-fuzz: do not replace a pointer argument to a function call with a synonym (#2901 ) Before this change, spirv-fuzz would replace a pointer argument to a function call with a synonym, which is problematic when the synonym is not a memory object declaration, since function call arguments are required to be memory object declarations. This change adds a check to ensure that such a replacement is not made. Fixes #2896.	2019-09-25 12:17:29 +01:00
Alastair Donaldson	290f6a820d	spirv-fuzz: do not replace boolean constant argument to OpPhi instruction (#2903 ) Before this change, spirv-fuzz would replace a constant boolean argument to an OpPhi with the result of a binary operation, inserting the instruction to compute the binary operation right before the OpPhi, leading to an invalid module. This change conservatively disallows replacing OpPhi arguments. Issue #2902 notes that there is scope for being less conservative. Fixes #2897.	2019-09-25 12:16:25 +01:00
alan-baker	527a689307	Remove validate_datarules.cpp (#2911 ) * Checks moved into individual opcode validation * removes duplicated checks * Add check that forward pointer points to struct	2019-09-24 17:55:12 -04:00
Steven Perron	55ea57a785	Handle extract with no indexes (#2910 ) * Handle extract with no indexes It is possible that OpCompositeExtract instructions will not have any indexes. This is not handled well by scalar replacement and instruction folding. Fixes https://crbug.com/1006435 * Fix typo.	2019-09-24 16:19:31 -04:00
Steven Perron	6f26d9ad81	Handle id overflow in convert local access chains (#2908 ) Fixes https://crbug.com/1004453	2019-09-24 14:04:54 -04:00
Alastair Donaldson	958f7e72a7	Employ the "swarm testing" idea in spirv-fuzz (#2890 ) This change to spirv-fuzz uses ideas from "Swarm Testing" (Groce et al. 2012), so that a random subset of fuzzer passes are enabled. These passes are then applied repeatedly in a randomized fashion, with the aggression with which they are applied being randomly chosen per pass. There is plenty of scope for refining the probabilities introduce in this change; this is just meant to be a reasonable first effort.	2019-09-23 16:29:19 +01:00
Alastair Donaldson	b83535da53	Fix operand index in spirv-fuzz (#2895 ) This change rectifies a problem where an absolute operand index was being used when an index restricted to input operands was required. Fixes #2893.	2019-09-23 16:28:25 +01:00
David Neto	8d0ca43da5	Add method comment for opt::Function::WhileEachInst (#2867 ) Also, say that ForEachInst and ForEachParam process instructions/parameters in order.	2019-09-23 09:36:48 -04:00
Steven Perron	6b07212659	Use OpReturn* in wrap-opkill (#2886 ) * Use OpReturn* in wrap-opkill The warp-opkill pass is generating incorrect code. It is placing an OpUnreachable at the end of a basic block, when the block can be reached. We can't reach the end of the block, but we can reach the end. Instead we will add a return instruction. Fixes #2875.	2019-09-20 10:32:27 -04:00
Alastair Donaldson	7275a71654	Allow validation during spirv-fuzz replay (#2873 ) To aid in debugging issues in spirv-fuzz, this change adds an option whereby the SPIR-V module is validated after each transformation is applied during replay. This can assist in finding a transformation that erroneously makes the module invalid, so that said transformation can be debugged.	2019-09-20 10:54:09 +01:00
Steven Perron	61edde52a0	Revert "Use OpReturn* in wrap-opkill" This reverts commit `87f0fa432f`.	2019-09-19 22:39:56 -04:00
Steven Perron	87f0fa432f	Use OpReturn* in wrap-opkill The warp-opkill pass is generating incorrect code. It is placing an OpUnreachable at the end of a basic block, when the block can be reached. We can't reach the end of the block, but we can reach the end. Instead we will add a return instruction. Fixes #2875.	2019-09-19 22:34:57 -04:00
Ehsan	08fcf8a4ab	Fix header include syntax. (#2882 )	2019-09-19 09:26:24 -05:00
Steven Perron	248c80b049	Handle OpConstantNull in copy-prop-arrays. (#2870 ) Many of the places in copy propagate arrays assumes that integer constant will be defined by an OpConstant instruction. That is not always true. We fix these spots by allowing for an OpConstantNull.	2019-09-19 10:24:00 -04:00
David Neto	d06fe08489	Fix comment typo found by protobufs linter (#2884 )	2019-09-19 09:47:46 -04:00
Alastair Donaldson	e59b60de07	Fix detection of blocks bypassed by new edge (#2874 ) Fixes an issue where the blocks that would be bypassed by a new break or continue control flow edge were not properly detected. Fixes #2871.	2019-09-18 20:50:08 +01:00
Alastair Donaldson	ccd7bf1675	Fix CMake issue related to spirv-fuzz (#2877 ) spirv-fuzz generates protobuf sources in a 'protobuf' directory. When building with Unix Makefiles, compilation would fail due to to this directory not existing. This change causes the directory to be created when the build is prepared.	2019-09-18 20:47:58 +01:00
Alastair Donaldson	0a07cd1c9a	Add fuzzer pass to replace ids with synonyms (#2857 ) If the fuzzer's fact manager knows that ids A and B are synonymous, it can replace a use of A with a use of B, so long as various conditions hold (e.g. the definition of B must dominate the use of A, and it is not legal to replace a use of an OpConstant in a struct's access chain with a synonym that is not an OpConstant). This change adds a fuzzer pass to sprinke such synonym replacements through the module.	2019-09-18 20:47:08 +01:00
alan-baker	bbb29870b5	Relaxed bitcast with pointers (#2878 ) * When input or result is a pointer type also allow 32-bit integer vectors for the other type * Relaxation only applies to SPIR-V 1.5 or in the presence of SPV_KHR_physical_storage_buffer * new tests	2019-09-18 11:55:39 -04:00
Raun Krisch	99793fa67d	Adding valilidation checks for OpEntryPoint duplicate names and execution mode (#2862 )	2019-09-16 19:13:30 -04:00
alan-baker	9325619353	Extra resource interface validation (#2864 ) * Vulkan specific checks * storage buffer variables must be structs or arrays of structs * storage buffer struct must be Block decorated * uniform struct must be Block or BufferBlock decorated * new tests	2019-09-16 10:46:31 -04:00
alan-baker	5a48c0da15	SPIRV-Tools support for SPIR-V 1.5 (#2865 ) * Ensure same enum values have consistent extension lists * val: fix checking of capabilities The operand for an OpCapability should only be checked for the extension or core version. The InstructionPass registers a capability, and all its implied sub-capabilities before actually checking the operand to an OpCapability. * Add basic support for SPIR-V 1.5 - Adds SPV_ENV_UNIVERSAL_1_5 - Command line tools default to spv1.5 environment - SPIR-V 1.5 incorporates several extensions. Now the disassembler prefers outputing the non-EXT or non-KHR names. This requires updates to many tests, to make strings match again. - Command line tests: Expect SPIR-V 1.5 by default * Test validation of SPIR-V 1.5 incorporated extensions Starting with 1.5, incorporated features no longer require the associated OpExtension instruction.	2019-09-13 14:59:02 -04:00
Alastair Donaldson	ad7f2c5c4c	Add fuzzer pass to copy objects (#2853 ) A new fuzzer pass that randomly introduces OpCopyObject instructions that make copies of ids, and uses the fact manager to record the fact that an id %id is synonymous with an id generated by an OpCopyObject applied to %id. (A future pass will exploit such synonym facts.)	2019-09-11 23:45:20 +01:00
Ryan Harrison	67b87f22cf	Handle another case where creating a constant can fail (#2854 ) Fixes #2847	2019-09-11 17:18:05 -04:00
Steven Perron	c7a39bc40f	Don't inline function containing OpKill (#2842 ) If an OpKill instruction is inlined into a continue construct, then the spir-v is no longer valid. To avoid this issue, we do inline into an OpKill at all. This method was chosen because it is difficult to keep track of whether or not you are in a continue construct while changing the function that is being inlined into. This will work well with wrap OpKill because every will still be inlined except for the OpKill instruction itself. Fixes #2554 Fixes #2433 This reverts commit `aa9e8f5380`.	2019-09-11 13:26:55 -04:00
Steven Perron	4f9256db35	Handle id overflow in wrap op kill. (#2851 ) Fixes https://crbug.com/997729	2019-09-11 13:26:42 -04:00
David Neto	9f188e3374	Assembler: Can't set an ID in instruction without result ID (#2852 ) Fix tests that violated this rule. Fixes #2257	2019-09-11 13:15:25 -04:00
Ryan Harrison	c0e9807094	Handle creating a new constant failing gracefully (#2848 ) Fixes #2847	2019-09-10 12:51:19 -04:00
Alastair Donaldson	e2e95172df	Rework management of probabilities in spirv-fuzz (#2839 ) Before this change there was quite a lot of duplication in the code being used to choose random percentages, and some of it was incorrect so that a percentage chance of (100-N)% instead of N% was being used. Also there was a lot of duplicate code to choose a random index into a vector. This change eliminates that duplication (fixing up the percentage problem), and gets rid of direct access to the random number generator being used for fuzzing, so that all randomization requests must go through the FuzzerContext class, discouraging future ad-hoc uses of the random number generator.	2019-09-10 15:02:25 +01:00
Alastair Donaldson	7ee8f443ea	Fix add-dead-break and add-dead-continue passes to respect dominance (#2838 ) The implementation of these passes had overlooked the fact that adding a new edge to a control flow graph can change dominance information. Adding a dead break/continue risks causing uses to no longer be dominated by their definitions. This change introduces various tests to expose such scenarios, and augments the preconditions for these transformations with checks to guard against the situation.	2019-09-10 14:48:27 +01:00
Steven Perron	35c9518c4e	Handle id overflow in the ssa rewriter. (#2845 ) * Handle id overflow in the ssa rewriter. Remove LocalSSAElim pass at the same time. It does the same thing as the SSARewrite pass. Then even share almost all of the same code. Fixes crbug.com/997246	2019-09-10 09:38:23 -04:00
Steven Perron	7f7236f1eb	Handle id overflow in the constant manager. (#2844 ) Fixes crbug.com/997246	2019-09-09 15:12:26 -04:00
alan-baker	a464ac1a27	Add generic builtin validation of target (#2843 ) * Validate the target's opcode is acceptable * Update tests * New tests * move early exit for builtins a bit later in the pass	2019-09-09 14:53:30 -04:00
Steven Perron	6797173cf6	Don't register duplicate decoration in validator. (#2841 ) As far as I know, it is legal to have multiple decoration adding the same decoration to the same id. The validator registers all of these decoration as if they were distinct decorations. This can cause poor memory usage and performance in some cases. This fix is to make sure that duplicates are not registers. I keep the type of the decoration list as an std::vector because I expect it to be small enough in most cases that the linear search will still be faster that using some type of map. No tests are added because we do not have a mechanism to test memory usage in our unit tests. Fixes #2837. The total memory usage drop to 14,236KB.	2019-09-09 12:55:44 -04:00
Steven Perron	76261e2a7d	Replace CubeFaceCoord and CubeFaceIndexAMD (#2840 ) Part of #2814.	2019-09-06 17:11:37 -04:00
Steven Perron	b218ad1994	Fold Min, Max, and Clamp instructions. (#2836 ) Fixes #2830.	2019-09-05 13:30:03 -04:00
Steven Perron	a41520eaa4	Replace uses of SPV_AMD_shader_trinary_minmax extension (#2835 ) Part of #2814	2019-09-05 09:29:04 -04:00
rumblehhh	1dfb5fc12e	Export SPIRV-Tools targets on installation (#2785 ) This allows the targets to be used in other cmake projects. See the following for more details: https://cmake.org/cmake/help/latest/manual/cmake-packages.7.html#creating-packages https://foonathan.net/blog/2016/07/07/cmake-dependency-handling.html	2019-09-04 12:45:26 -04:00
greg-lunarg	c77045b4a0	Instrument: Be sure Float16 capability on when generating float16 null (#2831 )	2019-09-03 15:19:36 -04:00
greg-lunarg	d11725b1d4	Add --relax-float-ops and --convert-relaxed-to-half (#2808 ) The first pass applies the RelaxedPrecision decoration to all executable instructions with float32 based type results. The second pass converts all executable instructions with RelaxedPrecision result to the equivalent float16 type, inserting converts where necessary.	2019-09-03 13:22:13 -04:00
Steven Perron	b54d950298	Fold Fmix should accept vector operands. (#2826 ) Fixes #2819	2019-09-03 09:17:18 -04:00
Alastair Donaldson	2c5ed16ba9	Fix end comments in header files (#2829 ) The end comments for the #ifndef ... #endif macros in various header files containd a stray #define.	2019-09-02 17:31:27 -04:00
Ben Clayton	65e362b7ae	AggressiveDCEPass: Set modified to true when appending to to_kill_ (#2825 ) Also add an assertion that these `modified` is true if to_kill_ has a non-zero size to catch this sort of issue in the pass. Fixes: #2824	2019-08-30 16:27:22 -04:00
Steven Perron	d67130caca	Replace SwizzleInvocationsAMD extended instruction. (#2823 ) Part of #2814	2019-08-30 14:07:24 -04:00
Steven Perron	ad71c057c7	Replace SwizzleInvocationsMaskedAMD extended instruction. (#2822 ) Part of #2814	2019-08-30 10:48:42 -04:00
Steven Perron	35d98be3bc	Amd ext to khr (#2811 ) Add the first steps to removing the AMD extension VK_AMD_shader_ballot. Splitting up to make the PRs smaller. Adding utilities to add capabilities and change the version of the module. Replaces the instructions: OpGroupIAddNonUniformAMD = 5000 OpGroupFAddNonUniformAMD = 5001 OpGroupFMinNonUniformAMD = 5002 OpGroupUMinNonUniformAMD = 5003 OpGroupSMinNonUniformAMD = 5004 OpGroupFMaxNonUniformAMD = 5005 OpGroupUMaxNonUniformAMD = 5006 OpGroupSMaxNonUniformAMD = 5007 and extentend instructions WriteInvocationAMD = 3 MbcntAMD = 4 Part of #2814	2019-08-29 12:48:17 -04:00
Ben Clayton	5a581e738c	spvtools::Optimizer - don't assume original_binary and optimized_binary are aliased (#2799 ) If they are not aliased, the function will always print the message: "Binary unexpectedly changed despite optimizer saying there was no change" Which is (usually) totally bogus. Fixes #2798	2019-08-29 10:04:55 -04:00
Steven Perron	73422a0a5e	Check feature mgr in context consistency check (#2818 ) We add a check that the feature manager is correcter after each pass. This resulted in a couple failing tests cases. Those are fixed. Part of #2814	2019-08-28 11:49:16 -04:00
Steven Perron	15fc19d091	Refactor instruction folders (#2815 ) * Refactor instruction folders We want to refactor the instruction folder to allow different sets of rules to be added to the instruction folder. We might want different sets of rules in different circumstances. We also need a way to add rules for extended instructions. Changes are made to the FoldingRules class and ConstFoldingRules class to enable that. We added tests to check that we can fold extended instructions using the new framework. At the same time, I noticed that there were two tests that did not tests what they were suppose to. They could not be easily salvaged. #2813 was opened to track adding the new tests.	2019-08-26 18:54:11 -04:00
Alastair Donaldson	8336d1925f	Extend reducer to remove relaxed precision decorations (#2797 ) Adds a reduction pass that removes OpDecorate and OpMemberDecorate instructions that annotate instructions and members with RelaxedPrecision. As well as being useful in its own right, removing such references allows other passes to remove further instructions.	2019-08-22 23:33:09 +01:00
Steven Perron	b00ef0d26e	Handle Id overflow in private-to-local (#2807 ) We need to handle id overflow in the private to local pass. Fixes https://crbug.com/962295	2019-08-22 09:14:48 -04:00
Steven Perron	aef8f92b2b	Even more id overflow in sroa (#2806 ) Now we need to handle id overflow when we overflow while replacing uses of the variable. While looking at this code, I noticed an error in the way we handle access chains that cannot be replaced because of overflow. Name it will make some change, and then give up by returning SuccessWithoutChange. But it was changed. This is fixed up by returning Failure if we notice the error at the time of rewriting the users. This is for both id overflow or out-of-bounds accesses. Code is added to "CheckUses" to remove variables that have out-of-bounds accesses from the candidate list, so we don't even try to rewrite its uses. Fixes https://crbug.com/995032	2019-08-21 13:12:42 -04:00
Steven Perron	c5d1dab99e	Add name for variables in desc sroa (#2805 ) Fixes #2802.	2019-08-21 10:55:02 -04:00
David Neto	0cbdc7a2c3	Remove unimplemented method declaration (#2804 )	2019-08-20 08:53:27 -04:00
Steven Perron	bc62722b80	Handle overflow in wrap-opkill (#2801 ) Fixes https://crbug/994203	2019-08-18 19:00:18 -04:00
Steven Perron	9cd07272a6	More handle overflow in sroa (#2800 ) If we run out of ids when creating a new variable, sroa does not recognize the error, and continues doing work. This leads to segmentation faults. Fixes https://crbug/969655	2019-08-16 13:15:17 -04:00
greg-lunarg	06407250a1	Instrument: Add support for Buffer Device Address extension (#2792 )	2019-08-16 09:18:34 -04:00
Toomas Remmelg	7b4e5bd5ec	Update remquo validation to match the OpenCL Extended Instruction Set Specification (#2791 )	2019-08-15 09:38:37 -04:00
Jaebaek Seo	ff872dc6bf	Change the way to include header (#2795 ) `#include <source/util/string_utils.h>` works only when we specify `include_directories(${CMAKE_CURRENT_SOURCE_DIR}/)` in cmake. It is hard to set the source directory as a include path in some build systems e.g., bazel. Using the relative path easily solves this issue. This commit uses `#include "source/util/string_utils.h"` instead of `#include <source/util/string_utils.h>`.	2019-08-14 18:09:20 -04:00
alan-baker	bbd80462f5	Fix validation of constant matrices (#2794 ) Fixes #2793 * Don't special case matrix validation compared to other composites * just check the constituents are constants or undefs * later checking validates the column type * new test	2019-08-14 11:26:41 -04:00
Steven Perron	60043edfa1	Replace OpKill With function call. (#2790 ) We are no able to inline OpKill instructions into a continue construct. See #2433. However, we have to be able to inline to correctly do legalization. This commit creates a pass that will wrap OpKill instructions into a function of its own. That way we are able to inline the rest of the code. The follow up to this will be to not inline any function that contains an OpKill. Fixes #2726	2019-08-14 09:27:12 -04:00
greg-lunarg	95386f9e45	Instrument: Fix version 2 output record write for tess eval shaders. (#2782 ) Fix output record write for tess eval shaders. Also change command line for bindless instrumentation to use use output record version 2.	2019-08-09 08:22:41 -04:00
Steven Perron	4b64beb1ae	Add descriptor array scalar replacement (#2742 ) Creates a pass that will replace a descriptor array with individual variables. See #2740 for details. Fixes #2740.	2019-08-08 10:53:19 -04:00
greg-lunarg	29af42df12	Add SPV_EXT_physical_storage_buffer to opt whitelists (#2779 ) This also fixes ADCE to not remove possibly needed OpTypeForwardPointer. The bug, its fix and the corresponding test have a circular dependency with the extension, so they are packaged together.	2019-08-08 09:45:59 -04:00
Steven Perron	b029d3697e	Handle RelaxedPrecision in SROA (#2788 ) If a member of a struct has a relaxed precision, sroa will not split the struct. This means we do not get all cases. This commit handles these cases. The other part is that the decoration needs to be passed on to the new variables. Fixes #2786	2019-08-07 12:17:26 -04:00
Alastair Donaldson	698b56a8f0	Add 'copy object' transformation (#2766 ) This transformation can introduce an instruction that uses OpCopyObject to make a copy of some other result id. This change introduces the transformation, but does not yet introduce a fuzzer pass to actually apply it.	2019-08-05 18:00:13 +01:00
Geoff Lang	0b70972a29	Remove extra ';' after member function definition. (#2780 ) This fixes a clang compiler warning about extra semicolons.	2019-08-01 19:33:55 -04:00
Ryan Harrison	5ada98d0bb	Update WebGPU validation rules of OpAtomic*s (#2777 ) Fixes #2723	2019-07-31 17:15:47 -04:00
alan-baker	3726b500b1	Treat access chain indexes as signed in SROA (#2776 ) Fixes #2768 * In scalar replacement, interpret access chain indexes as signed counts * Use Constant::GetSignExtendedValue and Constant::GetZeroExtendedValue where appropriate * new tests	2019-07-31 15:39:33 -04:00
David Neto	31590104ec	Add pass to inject code for robust-buffer-access semantics (#2771 ) spirv-opt: Add --graphics-robust-access Clamps access chain indices so they are always in bounds. Assumes: - Logical addressing mode - No runtime-array-descriptor-indexing - No variable pointers Adds stub code for clamping coordinate and samples for OpImageTexelPointer. Adds SinglePassRunAndFail optimizer test fixture. Android.mk: add source/opt/graphics_robust_access_pass.cpp Adds Constant::GetSignExtendedValue, Constant::GetZeroExtendedValue	2019-07-30 19:52:46 -04:00
Ryan Harrison	4a28259cc8	Update OpMemoryBarriers rules for WebGPU (#2775 ) Part of #2724	2019-07-30 14:50:55 -04:00
David Neto	ac3d131054	Element type is const for analysis::Vector,Matrix,RuntimeArray (#2765 ) This makes it symmetric with the result type of ...->element_type which returns a const Type. So now we can write code like this: analysis::Vector v = ... analysis::Vector(v->element_type(), 2);	2019-07-29 22:55:18 -04:00
Diego Novillo	49797609b7	Protect against out-of-bounds references when folding OpCompositeExtract (#2774 ) This fixes #2608. The original test case had an out-of-bounds reference that ended up folding into OpCompositeExtract that was indexing right outside the constant composite. The returned constant would then cause a segfault during constant propagation.	2019-07-29 13:27:40 -07:00
alan-baker	7fd2365b06	Don't move debug or decorations when folding (#2772 ) Fixes #2764 * Don't replace all uses when simplifying instructions, instead only update non-debug, non-decoration uses * added a test * Add a new version of RAUW that takes a predicate to decide whether to replace the use or not * used in simplification pass	2019-07-29 16:20:43 -04:00
Ryan Harrison	7bafeda284	Update OpControlBarriers rules for WebGPU (#2769 ) * Update OpControlBarriers rules for WebGPU Part of #2724	2019-07-29 12:53:27 -04:00
Diego Novillo	9559cdbdf0	Fix #2609 - Handle out-of-bounds scalar replacements. (#2767 ) * Fix #2609 - Handle out-of-bounds scalar replacements. When SROA tries to do a replacement for an OpAccessChain that is exactly one element out of bounds, the code was trying to access its internal array of replacements and segfaulting. This protects the code from doing this, and it additionally fixes the way SROA works by not returning failure when it refuses to do a replacement. Instead of failing the optimization pass, SROA will now simply refuse to do the replacement and keep going. Additionally, this patch fixes the SROA logic to now return a proper status so we can correctly state that the pass made no changes to the IR if it only found invalid references.	2019-07-26 12:33:40 -04:00
Steven Perron	bb0e2f65bb	Fix check for unreachable blocks in merge-return (#2762 ) Merge return expects unreachable merge block to look a certain way, and unreachable continue blocks to look a certain way. What if an unreachable block is both a merge and a continue? The continue is suppose to take precedent, but merge-return implements it with the merge taking precedent. This change flips that around. Fixes #2746	2019-07-25 09:34:18 -04:00
Alastair Donaldson	1a89ac8b28	Transformation and fuzzer pass to add dead continues (#2758 ) Similar to the existing 'add dead breaks' pass, this adds a pass to add dead continues to blocks in loops where such a transformation is viable. Various functionality common to this new pass and 'add dead breaks' has been factored into 'fuzzer_util', and some small improvements to 'add dead breaks' that were identified while reviewing that code again have been applied. Fixes #2719.	2019-07-25 13:50:33 +01:00
Steven Perron	c7fcb8c3b9	Process OpDecorateId in ADCE (#2761 ) * Process OpDecorateId in ADCE When there is an OpDecorateId instruction that is live, the ids that is references must be kept live. This change adds them to the worklist. I've also updated a validator check to allow OpDecorateId to be able to apply to decoration groups. Fixes #1759. * Remove dead code.	2019-07-24 14:43:49 -04:00
Steven Perron	fb83b6fbb5	Record correct dominators in merge return (#2760 ) In merge return, we need to know the original dominator for a block in order to traverse code from the original dominator to the new dominator and add appropriate Phi nodes. The current code gets this wrong because the dominator tree is build as needed. The first time we get the immediate dominator for a function we just built the dominator tree and it takes into account that a block has been split. The second time it does not. This inconsistency needs to be fixed. We do that by recording the original dominator for all blocks at the start of the pass. If we were to record just the basic block, that could change if the block is split. We want to traverse the code in the body of the original dominator, whatever block it ends up in. To make this easy to track, we not save the terminator instruction to represent the original dominator. Fixes #2745	2019-07-24 13:56:54 -04:00
Steven Perron	c9190a54da	SSA rewriter: Don't use trivial phis (#2757 ) When a phi candidate is marked as trivial, we are suppose to update all of its uses to the reference the value that it is being folded to. However, the code updates the uses misses `defs_at_block_`. So at a later time, the id for the trivial phi can reemerge. Fixes #2744	2019-07-23 17:59:30 -04:00
alan-baker	aea4e6b1b9	Fix block depth rule priority (#2755 ) Fixes #2743 * Continue depth calculation should take precedence over merge calculation	2019-07-23 13:57:44 -04:00
alan-baker	a94ddc267c	Case validation with repeated labels (#2689 ) Fixes #2686 * Update validation to handle the default case being mentioned multiple times * new tests	2019-07-23 11:23:32 -04:00
greg-lunarg	3855447d93	Bindless Instrument: Make init check depend solely on input_init_enabled (#2753 ) * Bindless Instrument: Make init check depend solely on input_init_enabled Previously was dependent on presense of descriptor_indexing extension in SPIR-V, but this missed some cases. Tests updated to refect this new policy. * Fix format.	2019-07-22 13:51:39 -04:00
Kévin Petit	11516c0b9a	Validate storage class OpenCL environment rules for atomics (#2750 ) This change refactors all storage class validation for atomics to reflect the similar refactoring in the specification. It is currently not possible to write a test for the check rejecting Generic in an OpenCL 1.2 environment as the required GenericPointer capability isn't allowed there. I've decided to keep the check nonetheless to guard against the capability becoming available without the rules for atomics being updated. The ID changes in existing tests aren't ideal but introducing names drags in a substantial refactoring of this file. Contributes to #2595. Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-07-22 08:38:42 -04:00
Jason Macnak	bac82f49aa	Allow LOD ops in compute shaders with derivative group execution modes (#2752 ) Also update existing derivative check to be based on the execution mode instead of just the extension being present. More info about extension: - https://github.com/KhronosGroup/SPIRV-Registry/blob/master/extensions/NV/SPV_NV_compute_shader_derivatives.asciidoc	2019-07-22 08:37:44 -04:00
David Neto	76b75c40a1	Document opt::Instruction::InsertBefore methods (#2751 )	2019-07-18 11:37:28 -04:00
Steven Perron	aa9e8f5380	Revert "Do not inline OpKill Instructions (#2713 )" (#2749 ) This reverts commit `fe7cc9c612`.	2019-07-17 14:59:05 -04:00
Jeff Bolz	58e2ec25ba	For Vulkan, disallow structures containing opaque types (#2546 )	2019-07-16 16:16:19 -04:00
Steven Perron	230c9e4371	Fix bug in merge return (#2734 ) * Fix bug in merge return The merge return pass seems to assume that the only new edges in the cfg are from return block to merge blocks. However, it is possible that a merge block branches to a merge block when it did not before. This change add a new variable to track all of the new edges. It also renames some other variables and cleans us the code to make it a bit easier to read. Fixes #2702.	2019-07-16 09:11:22 -04:00
Jason Macnak	1fedf72e50	Allow ray tracing shaders in inst bindle check pass. (#2733 ) Adds the ray tracing stages (ray gen, intersection, any hit, closest hit, miss, and callable) to the allowed stages in pass instrumentation and add debug records for these stages to output the global launch id. More information for ray tracing shaders: - https://github.com/KhronosGroup/GLSL/blob/master/extensions/nv/GLSL_NV_ray_tracing.txt	2019-07-15 16:24:42 -04:00
greg-lunarg	92c41ff1e7	Remove Common Uniform Elimination Pass (#2731 ) Remove Common Uniform Elimination Pass Fixes #2520.	2019-07-12 11:02:10 -04:00
Ryan Harrison	55adf4cf70	Update execution scope rules for WebGPU (#2730 ) Fixes #2722	2019-07-11 14:37:36 -04:00
alan-baker	1a2de48a12	Extra small storage validation (#2732 ) Fixes #2729 * Check acceptable uses of small type generators	2019-07-11 13:05:14 -04:00
Jeff Bolz	327963765b	Add validation for SPV_EXT_demote_to_helper_invocation (#2707 )	2019-07-11 10:33:22 -04:00
Steven Perron	5ce8cf781f	Change the order branches are simplified in dead branch elim (#2728 ) Dead branch elimination needs to know about the constructs that a block is contained it when determining what to do with its merge instruction. We currently fold branches in block as we see them, which is parent constructs before their children. This causes the struct cfg analysis to crash because it tries to get the parent construct for a block after the parent has been folded. This can be fixed by folding the branch of the children before the parents. Fixes #2667.	2019-07-10 14:59:44 -04:00
Thomas Roughton	cd153db8ed	Add —preserve-bindings and —preserve-spec-constants (#2693 ) Add optimizer options to for preservation of spec constants and variable with binding decorations. They are to be preserved even if they are unused.	2019-07-10 14:12:19 -04:00
Steven Perron	86e45efe15	Handle decorations better in some optimizations (#2716 ) There are a couple spots where we are not looking at decorations when we should. 1. Value numbering is suppose to assign a different value number to ids if they have different decorations. However that is not being done for OpCopyObject and OpPhi. 1. Instruction simplification is propagating OpCopyObject instruction without checking for decorations. It should only do that if no decorations are being lost. Add a new function to the decoration manager to check if the decorations of one id are a subset of the decorations of another. Fixes #2715.	2019-07-10 11:37:16 -04:00
Ryan Harrison	3a252a267b	Update memory scope rules for WebGPU (#2725 ) Fixes #2721	2019-07-10 10:34:50 -04:00
alan-baker	0c4feb643b	Remove extra semis (#2717 ) * Remove extra semi-colons * Update re2 dep	2019-07-08 15:07:36 -04:00
alan-baker	456cc598af	Validate usage of 8- and 16-bit types with only storage capabilities (#2704 ) Fixes #2669 * Check capabilities when validating variables * validate load and store types * Constant check * Don't checks pointers for stores, constants and loads * Validate composite instructions * Validate conversions for 8- and 16-bit limited types * Unified tests and expanded them * Disallow OpCopyMemory * new tests and update old tests	2019-07-08 14:10:13 -04:00
Alastair Donaldson	b8ab80843f	Shrinker for spirv-fuzz (#2708 ) Adds to spirv-fuzz the option to shrink a sequence of transformations that lead to an interesting binary to be generated, to find a smaller sub-sequence of transformations that still lead to an interesting (but hopefully simpler) binary being generated. The notion of what counts as "interesting" comes from a user-provided script, the "interestingness function", similar to the way the spirv-reduce tool works. The shrinking process will give up after a maximum number of steps, which can be configured on the command line. Tests for the combination of fuzzing and shrinking are included, using a variety of interestingness functions.	2019-07-07 08:55:30 +01:00
Steven Perron	37e8f79946	Perform merge return with single return in loop. (#2714 ) Inlining does not inline functions that have a single return that is in a loop. This is because the return cannot be replaced by a branch outside of the loop easily. Merge return knows how to rewrite the function so the return is replaced by a branch. Fixes #2038.	2019-07-04 14:14:49 -04:00
Steven Perron	fe7cc9c612	Do not inline OpKill Instructions (#2713 ) It is illegal to inline an OpKill instruction into a continue construct because the continue header will no longer dominate the backedge. This commit adds a check for this, and does not inline. If we still want to be able to inline a function that contains an OpKill, we can add a new pass that will wrap OpKill instructions into its own function with just the single instruction. I do not believe that this is a common case right now, so I will not do that yet. Fixes #2433.	2019-07-04 12:08:23 -04:00
Alastair Donaldson	5a93e07392	Refactor reducer options (#2709 ) Avoids polluting the global namespace with a constant, and moves constructor to .cpp file as is done for spirv-reduce's options.	2019-07-04 11:11:42 +01:00
Caio Marcelo de Oliveira Filho	9702d47c6f	Validate that in OpenGL env block variables have Binding (#2685 ) * Add spvIsOpenGLEnv helper * Validate that in OpenGL env block variables have Binding	2019-07-02 08:11:20 -04:00
Jason Macnak	e6e3e2ccc6	Update type for loaded builtin GlobalInvocationID in pass instrumentation (#2705 ) When working on descriptor indexing validation for compute shaders, the gl_GlobalInvocationID builtin was being loaded as uint which would cause compute shaders instrumented by the bindless check pass to have: %83 = OpLoad %uint %gl_GlobalInvocationID %84 = OpCompositeExtract %uint %83 0 %85 = OpCompositeExtract %uint %83 1 %86 = OpCompositeExtract %uint %83 2 which results in validation failures: error: line 127: Reached non-composite type while indexes still remain to be traversed. %84 = OpCompositeExtract %uint %83 0 for trying to extract a uint from a uint.	2019-06-28 09:46:16 -04:00
Alastair Donaldson	6ccb52b864	Warn when input facts are invalid. (#2699 ) Fixes #2621. Instead of aborting when an invalid input fact is provided, the tool now warns about the invalid fact and then ignores it. This is convenient for example if facts are specified about uniforms with descriptor sets and bindings that happen to not be present in the input binary.	2019-06-26 16:40:19 +01:00
Alastair Donaldson	efde682369	Disallow movement of unreachable blocks. (#2700 ) Fixes #2695. Allowing unreachable blocks to be moved can lead to an unreachable block A getting placed after an unreachable successor B, which is a problem if B uses ids that A generates.	2019-06-26 15:32:25 +01:00
Alastair Donaldson	dfcb5a1e10	Refactor fuzzer transformations (#2694 ) Introduced abstract class for transformations, and refactored all transformations to inherit from this abstract class.	2019-06-25 20:49:46 +01:00
Józef Kucia	888aeef8a9	Fix Component decoration validation for arrays (#2697 )	2019-06-25 13:28:16 -04:00
Kévin Petit	df86bb44fe	Replace global static map with an array of pairs (#2691 ) * Replace global static map with an array of pairs \#2687 introduced a global static map, which isn't allowed by the style guide and caused an issue in DXC. This change replaces it with an array of pairs. Signed-off-by: Kévin Petit <kpet@free.fr> * Replace constexpr with const Signed-off-by: Kévin Petit <kpet@free.fr>	2019-06-21 08:47:27 -04:00
Józef Kucia	7c294608ca	Basic validation for Component decorations (#2679 ) * Add basic validation for Component decoration * Add validator tests for Component decoration	2019-06-20 18:16:12 -04:00
alan-baker	2b84d25f10	Fix store to uniform Vulkan check (#2688 ) * Wrong operands were used for pointer and array types * added tests to catch the wierd number corner	2019-06-20 14:22:41 -04:00
Kévin Petit	bec7e0393f	Add all accepted target environments to the tools' help texts (#2687 ) Several tools take a --target-env option to specify the SPIR-V environment to use. They all use spvParseTargetEnv to parse the user-specified string and select the appropriate spv_target_env but all tools list only _some_ of the valid values in their help text. This change makes the help text construction automatic from the full list of valid values, establishing a single source of truth for the values printed in the help text. The new utility function added allows its user to specify padding and wrapping constraints so the produced strings fits well in the various help texts. Signed-off-by: Kévin Petit <kpet@free.fr>	2019-06-20 09:41:28 -04:00
Alastair Donaldson	51b0d5ce50	Represent uniform facts via descriptor set and binding. (#2681 ) * Represent uniform facts via descriptor set and binding. Previously uniform facts were expressed with resepect to the id of a uniform variable. Describing them with respect to a descriptor set and binding is more convenient from the point of view of expressing facts about a shader without requiring analysis of its SPIR-V. * Fix equality testing for uniform buffer element descriptors. The equality test now checks that the lengths of the index vectors match. Added a test that exposes the previous omission.	2019-06-19 20:45:14 +01:00
Ehsan	a132c9b640	Whitelist SPV_GOOGLE_user_type. (#2673 )	2019-06-19 12:18:13 -04:00
Alastair Donaldson	001e823b65	Add fuzzer pass to obfuscate constants. (#2671 ) Adds a new transformation that can replace a constant with a uniform known to have the same value, and adds a fuzzer pass that (a) replaces a boolean with a comparison of literals (e.g. replacing "true" with "42 > 24"), and then (b) obfuscates the literals appearing in this comparison by replacing them with identically-valued uniforms, if available. The fuzzer_replayer test file has also been updated to allow initial facts to be provided, and to do error checking of the status results returned by the fuzzer and replayer components.	2019-06-18 18:41:08 +01:00
alan-baker	2090d7a2d2	Handle volatile memory semantics in upgrade (#2674 ) * If an atomic is decorated with volatile add the volatile bit to its memory semantics	2019-06-17 16:01:37 -04:00
alan-baker	3d5fb7b908	Validate Volatile memory semantics bit (#2672 ) * Can only be used with Vulkan memory model * Can only be used with atomics * Bit setting must match for compare exchange opcodes * Updated memory semantics checks to allow constant instructions generally with CooperativeMatrixNV	2019-06-17 13:35:40 -04:00
alan-baker	400dbde0ba	Disallow stores to UBOs (#2651 ) Fixes #2638 * Adds a check that errors out if there is a store to a UBO in the Vulkan environment * tests * Function to trace pointers	2019-06-17 13:13:07 -04:00
alan-baker	59983a6010	Validate variable initializer type (#2668 ) Fixes #249 * The pointed to type of Result Type must match the initializer type * Had to update some opt tests to be valid	2019-06-15 00:34:18 -04:00
Alastair Donaldson	42830e5a68	Add replayer tool for spirv-fuzz. (#2664 ) The replayer takes an existing sequence of transformations and applies them to a module. Replaying a sequence of transformations that were obtained via fuzzing should lead to an identical module to the module that was fuzzed. Tests have been added to check for this.	2019-06-13 14:08:33 +01:00
alan-baker	b4bf7bcf0a	Add validation for Subgroup builtins (#2637 ) Fixes #2611 * Validates builtins in the Vulkan environment: * NumSubgroups * SubgroupId * SubgroupEqMask * SubgroupGeMask * SubgroupGtMask * SubgroupLeMask * SubgroupLtMask * SubgroupLocalInvocationId * SubgroupSize	2019-06-13 08:47:05 -04:00
Alastair Donaldson	9c0830133b	Add constant == uniform facts. (#2660 ) Adds a new (and first) kind of fact to the fact manager, which is that a specific uniform value is guaranteed to be equal to a specific constant. The point of this is that such information (if known to be true by some external source) can be used by spirv-fuzz to transform the module in interesting ways that a static compiler cannot reverse via compile-time analysis. This change introduces protobuf messages for the fact, and adds capabilities to the fact manager to store this kind of fact and provide information about it.	2019-06-11 15:56:08 +01:00
Steven Perron	208d3132e6	Cast __LINE__ to size_t (#2661 ) Fixes #2648	2019-06-07 13:06:42 -04:00
Alastair Donaldson	a8ae579f7a	Add transformation to replace a boolean constant with a numeric comparison (#2659 ) The transformation can, for example, replace "true" with "12.0 > 6.0", if constants for those floating-point values are available. This introduces a new 'id use descriptor' structure, which provides a way to describe a particular use of an id, and which will be heavily used in future transformations. Describing an id use is trivial if the use occurs in an instruction that itself generates an id, but is less straightforward if the id of interest is used by an instruction such as OpStore that does not have a result id. The 'id use descriptor' structure caters for such cases.	2019-06-06 22:22:35 +01:00
Daniel Koch	0755d6ce82	Add builtin validation for SPV_NV_shader_sm_builtins (#2656 ) Also add a Builtin test generator variant that takes capabilities and extensions. Tests - verify that the SMCountNV, SMIDNV, WarpsPerSMNV, and WarpIDNV Builtins are accepted as Inputs in Vertex, Fragment, TessControl, TessEval, Geometry, and Compute. - verify that the SMCountNV, SMIDNV, WarpsPerSMNV, and WarpIDNV Builtins are accepted as Inputs in MeshNV and TaskNV shaders. - verify that the SMCountNV, SMIDNV, WarpsPerSMNV, and WarpIDNV Builtins are accepted as Inputs in the 6 ray tracing stages - verify that the SMCountNV, SMIDNV, WarpsPerSMNV, and WarpIDNV Builtins are NOT accepted as Outputs. - verify that the SMCountNV, SMIDNV, WarpsPerSMNV, and WarpIDNV Builtins are NOT accepted as non-scalar integers (f32, uvec3) - verify that the SMCountNV, SMIDNV, WarpsPerSMNV, and WarpIDNV Builtins are NOT accepted as non-32-bit integers (u64)	2019-06-06 14:53:48 -04:00
greg-lunarg	43fb2403a6	Instrument: Fix code for version 2 output format. (#2655 ) Correct record size. Also bring version 2 tests up to version 1 equivalence.	2019-06-06 11:35:34 -04:00
Alastair Donaldson	08cc49ec59	Fix bug in 'split blocks', and add tests for fuzzer. (#2658 ) There turned out to be a bug in the 'split blocks' transformation due to blocks being split while they were being iterated over. This change fixes that issue, and adds tests that were able to expose the issue by running the fuzzer on some example shaders.	2019-06-05 21:54:47 +01:00
David Neto	d01a3c3b4b	Optimizer: Handle array type with OpSpecConstantOp length (#2652 ) When it's an OpConstant or OpSpecConstant, then the literal values are compared. If the OpSpecConstant also has a SpecId decoration, then that's also compared. Otherwise, it's an OpSpecConstantOp and we only compare the ID of the OpSpecConstantOp instruction itself. Fixes #2649	2019-06-05 16:35:50 -04:00
Alastair Donaldson	4a00a80c40	Add fuzzer pass to add dead breaks. (#2654 ) This pass randomly add breaks to the merge blocks of selection and loop constructs, such that the breaking edges will not be dynamically reachable.	2019-06-05 08:02:16 +01:00
Alastair Donaldson	620197bd65	Add fuzzer pass that adds useful constructs to a module (#2647 ) This new pass adds some basic ingredients to a module on which future passes are likely to depend, such as boolean constants and some specfic integer and floating-point values. This is not a fuzzer pass in the true sense in that it does not employ randomization, but it makes sense to define it as a fuzzer pass since it is the first of a number of transformations passes that the fuzzer will run on a module.	2019-06-04 14:55:00 +01:00
Jeff Bolz	2c0111e6eb	Add validation for SPV_EXT_fragment_shader_interlock (#2650 )	2019-06-03 10:55:07 -04:00
Ryan Harrison	699e167d78	Remove asserts from GetUnderlyingType (#2646 ) Fixes #2463	2019-05-31 08:57:41 -07:00
Kévin Petit	f99d7ad5c0	Validate OpenCL rules for ImageRead and OpImageSampleExplicitLod (#2643 ) Fixes #2594. Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-05-31 10:05:34 -04:00
Alastair Donaldson	209ff0ce90	Add spirv-fuzz pass to permute blocks. (#2642 ) The blocks within each function in the module will be permuted in a randomized manner that respects dominance.	2019-05-31 09:59:06 +01:00
Pierre Moreau	e7866de4b1	Linker: Better type comparison for OpTypeArray and OpTypeForwardPointer (#2580 ) * Types: Avoid comparing IDs for in Type::IsSameImpl When linking, we end up with duplicate types for imported and exported types, that needs to be removed. The current code would reject valid import/export pairs of symbols due to IDs mismatch, even if the types or constants behind those ID were the same. Enabled remaining type_match_test Fixes #2442	2019-05-29 16:12:02 -04:00
Ryan Harrison	0125b28ed4	Add compact ids to WebGPU <-> Vulkan transformations (#2639 ) Fixes #2634	2019-05-29 12:58:37 -07:00
greg-lunarg	3d62cb8148	Instrument: Add version 2 of record formats (#2630 ) New version has additional word in stage-specific section. Also some changes in content for tesselation and compute shaders. Either version can be invoked at pass creation. This is done to ease integration and updating of validation layers. Version 1 is deprecated and eventually will go away. Also sneaking in fix to version 1 compute shaders.	2019-05-29 15:08:21 -04:00
Alastair Donaldson	1b71e45338	Add "split block" transformation. (#2633 ) With this pass, the fuzzer can split blocks in the input module. This is mainly useful in order to give other (future) transformations more opportunities to apply.	2019-05-29 16:42:46 +01:00
Steven Perron	6c7db9c630	Handle nested breaks from switches. (#2624 ) * Handle nested breaks from switches. There was a recent decision made to allow branches to the merge node of a switch even if the switch is not the first enclosing construct. They can be generated by glslang from break statements in switches. Dead branch elimination seems to be the only optimization that will break because of this change, so I will update that optimizations. The change made are: - Track switches in structured cfg analysis. - In Dead branch elimination: - Look for nested breaks that will require a switch instruction. - Rewrite, but don't delete, switchs that are required even if it could be replaced by an unconditional branch. - When looking for the first break, consider the merge of a switch as well. See #2612. * Fix variable names and comments. * Add tests for the struct cfg analysis and switches. * Fix typos in comments.	2019-05-27 16:28:14 -04:00
Alastair Donaldson	fe9f870130	Add library for spirv-fuzz (#2618 ) Adds a library for spirv-fuzz, consisting of a Fuzzer class that will transform a module with respect to (a) facts about the module provided via a FactManager class, and (b) a source of random numbers and parameters to control the transformation process provided via a FuzzerContext class. Transformations will be applied via classes that implement a FuzzerPass interface, and both facts and transformations will be represented via protobuf messages. Currently there are no concrete facts, transformations nor fuzzer passes; these will follow.	2019-05-27 14:34:55 +01:00
dan sinclair	42abaa099a	Remove MarkV and Stats code. (#2576 ) * Remove MarkV and Stats code. This Cl removes the MarkV and Stats code from SPIRV-Tools. This code was unused and currently un-maintained.	2019-05-24 15:43:59 -04:00
Sahil Parmar	b8fe7211c4	Allow arrays of out per-primitive builtins for mesh shaders (#2617 ) - PrimitiveID, Layer, ViewportIndex * Add validation tests for mesh builtins	2019-05-23 15:08:59 -04:00
Kévin Petit	07a1019717	Validate OpenCL environment rules for OpImageWrite (#2619 ) Fixes #2593. Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-05-23 08:35:14 -04:00
Ryan Harrison	4557d08584	Add in individual flags for Vulkan <-> WebGPU passes (#2615 ) Adds flags and/or documentation for individual transformation passes that had been missed in previous patches. Fixes #2574	2019-05-22 10:06:53 -07:00
Toomas Remmelg	13f61bf859	Update vloadn and vstoren validation to match the OpenCL Extended Instruction Set Specification (#2599 )	2019-05-22 08:09:50 -04:00
Steven Perron	d9c00e1d2d	Add folding rules for OpQuantizeToF16 (#2614 ) Adding the folding rules for OpQuantizeToF16, and fixed some matching tests to check identify new lines.	2019-05-21 23:15:01 -07:00
alan-baker	713da30b63	Disallow merge targeting block with OpLoopMerge (#2610 ) Fixes #2588 * Add a check that the merge block of OpLoopMerge may not be the block that contains the OpLoopMerge * add a test	2019-05-21 23:02:53 -07:00
alan-baker	60aaafbc70	Allows breaks selection breaks to switches (#2605 ) Fixes #2604 * Allow selection constructs to branch to the nearest selection merge whose header is terminated by an OpSwitch * Cleanup break and continue checks generally * add tests	2019-05-21 22:49:37 -07:00
Steven Perron	0982f0212e	Using the instruction folder to fold OpSpecConstantOp (#2598 ) In order to try to reduce code duplication and to be able to fold more cases, we want to use the instruction folder when folding an OpSpecConstantOp with constant operands. A couple other changes are need to make this work. First GetDefiningInstruction\| in the constant manager is able to handle \|type_id\| being logically equivalent to another type, so we updated the interface, and removed the assert. Some tests were also updated because we not generate better code because constants are not duplicated as much as before. No need for new tests. The functionality of the instruction folder is already tested. There are tests check that the instruction folder is being used correctly for OpCompositeExtract and OpVectorShuffle in the existing test cases. Fixes #2585.	2019-05-21 12:45:00 -04:00
Kévin Petit	9f035269d6	Validate OpenCL environment rules for OpTypeImage (#2606 ) It is currently not possible to use an Image Format that is not Unknown without requiring a capability forbidden by the OpenCL environment. As such the validation of Image Format currently leans on capability validation entirely. Fixes #2592. Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-05-21 09:17:50 -04:00
Kévin Petit	47741f0504	Validate OpenCL memory and addressing model environment rules (#2589 ) Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-05-17 08:25:20 -04:00
alan-baker	ff4feb44b4	Validate construct exits (#2459 ) Validate structured exits from constructs * Add checks that exits from a construct are valid * Add Construct::IsStructuredExit() * uses specific rules for each type of construct * Added a test and check for #2213 * Adding tests for bad loop and continue exits * Fix identification of continue block that prevented some selections from having any blocks	2019-05-16 14:59:30 -07:00
greg-lunarg	9dfd4b8358	Bindless Validation: Instrument descriptor-based loads and stores (#2583 ) Essentially, support UBOs and SSBOs, scalar and array (sized and unsized).	2019-05-15 19:43:23 -04:00
alan-baker	7e7745fce8	Validate loop merge (#2579 ) Fixes #2559 * Validate OpLoopMerge including loop controls * add tests * fix some bad tests	2019-05-15 19:38:41 -04:00
alan-baker	fc7b5d8c6a	Mem model spv 1.4 (#2565 ) * Update memory model support for SPIR-V 1.4 Fixes #2552 * Upgrade memory model now supports two memory access operands for OpCopyMemory* * in all cases the pass will first generate two operands by either adding them or copying * updates accounts for multiple operands * tests	2019-05-15 19:06:37 -04:00
Steven Perron	84503583c6	Handle id overflow in sroa better. (#2582 ) There is a case where sroa is not handling id overflow gracefully. It is handled and an error message is output when the ids overflow. Fixes https://crbug.com/961030.	2019-05-15 09:29:28 -04:00
Steven Perron	e935dac9ef	Make pointers to isomorphic type interchangeable with option. (#2570 ) * Make pointers to logically matching types interchangeable with option. DXC will be generating code where the function parameters will be a more generic type that the actual parameter. They should be logically matching and the decorations of the actual parameter must be a superset of the decorations of the formal parameter. We want to accept this code with an options so that spirv-opt can then inline and fix the type mismatch. We will accept this under a new options `--before-hlsl-legalization`. The new option will also imply `relax-logical-pointer` so that HLSL frontends will need to use just the one more generic option. Moved the \|LogicallyMatches\| to the validation state to make it available in more places. Also added a parameter to have it check the decorations. I did not do a separate function for the decorations because checking the decorations involves making sure the types logically match anyway. Fixes #2535	2019-05-13 13:48:17 -04:00
alan-baker	2947e88f79	Update instrumentation passes to handle 1.4 interfaces (#2573 ) Fixes #2556 Added variables get added to entry point interfaces Add to input buffer too	2019-05-10 11:08:28 -04:00
greg-lunarg	06ce59b0b0	Instrument: Fix load type of pre-existing builtin (#2575 ) Builtins may be declared int, so load with its pointee type and cast to uint if needed.	2019-05-10 11:06:00 -04:00
alan-baker	87c4ef8a9c	Do not fold floating point if float controls used (#2569 ) Fixes #2558 * Mark floating point instructions as non-foldable if any SPV_KHR_float_controls capabilities are present * tests	2019-05-10 11:03:22 -04:00
alan-baker	45fb696668	Use last version (#2578 ) * Use grammar last version Fixes #2560 * Parse last version and use it in checks * Update grammar header generation * Fix NonWritable tests * Fix check and add specific tests	2019-05-10 11:02:01 -04:00
Ryan Harrison	f6d9a17843	Add pass to fix some invalid unreachable blocks for WebGPU (#2563 ) Attempts to split up unreachable blocks that are used both as a merge-block and a continue-target. Fixes #2429	2019-05-09 12:56:10 -04:00
Diego Novillo	89fe836fe2	Fix clang-tidy warning about definition/declaration mismatch. (#2571 ) Fix clang-tidy warning about definition/declaration mismatch.	2019-05-09 00:15:08 -04:00
David Neto	f2803c4a7f	VK_KHR_uniform_buffer_standard_layout validation (#2562 ) Add a command-line option to enable validating SPIR-V for implementations that support VK_KHR_uniform_buffer_standard_layout.	2019-05-08 18:01:10 -04:00
alan-baker	ea5e1b62e1	Update priv-to-local for SPIR-V 1.4 (#2567 ) Fixes #2555 * Fix a bug in validation where interfaces were considered non-unique between different entry points targeting the same function * added a test * Update private to local pass to remove localized private variables from entry point interfaces * added tests	2019-05-08 12:38:49 -04:00
alan-baker	b74d92a8c3	ADCE support for SPIR-V 1.4 entry points (#2561 ) Fixes #2551 * Add support for 1.4 entry point interface lists * only input and output variables are automatically live * can clean up interfaces after DCE * added tests * allow opt tests to specify a target environment	2019-05-07 14:52:22 -04:00
David Neto	63f57d95d6	Support SPIR-V 1.4 (#2550 ) * SPIR-V 1.4 headers, add SPV_ENV_UNIVERSAL_1_4 * Support --target-env spv1.4 in help for command line tools * Support asm/dis of UniformId decoration * Validate UniformId decoration * Fix version check on instructions and operands Also register decorations used with OpDecorateId * Extension lists can differ between enums that match Example: SubgroupMaskEq vs SubgroupMaskEqKHR * Validate scope value for Uniform decoration, for SPIR-V 1.4 * More unioning of exts * Preserve grammar order within an enum value * 1.4: Validate OpSelect over composites * Tools default to 1.4 * Add asm/dis test for OpCopyLogical * 1.4: asm/dis tests for PtrEqual, PtrNotEqual, PtrDiff * Basic asm/Dis test for OpCopyMemory * Test asm/dis OpCopyMemory with 2-memory access Add asm/dis tests for OpCopyMemorySized Requires grammar update to add second optional memory access operand to OpCopyMemory and OpCopyMemorySized * Validate one or two memory accesses on OpCopyMemory* * Check av/vis on CopyMemory source and target memory access This is a proposed rule. See https://gitlab.khronos.org/spirv/SPIR-V/issues/413 * Validate operation for OpSpecConstantOp * Validate NonWritable decoration Also permit NonWritable on members of UBO and SSBO. * SPIR-V 1.4: NonWrtiable can decorate Function and Private vars * Update optimizer CLI tests for SPIR-V 1.4 * Testing tools: Give expected SPIR-V version in message * SPIR-V 1.4 validation for entry point interfaces * Allow only unique interfaces * Allow all global variables * Check that all statically used global variables are listed * new tests * Add validation fixture CompileFailure * Add 1.4 validation for pointer comparisons * New tests * Validate with image operands SignExtend, ZeroExtend Since we don't actually know the image texel format, we can't fully validate. We need more context. But we can make sure we allow the new image operands in known-good cases. * Validate OpCopyLogical * Recursively checks subtypes * new tests * Add SPIR-V 1.4 tests for NoSignedWrap, NoUnsignedWrap * Allow scalar conditions in 1.4 with OpSelect * Allows scalar conditions with vector operands * new tests * Validate uniform id scope as an execution scope * Validate the values of memory and execution scopes are valid scope values * new test * Remove SPIR-V 1.4 Vulkan 1.0 environment * SPIR-V 1.4 requires Vulkan 1.1 * FIX: include string for spvLog * FIX: validate nonwritable * FIX: test case suite for member decorate string * FIX: test case for hlsl functionality1 * Validation test fixture: ease debugging * Use binary version for SPIR-V 1.4 specific features * Switch checks based on the SPIR-V version from the target environment to instead use the version from the binary * Moved header parsing into the ValidationState_t constructor (where version based features are set) * Added new versions of tests that assemble a 1.3 binary and validate a 1.4 environment * Fix test for update to SPIR-V 1.4 headers * Fix formatting * Ext inst lookup: Add Vulkan 1.1 env with SPIR-V 1.4 * Update spirv-val help * Operand version checks should use module version Use the module version instead of the target environment version. * Fix comment about two-access form of OpCopyMemory	2019-05-07 12:27:18 -04:00
Steven Perron	106c98d0fa	Validate sign of int types. (#2549 ) Fixes https://crbug.com/959011.	2019-05-06 13:05:31 -04:00
Steven Perron	6d04da22c6	Fix up type mismatches. (#2545 ) Add functionality to fix-storage-class so that it can fix up mismatched data types for pointers as well. Fixes bugs in when fixing up storage class. Move GenerateCopy to the Pass class to be reused. The spirv-opt change for #2535.	2019-05-02 09:31:46 -04:00
Ryan Harrison	c8b09744c6	Add validation specific to OpExecutionModeId (#2536 ) Fixes #1565	2019-05-01 13:29:39 -04:00
Ryan Harrison	a5da68d446	Remove stale comment (#2542 ) Fixes #1121	2019-05-01 10:56:39 -04:00
Steven Perron	32af42616a	Change implementation of post order CFG traversal (#2543 ) * Change implementation of post order CFG traversal It seems like the recursion is going very deep, and causing some problem is particular situations. I've reimplemented the CFG post order traversal to not use recursion. Fixes #2539.	2019-04-29 17:09:20 -04:00
Steven Perron	64faf6d9cb	Fix undefined bit shift in sroa. (#2532 ) There was a bit shift done on 32-bit values, but they should have been done on 64-bit values. This is fixed. At the same time, uses of size_t are repalaced by uint64_t to ensure these values are 64-bit. A test case cannot be created because the code that was change is not run at the moment since we do not split up vectors or matricies. I do not want to delete the code because I like to experitment with it every once in a while. Fixes #2528.	2019-04-26 12:52:23 -04:00
Ryan Harrison	b68af7ca8e	Add support for Private & Output to initializer decompose flag (#2537 ) Fixes #2388	2019-04-25 16:24:32 -04:00
Ryan Harrison	736376dbf9	Remove Acquire, Release, and Relaxed from allowed Mem Sem bits for WebGPU (#2526 ) Fixes #2524	2019-04-23 13:27:40 -04:00
alan-baker	07c4dd4b9e	Reduce runtime of array layout checks (#2534 ) Fixes #2533 * Stop checking layouts once the offset gets back to a 16 byte alignment	2019-04-23 10:33:00 -04:00
Ryan Harrison	7aad9653f9	Remove legacy utility functions (#2530 ) These are not called/referenced by anything, and are marked as being unused. They were brought to my attention by a coverity based bug report. Fixes #2537	2019-04-18 14:57:19 -04:00
Steven Perron	d754b70592	Shorten names of cmake targets (#2531 ) Window still had a limit of 260 chars for file paths. Visual C++ create directories and file names based on the cmake target names, so if they are too long, the windows build will fail. This is not a problem for spirv-tools on its own, but the files names currently go up to 220 characters for some spirv-tools files when built as part of VK-GL-CTS. This change will get it back down to 190, leaving more space for the directory that will contain VK-GL-CTS. This is fixing an issue reported against the VK-GL-CTS.	2019-04-18 13:22:28 -04:00
alan-baker	ac878fcbdd	Remove unreachable block validation (#2525 ) * Remove the check that blocks terminated by OpUnreachable are not statically reachable in the CFG * Updated tests	2019-04-17 18:21:19 -04:00
Ryan Harrison	21712068fe	Validate that SPIR-V binary is encoded as little endian for WebGPU (#2523 ) Fixes #2522	2019-04-17 12:44:54 -04:00
Ryan Harrison	3aad3e9228	Change validation of memory semantics for OpAtomics* in WebGPU (#2519 ) Recent change to the spec restricted the valid values for Memory Semantics in OpAtomics* in the WebGPU env. Implementing enforcing these changes. Fixes #2499	2019-04-16 14:49:07 -04:00
Ryan Harrison	048dcd38ce	Implement WebGPU->Vulkan initializer conversion for 'Function' variables (#2513 ) WebGPU requires certain variables to be initialized, whereas there are known issues with using initializers in Vulkan. This PR is the first of three implementing a pass to decompose initialized variables into a variable declaration followed by a store. This has been broken up into multiple PRs, because there 3 distinct cases that need to be handled, which require separate implementations. This first PR implements the basic infrastructure that is needed, and handling of Function storage class variables. Private and Output will be handled in future PRs. This is part of resolving #2388	2019-04-16 14:31:36 -04:00
Paul Thomson	3335c61147	reduce: Add two branch reduction passes (#2507 ) * Fix #2320. `conditional_branch_to_simple_conditional_branch` reduction pass changes conditional branches so both targets point to the same block id (creating a "simple" conditional branch). * Fix #2501. `simple_conditional_branch_to_branch` reduction pass changes "simple" conditional branches to branches. * Fix #2503. `conditional_branch_to_simple_conditional_branch` proper handling of back-edges.	2019-04-15 19:54:36 +01:00
Ryan Harrison	102e430a88	Add pass to legalize OpVectorShuffle for WebGPU (#2509 ) In WebGPU, the component operand 0xFFFFFFFF is forbidden, but in Vulkan it is used to indicate a value is undefined. When converting to WebGPU, 0xFFFFFFFF needs to converted to a legal value, though the specific one does not matter, since it was used to indicate an undefined entry in the original code. Choosing to use 0, since the operands are required to be on [0, N-1], so 0 is guaranteed to always be valid. Fixes #2349	2019-04-12 12:14:23 -04:00
alan-baker	98b3f26c2f	Gate formatless checks on Vulkan env (#2486 ) Fixes #2470 * Only require the WithoutFormat capabilities for Unknown image reads and writes in the Vulkan environment update tests and add new vulkan specific tests	2019-04-11 16:39:50 -04:00
Steven Perron	9047de51cb	Accept OpBitCast in fix storage class. (#2505 ) Fixes http://crbug.com/950889.	2019-04-09 14:10:35 -04:00
Paul Thomson	d90aae9a5a	reduce: miscellaneous fixes (#2494 ) * Fix .gitignore * Add missing reduction pass: RemoveBlockReductionOpportunityFinder * Add DumpShader functions in test_reduce for debugging * Add DumpShader functions in spirv-reduce for debugging * Fix include style * Don't use "using namespace"	2019-04-08 19:37:17 +01:00
Steven Perron	7ce37d66a8	Fix use of Logf to avoid format security warning (#2498 ) When -Wformat-security is enabled, we are getting an error. I do not claim to fully understand when the warning is triggered or not, but this one can be avoided by calling "Log" instead of "Logf" because the formating string is not needed.	2019-04-08 11:06:48 -04:00
Ryan Harrison	0cb2d4079e	Add WebGPU->Vulkan and Vulkan->WebGPU flags in spirv-opt (#2496 ) Renames the existing flag '--webgpu-mode' to '--vulkan-to-webgpu' for the Vulkan->WebGPU operation, and adds a new flag '--webgpu-to-vulkan' for the WebGPU->Vulkan operation. Currently '--webgpu-to-vulkan' doesn't have any passes associated with it yet, but further patches will implement them. Fixes #2495	2019-04-05 15:12:26 -04:00
JasperNV	9766b22b33	spirv-opt: Behave a bit better in the face of unknown instructions (#2487 ) * opt/ir_loader: Don't silently drop unknown instructions on the floor Currently, if spirv-opt sees an instruction it does not know, it will silently ignore it and move to the next one. This changes it to be an error, as dropping it on the floor is likely to generate invalid SPIR-V output. * opt/optimizer: Complain a bit louder for unexpected binary changes If a binary change happens despite a pass saying that the binaries should be identical, this is indicative of a bug in the pass itself. This does not change behavior for it to be an error, but simply emits a warning in this case.	2019-04-05 13:36:42 -04:00
Steven Perron	3a0bc9e724	Add fix storage class code. (#2434 ) This pass tries to fix validation error due to a mismatch of storage classes in instructions. There is no guarantee that all such error will be fixed, and it is possible that in fixing these errors, it could lead to other errors. Fixes #2430.	2019-04-05 13:12:08 -04:00
alan-baker	236bdc0065	Change prioritization of unreachable merge and continue (#2460 ) Fixes #2452 Swaps priority of handling unreachable merge and continues so that the back-edge is retained in the case a block is both a loop continue and loop merge	2019-04-03 12:50:08 -04:00
Steven Perron	12e4a7b649	Handle variable pointer in some optimizations (#2490 ) * Check var pointer capability in ADCE. * Check var ptr capability for common uniform. * Check var ptr capability in access chain convert. Since we want this pass to run even if there are variable pointer on storage buffers, we had to remove asserts that assumed there were no variable pointers. The functions with the asserts will now work, it becomes the responsibility of the callers to deal with the output as appropriate. * Single block elimination and variable pointers. It seems like the code in local single block elimination is able to handle cases with variable pointers already. This is because the function `HasOnlySupportedRefs` ensures that variables that feed a variable pointer are not candidates. * Single store elimination and variable pointers. It seems like the code in local single stroe elimination is able to handle cases with variable pointers already. This is because the function `FindSingleStoreAndCheckUses` ensures that variables that feed a variable pointer are not candidates. * SSA rewriter and variable pointers. It seems like the code in the two passes that call the SSA rewriter are able to handle cases with variable pointers already. This is because the function `HasOnlySupportedRefs` ensures that variables that feed a variable pointer are not candidates. Fixes #2458.	2019-04-03 12:47:51 -04:00
Ryan Harrison	01964e325f	Add pass to generate needed initializers for WebGPU (#2481 ) Fixes #2387	2019-04-03 11:44:09 -04:00
alan-baker	4bd106b089	Handle dead infinite loops in DCE (#2471 ) Fixes #2456 * When eliminating a structured construct that has an unreachable merge, replace that unreachable terminator with an appropriate return * New tests	2019-04-03 10:30:12 -04:00
alan-baker	8129cf2f99	Remove merge assert in block calculation (#2489 ) Fixes #2488 * Validator doesn't identify back-edge of the loop, so the merge is never set * Construct::blocks() has safe uses of `merge` so the assert can be removed * Added a test	2019-04-02 14:37:05 -04:00
Paul Thomson	e2ddb9371e	reduce: add remove_selection_reduction_opportunity (#2485 ) Fix #2484	2019-04-02 16:50:15 +01:00
alan-baker	c9874e5090	Fix merge return in the face of breaks (#2466 ) Fixes #2453 * Enable addition of OpPhi instructions when the loop has multiple predecessors of the merge due to a break * This can result in some values no longer dominating their uses * Track return blocks in structured flow to produce OpPhis that have multiple undef and non-undef arguments * New tests to catch the bug * When a block is predicated, mark the new body as a return if the old block as already a return	2019-04-02 10:05:28 -04:00
alan-baker	0300a464a4	Maintain inst to block mapping in merge return (#2469 ) Fixes #2455 Properly maintains instruction to block mapping for newly created phi instructions in merge return	2019-04-01 13:14:10 -04:00
alan-baker	320a7de5c9	Validate that OpUnreacahble is not statically reachable (#2473 ) * Adds a validator check that ensures no block reachable from the entry block is terminated by OpUnreachable * Updated tests * Added new tests	2019-03-29 10:49:37 -04:00
Paul Thomson	fcb8453104	reduce: fix loop to selection pass for loops with combined header/continue block (#2480 ) * Fix #2478. The fix is to just not try to simplify such loops. * Also added `BasicBlock::MergeBlockId()` and `BasicBlock::ContinueBlockId()`. * Some minor changes to `structured_loop_to_selection_reduction_opportunity.cpp`. * Added test.	2019-03-29 11:29:24 +00:00
alan-baker	2ff54e34ed	Handle function decls in Structured CFG analysis (#2474 ) Fixes #2451 * Structured cfg analysis now handles functions with no basic blocks * Added a test	2019-03-26 14:39:16 -04:00
alan-baker	42e6f1aa62	Add option to validate after each pass (#2462 ) * New command-line option to opt: --validate-after-all * Pass manager will validate after each pass it runs	2019-03-26 14:38:59 -04:00
Paul Thomson	fb0753640a	reduce: fix loop to selection dominance query (#2477 ) Fix #2457	2019-03-26 16:37:08 +00:00
Paul Thomson	7d1b176c1d	Improve reducer algorithm and other changes (#2472 ) Fix #2475. Fix #2476. * Improve reducer algorithm: shrink granularity, remove an early return, no lazy initialization, notify pass if binary is interesting, add comments. * Add fail-on-validation-error option to fail a reduction if an invalid state is reached; useful for tests. * Set fail-on-validation-error in tests. * Improve some documentation comments. * Add Reducer::AddDefaultReductionPasses so tests (and other library consumers) can add the default reduction passes. * Add CLIMessageConsumer in test_reduce so we can see messages for tricky tests. * Remove test RemoveUnreferencedInstructionReductionPassTest_ApplyReduction because it was indirectly testing the reduction algorithm, not the RemoveUnreferencedInstruction pass. * Tweak tests where needed.	2019-03-26 13:22:31 +00:00
Ryan Harrison	ffbecae56a	Check OpSampledImage is only passed into valid instructions (#2467 ) Fixes #1528	2019-03-25 15:44:57 -04:00
Paul Thomson	2d52cbee49	Add some val options to reduce (#2401 ) Fix #2396 * Check that initial state is valid. Add kInitialStateInvalid. * Fix RemoveOpnameAndRemoveUnreferenced test; turns out the original shader is invalid, but we never notice because we don't check this and the reduced shader is valid; fix original shader. Assert reduction status is kComplete. * Always check return value from `Reducer::Run`. * Change Reducer::Run to not immediately copy the input binary.	2019-03-21 14:28:06 +00:00
Paul Thomson	1f60f98964	reduce: remove unreferenced blocks pass (#2398 )	2019-03-21 13:32:21 +00:00
Ryan Harrison	08b54d9e45	Convert sampled consumers to being Instructions instead of IDs (#2464 ) Changing the stored value for a sampled image consumer to be the instruction instead of result ID, since not all instructions have result IDs. Using result IDs led to a potential crash when using OpReturnValue, which doesn't have a result ID. OpReturnValue is not a legal consumer, but the validator needs to look at the instruction to determine this, thus storing the pointer to the instruction, instead of trying to fetch the pointer using the instruction. Issue #1528 covers fixing the check. Fixes #2463	2019-03-19 12:39:37 -04:00
greg-lunarg	e1a76269b6	Bindless Validation: Descriptor Initialization Check (#2419 ) If SPV_EXT_descriptor_indexing is enabled, add check that for a descriptor-based reference, the descriptor is initialized. Initialization data is stored in the debug input buffer, added to the length information already there. This feature must be seperately enabled on the pass creation routine. NOTE: Currently just supports image references; buffer references are still TODO.	2019-03-19 09:53:43 -04:00
Alan Baker	9244e6ff62	Reverting commit `da5a780ff9`	2019-03-18 15:14:41 -04:00
SarahM0	da5a780ff9	Variable pointers cannot be an operand to OpArrayLength	2019-03-18 14:07:36 -04:00
Ryan Harrison	e545522146	Add --strip-atomic-counter-memory (#2413 ) Adds an optimization pass to remove usages of AtomicCounterMemory bit. This bit is ignored in Vulkan environments and outright forbidden in WebGPU ones. Fixes #2242	2019-03-14 13:34:33 -04:00
alan-baker	bdcb155163	Relax function call parameter check (#2448 ) Fixes #2447 * Allow sub-objects for UniformConstant storage class * Updated tests	2019-03-14 12:45:31 -04:00
Steven Perron	5186ffedb3	Remove duplicates from list of interface IDs in OpEntryPoint instruction (#2449 ) * Remove duplicates from list of interface IDs in OpEntryPoint instruction Fixes #2002.	2019-03-13 15:46:31 -04:00
Ryan Harrison	6df8a917a4	Add validation of storage classes for WebGPU (#2446 ) Fixes #2445	2019-03-13 13:01:25 -04:00
Jaebaek Seo	a5c06c903c	Validator: no Storage comparison for pointer param (#2428 ) If relax-logical-pointer is enabled, this commit makes Validator accept function param even when its Storage Class is different from the expected one. Related to #2423, #2430	2019-03-13 12:25:24 -04:00
Steven Perron	9d29c37ac5	Removing decorations when doing constant propagation. (#2444 ) In constant propagation, decoration are transfered from the original expression to the constant that will replace it. This can be wrong because there are no decorations that apply to constants. We choose to simply delete the decorations. Fixes #2441	2019-03-13 10:40:49 -04:00
Ryan Harrison	b75f4362f0	Add validation for ExecutionMode in WebGPU (#2443 ) Fixes #2437	2019-03-12 14:50:25 -04:00
Ryan Harrison	b1ff15f220	Add missing DepthGreater case to Fragment only check (#2440 ) Fixes #2439	2019-03-12 11:27:40 -04:00
Ryan Harrison	b12e7338ee	Implement WebGPU specific CFG validation (#2386 ) In WebGPU all blocks are required to be reachable, unless they are one of two specific degenerate cases for merge-block or continue-target. This PR adds in checking for these conditions. Fixes #2068	2019-03-08 13:01:09 -05:00
Ehsan	5fb83a9708	Allow NonWritable to target struct members. (#2420 ) It should be allowed for the NonWritable decoration to be applied to structure type members.	2019-02-27 16:11:50 -05:00
Steven Perron	32b0f6739f	Use correct option in spvTextToBinary. (#2416 ) Fixes #2414.	2019-02-26 16:52:33 -05:00
Steven Perron	d800bbbac9	Handle back edges better in dead branch elim. (#2417 ) * Handle back edges better in dead branch elim. Loop header must have exactly one back edge. Sometimes the branch with the back edge can be folded. However, it should not be folded if it removes the back edge. The code to check this simply avoids folding the branch in the continue block. That needs to be changed to not fold the back edge, wherever it is. At the same time, the branch can be folded if it folds to a branch to the header, because the back edge will still exist. Fixes #2391.	2019-02-26 09:06:51 -05:00
Jeff Bolz	002ef361ca	Add validation for SPV_NV_cooperative_matrix (#2404 )	2019-02-25 17:43:11 -05:00
Sarah	fc3897b5f5	Validate: (data) Block can't appear within a Block (#2410 ) A Block or BufferBlock cannot be nested within another Block or BufferBlock	2019-02-25 10:37:43 -05:00
François Bertel	37b584a736	Fixed undefined reference to 'clock_gettime' by linking rt library (#2409 )	2019-02-25 08:57:34 -05:00
Steven Perron	a006cbc1d0	Non memory object as parameters. (#2415 ) In relaxed addressing mode, we want to accept non memory objects because this is a very natural translation of hlsl. It should be fixed by legalization by inlining the calls.	2019-02-22 12:51:22 -05:00
Sarah	4c43afcade	It is invalid to apply both Restrict and Aliased to the same <id> (#2408 ) to fix #2408 - It is invalid to apply both Restrict and Aliased to the same	2019-02-21 12:03:52 -05:00
Steven Perron	fde69dcd80	Fix OpDot folding of half float vectors. (#2411 ) * Fix OpDot folding of half float vectors. The code that folds OpDot does not handle half floats correctly. After trying to multiple the first components, we get a nullptr because we don't fold half float values. This nullptr gets passed to the code that does the addition, and causes an assert. Fixes #2405.	2019-02-20 20:05:08 -05:00
Steven Perron	8eddde2e70	Don't change type of input and output var in dead member elim (#2412 ) The types of input and output variables must match for the pipeline. We cannot see the uses in all of the shader, so dead member elimination cannot safely change the type of input and output variables.	2019-02-20 18:59:41 -05:00
Sarah	76730a46a1	In Vulkan, disallow BufferBlock on StorageBuffer variables (#2380 ) To fix #2168.	2019-02-20 11:50:57 -05:00
greg-lunarg	2f84b5de9a	Bindless: Fix computation of set and binding for runtime bounds check (#2384 ) Also fix test to use non-zero set and binding which will make error more obvious.	2019-02-19 11:43:30 -05:00
dan sinclair	528fea2b1e	Fixup unused variables (#2402 )	2019-02-19 11:11:04 -05:00
Steven Perron	78ac954c41	Mark type id of unknown instructions at fully used. (#2399 )	2019-02-15 10:49:49 -05:00
greg-lunarg	9540f2d981	Instrumentation: Fix instruction index when multiple functions (#2389 )	2019-02-15 09:49:18 -05:00
Steven Perron	1b0047f210	Add pass to remove dead members. (#2379 ) Add a pass that looks for members of structs whose values do not affects the output of the shader. Those members are then removed and just treated like padding in the struct.	2019-02-14 13:42:35 -05:00
Ryan Harrison	0167a20b0a	Move usage detection to after all instructions are registered (#2378 ) This is required to properly handle uses of forward declared ids. Since forward declared ids were not being properly covered by the validator this uncovered a bunch of small issues that needed to be resolved to get tests passing again. Fixes #2373	2019-02-13 14:06:56 -05:00
alan-baker	354205b3dc	Don't merge unreachable blocks (#2375 ) Fixes #2374 * Block merging no longer merges unreachable blocks into their successors * added a test	2019-02-12 09:24:01 -05:00
Paul Thomson	40a7940e05	Fix merge blocks opportunity to check if still enabled (#2370 ) Fix MergeBlocksReductionOpportunity so it checks whether it is still enabled Fixes #2369. Added tests.	2019-02-11 16:26:37 -05:00
Ryan Harrison	12b3d7e9d6	Add strip-debug to webgpu-mode passes (#2368 ) Fixes #2366	2019-02-08 14:26:17 -05:00
Alastair Donaldson	34c5ac614c	Fixes #2358 . Added to the reducer the ability to remove a function t… (#2361 ) * Fixes #2358. Added to the reducer the ability to remove a function that is not directly called. Factored out some code from the optimizer to help with this.	2019-02-08 16:20:29 +00:00
dan sinclair	39bfb6b978	Make spvParseTargetEnv public (#2362 ) This CL moves the method to parse the SPIRV environment into the public headers. This will allow other applications to re-use the same parsing logic.	2019-02-07 14:49:15 -05:00
greg-lunarg	cf21146137	Expand bindless bounds checking to runtime-sized descriptor arrays (#2316 )	2019-02-07 14:00:36 -05:00
alan-baker	9b6ba4d1c5	Allow arrayed storage images for NonWritable decoration (#2358 ) Fixes #2354 * Storage image pointer registration allows optional level of arraying * Added a test	2019-02-06 15:20:19 -05:00
alan-baker	117a1fd11f	Validate variable pointer related function call rules (#2270 ) Fixes #2105 * Check storage class validity * Check memory object declaration validity	2019-02-06 14:10:40 -05:00
Ryan Harrison	0f4bf0720a	Add flatten-decorations flag to webgpu-mode flags (#2348 ) Fixes #2272	2019-02-05 14:07:53 -05:00
Alastair Donaldson	37861ac106	Merge blocks in reducer (#2353 ) Fixes #2120 Enhanced the reducer so that it can merge blocks together, leveraging the functionality extracted from the block_merge pass in the optimizer.	2019-02-01 14:56:54 +00:00
Ryan Harrison	846d12afed	Add whitelist for decorations in WebGPU (#2346 ) Fixes #2273	2019-01-31 16:25:46 -05:00
alan-baker	63e032f910	Remove unused lambda capture (#2350 )	2019-01-31 15:57:45 -05:00
Alastair Donaldson	3b6fee3dae	Fixes #2338 . Added functionality to remove OpPhi instructions (replacing their uses) when merging blocks (#2339 ) * Fixes #2338. Added check for phi node before merging blocks. * Added functionality to merge blocks A and B even when B starts with OpPhi instructions, by replacing uses of the OpPhi results with the definitions coming from A. Added some tests for this. * Fixed assertion.	2019-01-31 09:36:05 -05:00
Ryan Harrison	2acbf488b8	Add WebGPU specific validation for WorkgroupSize BuiltIn decoration (#2334 ) Part of resolving #2276	2019-01-30 17:01:17 -05:00
Ryan Harrison	e2f4622627	Add WebGPU specific validation for multiple BuiltIn decorations (#2333 ) Covers NumWorkgroups, LocalInvocationId & GlobalInvocationId Part of resolving #2276	2019-01-30 17:00:58 -05:00
Ryan Harrison	3d2afb78c2	Add whitelist of allowed BuiltIn decorations for WebGPU (#2337 ) Part of resolving #2276	2019-01-30 15:46:02 -05:00
Ryan Harrison	d17fcf8abd	Add WebGPU validation for LocalInvocationIndex BuiltIn decoration (#2335 ) Part of resolving #2276	2019-01-30 15:45:31 -05:00
Ryan Harrison	837153ccdd	Add WebGPU specific validation for FragDepth BuiltIn decoration (#2332 ) Part of resolving #2276	2019-01-30 15:27:04 -05:00
Ryan Harrison	0c14583f15	Add WebGPU specific validation for FragCoord BuiltIn decoration (#2331 ) Part of resolving #2276	2019-01-30 14:53:43 -05:00
Ryan Harrison	b6698e0d83	Add WebGPU specific validation for FrontFacing BuiltIn decoration (#2330 ) Part of resolving #2276	2019-01-30 14:48:43 -05:00
Ryan Harrison	734def1447	Add WebGPU specific validation for InstanceIndex BuiltIn decoration (#2329 ) Part of resolving #2276	2019-01-30 14:20:55 -05:00
Ryan Harrison	b947ecfe79	Add WebGPU specific validation for VertexIndex BuiltIn decoration (#2328 ) Part of resolving #2276	2019-01-30 12:22:30 -05:00
David Neto	7f3679a8b6	Validate NonWritable decoration (#2263 ) Also permit NonWritable on members of structs used for UBO and SSBO. (That seems inadvertently removed in recent revisions of the spec.)	2019-01-28 12:44:13 -08:00
Steven Perron	9ab1c0ddd0	Remove code sinking for -O. (#2340 ) Community feedback says it is not generaly benificial, so we will remove it from the standard optimization set.	2019-01-28 11:50:50 -05:00
Alastair Donaldson	98c67d3850	Fixed names in ifdefs and GetName functions that had been forgotten in a previous refactoring. Also shortened names of test files as those files test both the new 'finder' classes introduced in the refactoring, as well as the 'reduction pass' class; the shorter names capture both. (#2336 )	2019-01-25 11:37:03 -05:00
Alastair Donaldson	3345fe6a9d	Extracted block merging functionality into its own utility file (#2325 ) * Extracted useful functionality from block merger and exposed it as stand-alone methods. * Separated these methods into a utility file.	2019-01-25 10:57:13 +00:00
alan-baker	cf011f9901	More layout check fixes (#2315 ) * check array strides for multidimensional arrays * check layouts of structs in arrays for multiple indices * new tests	2019-01-24 14:24:31 -08:00
Steven Perron	e2279da714	Remove the static maps from CheckDecorationsCompatibility (#2327 ) * Remove the static maps from CheckDecorationsCompatibility There are a few data structures in the function `CheckDecorationsCompatibility` that are allocated using `new` and their address is stored in a static pointer. This code pattern causes the MSVC memory leak checker to say there is a memory leak. Some people are interested in keeping that clean. To work around it, I have replaced them with either a function or an array of POD types. The array can be kept as a static directly because it has a trivial destructor, and we don't have to worry about it being destroyed too early. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/2317.	2019-01-24 14:50:58 -05:00
JasperNV	8915a7c8f1	spirv-val: Emit an error when an OpSwitch target is not an OpLabel (#2298 ) Fixes #1628. * spirv-val: Emit an error when an OpBranch target is not an OpLabel	2019-01-24 12:11:49 -05:00
Ryan Harrison	1e3c589a6d	Add WebGPU specific validation for Position BuiltIn decoration (#2309 ) This CL adds in the specific checks required for WebGPU, enables running the builtin checks for WebGPU, and refactors the existing testing infrastructure to support testing the new checks. This PR is part of resolving #2276	2019-01-24 12:08:25 -05:00
fjhenigman	20b2e2b9f5	Add SpirvTools::IsValid(). (#2326 ) * Add SpirvTools::IsValid(). Add a method to determine if a SpirvTools object was successfully constructed and can be used. It might not be depending on the parameter to the constructor. This is something a fuzzer wants to know before trying to use an SpirvTools object constructed with a fuzzed parameter.	2019-01-24 09:45:09 -05:00
Alastair Donaldson	86d0d9be25	Refactored reducer so that the 'finding' functionality of a reduction pass are separated from the generic functionality for tracking progress of a pass. With this change, we now have a ReductionOpportunityFinder abstract class, with many subclasses for each type of reduction, and just one ReductionPass class, which has an associated finder. (#2321 ) Sounds good.	2019-01-23 17:07:58 -05:00
Ryan Harrison	b1be6763f6	Add helper for 'is Vulkan or WebGPU' (#2324 ) Fixes #2323	2019-01-23 13:07:03 -05:00
David Neto	4a405eda53	Fix layout checks for nested struct in relaxed layout; and descriptor arrays (#2312 ) * Fixed layout checks for nested structures Fixes #2303 * Incoming offsets accumulate through nested structures * Check layouts through arrays * Perform layout checks in the presence of descriptor arrays (and runtime arrays) * Fix formatting	2019-01-22 15:15:24 -08:00
Ryan Harrison	3a3ad2ec50	Add utility to generate a logging string for a given environment (#2314 ) Fixes #2313	2019-01-22 15:18:14 -05:00
greg-lunarg	a64c651e18	Fix Constants Analyses bug inserted by #2302 (#2306 ) Need to also remove Constants from the valid_analyses set when invalidated, otherwise Constants is not reinitialized before used.	2019-01-21 12:34:12 -05:00
Steven Perron	eab06d669e	Check forward reference in OpTypeArray. (#2307 ) In a recent PR, we allowed a forward reference for the element type in an array declaration. However, we do not have other check to make sure the forward reference is a pointer type first reference in OpTypeForawrdPointer. We add that check. Fixes https://crbug.com/920074.	2019-01-21 12:10:25 -05:00
Steven Perron	8df947d2d6	Handle instructions not in blocks in code sinking. (#2308 ) When looking at the uses of the result of an instruction, code sinking assumes that all uses are in a basic block. However, this is not true if there is a decoration or name for the result of that insturction. This commit checks for this. Fixes https://crbug.com/923243.	2019-01-21 12:09:56 -05:00
greg-lunarg	d14db341b8	Invalidate ConstantManager if TypeManager is invalidated... (#2302 ) ...as the ConstantManager contains pointers into the TypeManager.	2019-01-18 15:49:00 -05:00
Steven Perron	d6c067630d	Handle extract with no index in VDCE. (#2305 ) It is legal, but not generated by any SPIR-V producer: an OpCompositeExtract with no indexes. This is essentially just a copy of the object, so we treat them that way. We simply propagate the live variables of the result to the operand. Fixes https://crbug.com/919181.	2019-01-18 15:43:36 -05:00
Steven Perron	81fb2649bf	Handle access chain with no index in SROA. (#2304 ) It is legal, but not generated by any SPIR-V producer: an OpAccessChain with no indexes. This is essentially just a copy of the pointer. I have decided to treat it like an OpCopyObject. In CheckUses, we return that it is not okay. When looking at this I realized that we had code in GetUsedComponents that cannot be reached. If there is a use in an OpCopyObject the it will not call GetUsedComponents. I removed that dead code. Fixes https://crbug.com/918311.	2019-01-18 14:19:43 -05:00
Steven Perron	213e15e100	Fix overflow when negating INT_MIN. (#2293 ) When doing (-INT_MIN) is considered overflow, so we cannot fold it by actually performing the negation. Fixes https://crbug.com/917991	2019-01-17 17:01:55 -05:00
Steven Perron	99c2c21cf4	Fix memory leak in unrolling. (#2301 ) During unrolling a new loop is created, but its ownership is not clear as it gets passed through the code. Changed something to unique_ptr to make that clearer. Fixes #2299. Fixing other memory leaks at the same time. Fixes #2296 Fixes #2297	2019-01-17 16:02:43 -05:00
Steven Perron	dd4157dcee	Sink (#2284 ) Add code sinking pass. It will move OpLoad and OpAccessChain instructions as close as possible to their uses. Part of #1611.	2019-01-17 15:56:36 -05:00
Ryan Harrison	7577415cc7	Add in WebGPU specific memory scope validation (#2288 ) Fixes #2278	2019-01-17 10:39:35 -05:00
Ryan Harrison	b6150e5170	Add WebGPU specific RTA validation rules (#2287 ) Fixes #2066	2019-01-17 10:39:12 -05:00
greg-lunarg	8d2d66f30c	Fix vertex instrumentation to use VertexIndex and InstanceIndex (#2294 ) ...instead of VertexId and InstanceId	2019-01-16 18:02:07 -05:00
Steven Perron	49b5b0abc6	Fix up bit shifts by 32. (#2292 ) In C++, a bit shift of the same size as the type is undefined, but it is defined in spir-v. When folding those cases, we have to be careful. We cannot simply do the shift in C++. Fixes https://crbug.com/917697.	2019-01-16 15:52:23 -05:00
greg-lunarg	83bfdc976a	Instrumentation: Add ArrayStride decoration to debug output buffer array (#2290 )	2019-01-16 10:01:40 -05:00
Ryan Harrison	cb27ffdcd8	Ensure that required storage classes have initializer for WebGPU (#2285 ) Fixes #2279	2019-01-15 10:24:58 -05:00
Ryan Harrison	9d8534e329	Enforce rules for OpTypeRuntimeArray on Vulkan (#2191 ) Fixes #1936	2019-01-14 16:44:44 -05:00
Ryan Harrison	68f2af9f7d	Removing unused const version of id_decorations (#2283 ) Fixes #2282	2019-01-14 13:52:50 -05:00
Ryan Harrison	16a0da370b	Ensure that entry point names are unique for WebGPU (#2281 ) Fixes #2275	2019-01-14 13:52:28 -05:00
David Neto	6958d11bc2	Validate decorations from SPV_KHR_no_integer_wrap (#2271 ) Validates NoSignedWrap, NoUnsignedWrap. We are permissive by allowing any extended instruction.	2019-01-09 10:36:17 -05:00
David Neto	df5bd2d05a	Permit UConvert spec-constant op for SPV_AMD_gpu_shader_int16 (#2264 ) See https://github.com/KhronosGroup/glslang/issues/848	2019-01-08 19:00:18 -05:00
Jeff Bolz	5eab6df648	SPV_EXT_physical_storage_buffer (#2267 )	2019-01-07 13:19:24 -05:00
alan-baker	06c9dc07bd	Upgrade modf and frexp (#2266 ) Fixes #2138 * Modf and frexp are upgraded to use the struct version of the instruction and generate an explicit store whose flags can be upgraded separately * Fixed major bug where availability and visibility were reversed for non-copy memory instructions * Fixed bug where availability and visibility scope operands were reversed for copy memory * Upgraded all opt tests to use SPV_ENV_UNIVERSAL_1_3 * Upgrade tests moved into unified tests and removed standalone test	2019-01-07 12:36:38 -05:00
David Neto	a87d3ce48e	Validate operation for OpSpecConstantOp (#2260 )	2019-01-03 14:28:00 -05:00
alan-baker	a900bacb58	Broader check for ids that require a type (#2259 ) Broader check for ids that require a type Fixes https://crbug.com/911700 * Adds a broader check for when id operands require a type * updated a few tests * added a test to catch the original issue	2019-01-03 13:55:43 -05:00
Steven Perron	241644a5a3	Have replace load size handle extact with no index. (#2261 ) Fixes https://crbug.com/917774	2019-01-03 13:02:10 -05:00
Steven Perron	9f36c8bb72	Handle CompositeInsert with no indices in VDCE (#2258 ) * Handle CompositeInsert with no indices in VDCE In the spec, there it nothing that forces an OpCompositeInsert to have an index, but VDCE assumes there is at least 1 in a couple places. This commit updates VDCE to handle these cases.	2019-01-02 14:00:04 -05:00
kholtnv	980ae1d1cd	Added NVIDIA ray tracing storage classes in ValidateVariable. (#2254 ) * Added additional changes for the new AccelerationStructureNV type. * Added NVIDIA ray tracing storage classes for checking in ValidateVariable. * For NVIDIA ray tracing storage classes added test to load bool type (allowed) in new storage class.	2018-12-27 15:08:11 -05:00
dan sinclair	167f1270a9	Output disassembly line number for binary parse errors. (#2195 ) This Cl changes the binary parser to keep track of the instruction count being processed. The parser will then use that instruction number as the error number, instead of the binary word. This should make it easier to match the error up to what the disassembler would output for the error. Issue #2091	2018-12-21 16:24:15 -05:00
Steven Perron	bdc2ab9356	In LICM don't place code between merge instruction and branch. (#2252 ) Fixes #2210.	2018-12-20 18:33:52 -05:00
Steven Perron	5e19d3febc	Add custom target to wrap around custom commands. (#2198 ) In CMake, we are not suppose to have multiple targets depend on the same custom command. To avoid this, we have to add a custom target around the command. Then we have add the appropriate dependencies. Fixes #1941.	2018-12-20 20:02:53 +00:00
Steven Perron	c2013e248b	Make the constant and type manager analyses. (#2250 ) Currently it is impossible to invalidate the constnat and type manager. However, the compact ids pass changes the ids for the types and constants, which makes them invalid. This change will make them analyses that have to been explicitly marked as preserved by passes. This will allow compact ids to invalidate them. Fixes #2220.	2018-12-20 18:00:05 +00:00
kholtnv	e49bd96f2c	Added additional changes for the new AccelerationStructureNV type. (#2218 ) * Added additional changes for the new AccelerationStructureNV type. * Added additional changes for the new AccelerationStructureNV type. Change tabs to space... * Added additional changes for the new accelerationStructureNV type -- add proper type name. Fix TypeManager.TypeStrings test: [----------] 29 tests from TypeManager [ RUN ] TypeManager.TypeStrings [ OK ] TypeManager.TypeStrings (7 ms)	2018-12-19 21:42:39 +00:00
Steven Perron	68b69e16aa	Update the continue target in merge return. (#2249 ) When we are predicating the continue target for a loop, it can no longer be the continue target because it will have a branch that exits the loop and is not the bach edge. The continue target will have to be the target of that branch that is still in the loop. Fixes #2211.	2018-12-19 21:24:49 +00:00
Steven Perron	ac7feace90	Fix missing OpPhi after merge return. (#2248 ) The function `UpdatePhiNodes` was being called inconsistently. In one case, the cfg had already been updated to include the new edge, and in another place the cfg was not updated. This caused the function to miss flagging a block as needing new phi nodes. I picked that the cfg should not be updated before making the call. I documented it, and change the call sites to match. Fixes #2207.	2018-12-19 18:17:42 +00:00
Steven Perron	9d04f82bef	Ensure SROA gets the correct pointer type. (#2247 ) We initially assumed that if the type manager returned the correct id for the pointee type, that we would get the correct pointer type back, but that is not true. See the unit test added with this commit. We need to fall back to the linear search any time we are looking for a pointer to a type that may not be unique. At the same time, SROA considered an OpName on a variable to be a use of the entire variable. That has been fixed. Fixes #2209.	2018-12-19 17:07:29 +00:00
Steven Perron	9e81c337f9	Place load after OpPhi instructions in block. (#2246 ) We currently place the load instructions at the start of the basic block that dominates all of the loads. If that basic block contains OpPhi instructions, then this will generate invalid code. We just need to search for a location that comes after all of the OpPhi instructions. Fixes #2204.	2018-12-19 15:18:22 +00:00
Paul Thomson	71aa48f91d	spirv-reduce: add OperandToUndefReductionPass (#2200 ) * Add OperandToUndefReductionPass. Fixes #2115. Also added some tests that are similar to those in OperandToConstantReductionPassTest. In addition, refactor FindOrCreateGlobalUndef into reduction_util.cpp. Fixes #2184. Removed many documentation comments that were identical or very similar to the overridden function's documentation comment.	2018-12-19 13:25:56 +00:00
Steven Perron	5ec2d1a8cd	Don't fold specialized branches in loop unswitch (#2245 ) * Don't fold specialized branchs in loop unswitch Folding branches can have a lot of special cases, and can be a little error prone. So I only want it in one place. That will be in dead branch elimination. I will change loop unswitching to set the branches that were being folded to have a constant condition. Then subsequent pass of dead branch elimination will be able to remove the code. At the same time, I added a check that loop unswitching will not unswitch a branch with a constant condition. It is not useful to do it because dead branch elimination will simple fold the branch anyway. Also it avoid an infinite loop that would other wise be introduced by my first change. Fixes #2203.	2018-12-19 04:40:30 +00:00
Ryan Harrison	47c08a79c4	Implement initial --webgpu-mode flag (#2217 ) Fixes #2166	2018-12-18 15:10:34 -05:00
Steven Perron	acd2781952	Handle id overflow in inlining. (#2196 ) Have inlining return Failure if the ids overflow. Part of #1841.	2018-12-18 19:34:03 +00:00
Ryan Harrison	7f57887e05	Remove check for SpvCapabilityAtomicStorage (#2243 ) Per conversation on https://github.com/KhronosGroup/glslang/issues/1618 and other places.	2018-12-18 13:34:30 -05:00
Steven Perron	1254335d13	Don't unswitch the latch block. (#2205 ) Loop unswitching is unswitching the conditional branch that creates the back-edge. In the version of the loop, where the bachedge is not taken, there is no back-edge. This is what causes the validator to complain. The solution I will go with will be to now unswitch a condition with a back-edge. At this time we do not now if loop unswitching is used. We do not include it in the optimization sets provided, nor is it used in glslang's set. When there are opportunities and no breaks from the loop, the loop with either be a single iteration loop, or an infinite loop. There is no performance advantage to performing loop unswitching in either of those cases. If there is a break, maintaining structured control flow will be tricky. Unless we see a clear advantage to handling these case, I would go with the safer simpler solution. Fixes #2201.	2018-12-18 18:15:00 +00:00
Steven Perron	ff07c6df83	SSA-rewriter: make sure phi entries are unique. (#2206 ) If there are multiple edges to a basic block, then the ssa rewriter will create OpPhi instructions with duplicate entries. This is invalid, and it is fixed in this commit. Fixes #2202.	2018-12-18 18:14:27 +00:00
Ryan Harrison	e0292c269d	Add --target-env flag to spirv-opt (#2216 ) Fixes #2199	2018-12-17 16:54:23 -05:00
Steven Perron	c512c68640	Avoid GCC8 warning in text_handler.cpp. (#2197 ) In the function `AssemblyContext::binaryEncodeString`, we want to copy a nul terminated string to an instruction. When coping the string, we did not copy the nul at the end of the source. It was added by setting the entire last word to 0, which is mandated by the spir-v spec. This is not a bug, but it does trigger a warning in GCC8 when doing a release build. To avoid the warning, we will copy the nul character at the end of the string too. Fixes #1541.	2018-12-13 15:03:28 -05:00
Alastair Donaldson	1cba9942bd	Validate during reduction (#2194 ) * Run validator during reduction. * Added functionality to validate modules after each reduction step, and some tests to check this is working. Also fixed an issue where reduction passes were not guaranteed to be executed at their minimum granularities.	2018-12-12 09:06:13 -05:00
Jeff Bolz	24328a0554	Recognize OpTypeAccelerationStructureNV as a type instruction (#2190 )	2018-12-11 19:03:55 -05:00
Ryan Harrison	a719fc18a5	Disable checking that AtomicStorage capability is present (#2193 ) There is inconsistencies between the different specs about whether or not this capability is required/allowed, so tooling like glslang currently ignores it. Once this is resolved the check and test can be re-enabled.	2018-12-11 14:19:44 -05:00
Steven Perron	e07dabc25f	Invalidate the decoration manager at the start of ADCE. (#2189 ) * Invalidate the decoration manager at the start of ADCE. If the decoration manager is kept live the the contex will try to keep it up to date. ADCE deals with group decorations by changing the operands in \|OpGroupDecorate\| instructions directly without informing the decoration manager. This puts it in an invalid state, which will cause an error when the context tries to update it. To Avoid this problem, we will invalidate the decoration manager upfront. At the same time, the decoration manager is now considered when checking the consistency of the decoration manager.	2018-12-10 13:24:33 -05:00
Hugues Evrard	4aeadc0199	Add RemoveOpNameInstruction reduction pass (#2187 ) Add a spirv-reduce pass which removes OpName and OpMemberName instructions. This is useful to enable other reduction passes, e.g. RemoveUnreferencedInstruction may not be able to remove an instruction creating an id whose only usage is an OpName for this id.	2018-12-10 11:53:31 -05:00
Steven Perron	0bc66a8ba9	Fix invalid OpPhi generated by merge-return. (#2172 ) * Fix invalid OpPhi generated by merge-return. When we create a new phi node for a value say %10, we have to replace all of the uses of %10 that are no longer dominated by the def of %10 by the result id of the new phi. However, if the use is in a phi node, it is possible that the bb contains the use is not dominated by either. In this case, needs to be handled differently. * Split loop headers before add a new branch to them. In merge return, Phi node in loop header that are also merges for loop do not get updated correctly. Those cases do not fit in with our current analysis. Doing this will simplify the code by reducing the number of cases that have to be handled.	2018-12-07 14:10:30 -05:00
Alejandro Lopez	de797ddcb5	Check that certain decorations cannot be used more than once and/or are mutually exclusive (#2171 ) Fixes #1636 * Add a hash functor for decoration types for c++11 compliance * Change non-POD static variables and add test for Block+BufferBlock	2018-12-07 12:46:27 -05:00
Alastair Donaldson	6679d5df89	Replace loop with selection (#2164 ) Add a pass for spirv-reduce that will turn a loop into a selection.	2018-12-07 12:44:46 -05:00
Ryan Harrison	7c38fee64a	Restrict mask bits for memory semantics in WebGPU (#2180 ) Fail to validate memory semantics value if it includes set bits that are not on the whitelist from the spec. Fixes #2070	2018-12-07 10:38:52 -05:00
David Neto	6df6194db8	Validate Uniform decoration (#2181 )	2018-12-07 09:32:57 -05:00
Ryan Harrison	cf37ab7213	Merge two implementations of ValidateMemorySemantics (#2175 ) Fixes #2170	2018-12-06 14:38:15 -05:00
Steven Perron	2e4563d94f	Document in the context what happens with id overflow. (#2159 ) Added documentation to the ir context to indicates that TakeNextId() returns 0 when the max id is reached. TODOs were added to each call sight so that we know where we have to start to handle this case. Handle id overflow in \|SplitLoopHeader\|. Handle id overflow in \|GetOrCreatePreHeaderBlock\|. Handle failure to create preheader in LICM. Part of https://github.com/KhronosGroup/SPIRV-Tools/issues/1841.	2018-12-06 09:07:00 -05:00
Ryan Harrison	378b7f3a29	Check for recursion in Vulkan and WebGPU entry points (#2161 ) Fixes #2061 Fixes #2160	2018-12-05 13:58:43 -05:00
Alejandro Lopez	2f5f5308b6	Validate that there is at most one push constant block (#2163 ) Fixes #2006 Validates that there is at most one PushConstant interface per entry point for Vulkan environment.	2018-12-05 13:30:04 -05:00
Ryan Harrison	3e645b9d67	Check that if A calls B, B is defined before A for WebGPU (#2169 ) Fixes #2067	2018-12-05 11:47:24 -05:00
alan-baker	68d1dc66d2	Loosen binding and descriptor check (#2167 ) * Only check for binding and descriptor set on variables that are statically used by an entry point * updated tests and added a couple new ones * new method for collecting entry points that statically reference an id	2018-12-05 08:10:02 -05:00
Steven Perron	a0816d03e9	Validate OpForwardPointer (#2156 ) * Validate OpForwardPointer The validator does not have a a check that OpForwardPointer is giving a forward reference to a pointer type. We add that check. https://crbug.com/910852 * Remove more specialized check. There was a check that the forward pointer is actually a poiner type, but it was only done if it was used in a struct. This was too specific. Remove it in favour of the more general check that was added. * Format * Check the storage type in OpTypeForwardPointer * Fix typo is test case epxected results.	2018-12-04 13:35:49 -05:00
Alejandro Lopez	a1439604ea	Check binding annotations in resource variables (#2151 ) Fixes #2007 Add checks that all uniform, uniform constant and storage buffer variables have descriptor set and binding decorations	2018-12-04 10:05:41 -05:00
Steven Perron	17cba4695c	Remove undefined behaviour when folding shifts. (#2157 ) We currently simulate all shift operations when the two operand are constants. The problem is that if the shift amount is larger than 32, the result is undefined. I'm changing the folder to return 0 if the shift value is too high. That way, we will have defined behaviour. https://crbug.com/910937.	2018-12-04 10:04:02 -05:00
alan-baker	b1ff8ba5b9	Check device scope for Vulkan memory model (#2149 ) Fixes #2147 * Checks that device scope is not used for availability and visibility operations unless VulkanMemoryModelDeviceScopeKHR capability is present * implemented for atomics, barriers and memory instructions currently	2018-12-03 17:15:47 -05:00
dan sinclair	d835d664bd	[val] Fixup id name output (#2158 ) This CL changes the id/name output from the validator to always use a consistent id[%name] style. This removes the need for getIdOrName. The name lookup is changed to use the NameMapper so the output is consistent with what the disassembler will produce. Fixes #2137	2018-12-03 17:01:30 -05:00
David Neto	0c172a6b74	Allow Float16/Int8 for Vulkan 1.0 (#2153 )	2018-12-03 12:50:12 -05:00
Steven Perron	ae1826154e	Validate uses of ids defined in unreachable blocks. (#2146 ) * Validate uses of ids defined in unreachable blocks. For some reason we do not make sure the uses of ids that are defined in unreachable blocks are dominated by their def. This is causing invalid code to pass the validator. Fixes #2143 * Add test for unreachable code after a return. We want to allow code like: ``` void foo() { a = ...; ... return; // for debugging <use of a>; ... } ``` I added a test to make sure that something like this is still accepted by the validator. * Add test for unreachable def used in phi.	2018-12-03 12:49:27 -05:00
alan-baker	d80259d35e	Strict validation of where type ids are acceptable (#2142 ) Fixes https://crbug.com/910239 * IdPass catches many instances of invalid references to types * Test updates * Added test to catch OpArrayLength issue	2018-12-03 11:03:52 -05:00
Ryan Harrison	b9f9a3bc9f	Add WebGPU Execution scope check (#2148 ) Fixes #2069	2018-12-03 10:56:55 -05:00
alan-baker	e510b1bac5	Update memory model (#1904 ) Upgrade to VulkanKHR memory model * Converts Logical GLSL450 memory model to Logical VulkanKHR * Adds extension and capability * Removes deprecated decorations and replaces them with appropriate flags on downstream instructions * Support for Workgroup upgrades * Support for copy memory * Adding support for image functions * Adding barrier upgrades and tests * Use QueueFamilyKHR scope instead of device	2018-11-30 14:15:51 -05:00
alan-baker	6af3c5cbe4	Clean uses of EvalInt32IfConst (#2145 ) Fixes #2133 * Don't return OpSpecConstant* as constants in that method * cleaned up uses * added tests to catch shader semantics and scope bugs	2018-11-30 14:00:56 -05:00
Alejandro Lopez	b8e2a9f258	Validate PushConstants annotation and type (#2140 ) * Validate PushConstants have Block annotation and are struct or array of structs * Add passing test and split into universal/vulkan environment tests	2018-11-30 13:12:05 -05:00
Ryan Harrison	625db3890d	Add check for QueueFamilyKHMR memory scope (#2144 ) This also fixes a small typo that was causing my test case to fail. Fixes #2136	2018-11-30 12:52:31 -05:00
Ryan Harrison	2cd040b0d3	Merging two ValidateMemoryScope implementations (#2132 ) Fixes #2125	2018-11-29 14:51:17 -05:00
Steven Perron	2d2a512691	Don't inline recursive functions. (#2130 ) * Move ProcessFunction* function from pass to the context. There are a few functions that are used to traverse the call tree. They currently live in the Pass class, but they have nothing to do with a pass, and may be needed outside of a pass. They would be better in the ir context, or in a specific call tree class if we ever have a need for it. * Don't inline recursive functions. Inlining does not check if a function is recursive or not. This has been fine as long as the shader was a Vulkan shader, which forbid recursive functions. However, not all shaders are vulkan, so either we limit inlining to Vulkan shaders or we teach it to look for recursive functions. I prefer to keep the passes as general as is reasonable. The change does not require much new code in inlining and gives a reason to refactor some other code. The changes are to add a member function to the Function class that checks if that function is recursive or not. Then this is used in inlining to not inlining a function call if it calls a recursive function. * Add id to function analysis There are a few places that build a map from ids to Function whose result is that id. I decided to add an analysis to the context for this to reduce that code, and simplify some of the functions. * Add missing file.	2018-11-29 14:24:58 -05:00
Ryan Harrison	8ce3dbabb8	Merge two implementations of ValidateExecutionScope (#2131 )	2018-11-29 13:48:42 -05:00
Ryan Harrison	3ee605d7cc	Ensure that only whitelisted extensions are used in WebGPU (#2127 ) Fixes #2058	2018-11-28 10:49:05 -05:00
Ryan Harrison	525e36d1cd	Move OpExtInst validation into validate_extensions.cpp (#2124 ) Fixes #2123	2018-11-27 17:05:54 -05:00
alan-baker	3d56cddb75	Validate pointer variables (#2111 ) Fixes #2104 * Checks the rules for logical addressing and variable pointers * Has an out for relaxed logical pointers * Updated PassFixture to expose validator options * enabled relaxed logical pointers for some tests * New validator tests	2018-11-27 16:47:10 -05:00
Ryan Harrison	4759082bbc	Ensure that imported extended instructions for WebGPU are only "GLSL.std.450" (#2119 ) Ensure that imported extended instructions for WebGPU are GLSL.std.450 Fixes #2059	2018-11-27 16:20:01 -05:00
Ryan Harrison	dab634da93	Ensure that function parameter's type is not void (#2118 ) Fixes #2094	2018-11-27 09:40:19 -05:00
Ryan Harrison	48d923907b	Restrict capabilities to WebGPU spec (#2113 ) Restrict capabilities to WebGPU spec This covers whitelisting Matrix, Shader, Sampled1D, Image1D, DerivativeControl, and ImageQuery. These are the allowed capabilities that don't require an extension. Whitelisting VulkanMemoryModelKHR will be handled by whitelisting its extension in a seperate patch. Fixes #2101	2018-11-27 09:39:37 -05:00
alelenv	f989b2dbd7	Add precise check for allowing use of gl_InstanceID for specific vulkan raytracing stages . (#2096 ) * Checks that gl_InstanceID is only used in specific execution models	2018-11-27 08:35:29 -05:00
Steven Perron	4e22b60122	Add validation for OpArrayLength. (#2117 ) The validation rules for OpArrayLength are not checked by the validator. This with add them. Fixes https://crbug.com/907451.	2018-11-26 19:46:08 -05:00
Alastair Donaldson	3b13040cf9	New spirv-reduce reduction pass: operand to dominating id. (#2099 ) * Added a reduction pass to replace ids with ids of the same type that dominate them. * Introduce helper method for querying whether an operand type is an input id.	2018-11-26 17:06:21 -05:00
alan-baker	e799bfb923	Prevent diagnostic memory leak (#2110 ) Fixes https://crbug.com/906669 * Don't free diagnostics in spvBinaryParse * When invoking the parser we wish to ignore the error messages from, instead create a hijacked context and replace the message consumer with a null consumer	2018-11-26 16:58:09 -05:00
Steven Perron	72d4e5414b	Change HexFloat to work with gcc8. (#2109 ) When we want to set a the value of a HexFloat to inf or nan, we construct the specific bit pattern in an appropriately sized integer. That integer is copied to a FloatProxy object through a memcpy. GCC8 complains about the memcpy because it is overwriting a private member of the class. The original solution worked well because the template to the HexFloat could be anything. However, we only used some instantiation of FloatProxy, which has a construction from that takes its uint_type, so I decided to use that constructor instead of the memcpy. This puts an extra requirement on the templace for HexFloat, but it will be fine for us. Part of #1541.	2018-11-26 15:47:48 -05:00
Michał Janiszewski	d543f7dfed	Don't use CMake's own property as variable name (#2112 ) ``` $ cmake --help-property-list \| grep ^VERSION$ VERSION ```	2018-11-26 10:37:30 -05:00
Daniel Koch	3b210d6a63	Add basic support for EXT_fragment_invocation_density (#2100 ) Whitelisting the extension in optimizations * copying what was done for NV_shading_rate	2018-11-23 10:21:19 -05:00
Minmin Gong	095cc6722f	Fix the missing pch files in spirv-reduce (#2097 )	2018-11-22 18:00:08 -05:00
dan sinclair	78c951b3f6	Add newline at end of file (#2098 )	2018-11-22 14:35:40 -05:00
Ryan Harrison	7a3493e887	Make sure that initialized variable have correct storage class (#2092 ) Make sure that initialized variable have correct storage class For WebGPU and Vulkan environments, variables must have the storage class; Output, Private, or Function, if they have an initializer. Fixes #2071	2018-11-22 12:52:04 -05:00
Ryan Harrison	981763ec74	Ensure correct Addressing and Memory model set for WebGPU (#2093 ) Adding validation that the addressing declared by OpMemoryModel is Logical and the memory model declared is VulkanKHR. Updating a bunch of tests that were broken by this. Fixes #2060	2018-11-21 16:41:59 -05:00
Alastair Donaldson	f3acb955c2	Initial commit for spirv-reduce. (#2056 ) Creates a new tool that can be used to reduce failing testcases, similar to creduce.	2018-11-21 14:03:09 -05:00
Ryan Harrison	3adb7977da	Check forbidden Annotation instructions for WebGPU env (#2090 ) Check forbidden Annotation instructions for WebGPU env From the WebGPU SPIR-V Execution Enviroment spec: OpDecorationGroup, OpGroupDecorate, OpGroupMemberDecorate are not allowed. Fixes #2062	2018-11-20 16:40:38 -05:00
Ryan Harrison	11c7a9e067	Validate that debugging instructions are not present for WebGPU (#2089 ) Validate that debugging instructions are not present for WebGPU For WebGPU execution environments, check that all of the debug instructions have already been stripped before validation. Fixes #2063	2018-11-20 16:12:28 -05:00
alan-baker	d41ff27f17	Add support for VK_EXT_Transform_feedback capabilities (#2088 ) * Added support for Transform Feedback capabilities. * Fix tests	2018-11-20 12:41:03 -05:00
dan sinclair	15fdcf94d7	Add missing override to ProcessLinesPass	2018-11-19 19:24:48 -05:00
alan-baker	f5b4a8eee3	Catch invalid input type to OpConvertUToPtr (#2078 ) Fixes https://crbug.com/906426 * Fails validation if the input operand is a type * Added a test	2018-11-19 15:08:38 -05:00
Ryan Harrison	8cd2a9d187	Validate component literals for OpVectorShuffle in WebGPU environment (#2077 ) Validate component literals for OpVectorShuffle in WebGPU environment Fixes #2072	2018-11-19 14:32:18 -05:00
Alan Baker	d652ed3029	Vulkan memory model: semantics validation Ban sequentially consistent with VulkanKHR * Added validation check that SequentiallyConsistent memory semantics are not used if the memory model is VulkanKHR * Added tests * Fixed a bug in evaluating constant 32-bit integers and updated some handling to avoid inferring a value from a spec constant default Remaining memory semantics validation * Adds checks that OutputMemoryKHR, MakeAvailableKHR and MakeVisibleKHR are only used if the VulkanMemoryModelKHR capabailty is present * Added checks that MakeAvailableKHR requires release semantics * Added checks that MakeVisibleKHR requires acquire semantics * Added checks that MakeAvailableKHR and MakeVisibleKHR require a storage class	2018-11-19 11:44:20 -05:00
Alan Baker	cd22b31557	Catch branch condition being a type Fixes https://crbug.com/903691 * Added a test	2018-11-16 16:40:39 -05:00
David Neto	8e9be303b0	Validator: Support VK_EXT_scalar_block_layout Adds validator option to specify scalar block layout rules. Both VK_KHR_relax_block_layout and VK_EXT_scalar_block_layout can be enabled at the same time. But scalar block layout is as permissive as relax block layout. Also, scalar block layout does not require padding at the end of a struct. Add test for scalar layout testing ArrayStride 12 on array of vec3s Cleanup: The internal getSize method does not need a round-up argument, so remove it.	2018-11-16 15:55:30 -05:00
alan-baker	28d8d7bc67	Fix min base alignment (#2075 ) Fixes #2073 * Added a test	2018-11-16 14:22:42 -05:00
Ryan Harrison	d7cd1203a4	Ensure for OpVariable that result type and storage class operand agree (#2052 ) From SPIR-V spec, section 3.32.8 on OpVariable: Its Storage Class operand must be the same as the Storage Class operand of the result type. Fixes #941	2018-11-16 11:22:11 -05:00
greg-lunarg	c37388f1ad	Add passes to propagate and eliminate redundant line instructions (#2027 ). (#2039 ) These are bookend passes designed to help preserve line information across passes which delete, move and clone instructions. The propagation pass attaches a debug line instruction to every instruction based on SPIR-V line propagation rules. It should be performed before optimization. The redundant line elimination pass eliminates all line instructions which match the previous line instruction. This pass should be performed at the end of optimization to reduce physical SPIR-V file size. Fixes #2027.	2018-11-15 14:06:17 -05:00
fjhenigman	ab76e332de	Validate uniform variable type in Vulkan (#1949 ) (#2055 ) From the Vulkan 1.1 spec 14.5.2: Variables identified with the Uniform storage class are used to access transparent buffer backed resources. Such variables must be typed as OpTypeStruct, or an array of this type. Fixes #1949	2018-11-15 13:42:17 -05:00
David Neto	a29a9947ac	UniformConstant variables can have RuntimeArray, TypeAccelerationStructureNV	2018-11-14 21:50:09 -05:00
Greg Fischer	d4a10590b7	Fix Instruction::IsFloatingPointFoldingAllowed() Was looking for decorations based on opcode. Should use result_id.	2018-11-14 15:25:51 -07:00
alan-baker	5c334514d6	Allow InstanceId for NV ray tracing (#2049 ) * Allow InstanceId for NV ray tracing Fixes #2046 * Allows InstanceId in the Vulkan environment if RayTracingNV capability is specified	2018-11-14 15:03:40 -05:00
Ryan Harrison	a362e60d5a	Validate variable types for UniformConstant storage in Vulkan (#2008 ) (#2044 ) Validate variable types for UniformConstant storage in Vulkan (#2008) From the Vulkan 1.1 spec 14.5.2: Variables identified with the UniformConstant storage class are used only as handles to refer to opaque resources. Such variables must be typed as OpTypeImage, OpTypeSampler, OpTypeSampledImage, or an array of one of these types. Fixes #2008	2018-11-14 15:00:03 -05:00
Steven Perron	dc9d155d62	Fix folding of volatile store. (#2048 ) When looking for the Volatile mask on a store, the instruction folder accesses an out-of-bounds element. We fix that up. Fixes crbug.com/903530.	2018-11-14 13:52:18 -05:00
Steven Perron	a6150a3fe7	Don't assert on void function parameters. (#2047 ) The type manager in spirv-opt currently asserts if a function parameter has type void. It is not exactly clear from the spec that this is disallowed, even if it probably will be disallowed. In either case, asserts should be used to verify assumptions that will actually make a difference to the code. As far as the optimizer is concerned, a void parameter does not matter. I don't see the point of the assert. I'll just remove it and let the validator decide whether to accept it or not. No test was added because it is not clear that it is legal, and should not force us to accept it in the future unless the spec make it clear that it is legal. Fixes crbug.com/903088.	2018-11-14 12:43:43 -05:00
Steven Perron	ec5574a9c6	Instruction::GetBaseAddress to handle OpPtrAccessChain (#2050 ) That function currently only handled OpPtrAccessChain if it was in the middle of the chain, but not at the start. Fixing that up. Fixes crbug.com/905271.	2018-11-14 12:42:25 -05:00
Neil Henning	2b1f6b373c	Validate that VertexId and InstanceId are not allowed in Vulkan. (#2036 ) The Vulkan specification does not permit use of the VertexId and InstanceId BuiltIn decorations, so add a check to ensure they are not being used when the target environment is Vulkan.	2018-11-13 09:22:48 -05:00
dan sinclair	f343a15764	Add missing overrides (#2041 )	2018-11-12 15:11:32 -05:00
dan sinclair	75999d9b71	Remove asserts around environment determination. (#2040 ) This CL removes several asserts around determining the SPIR-V environment. In each case we already return a default value if assertions are compiled out, so just return the default value.	2018-11-12 14:24:47 -05:00
greg-lunarg	1e9fc1aac1	Add base and core bindless validation instrumentation classes (#2014 ) * Add base and core bindless validation instrumentation classes * Fix formatting. * Few more formatting fixes * Fix build failure * More build fixes * Need to call non-const functions in order. Specifically, these are functions which call TakeNextId(). These need to be called in a specific order to guarantee that tests which do exact compares will work across all platforms. c++ pretty much does not guarantee order of evaluation of operands, so any such functions need to be called separately in individual statements to guarantee order. * More ordering. * And more ordering. * And more formatting. * Attempt to fix NDK build * Another attempt to address NDK build problem. * One more attempt at NDK build failure * Add instrument.hpp to BUILD.gn * Some name improvement in instrument.hpp * Change all types in instrument.hpp to int. * Improve documentation in instrument.hpp * Format fixes * Comment clean up in instrument.hpp * imageInst -> image_inst * Fix GetLabel() issue.	2018-11-08 13:54:54 -05:00
greg-lunarg	6721478ef1	Don't assume one return means function can be inlined. (#2018 ) (#2025 ) If there is only 1 return and it is in a loop, then the function cannot be inlined. Fix condition when inlined code needs one-trip loop wrapper. The dummy loop is needed when there is a return inside a selection construct. Even if there is only 1 return.	2018-11-08 09:11:20 -05:00
Jeff Bolz	c06a35b902	Rename PCH macro to spvtools_pch to avoid conflicts with other projects. Also add pch to test/opt. (#2034 )	2018-11-07 09:15:04 -05:00
Steven Perron	91f33503fc	Validate the id bound. (#2031 ) * Validate the id bound. Validates that the id bound for the module is not larger than the max id bound. Also adds an option to set the max id bound. Allows the optimizer option to set the max id bound to also set the id bound for the validation run done by the optimizer. Fixes #2030.	2018-11-06 11:30:19 -05:00
James Jones	398f37a2e0	Add explicit void parameter in libspirv.h again (#2032 ) When building C code with gcc and the -Wstrict-prototypes option, function declarations and definitions that don't specify their argument types generate warnings. Functions that don't take parameters need to specify (void) as their parameter list, rather than leaving it empty. Note this only applies to C, so only the functions exported in C-compatible headers need fixing. In C++ functions can't be declared/defined without a parameter list, so C++ can safely allow an empty parameter list to imply (void).	2018-11-06 11:12:26 -05:00
Jeff Bolz	60fac96c6b	Enable precompiled headers for spirv-tools(-shared) and some unit tests (#2026 )	2018-11-06 09:26:23 -05:00
Steven Perron	f2cc71e5cb	Handle OpMemberDecorateStringGOOGLE in ACDE (#2029 ) Add missing case to the switch statement for the annotation instructions. See https://github.com/KhronosGroup/glslang/issues/1561.	2018-11-02 13:42:45 -04:00
Jeff Bolz	fb996dce75	Add /Zm flag as a workaround for VS2013 build (#2023 )	2018-10-31 07:59:43 -04:00
Steven Perron	6647884a13	Remove MemberDecorateStringGOOGLE during stript-refect. (#2021 ) The strip-reflect pass is not removing the reflection decorations that are decorating members. With this commit, they will now be removed. Fixes #2019.	2018-10-30 16:17:35 -04:00
alelenv	1c1e749f0b	Add support for nv-raytracing-final (#2010 ) Add support for nv-raytracing (non-experimental)	2018-10-25 14:07:46 -04:00
Steven Perron	18fe6d59e5	Fix dead branch elim infinite loop. (#2009 ) When looking for a break from a selection construct, we do not realize that a jump to the continue target of a loop containing the selection is a break. This causes and infinit loop, or possibly other failures. Fixes #2004.	2018-10-24 09:10:30 -04:00
Steven Perron	0ba35798c3	Fix dead branch elim infinite loop. (#1997 ) When looking for a break from a selection construct, we do not need to look inside nested constructs. However, if a loop header has an unconditional branch, then we enter the loop. Entering the loop causes an infinite loop because we keep going through the loop. The solution is to look for a merge block, if one exsits, even for block terminated by an OpBranch. Fixes #1979.	2018-10-22 13:59:20 -04:00
alan-baker	20bbfb6f4d	Layout checks should recurse through runtime arrays (#1999 ) Fixes #1985 * Added test to catch bug * Tested aginst Vulkan CTS	2018-10-22 08:50:45 -04:00
alan-baker	89b8e238eb	Better checking of the index operand (#1992 ) Fixes https://crbug.com/897069 * Code previously assumed the index instruction had a type * Added a test to reproduce	2018-10-22 08:47:56 -04:00
alan-baker	6e85d1a6fc	Fix restrictions in if conversion (#1998 ) Fixes #1991 * Improved identification of potential conditional branches * Pass changed to only work for shaders * added a test to catch the bug	2018-10-19 15:16:46 -04:00
Jeff Bolz	dd1e837e1c	Use per-configuration location for pch file (#1989 )	2018-10-19 14:58:26 -04:00
Steven Perron	8edf3557ca	Revert "Add custom target to wrap around custom commands. (#1986 )" (#1996 ) Breaks the build when using makefiles. The ninja build is fine. This reverts commit `67ebe3f7ae`.	2018-10-19 14:05:19 -04:00
Neil Henning	d29a1f98f3	Add validaton for SPV_KHR_8bit_storage + convert to/from floats. (#1990 ) The SPV_KHR_8bit_storage extension does not permit 8-bit integers to be cast directly to floating point types. We are seeing shaders in the wild, being produced by toolchains like glslang, that are generating invalid SPIR-V. This change adds validation to check for the patterns not permitted, and some tests that expose the failure.	2018-10-19 13:45:26 -04:00
Steven Perron	715afb0cea	Add a nullptr check to array copy propagation. (#1987 ) We are missing a check for a nullptr that is causing things to fail. Added an extra test case, and fixed up others. This is the fix for https://github.com/Microsoft/DirectXShaderCompiler/issues/1598.	2018-10-19 12:53:40 -04:00
Steven Perron	67ebe3f7ae	Add custom target to wrap around custom commands. (#1986 ) In CMake, we are not suppose to have multiple targets depend on the same custom command. To avoid this, we have to add a custom target around the command. Fixes #1941.	2018-10-19 10:17:47 -04:00
greg-lunarg	c4687889b7	Fix ADCE to treat OpUnreachable correctly during liveness analysis (#1984 ) ADCE liveness algorithm should treat OpUnreachable at least like other branch instructions. It was being treated as always live which was preventing useless structured constructs from being eliminated. OpUnreachable is generated by dead branch elimination which is now being required by merge return, so this fix should accompany that change.	2018-10-19 10:16:35 -04:00
Steven Perron	0e68bb3632	Only run merge-returnon reachable functions. (#1983 ) We currently run merge-return on all functions, but dead-branch-elimination only runs on function reachable from an entry point or exported function. Since dead-branch-elimination is needed for merge-return, they have to match. Fixes #1976.	2018-10-18 08:48:27 -04:00
alan-baker	9aa14a38f4	OpGroupDecorate may not target OpDecorationGroup (#1977 ) Fixes https://crbug.com/896200 * Adds a check to validation of OpGroupDecorate that OpDecorationGroup cannot be targeted	2018-10-17 13:45:05 -04:00
Steven Perron	b407163ef3	Checks for variable pointers (#1976 ) In logical addressing mode, we are not allowed to generate variables pointers. There is already a check for OpSelect. However, OpPhi and OpPtrAccessChain are not checked to make sure it does not generate an variable pointer. I've added those checks. Fixes #1957.	2018-10-16 14:57:55 -04:00
greg-lunarg	ab45d69154	Fix ADCE liveness to include all enclosing control structures. (#1975 ) Was removing control structures which didn't have data dependency with enclosed live loop and otherwise did not contain live code. An example is a counting loop around a live loop. Fixes #1967.	2018-10-16 08:00:07 -04:00
David Neto	eea449a1e8	validator: FPRoundingMode can apply to vector conversions Fixes #1972	2018-10-15 17:22:50 -04:00
Jeff Bolz	339d23275d	Enable precompiled headers for MSVC (#1969 )	2018-10-15 11:12:02 -04:00
alan-baker	72bac04d73	Memory access checks for vulkan mem model (#1909 ) * MakePointerVisibleKHR cannot be used with OpStore * MakePointerAvailableKHR cannot be used with OpLoad * MakePointerAvailableKHR and MakePointerVisibleKHR both require NonPrivatePointerKHR * NonPrivatePointerKHR is limited to a subset of storage classes * many tests	2018-10-15 09:30:47 -04:00
David Neto	bdecee8c86	Validator: TaskNV can use LocalSize or LocalSizeId (#1970 ) Correponds to the update to Rev2 of SPV_NV_mesh_shader Fixes #1968	2018-10-12 08:54:52 -04:00
greg-lunarg	e545564887	Consider atomics that load when analyzing live stores in ADCE (#1956 ) (#1958 ) Consider atomics that load when analyzing live stores in ADCE. Previously it asserted that the base of an OpImageTexelPointer should be an image. It is actually a pointer to an image, so IsValidBasePointer should suffice.	2018-10-12 08:46:35 -04:00
Alan Baker	1c128aa9ef	Validating for new image operands * Validation checks for new image operands MakeTexelAvailableKHR and MakeTexelVisibleKHR * added tests * Tests that NonPrivateTexelKHR is accepted for all image operands Updating test environments * fixed build errors * changed image types for *FetchSuccess tests to use a type defined in 1.3 shader body	2018-10-11 17:47:18 -04:00
Steven Perron	82663f34c9	Check for unreachable blocks in merge-return. (#1966 ) Merge return assumes that the only unreachable blocks are those needed to keep the structured cfg valid. Even those must be essentially empty blocks. If this is not the case, we get unpredictable behaviour. This commit add a check in merge return, and emits an error if it is not the case. Added a pass of dead branch elimination before merge return in both the performance and size passes. It is a precondition of merge return. Fixes #1962.	2018-10-10 15:18:15 -04:00
alan-baker	bc09f53c96	Fix calculation of case fall through (#1965 ) Fixes #1959 * Code erroneously concluded that the target's fall through was itself * Added a test	2018-10-10 13:25:48 -04:00
Steven Perron	4e266f775a	Fold divisions by 0. (#1963 ) The current implementation in the folder when seeing a division by zero is to assert. In the release build, the compiler will attempt to compute the value, which causes its own problems. The solution I will go with is to fold the division, and just give it the value of 0. The same goes for remainder and mod operations. Fixes #1961.	2018-10-10 11:17:26 -04:00
alan-baker	fae1e61ab8	Fix bug in construct block calculation (#1964 ) Fixes #1960 * Only allows blocks that are dominated by the header * Fixed a bad loop fusion test * Added a test derived from the reported bug	2018-10-10 11:14:01 -04:00
Ben Ashbaugh	d3f88b0841	allow atomics on Function pointers for OpenCL (#1955 )	2018-10-09 11:33:01 -04:00
Jaebaek Seo	03cbf33a69	Validator: FPRoundingMode decoration (#1482 ) This commit checks the following when Shader capability exists: "The FPRoundingMode decoration can be applied only to a width-only conversion instruction that is used as the Object operand of an OpStore storing through a pointer to a 16-bit floating-point object in the StorageBuffer, Uniform, PushConstant, Input, or Output Storage Classes.".	2018-10-05 13:33:03 -04:00
Steven Perron	497958d899	Removing HLSLCounterBuffer decorations when not needed. (#1954 ) The HlslCounterBufferGOOGLE that was introduced changed the OpDecorateId so that is can now reference an id other than the target. If that other id is used only in the decoration, then the definition of the id will be removed because decoration do not count as real uses. However, if the target of the decoration is still live the decoration will not be removed. This leaves a reference to an id that is not defined. There are two solutions to consider. The first is that is the decoration is kept, then the definition of the id should be kept live. Implementing this change would be involved because the way ADCE handles decorations will have to be reimplemented. The other solution is to remove the decoration the id is otherwise dead. This works for this specific case. Also this is the more desirable behaviour in this case. The id will always be the id of a variable that belongs to a descriptor set. If that variable is not bound and we do not remove it, the driver will complain. I chose to implement the second solution. The first will be left to when a case for it comes up. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1885.	2018-10-05 08:23:09 -04:00
Jaebaek Seo	ebcc58b5f8	Validator: function scope variable at start of entry block #1923 All OpVariable instructions in a function must be the first instructions in the first block.	2018-10-04 15:05:47 -04:00
Alan Baker	3b5960174f	Don't scalarize spec constant sized arrays Fixes #1952 * Prevent scalarization of arrays that are sized by a specialization constant	2018-10-04 11:58:23 -04:00
Steven Perron	19c07731fc	Change handling of unknown extentions in validtor. (#1951 ) This commit will change the message for unknown extensions from an error to a warning. Code was added to limit the number of warning messages so that consummer of the messages are not overwhelmed. This is standard practice in compilers. Many other issues were found at while looking into this. They have been documented in #1950. Fixes http://crbug.com/875547.	2018-10-03 15:59:40 -04:00
Jaebaek Seo	d73b9d8dfb	[Validator] AMD_gpu_shader_half_float_fetch allow float16 (#1393 ) SPV_AMD_gpu_shader_half_float_fetch extension should implicitly allow declaring 16bit float.	2018-10-02 16:06:13 -04:00
Jaebaek Seo	37c99ab7e5	Validator: OpImageQuerySize validation (#1538 ) Validation of OpImageQuerySize is missing that is a TODO. This commit implements its validation based on the spec.	2018-10-02 15:53:52 -04:00
Alan Baker	a77bb2e54b	Add validation for execution modes * Check rules from Execution Mode tables, 2.16.2 and the Vulkan environment spec * Allows MeshNV execution model with the following execution modes * LocalSize, LocalSizeId, OutputPoints and OutputVertices * Done to not break their validation	2018-10-02 10:22:23 -04:00
Steven Perron	146eb3bdcf	Fix erroneous uses of the type manager in copy-prop-arrays. (#1942 ) There are a few spots where copy propagate arrays is trying to go from a Type to an id, but the type is not unique. When generating code this pass needs specific ids, otherwise we get type mismatches. However, the ambigous types means we can sometimes get the wrong type and generate invalid code. That code has been rewritten to not rely on the type manager, and just look at the instructions instead. I have opened https://github.com/KhronosGroup/SPIRV-Tools/issues/1939 to try to get a way to make this more robust.	2018-10-01 14:45:44 -04:00
Jeff Bolz	fe90a1d2dc	Enable /MP4 (parallel build across 4 cores for MSVC) for SPIRV-Tools/source[/opt] (#1930 )	2018-10-01 10:47:39 -04:00
Steven Perron	ddc705933d	Analyze uses for all instructions. (#1937 ) * Analyze uses for all instructions. The def-use manager needs to fill in the `inst_to_used_ids_` field for every instruction. This means we have to analyze the uses for every instruction, even if they do not have any uses. This mistake was not found earlier because there was a typo in the equality check for def-use managers. No new tests are needed. While looking into this I found redundant work in block merge. Cleaning that up at the same time. * Fix other transformations Aggressive dead code elimination did not update the OpGroupDecorate and the OpGroupMemberDecorate instructions properly when they are updated. That is fixed. Dead branch elimination did not analyze the OpUnreachable instructions that is would add. That is taken care of.	2018-09-28 14:39:06 -04:00
Steven Perron	32381e30ef	Handle decoration groups with no decorations. (#1921 ) In DecorationManager::RemoveDecorationsFrom, we do not remove the id from a decoration group if the group has no decorations. This causes problems because KillNamesAndDecorates is suppose to remove all references to the id, but in this case, there is still a reference. This is fixed by adding a special case. Also, there is the possibility of a double free because RemoveDecorationsFrom will delete the instructions defining \|id\| when \|id\| is a decoration group. Later, KillInst would later write to memory that has been deleted when trying to turn it into a Nop. To fix this, we will only remove the decorations that use \|id\| and not its definition in RemoveDecorationsFrom.	2018-09-28 14:16:04 -04:00
Jaebaek Seo	f0aa6f4e3a	Fixed Validator adjacency bug for OpPhi (#1922 ) OpPhi instruction must appear before all non-OpPhi instructions except for OpLine. Without this commit, Validator does not check the case that an OpPhi is preceeded by an OpLine and the OpLine is preceeded by a non-OpPhi instruction that is not OpLine.	2018-09-28 12:40:57 -04:00
alan-baker	ad0232dee5	Unify memory instruction validation style (#1934 ) * Rename ValidateMemoryInstructions to MemoryPass * Changed functions to take pointer to an instruction instead of reference	2018-09-27 12:34:14 -04:00
Jaebaek Seo	4b4bd4c53a	Validator: Validate OpImageTexelPointer (#487 ) Checked all instructions whose object is OpTypeSampledImage or OpTypeImage as suggested in #487. OpImageTexelPointer instruction is missing and others look good. This commit adds only OpImageTexelPointer.	2018-09-27 09:53:30 -04:00
Steven Perron	80564a56ec	Keep analyses live in unrolling (#1929 ) Add code to keep the def-use manger and the inst-to-block mapping up-to-date. This means we do not have to rebuild them later. To make this work, we will have to have to find places to update the def-use manager. Updating the def-use manager is not straight forward because we are unrolling loops, and we have circular references. This forces one pass to register all of the definitions. A second one to analyze the uses. Also because there will be references to the new instructions in the old code, we want to register the definitions of the new instructions early, so we can update the uses of the older code as we go along. The inst-to-block mapping is not too difficult. It can be done as instructions are created. Fixes #1928.	2018-09-26 17:36:27 -04:00
Lei Zhang	1225324ae2	VK_KHR_shader_atomic_int64 covers OpAtomic{Load\|Store}	2018-09-26 16:45:37 -04:00
Jaebaek Seo	026309ab27	Validator: OpGroupNonUniformBallotBitCount validation (#1486 )	2018-09-26 15:52:39 -04:00
Steven Perron	0e5fc7d75e	Allow 0 as argument to scalar replacement. (#1917 ) A limit of 0 for the scalar replacement options it used to indicate that there is no limit. The current implementation does not allow 0. This should be fixed.	2018-09-26 09:58:28 -04:00
Steven Perron	b85fb4a300	Get KillNameAndDecorates to handle group decorations. (#1919 ) It seems like the current implementation of KillNameAndDecorates does not handle group decorations correctly. The id being removed is not removed from the OpGroupDecorate instructions. Even worst, any decorations that apply to that group are removed. The solution is to use the function in the decoration manager that will remove the decorations and update the instructions instead of doing the work itself.	2018-09-25 12:57:44 -04:00
Alan Baker	90a12b3d4d	Decoration validation for Vulkan memory model * Adds a check that using Coherent or Volatile decorations with the Vulkan memory model is a validation error * Adds tests	2018-09-21 21:55:01 -04:00
Alan Baker	1492111332	Validate vulkan mem model capabilty * Check that if the VulkanMemoryModelKHR capability is specified that the memory model must be VulkanKHR * added tests	2018-09-21 21:50:20 -04:00
Chao Chen	6e2dab2ffd	Add support for Nvidia Turing extensions	2018-09-19 20:46:14 -04:00
Steven Perron	9fbcce4ca1	Add unrolling to the legalization passes (#1903 ) Adds unrolling to the legalization passes. After enabling unrolling I found a bug when there is a self-referencing phi node. That has been fixed. The test that checks for that the order of optimizations is correct also needed to be updated.	2018-09-19 16:40:09 -04:00
Jaebaek Seo	0cd3e599ae	Validator: correct out of bound check for OpMemberDecorate (#1881 ) The number that indicates a member in OpMemberDecorate must be less than the number of total members of struct.	2018-09-18 10:16:46 -04:00
Steven Perron	7f0a8877a2	Move the registration of decorations. (#1895 ) We currently register decorations in the first pass through the instructions. This is a problem because the validator has not even checked if the decoration instructions are valid yet. This can lead to unexpected behaviour from these side table. For example, in https://github.com/KhronosGroup/SPIRV-Tools/issues/1882, we use 5GB of data to store 1 decoration for ids that are not even defined. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1882.	2018-09-18 08:53:09 -04:00
Steven Perron	7075c49923	Add dummy loop in merge-return. (#1896 ) The current implementation of merge return can create bad, but correct, code. When it is not in a loop construct, it will insert a lot of extra branch around code. The potentially large number of branches are bad. At the same time, it can separate code store to variables from its uses hiding the fact that the store dominates the load. This hurts the later analysis because the compiler thinks that multiple values can reach a load, when there is really only 1. This poorer analysis leads to missed optimizations. The solution is to create a dummy loop around the entire body of the function, then we can break from that loop with a single branch. Also only new merge nodes would be those at the end of loops meaning that most analysies will not be hurt. Remove dead code for cases that are no longer possible. It seems like some drivers expect there the be an OpSelectionMerge before conditional branches, even if they are not strictly needed. So we add them.	2018-09-18 08:52:47 -04:00
Steven Perron	5f599e700e	Fix infinite loop in dead-branch-elimination (#1891 ) * Create structed cfg analysis. There are lots of optimization that have to traverse the CFG in a structured order just because it wants to know which constructs a basic block in contained in. This adds extra complexity to these optimizations, for causes too much refactoring of older optimizations. To help with this problem, I have written an analysis that can give this information. * Identify branches breaking from loops. Dead branch elimination does a search for a conditional branch to the end of the current selection construct. This search assumes that the only way to leave the construct is through the merge node. But that is not true. The code can jump to the merge node of a loop that contains the construct. The search needs to take this into consideration.	2018-09-17 13:00:24 -04:00
Diego Novillo	4a4632264e	Add IR dumping functions to use during debugging. When using lldb and/or gdb I frequently get odd std::string failures when using the IR printing instructions we have now. This adds the methods Instruction::Dump(), BasicBlock::Dump() and Function::Dump() to emit the output of the pretty print to stderr. With this I can now reliably print IR from gdb and lldb sessions.	2018-09-14 14:28:34 -04:00
Lei Zhang	63265097e5	Add support for VK_KHR_shader_atomic_int64 in validator	2018-09-14 14:07:25 -04:00
Steven Perron	6d5f1bc2e8	Allow merge blocks to merge two header blocks in some cases. (#1890 ) In merge blocks, we do not allow the merging of two blocks with merge instructions. This is because if the two block are merged only 1 of those instructions can exists. However, if the successor block is the merge block of the predecessor, then we can delete the merge instruction in the predecessor. In this case, we are able to merge the blocks.	2018-09-14 13:37:18 -04:00
Jaebaek Seo	2c2fee7979	Validator: check OpTypeBool inside Blocks (#1405 ) OpTypeBool can only be used with non-externally visible shader Storage Classes: Workgroup, CrossWorkgroup, Private, and Function.	2018-09-10 13:33:13 -04:00
Steven Perron	75c1bf2843	Add option for the max id bound. (#1870 ) * Create a new entry point for the optimizer Creates a new struct to hold the options for the optimizer, and creates an entry point that take the optimizer options as a parameter. The old entry point that takes validator options are now deprecated. The validator options will be one of the optimizer options. Part of the optimizer options will also be the upper bound on the id bound. * Add a command line option to set the max value for the id bound. The default is 0x3FFFFF. * Modify `TakeNextIdBound` to return 0 when the limit is reached.	2018-09-10 11:49:41 -04:00
Steven Perron	f62d7978fc	Add validation check for arrays of void type. (#1880 ) In the definition of an array (https://www.khronos.org/registry/spir-v/specs/1.2/SPIRV.html#Array), it specfically mentions that array elements have non-void type. I've added a check for that in this PR. http://crbug.com/879016	2018-09-10 09:21:32 -04:00
David Neto	571251c8f8	Support SPV_KHR_vulkan_memory_model rev2 Support collapsed into one commit: - Asm/Dis support for SPV_KHR_vulkan_memory_model - Add Vulkan mem model image operands to switch - Add TODO for source/validate_image.cpp - val: Image operands NonPrivateTexelKHR, VolatileTexelKHR have no operands This is required for memory model tests to pass SPIR-V validation. - Round trip tests: Test new flags on OpCopyMemory*	2018-09-06 13:30:32 -04:00
Alan Baker	cb0f1f565b	Remove struct member offset monotonicity check Fixes #1822 * Remove check that struct member offsets must be monotonic * All environments match Vulkan behaviour now * updated offending tests	2018-08-31 09:45:45 -04:00
Steven Perron	482b1744ca	Validate all type ids. (#1868 ) * Validate all type ids. The validator does not check if the type of an instruction is actually a type unless the OpCode has a specific requirement. For example, OpFAdd is checked, but OpUndef is not. The commit add a generic check that if there is a type id then the id defines a type. http://crbug.com/876694 * Merge other checks for type into new one. There are a couple check that the type id is a type for specific opcodes. Those have been mereged into 1. Small changes to other test cases to make them valid enough for the purpose of the test.	2018-08-27 23:45:32 -04:00
Steven Perron	06b42949b6	Validate uses of OpTypeFunction. (#1867 ) In the specification of `OpTypeFunction`, it says > OpFunction is the only valid use of OpTypeFunction. This commit add a check in the validator for this rule. A test started to fail because the new check happens before the check the test case is testing. Updated the test case to still fail the check it was suppose to fail originally. http://crbug.com/874571	2018-08-27 11:41:25 -04:00
alan-baker	d94a2077d6	Remove idUsage * Moved remaining validation out of idUsage and deleted it * Deleted unused functions	2018-08-27 11:06:09 -04:00
Steven Perron	416b1ab4f3	Have the constant manager take ownership of constants. (#1866 ) * Have the constant manager take ownership of constants. Right now the owner of an object of type contant that is in the \|const_pool_\| of the constant manager is unclear. The constant manager does not delete them, there is no other reasonable owner. This causes memory leaks. This change fixes the memory leaks by having the constant manager take ownership of the constant that is stores in \|const_pool_\|. Other changes include interface changes to make it explicit that the constant manager takes ownership of the object when a constant is registered with the constant manager. Fixes #1865.	2018-08-27 09:53:47 -04:00
Steven Perron	47ee776a2c	Revert "Have the constant manager take ownership of constants." This reverts commit `b938b74bac`.	2018-08-24 15:12:49 -04:00
Steven Perron	b938b74bac	Have the constant manager take ownership of constants. Right now the owner of an object of type contant that is in the \|const_pool_\| of the constant manager is unclear. The constant manager does not delete them, there is no other reasonable owner. This causes memory leaks. This change fixes the memory leaks by having the constant manager take ownership of the constant that is stores in \|const_pool_\|. Other changes include interface changes to make it explicit that the constant manager takes ownership of the object when a constant is registered with the constant manager.	2018-08-24 15:08:12 -04:00
Steven Perron	d746681fe9	Copy decorations when creating new ids. (#1843 ) * Copy decorations when creating new ids. When creating a new value based on an old value, we need to copy the decorations to the new id. This change does this in 3 places: 1) The variable holding the return value of the function generated by merge return should get decorations from the function. 2) The results of the OpPhi instructions should get decorations from the variable they are replacing in the ssa writer. 3) In local access chain convert the intermediate struct (result of OpCompositeInsert) generated for the store replacement should get its decorations from the variable being stored to. Fixes #1787.	2018-08-24 11:55:39 -04:00
Alan Baker	6d27a8350f	Fixing instances of iteration over unordered containers * There were several instances found in the validator * validate_id.cpp * validate_decorations.cpp * validate_interfaces.cpp	2018-08-23 14:49:10 -04:00
Steven Perron	b4d3618f77	Don't "break" from selection constructs. (#1862 ) If seems like at least 1 driver does not like a condition jump to the end of a selection construct. We are generating these in the merge return pass. This change stops merge return from generating this sequence. Part of #1861.	2018-08-23 14:38:25 -04:00
Steven Perron	6c73b1fb70	Update the order when predicating blocks. (#1859 ) When doing predicate blocks, we need to traverse every block in structured order in order to keep track of which construct a block is contained in. The standard way of traversing code in structured order is to create a list with all of the nodes in order. However, when predicating blocks, new blocks are created, and those blocks are missed. This causes branches that go too far. The solution is to update the order as new blocks are created. Since we are using an std::list, we do not have to worry about invalidation of iterators when changing the list.	2018-08-23 12:59:31 -04:00
Alan Baker	c5b38062ec	Moving constant opcode validation into a new file * Split constant opcode validation out of idUsage and into validate_constants.cpp * minor style fixes * reduced duplication * fixed an issue with array sizing	2018-08-21 17:30:26 -04:00
Steven Perron	d91d34e150	Fix VS2013 build break. (#1853 )	2018-08-21 13:50:47 -04:00
Steven Perron	19264ef42c	Have PredicateBlocks jump the existing merge blocks. (#1849 ) * Refactor PredicateBlocks Refactor PredicateBlocks so that we know which constructs a return is contained in. Will be used later. * Have PredicateBlocks jump the existing merge blocks. In PredicateBlocks, we currently skip instructions with side effects, but it still follows the same control flow (sort-of). This causes a problem, when we are trying to predicate code in a loop. We skip all of the code with side effects (IV increment), but still follow the same control flow (jump back the start of the loop). This creates an infinite loop because the code will keep jumping back to the start of the loop without changing the values that effect the exit condition. This is a large change to merge-return. When predicating a block that is in a loop or merge construct, it will jump to the merge block of the construct. Once out of all constructs we will generate code as we did before.	2018-08-21 12:04:08 -04:00
Alan Baker	197b4597a0	Fix EvalInt32IfConst to fail on type instructions. Fixes https://crbug.com/875842 * EvalInt32IfConst dereferenced a null pointer if a type instruction was sent as the id	2018-08-21 11:59:00 -04:00
Steven Perron	d693a83e36	Handle breaks from structured-ifs in DCE. (#1848 ) * Handle breaks from structured-ifs in DCE. dead code elimination assumes that are conditional branches except for breaks and continues in loops will have an OpSelectionMerge before them. That is not true when breaking out of a selection construct. The fix is to look for breaks in selection constructs in the same place we look for breaks and continues for loops.	2018-08-21 11:54:44 -04:00
Steven Perron	45c235d41f	Have dead-branch-elim handle conditional exits from selections. (#1850 ) When dead-branch-elim folds a conditional branch, it also deletes the OpSelectionMerge instruction. If that construct contains a conditional branch to the merge node, it will not have its own OpSelectionMerge. When the headers merge instruction is deleted, the the inner conditional branch will no longer be legal. It will be a selection to a node that is not a merge node. We fix this up by moving the OpSelectionMerge to a new location if it is still needed.	2018-08-21 11:49:56 -04:00
Diego Novillo	03000a3a38	Add testing framework for tools. This forks the testing harness from https://github.com/google/shaderc to allow testing CLI tools. New features needed for SPIRV-Tools include: 1- A new PlaceHolder subclass for spirv shaders. This place holder calls spirv-as to convert assembly input into SPIRV bytecode. This is required for most tools in SPIRV-Tools. 2- A minimal testing file for testing basic functionality of spirv-opt. Add tests for all flags in spirv-opt. 1. Adds tests to check that known flags match the names that each pass advertises. 2. Adds tests to check that -O, -Os and --legalize-hlsl schedule the expected passes. 3. Adds more functionality to Expect classes to support regular expression matching on stderr. 4. Add checks for integer arguments to optimization flags. 5. Fixes #1817 by modifying the parsing of integer arguments in flags that take them. 6. Fixes -Oconfig file parsing (#1778). It reads every line of the file into a string and then parses that string by tokenizing every group of characters between whitespaces (using the standard cin reading operator). This mimics shell command-line parsing, but it does not support quoting (and I'm not planning to).	2018-08-17 15:03:14 -04:00
Steven Perron	36d675a404	Change when instruction is registered in validator. (#1840 ) When doing the validator checks, an instruction is currently registered at the end of IdPass. This creates an inconsistency. In IdPass, an instruction that uses its own result will treat that use as a forward reference. Then in the following passes it will not because the definition can be found. It seems best to update the state after all of the check have been done for the current instruction. This makes it consistent for all of the passes. This makes a different when trying to verify OpTypeStruct. Fixes https://crbug.com/874372.	2018-08-15 13:18:47 -04:00
Steven Perron	e065cc208f	Keep decorations when replacing loads in access-chain-convert. (#1829 ) In local-access-chain-convert, we replace loads by load the entire variable, then doing the extract. The extract will have the same value as the load. However, if the load has a decoration on it, the decoration is lost because we do not copy any them to the new id. This is fixed by rewritting the load into the extract and keeping the same result id. This change has the effect that we do not call DCEInst on the loads because the load is not being deleted, but replaced. This could leave OpAccessChain instructions around that are not used. This is not a problem for -O and -Os. They run local_single_*_elim passes and then dead code elimination. The dce will remove the unused access chains, and the load elimination passes work even if there are unused access chains. I have added test to them to ensure they will not loss opportunities. Fixes #1787.	2018-08-15 09:14:21 -04:00
dan sinclair	ef678672fb	Remove source/message.h (#1838 ) The code in source/message was only used in a single set of tests to format the output results. This CL changes the test to verify the message instead of all the error values and removes the source/message code.	2018-08-14 15:41:21 -04:00
dan sinclair	1963a2dbda	Use MakeUnique. (#1837 ) This CL replaces instances of reset(new ..) with MakeUnique.	2018-08-14 15:01:50 -04:00
dan sinclair	1553025f4c	Move make_unique to source/util. (#1836 ) This MakeUnique code is used in places other then source/opt so move it to source/utils.	2018-08-14 12:44:54 -04:00
Steven Perron	bf24d9b4ac	Don't copy decorations twice when rebuilding a type. (#1835 ) In `TypeManager::RebuildType`, the base cases call `Clone`, which will copy the decorations for the type. After that it breaks out of the switch statement and copies the decorations again. This has not causes any real problems yet because none of those types are allowed to have decorations. However to make the code more robust it is best to not copy twice because it should be empty. This way if a new base type or decoration is added that changes this rule the code will be correct.	2018-08-14 11:26:14 -04:00
Alan Baker	8cb949ad34	Validate correct opcode uses of OpFunction Fixes https://crbug.com/873457 * Filed Khronos SPIR-V issue 352 * Updated bad tests * Added new test	2018-08-14 10:13:06 -04:00
dan sinclair	5fc011b453	Move bit_stream, move_to_front and huffman_codec. (#1833 ) bit_stream, move_to_front and huffman_codec are only used by source/tools. Move into that directory to make the usage clearer.	2018-08-14 09:52:05 -04:00
alan-baker	ce4547bdc7	Disallow void types in structs (#1832 ) Fixes #1831 * Adds validation check that void is not a member of a struct * added a test	2018-08-14 08:55:49 -04:00
Alan Baker	e7fdcdba75	Split function opcode validation into new files. * Moved function opcode validation out of idUsage and into new files * minor style changes * General opcode checking is in validate_function.cpp * Execution limitation checking is in validate_execution_limitations.cpp * Execution limitations was split into a new pass as it requires other validation to register those limitations first.	2018-08-13 17:04:57 -04:00
Alan Baker	397e02442e	Fixing heap overflow in validation. * Changed entry point validation to check storage class of variable instead of pointer * added a test * Moved several checks after opcode validation * These checks should be able to guarantee individual instructions are ok * Updated tests due to reordered checks	2018-08-13 15:23:30 -04:00
Steven Perron	bcb0b6935c	Reenable --skip-validation. (#1820 ) In previous changes, the option `--skip-validation` was disabled. This change is to reenable it.	2018-08-13 13:18:46 -04:00
dan sinclair	da0f1dcccc	Move spirv_stats into tools/stats. (#1826 ) The spirv_stats code is only used by the tools/stats module. This CL moves the code to that module.	2018-08-13 11:48:25 -04:00
Alan Baker	6cd4441c87	Move cfg opcode validation to another file. * Moved cfg opcode validation out of idUsage and into validate_cfg.cpp * minor style updates	2018-08-13 11:30:08 -04:00
dan sinclair	b6319c3a43	Split MarkV into multiple files (#1809 ) This CL breaks the monolithic markv_codec file into files for the base class, encoder, decoder and logger.	2018-08-09 17:07:19 -04:00
Alan Baker	714bf84e58	Split mode setting opcode validation into new file. * Moved mode setting opcode validation out of idUsage and into a new pass * minor style updates	2018-08-08 15:45:53 -04:00
Alan Baker	7d4b0464a3	Split annotation opcode validation into new file. * Moves annotation opcode checks from idUsage into a new pass * minor style updates	2018-08-08 15:43:11 -04:00
Alan Baker	983f8f02de	Replace asserts with returns * Changes to satisfy fuzzer	2018-08-08 15:13:04 -04:00
Alan Baker	ca7278cff7	Split debug opcode validation into new file * Removes debug opcode validation from idUsage and puts it in a separate file * minor updates	2018-08-08 13:47:09 -04:00
Alan Baker	f2a990022a	Move type instruction validation into separate file * Moved type instruction validation out of validation idUsage into a new file * Consolidate type unique pass into new file * Removed one bad test * Reworked validation ordering	2018-08-08 12:55:39 -04:00
Steven Perron	5c8b4f5a1c	Validate the input to Optimizer::Run (#1799 ) * Run the validator in the optimization fuzzers. The optimizers assumes that the input to the optimizer is valid. Since the fuzzers do not check that the input is valid before passing the spir-v to the optimizer, we are getting a few errors. The solution is to run the validator in the optimizer to validate the input. For the legalization passes, we need to add an extra option to the validator to accept certain types of variable pointers, even if the capability is not given. At the same time, we changed the option "--legalize-hlsl" to relax the validator in the same way instead of turning it off.	2018-08-08 11:16:19 -04:00
Alan Baker	3a20879f4d	Unify validation of OpCopyMemory* Fixes #1800 * Refactored duplication of code between OpCopyMemory and OpCopyMemorySized validation * Fixed some bugs in OpCopyMemorySized validation * Replaced asserts with checks * Added new tests	2018-08-07 19:01:58 -04:00
Alan Baker	2896b8f0e5	Refactor where opcodes are validated * Replaced uses in opcode validation of current_function() * Added non-const accessor to function lookup in ValidationState_t * Updated a couple bad tests due to check reordering	2018-08-07 10:29:30 -04:00
dan sinclair	508df9a387	Remove unused bit stream methods. (#1807 ) This CL deletes methods from bit stream which are never used and moves several to the anonymous namespace in the bit_stream test file.	2018-08-07 09:10:54 -04:00
dan sinclair	e3ea909ebe	Simplify MoveToFront (#1806 ) This CL removes the templating from the MoveToFront code as all non-test code uses uint32_t as the variable.	2018-08-07 09:10:25 -04:00
dan sinclair	9991d661f8	Fix readbility/braces warnings (#1804 )	2018-08-07 09:09:47 -04:00
dan sinclair	eda2cfbe12	Cleanup includes. (#1795 ) This Cl cleans up the include paths to be relative to the top level directory. Various include-what-you-use fixes have been added.	2018-08-03 15:06:09 -04:00
dan sinclair	58a6876cee	Rewrite include guards (#1793 ) This CL rewrites the include guards to make PRESUBMIT.py include guard check happy.	2018-08-03 08:05:33 -04:00
dan sinclair	d38a0a3b44	Validation within function body when doing a FunctionCall. (#1790 ) When validating a FunctionCall we can trigger an assert if we are not currently within a function body. This CL adds verification that we are within a function before attempting to add a function call. Issue 1789.	2018-08-02 16:58:45 -04:00
dan sinclair	6aa8a59415	Simplify validation ProcessInstruction (#1786 ) This CL moves most of the logic out of validation ProcessInstruction and groups it into validate. This places all of the validation logic in the same place making it clearer what is running. The Instruction class is changed to allow setting the function and block after creation.	2018-08-02 15:12:06 -04:00
dan sinclair	1946fb4ddb	Remove ValidateInstructionAndUpdateValidationState (#1784 ) This CL changes the stats aggregator to use ValidateBinaryAndKeepValidationState to process the binary. This means we can remove ValidateInstructionAndUpdateValidationState which expects to be able to call ProcessInstruction in the validate anonymous namespace. This decouples the stats aggregator from how validation processes the binary.	2018-08-02 12:01:26 -04:00
Steven Perron	ce644d4a24	Update OpPhi instructions after splitting block. (#1783 ) In the merge return pass, we will split a block, but not update the phi instructions that reference the block. Since the branch in the original block is now part of the block with the new id, the phi nodes must be updated. This commit will change this. I have also considered other places where an id of a basic block could be referenced, and I don't think any of them need to change. 1) Branch and merge instructions: These jump to the start of the original block, and so we want them to jump to the block that uses the original id. Nothing needs to change. 2) Names and decorations: I don't think it matters with block keeps the name, and there are no decorations that apply to basic blocks. Fixes #1736.	2018-08-02 11:02:50 -04:00
dan sinclair	53afb3b77b	Combine ordered_instruction loops in validation. (#1782 ) There are several validation passes which loop over all ordered instructions. This CL combines those into a single loop, calling each pass as needed.	2018-08-02 10:00:52 -04:00
dan sinclair	c9cd73b33a	Remove instruction_counter from ValidationState. (#1781 ) The instruction counter is the same as the size of the ordered_instruction list when we insert a new instruction. This Cl removes instruction_counter_ and uses that instead.	2018-08-01 16:12:07 -04:00
Alan Baker	d49bedcaa6	Move memory class instructions to new pass * Refactored the Memory class of instructions in the spec out Id validation and into a new pass * Tests unmodified * some minor disassembly changes * minor style changes	2018-08-01 16:10:11 -04:00
dan sinclair	a5a5ea0e2d	Remove using std::<foo> statements. (#1756 ) Many of the files have using std::<foo> statements in them, but then the use of <foo> will be inconsistently std::<foo> or <foo> scattered through the file. This CL removes all of the using statements and updates the code to have the required std:: prefix.	2018-08-01 14:58:12 -04:00
dan sinclair	ebd6c75a71	Remove diag() overloads. (#1776 ) This CL removes the two diag() overloads and leaves only the version which accepts an Instruction. This is safer as we never use the implicit location from the validation state.	2018-08-01 14:55:20 -04:00
dan sinclair	aa81e62cbe	Update diag() calls in validate_capability. (#1759 ) This CL updates the diag() call in validate_capability to provide the instruction.	2018-08-01 13:48:16 -04:00
Steven Perron	c8c724cba7	Don't change decorations and names in merge return. (#1777 ) When creating a new phi for a value in the function, merge return will rewrite all uses of an id that are no longer dominated by its definition. Uses that are not in a basic block, like OpName or decorations, are not dominated, but they should not be replaced. Fixes #1736.	2018-08-01 13:47:09 -04:00
dan sinclair	ab061afc83	Update diag() calls in validate_type_unique. (#1775 ) This CL updates the diag() calls in validate_type_unique to pass the relevant instruction.	2018-08-01 13:13:44 -04:00
dan sinclair	78335c927a	Update diag() calls in validate_primitives. (#1774 ) This CL updates the diag() calls in validate_primitives to provide the relevant instruction.	2018-08-01 13:00:38 -04:00
dan sinclair	6bb9ab48b8	Update diag() calls in validate_non_uniform. (#1773 ) This CL upldates diag() calls in validate_non_uniform to provide the relevant instruction.	2018-08-01 12:49:43 -04:00
dan sinclair	7c9a73fc30	Update diag() calls in validate_logicals. (#1772 ) This CL updates the diag() calls in validate_logicals to provide the Instruction.	2018-08-01 12:41:57 -04:00
dan sinclair	72766d9e88	Update diag() calls in validate_literals. (#1771 ) This CL updates the diag() call in validate_literals to provide the relevant instruction.	2018-08-01 12:41:46 -04:00
dan sinclair	e1e20f1abe	Update diag() calls in validate_layout. (#1770 ) This CL updates the diag() calls in validate_layout to pass the relevant instruction.	2018-08-01 12:01:35 -04:00
dan sinclair	f37e8d74e7	Update diag() call in validate_interface. (#1769 ) This CL upldates validate_interface to pass the instruction to the diag() method.	2018-08-01 11:58:37 -04:00
dan sinclair	d792ccd1ee	Update diag() calls in validate_instruction. (#1768 ) This CL updates validate_instruction to pass the Instruction to diag().	2018-08-01 11:37:02 -04:00
dan sinclair	176cb5e593	Update diag() calls in validate_image. (#1767 ) This CL updates the diag() calls in validate_image to provide the relvant instruction.	2018-08-01 11:30:28 -04:00
dan sinclair	c64bad70d9	Update diag() calls in validate_ext_inst. (#1766 ) This CL updates the diag() usage in validate_ext_inst to provide the relevant instruction.	2018-08-01 11:11:23 -04:00
dan sinclair	441c0190eb	Update diag() calls in validate_derivatives. (#1765 ) This CL updates diag() in validate_derivatives to provide the instruction of interest.	2018-08-01 11:04:22 -04:00
dan sinclair	83b7f2b674	Update diag() calls in validate_decorations. (#1764 ) Several of the diag() calls in validate_decorations do not provide the line number, and will output the last line in the file. This CL updates the diag() calls to provide the instruction of interest.	2018-08-01 10:44:27 -04:00
dan sinclair	a504656dad	Remove std::deque in favour of std::vector. (#1755 ) This CL removes the two deque's from ValidationState and converts them into std::vectors. In order to maintain the stability of instructions we walk over the binary and counter the instructions and functions in the ValidationState constructor and reserve the required number of items in the module_functions_ and ordered_instructions_ vectors. Issue #1176.	2018-08-01 10:37:36 -04:00
dan sinclair	fae987b470	Update diag() calls in validate_datarules. (#1763 ) This CL updates validate_datarules to provide the instruction to diag().	2018-08-01 10:35:19 -04:00
dan sinclair	5a59a06e24	Update diag() calls in validate_conversion. (#1762 ) This CL updates validate_conversion to provide the instruction to diag() calls.	2018-08-01 10:18:06 -04:00
dan sinclair	eb03b152da	Update diag() calls in validate_composites. (#1761 ) This CL updates the diag() calls in validate_composites to provide the instruction directly.	2018-08-01 10:07:53 -04:00
dan sinclair	2c5f1b01d8	Update diag() calls in validate_cfg. (#1760 ) This CL updates the diag() calls in validate_cfg to provide the associated instruction. This fixes a couple places where we output the last line of the file instead of the instruction as the disassembly.	2018-08-01 09:52:16 -04:00
dan sinclair	3619de9ad5	Update diag() use in validate_builtin. (#1758 ) This CL updates the calls to diag() in vlidate_builtings to provide the instruction.	2018-08-01 09:31:31 -04:00
dan sinclair	12c1f2b603	Update diag() usage in validate_bitwise. (#1757 ) This Cl upldates the diag() calls to pass the instruction in validate_bitwise.	2018-08-01 09:19:37 -04:00
dan sinclair	111933537b	Update diag() in validate_barriers (#1754 ) This CL updates validate_barriers to provide an explicit instruction when calling diag().	2018-07-31 18:44:35 -04:00
dan sinclair	32ccf0d04c	Update diag() in validate_atomics (#1753 ) This CL updates validate_atomics to explicitly provide the instruction when caling diag().	2018-07-31 17:20:43 -04:00
dan sinclair	a4fe771da7	Pass the instruction to diag in arithmetic validation (#1752 ) This CL updates the diag() calls in validate_arithmetics to explicitly provide the instruction the diagnostic is attached too.	2018-07-31 16:26:58 -04:00
dan sinclair	dfb53f9f1a	Fix disassembly line for adjacency validations. (#1751 ) Previously the adjacency messages would output the last line of the file as the disassembly. This is incorrect, as we have an instruction they can be attached too. This CL fixes the messages to attach to the correct line number.	2018-07-31 15:31:09 -04:00
dan sinclair	b7afe4e7ae	Switch validate to use explicit diag() method. (#1750 ) This CL changes validate.cpp to use diag providing an explicit instruction. This changes the result of the function end checks to not output a disassembly anymore as printing the last line of the module didn't seem to make sense.	2018-07-31 14:53:10 -04:00
dan sinclair	a9d8fceec9	Change ValidationState::diag to accept an Instruction. (#1749 ) This CL changes the signature of diag() to accept an Instruction instead of the instructions position. A deprecated variant that accepts the position is available but will be removed in the near future.	2018-07-31 14:19:34 -04:00
Alan Baker	755e5c9420	Transform to combine consecutive access chains * Combines OpAccessChain, OpInBoundsAccessChain, OpPtrAccessChain and OpInBoundsPtrAccessChain * New folding rule to fold add with 0 for integers * Converts to a bitcast if the result type does not match the operand type V	2018-07-31 13:42:47 -04:00
Dan Sinclair	89901a8a48	Wrap entire timer.cpp in SPIRV_TIMER_ENABLED. This CL moves the SPIRV_TIMER_ENABLED preprocesser guard to encompass the includes along with the source. Currently we will try to pull in sys/resource.h on machines which may not have the file available and the build will fail. If we don't need timers, then we don't need the includes as well.	2018-07-31 10:38:18 -04:00
Dan Sinclair	f28ed82fd9	Make sure all instructions are in the ordered list. Currently, some instructions will be missing from the list of ordered_instructions. This will cause issues due to the debug change which passed the last instruction into subsequent passes. This CL moves the addition to the ordered list out of the RegisterInstruction method into AddOrderedInstruction. This method is called first in ProcessInstruction and the CapabilitiesPass and IdPass are updated to take an Instruction parameter.	2018-07-31 09:55:57 -04:00
dan sinclair	dcea11fa03	Update error messages in validate_composites. (#1743 ) This CL removes the redundant operator name from the error messages in validate_composites. The operator will be printed on the next line with the disassembly.	2018-07-31 09:52:14 -04:00
dan sinclair	dcb0dc21de	Split ImagePass into individual methods. (#1742 ) This CL splits the switch in ImagePass into individual validate functions. The error messages have been updated to drop the suffix/prefix of the opcode name since it will be displayed in the disassembly.	2018-07-30 16:59:29 -04:00
dan sinclair	673483d6a7	Move OpVectorShuffle check into validate_composites (#1741 ) This CL moves the OpVectorShuffle ID check out of validate_id and into validate_composites with the rest of the composite checks.	2018-07-30 16:12:49 -04:00
dan sinclair	ee22928bd9	Move CompositePass code into methods. (#1740 ) This Cl splits the CompositePass switch to have one method per case label. This makes the code a lot simpler to follow.	2018-07-30 13:06:03 -04:00
Diego Novillo	99fe61e724	Add API to create passes out of a list of command-line flags. This re-implements the -Oconfig=<file> flag to use a new API that takes a list of command-line flags representing optimization passes. This moves the processing of flags that create new optimization passes out of spirv-opt and into the library API. Useful for other tools that want to incorporate a facility similar to -Oconfig. The main changes are: 1- Add a new public function Optimizer::RegisterPassesFromFlags. This takes a vector of strings. Each string is assumed to have the form '--pass_name[=pass_args]'. It creates and registers into the pass manager all the passes specified in the vector. Each pass is validated internally. Failure to create a pass instance causes the function to return false and a diagnostic is emitted to the registered message consumer. 2- Re-implements -Oconfig in spirv-opt to use the new API.	2018-07-27 15:10:08 -04:00
Alan Baker	b49f76fd62	Handle undef literal value in vector shuffle Fixes #1731 * Updated folding rules related to vector shuffle to account for the undef literal value: * FoldVectorShuffleFeedingShuffle * FoldVectorShuffleFeedingExtract * FoldVectorShuffleWithConstants * These rules would commit memory violations due to treating the undef literal value as an accessible composite component	2018-07-20 11:32:43 -04:00
dan sinclair	effafedcee	Replace opt::Instruction type and result cache with flags. (#1718 ) Currentlty opt::Instruction class holds a cache of the result_id and type_id for the instruction. That cache needs to be updated if the underlying operand values are changes. This CL changes the cache to being a flag if there is a type or result id for the instruction. We then retrieve the value if needed from the operands.	2018-07-20 11:09:30 -04:00
Alan Baker	3c19651733	Add variable pointer support to IsValidBasePointer Fixes #1729 * Adds supported opcodes to IsValidBasePointer() enable by VariablePointers and VariablePointersStorageBuffer capabilities * Added tests	2018-07-19 14:43:59 -04:00
Alan Baker	28199b80b7	Fix block ordering in dead branch elim Fixes #1727 * If the pass finds any dead branches it can optimize then at the end of the pass it reorders basic blocks to ensure they satisfy block ordering requirements * Added some new tests * While investigating this issue, found and fixed a non-deterministic ordering of dominators * Now the edges used to construct the dominator tree are sorted according to posorder traversal indices	2018-07-19 11:17:57 -04:00
Dan Sinclair	8c7dab5caa	Fixup line number for OpVectorShuffle ID error. This CL updates the code to pull a valid instruction for the line number when outputting a component error in OpVectorShuffle. The error line isn't the best at this point as it points at the component, but it's better then a -1 (turning to max<size_t>) that was being output. The error messages has been updated to better reflect what the error is attempting to say. Issue 1719.	2018-07-16 14:18:53 -04:00
Steven Perron	208921efe8	Fix finding constant with particular type. (#1724 ) With current implementation, the constant manager does not keep around two constant with the same value but different types when the types hash to the same value. So when you start looking for that constant you will get a constant with the wrong type back. I've made a few changes to the constant manager to fix this. First off, I have changed the map from constant to ids to be an std::multimap. This way a single constant can be mapped to mutiple ids each representing a different type. Then when asking for an id of a constant, we can search all of the ids associated with that constant in order to find the one with the correct type.	2018-07-16 12:36:53 -04:00
Steven Perron	95b4d47e34	Fix infinite loop while folding OpVectorShuffle (#1722 ) When folding an OpVectorShuffle where the first operand is defined by an OpVectorShuffle, is unused, and is equal to the second, we end up with an infinite loop. This is because we think we change the instruction, but it does not actually change. So we keep trying to folding the same instruction. This commit fixes up that specific issue. When the operand is unused, we replace it with Null.	2018-07-13 12:43:00 -04:00
Steven Perron	63c1d8fb15	Fix size error when folding vector shuffle. (#1721 ) When folding a vector shuffle that feeds another vector shuffle causes the size of the first operand to change, when other indices have to be adjusted reletive to the new size.	2018-07-13 11:20:02 -04:00
dan sinclair	7603944a10	Remove dead code (#1720 ) Remove commented out code from validate_id.	2018-07-12 20:26:44 -04:00
dan sinclair	c7da51a085	Cleanup extraneous namespace qualifies in source/opt. (#1716 ) This CL follows up on the opt namespacing CLs by removing the unnecessary opt:: and opt::analysis:: namespace prefixes.	2018-07-12 15:14:43 -04:00
dan sinclair	e477e7573e	Remove the module from opt::Function. (#1717 ) The function class provides a {Set\|Get}Parent call in order to provide the context to the LoopDescriptor methods. This CL removes the module from Function and provides the needed context directly to LoopDescriptor on creation.	2018-07-12 14:42:05 -04:00
dan sinclair	3ded745f21	Cleanup CFG header. (#1715 ) This CL removes some unused methods from CFG, makes the constructor explicit and moves the using statement to the cpp file where it's used.	2018-07-12 14:40:40 -04:00
dan sinclair	6803e42bb5	Cleanup some pass code to get context directly. (#1714 ) Instead of going through the instruction we can access the context() directly from the pass. Issue #1703	2018-07-12 11:13:32 -04:00
dan sinclair	a5e4a53217	Remove context() method from opt::Function (#1700 ) This CL removes the context() method from opt::Function. In the places where the context() was used we can retrieve, or provide, the context in another fashion.	2018-07-12 10:16:15 -04:00
dan sinclair	4cc6cd184a	Pass the IRContext into the folding rules. (#1709 ) This CL updates the folding rules to receive the IRContext as a paramter instead of retrieving off of the Instruction. Issue #1703	2018-07-12 09:12:23 -04:00
dan sinclair	f96b7f1cb9	use Pass::Run to set the context on each pass. (#1708 ) Currently the IRContext is passed into the Pass::Process method. It is then up to the individual pass to store the context into the context_ variable. This CL changes the Run method to store the context before calling Process which no-longer receives the context as a parameter.	2018-07-12 09:08:45 -04:00
Lei Zhang	4db9c789ff	Add option to skip verifying block layout We need this to avoid emitting errors on DirectX layout rules.	2018-07-11 18:00:54 -04:00
Steven Perron	e63551deac	Add folding rule to merge a vector shuffle feeding another one.	2018-07-11 14:44:46 -04:00
David Neto	2c6185e6bf	Enforce block layout rules even when relaxed - Vulkan 1.0 uses strict layout rules - Vulkan 1.0 with relaxed-block-layout validator option enforces all rules except for the relaxation of vector offset. - Vulkan 1.1 and later always supports relaxed block layout Add spot check tests for the relaxed-block-layout scenarios. Fixes #1697	2018-07-11 10:38:36 -04:00
dan sinclair	e70a412609	Move validation files to val/ directory (#1692 ) This CL moves the various validate files into the val/ directory with the rest of the validation infrastructure. This matches how opt/ is setup with the passes with the infrastructure.	2018-07-11 10:27:34 -04:00
dan sinclair	2cce2c5b97	Move tests into namespaces (#1689 ) This CL moves the test into namespaces based on their directories.	2018-07-11 09:24:49 -04:00
David Neto	fec6315fad	Vulkan permits non-monotonic offsets for block members Other environments do not. Add tests for OpenGL 4.5 and SPIR-V universal 1.0 to ensure they still check monotonic layout. For universal 1.0, we're assuming it otherwise follows Vulkan rules for block layout. Fixes #1685	2018-07-10 17:16:54 -04:00
Arseny Kapoulkine	ead54bbd91	Use spv_result_t instead of bool Using bool is confusing here, and results in an MSVC warning about an implicit cast to bool.	2018-07-10 14:24:39 -04:00
Steven Perron	cbdbbe9a26	Fix up code to make ClangTidy happy. Just a few changes to pass `std::function` objects by const reference instead of by value.	2018-07-10 13:59:01 -04:00
dan sinclair	84846b7e76	Cleanup whitespace lint warnings. (#1690 ) This CL cleans up the whitespace warnings and enables the check when running 'git cl presubmit --all -uf'.	2018-07-10 13:09:46 -04:00
dan sinclair	a3e3869540	Convert validation to use libspriv::Instruction where possible. (#1663 ) For the instructions which execute after the IdPass check we can provide the Instruction instead of the spv_parsed_instruction_t. This Instruction class provides a bit more context (like the source line) that is not available from spv_parsed_instruction_t.	2018-07-10 10:57:52 -04:00
dan sinclair	43144e36c1	Move the validation code into the val:: namespace (#1682 ) This CL moves the validation code to the val:: namespace. This makes it clearer which instance of the Instruction and other classes are being referred too.	2018-07-09 23:18:44 -04:00
dan sinclair	48326d443e	Move link/ code to anonymous namespace (#1679 ) Most of the link code is marked as static. This CL introduces an anonymous namespace and removes the static methods. The last two methods are exposed in the public API and have been left in the spvtools namespace.	2018-07-09 14:32:31 -04:00
dan sinclair	e6b953361d	Move the ir namespace to opt. (#1680 ) This CL moves the files in opt/ to consistenly be under the opt:: namespace. This frees up the ir:: namespace so it can be used to make a shared ir represenation.	2018-07-09 11:32:29 -04:00
dan sinclair	3dad1cda11	Change libspirv to spvtools namespace (#1678 ) This CL changes all of the libspirv namespace code to spvtools to match the rest of the code base.	2018-07-07 09:38:00 -04:00
dan sinclair	76e0bde196	Move utils/ to spvtools::utils Currently the utils/ folder uses both spvutils:: and spvtools::utils. This CL changes the namespace to consistenly be spvtools::utils to match the rest of the codebase.	2018-07-06 16:47:46 -04:00
dan sinclair	9836b05acd	Move comp code into comp namespace This CL moves the code in the comp/ directories into the comp namespace.	2018-07-06 16:38:41 -04:00
David Neto	5e0276bdc9	validator: use RowMajor, ArrayStride, MatrixStride Implement rules for row-major matrices Use ArrayStride and MatrixStride to compute sizes Propagate matrix stride and RowMajor/ColumnMajor through array members of structs. Fixes #1637 Fixes #1668	2018-07-06 13:35:16 -04:00
David Neto	1a283f41ed	Layout validation: Permit {vec3; float} tight packing Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1666	2018-07-06 13:11:07 -04:00
Alan Baker	c460f44fbc	Add a check for invalid exits from case construct. Fixes #1618. Adds a check that validates acceptable exits from case constructs. Case constructs may only exit to another case construct, the corresponding merge, an outer loop continue or outer loop merge.	2018-07-06 11:52:13 -04:00
David Neto	a069499032	Fix layout checks for StorageBuffer and PushConstant storage classes Fixes #1664 : PushConstant with Block follows storage buffer rules PushConstant variables were being checked with block rules, which are too strict. Fixes #1606 : StorageBuffer with Block layout follows buffer rules StorageBuffer variables were not being checked before. Fix layout messages: say storage class and decoration We need to provide more information about storage class and decoration.	2018-07-06 11:04:23 -04:00
Steven Perron	a45d4cac61	Move folding routines into a class The folding routines are currently global functions. They also rely on data in an std::map that holds the folding rules for each opcode. This causes that map to not have a clear owner, and therefore never gets deleted. There has been a request to delete this map. To implement this, we will create a InstructionFolder class that owns the maps. The IRContext will own the InstructionFolder instance. Then the global functions will become public memeber functions of the InstructionFolder. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1659.	2018-07-05 17:52:43 -04:00
Steven Perron	9ecbcf5fc8	Make sure the constant folder get the correct type. There are a few locations where we need to handle duplicate types. We cannot merge them because they may be needed for reflection. When this happens we need do some extra lookups in the type manager. The specific fixes are: 1) When generating a constant through `GetDefiningInstruction` accept and use an id for the desired type of the constant. This will make sure you get the type that is needed. 2) In Private-to-local, make sure we to update the def-use chains when a new pointer type is created. 3) In the type manager, make sure that `FindPointerToType` returns a pointer that points to the given type and not a duplicate type. 4) In scalar replacment, make sure the null constants that are created are the correct type.	2018-07-05 14:34:30 -04:00
Steven Perron	101a9bcbb0	Add private to local to optimization and size passes. Many optimization will run on function scope symbols only. When symbols are moved from private scope to function scople, then these optimizations can do more. I believe it is a good idea to run this pass with both -O and -Os. To get the most out of it it should be run ASAP after inlining and something that remove all of the dead functions.	2018-07-04 21:26:09 -04:00
David Neto	30a9cefa1d	Support SPV_KHR_8bit_storage - Add asm/dis test for SPV_KHR_8bit_storage - validator: SPV_KHR_8bit_storage capabilities enable declaration of 8bit int TODO: - validator: ban arithmetic on 8bit unless Int8 is enabled Covered by https://github.com/KhronosGroup/SPIRV-Tools/issues/1595	2018-07-03 15:53:19 -04:00
dan sinclair	51091045fe	Produce better error diagnostics in the CFG validation. (#1660 ) Produce better error diagnostics in the CFG validation. This CL fixes up several issues with the diagnostic error line output in the CFG validation code. For the cases where we can determine a better line it has been output. For other cases, we removed the diagnostic line and the error line number from the results. Fixes #1657	2018-07-03 15:06:54 -04:00
Steven Perron	465f2815cb	Revert change and stop running remove duplicates. Revert "Don't merge types of resources" This reverts commit `f393b0e480`, but leaves the tests that were added. Added new test. These test are the so that, if someone tries the same change I made, they will see the test that they need to handle. Don't run remove duplicates in -O and -Os Romve duplicates was run to help reduce compile time when looking for types in the type manager. I've run compile time test on three sets of shaders, and the compile time does not seem to change. It should be safe to remove it.	2018-06-29 14:09:44 -04:00
Steven Perron	2eb9bfb5b6	Remove stores of undef. When storing an undef, any value is valid, including the one already in that memory location. So we can avoid the store.	2018-06-29 09:49:19 -04:00
David Neto	b67beca723	GLSL.std.450 Refract Eta can be any float scalar This is a decision from Khronos-internal SPIR-V spec issue 337.	2018-06-28 16:12:21 -04:00
Greg Roth	4717d24e24	Fix assert during compact IDs pass (#1649 ) During the compact IDs optimization pass, the result IDs of some basic blocks can change. In spite of this, GetPreservedAnalyses indicated that the CFG was preserved. But the CFG relies on the basic blocks having the same IDs. Simply removing this flag resolves the issue by preventing the CFG check. Also Removes combinators and namemap preserved analyses from compact IDs pass.	2018-06-27 19:29:08 -04:00
Steven Perron	f393b0e480	Don't merge types of resources When doing reflection users care about the names of the variable, the name of the type, and the name of the members. Remove duplicates breaks this because it removes the names one of the types when merging. To fix this we have to keep the different types around for each resource. This commit adds code to remove duplicates to look for the types uses to describe resources, and make sure they do not get merged. However, allow merging of a type used in a resource with something not used in a resource. Was done when the non resource type came second. This could have a negative effect on compile time, but it was not expected to be much. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1372.	2018-06-27 13:57:07 -04:00
David Neto	c2e3e67c31	validator: Fix storage buffer layout message	2018-06-27 09:54:40 -04:00
David Neto	8ecd833dbc	Block-decorated structs must list members in offset-order Additionally, implmentes code review feedback. Adds more detailed messages for Block and BufferBlock layout errors. Fixes #1638	2018-06-26 23:31:00 -04:00
Ari Suonpaa	29923409e9	Add validation for structs decorated as Block or BufferBlock. Fixes #937 Stop std140/430 validation when runtime array is encountered. Check for standard uniform/storage buffer layout instead of std140/430. Added validator command line switch to skip block layout checking. Validate structs decorated as Block/BufferBlock only when they are used as variable with storage class of uniform or push constant. Expose --relax-block-layout to command line. dneto0 modification: - Use integer arithmetic instead of floor.	2018-06-26 14:23:18 -04:00
Alan Baker	0d43e10b4a	Use type id when looking up vector type Fixes #1634 * Vector components of composite constructs used wrong accessor	2018-06-25 09:47:29 -04:00
Corentin Wallez	ba602c9059	Add a WIP WebGPU environment. It disallows OpUndef Add SPV_ENV_WEBGPU_0 for work-in-progress WebGPU. val: Disallow OpUndef in WebGPU env Silence unused variable warnings when !defined(SPIRV_EFFCE) Limit visibility of validate_instruction.cpp's symbols Only InstructionPass needs to be visible so all other functions are put in an anonymous namespace inside the libspirv namespace.	2018-06-21 15:53:15 -04:00
Alan Baker	e7ace1b280	Add Vulkan 1.1 capability sets Fixes #1597 * Classifies useable capabilities for Vulkan 1.1 * Updates tests	2018-06-21 14:12:02 -04:00
David Neto	8d65c89678	Instruction lookup succeeds if it's enabled by a capability Also add a corresponding check for capabilities in the validator. Update previously existing test cases where an instruction used to fail assembling because of a version check, but now they succeed because the instruction is also guarded by a capability. Now it should assemble. Add tests to ensure that capabilities are checked appropriately. The explicitly reserved instructions OpImageSparseSampleProj* now assemble, but they fail validation. Fixes #1624	2018-06-20 10:44:03 -04:00
dan sinclair	f80696eaf6	[val] Add extra context to error messages. (#1600 ) [val] Add extra context to error messages. This CL extends the error messages produced by the validator to output the disassembly of the errored line. The validation_id messages have also been updated to print the line number of the error instead of the word number. Note, the error number is from the start of the SPIR-V, it does not include any headers printed in the disassembled code. Fixes #670, #1581	2018-06-19 16:02:44 -04:00
dan sinclair	c4304ea0ac	Reland "Disallow array-of-arrays with DescriptorSets when validating. (#1586 )" This CL reverts the revert of 'Disallow array-of-arrays with DescriptorSets when validating." Other changes have been committed which should aleviate the AppVeryor resource constraints. This reverts commit `f2c93c6e12`. This CL adds validation to disallow using an array-of-arrays when attached to a DescriptorSet. Fixes #1522	2018-06-19 15:14:17 -04:00
dan sinclair	d3ed998222	Validate Ids before DataRules. (#1622 ) Validate Ids before DataRules. The DataRule validators call FindDefs with the assumption that they definitions being looked at can be found. This may not be true if we have not validated identifiers first. This CL flips the IdPass and DataRulesPass to fix this issue.	2018-06-19 09:32:20 -04:00
Alan Baker	ea7239fa73	Structured switch checks Fixes #491 * Basic blocks now have a link to the terminator * Check all case sepecific rules * Missing check for branching into the middle of a case (#1618)	2018-06-13 15:04:47 -04:00
Alan Baker	4f866abfd8	Validate static uses of interfaces Fixes #1120 Checks that all static uses of the Input and Output variables are listed as interfaces in each corresponding entry point declaration. * Changed validation state to track interface lists * updated many tests * Modified validation state to store entry point names * Combined with interface list and called EntryPointDescription * Updated uses * Changed interface validation error messages to output entry point name in addtion to ID	2018-06-13 10:56:14 -04:00
David Neto	b49cbf09c2	Fix buffer read overrun in linker Fixes an ASAN failure. Was occuring when generating the OpModuleProcessed instruction declaring that this module was processed by the linker.	2018-06-13 10:18:04 -04:00
Steven Perron	1f7b1f1bf7	Small vector optimization for operands. We replace the std::vector in the Operand class by a new class that does a small size optimization. This helps improve compile time on Windows. Tested on three sets of shaders. Trying various values for the small vector. The optimal value for the operand class was 2. However, for the Instruction class, using an std::vector was optimal. Size of "0" means that an std::vector was used. Instruction size 0 4 8 Operand Size 0 489 544 684 1 593 487 2 469 570 4 473 8 505 This is a single thread run of ~120 shaders. For the multithreaded run the results were the similar. The basline time was ~62sec. The optimal configuration was an 2 for the OperandData and an std::vector for the OperandList with a compile time of ~38sec. Similar expiriments were done with other sets of shaders. The compile time still improved, but not as much. Contributes to https://github.com/KhronosGroup/SPIRV-Tools/issues/1609.	2018-06-12 13:41:08 -04:00
David Neto	363bfca2ed	Operand lookup succeeds if it's enabled by a capability - Fix tests for basic group operations (e.g. Reduce) to allow for new capabilities in SPIR-V 1.3 that enable them. - Refactor operand capability check to avoid code duplication and to put all checks that don't need table lookup before any table lookup. - Test round trip assembly/disassembly support for extension SPV_NV_viewport_array2 - Test assembly and validation of decoration ViewportRelativeNV Fixes #1596	2018-06-11 19:27:52 -04:00
Alan Baker	06de86863b	Check for invalid branches into construct body. Fixes #1281 * New structured cfg check: all non-construct header blocks' predecessors must come from within the construct * New function to calculate blocks in a construct * Fixed a bug in BasicBlock type bitset Relaxing check to not consider unreachable predecessors * Fixing broken common uniform elim test	2018-06-11 19:23:44 -04:00
dan sinclair	63c9bba59d	[val] Output id names along with numbers in validate_id (#1601 ) This CL updates the validate_id code to output the name of the object along with the id number. There were a few instances which already output the name, this just extends to all of them. Now, the output should say 123[obj] instead of just 123. Issue #1581	2018-06-06 22:08:27 -04:00
dan sinclair	f2c93c6e12	Revert "Disallow array-of-arrays with DescriptorSets when validating. (#1586 )" (#1607 ) This reverts commit `e3f1f3bda5`.	2018-06-06 20:27:43 -04:00
dan sinclair	e3f1f3bda5	Disallow array-of-arrays with DescriptorSets when validating. (#1586 ) * Disallow array-of-arrays with DescriptorSets when validating. This CL adds validation to disallow using an array-of-arrays when attached to a DescriptorSet. Fixes #1522	2018-06-05 09:11:35 -04:00
Steven Perron	a1f9e1342e	Preserve inst-to-block and def-use in passes. The following passes are updated to preserve the inst-to-block and def-use analysies: private-to-local aggressive dead-code elimination dead branch elimination local-single-block elimination local-single-store elimination reduce load size compact ids (inst-to-block only) merge block dead-insert elimination ccp The one execption is that compact ids still kills the def-use manager. This is because it changes so many ids it is faster to kill and rebuild. Does everything in https://github.com/KhronosGroup/SPIRV-Tools/issues/1593 except for the changes to merge return.	2018-06-04 13:48:30 -04:00
Steven Perron	fe2fbee294	Delete the insert-extract-elim pass. Replaces anything that creates an insert-extract-elim pass and create a simplifiation pass instead. Then delete the implementation of the pass. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1570.	2018-06-01 10:13:39 -04:00
Steven Perron	9a008835f4	Add store for var initializer in inlining. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1591.	2018-06-01 09:44:42 -04:00
Alan Baker	badcf73d00	Allow duplicate pointer types Fixes #1577 * Remove validation requiring unique pointer types unless variable pointers extension enabled * Modified scalar replacement to always look for an undecorated pointer	2018-05-31 09:14:38 -04:00
Steven Perron	93c4c184d5	Handle types with self references. By using forward pointers, we are able to define a struct that has a pointer to itself. This could be directly or indirectly. The current implementation of the type manager did not handle this case. There are three changes that are made in this commit inorder to handle this case: 1) Change the handling of OpTypeForwardPointer The current handling of OpTypeForwardsPointer is broken if there is a reference to the pointer before the real definition. When build the type that contain the forward delared pointer, the type manager will ask for the type for that ID, and will get a nullptr because it does not exists. This nullptr is not handleded very well. The change is to keep track of the incomplete types the first time through all of the types. An incomplete type is a ForwardPointer or any type that references an incomplete type. Then we implement a second pass through the incomplete types that will complete them. 2) Hashing types. When hashing a type, we want to uses all of the subtypes as part of the hash. However, with types that reference them selves, this creates an infinite recursion. To get around this, we keep track of which types have been seen on the path from the root type. If we have see the current type already then we can stop the recursion. 3) Comparing types. In order to check if two types are the same, we must check that all of their subtypes are the same as well. This also causes an infinit recursion. The solution is to stop comparing the subtypes if we are trying to compare two pointer types that we are already in the middle of comparing. The ideas is that if the two pointer are different, then in progress compare will return false itself. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1578.	2018-05-30 15:48:38 -04:00
Steven Perron	745dd00af9	Fold FMix feeding Extract, and use the simplification pass. We add a new rule to the folding rules to fold an FMix feeding an extract when the alpha value for the element being extracted is either 0 or 1. In those case, we can simple extract from one of the operands to the FMix. With that change the simplification pass completely subsumes the insert-extract elimination pass. So we remove the insert-extract elimination passes and replce them with calls to the simplification pass. In a follow up PR, we should delete the insert-extract elimination pass. Contributes to https://github.com/KhronosGroup/SPIRV-Tools/issues/1570.	2018-05-25 14:42:59 -04:00
Arseny Kapoulkine	f765d16bd9	Add external interface for creating a pass token Currently it's impossible for external code to register a pass because the only source file that can create pass tokens is optimizer.cpp. This makes it hard to add passes that can't be upstreamed since you can't run them from the usual pass sequence without reimplementing Optimizer. This change adds a PassToken constructor that takes unique_ptr to opt::Pass; if out-of-tree code implements opt::Pass it can register a custom pass without having to add it to SPIRV-Tools source code.	2018-05-25 09:19:43 -04:00
dan sinclair	0a14a1f748	Validate that only a single OpMemoryModel is provided. This CL adds validation that only a single OpMemoryModel is provided in the SPIR-V binary. Fixes #1574	2018-05-24 08:43:14 -04:00
dan sinclair	3b87dac56b	Validate presence of OpMemoryModel. According to the SPIR-V Spec, section 2.4 Logical Layout of a Module there should be a single required OpMemoryModel instruction provided. This CL adds validation that OpMemoryModel is provided to the SPIR-V validator. Fixes #1207	2018-05-23 08:17:39 -04:00
Steven Perron	a579e720a8	Remove the limit on struct size in SROA. Removes the limit on scalar replacement for the lagalization passes. This is done by adding an option to the pass (and command line option) to set the limit on maximum size of the composite that scalar replacement is willing to divide. Fixes #1494.	2018-05-18 10:03:46 -04:00
Steven Perron	f1f7cc870e	Get ADCE to handle OpCopyMemory ADCE does not treat OpCopyMemory as an instruction that references memory. Because of that stores are removed that should not be. This change teaches ADCE that OpCopyMemory and OpCopyMemorySize both loads from and stores to memory. This will keep other stores live when needed, and will allows ADCE to remove OpCopyMemory instructions as well. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1556.	2018-05-16 13:50:47 -04:00
Lei Zhang	b09e3ce842	Allow ViewportIndex & Layer to be used in VS/DS with extension SPV_EXT_shader_viewport_index_layer enables using ViewportIndex and Layer in vertex and tessellation shaders. Also, as per the Vulkan spec: > The ViewportIndex decoration must be used only within vertex, > tessellation evaluation, geometry, and fragment shaders. > In a vertex, tessellation evaluation, or geometry shader, any > variable decorated with ViewportIndex must be declared using > the Output storage class. > In a fragment shader, any variable decorated with ViewportIndex > must be declared using the Input storage class. Similarly for Layer.	2018-05-16 13:16:27 -04:00
Steven Perron	9b1a938ea1	SROA: Only create symbols that are loaded. Currently in scalar replacement, we create a new variable for every memeber of the composite being divided. It is often overkill, because not all of those members will be used. This change will check which elements are used and only create variable for the members that are used. This reduces the compile time for one set of shader from 248s to 165s. Part of https://github.com/KhronosGroup/SPIRV-Tools/issues/1494.	2018-05-16 10:48:25 -04:00
Steven Perron	0e1b7e5aef	Fix getting operand without checking opcode. Fixes https://github.com/KhronosGhttps://github.com/KhronosGroup/SPIRV-Tools/issues/1559roup/SPIRV-Tools/issues/1559. There is an load of an operand of an instruction that was suppose to be only for the OpCompositeExtract case. However, an error caused it to be loaded for every opcode, even those that do not have an operand in that position. We fix up that bug, and a couple other things noticed that the same time.	2018-05-16 09:34:43 -04:00
Lei Zhang	efcc33e8a9	Support SpvOpExecutionModeId in SPIR-V logical layout	2018-05-16 08:43:50 -04:00
Steven Perron	f46f2d3e5d	Remove redundant stores. The code patterns generated by DXC around function calls can cause many store to be storing the same value that was just loaded from the same location: ``` %10 = OpLoad %type %var OpStore %var %10 ``` We want to clean these up very early on because they can cause other transformations to do a lot of work. For the cases I see, they can be removed during local-single-block-elim. For one set of shaders the compile time goes from 248s to 182s. A 26% improvement. Part of https://github.com/KhronosGroup/SPIRV-Tools/issues/1494.	2018-05-15 10:24:05 -04:00
Steven Perron	af430ec822	Add pass to fold a load feeding an extract. We have already disabled common uniform elimination because it created sequences of loads an entire uniform object, then we extract just a single element. This caused problems in some drivers, and is just generally slow because it loads more memory than needed. However, there are other way to get into this situation, so I've added a pass that looks specifically for this pattern and removes it when only a portion of the load is used. Fixes #1547.	2018-05-14 15:40:34 -04:00
Steven Perron	804e8884c4	Fold fclamp feeding compare. An FClamp instruction forces a values to be within a certain interval. When the upper or lower bound of the FClamp is a constant and the value being compared with is a constant, then in some case we can fold the compared because the entire range is say less than the value. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1549.	2018-05-14 10:27:49 -04:00
Steven Perron	9ec3f81e5c	Remove dead Workgroup variables in ADCE. If there is a shader with a variable in the workgroup storage class that is stored to, but not loadeds, then we know nothing will read those loads. It should be safe to remove them. This is implemented in ADCE by treating workgroup variables the same way that private variables are treated. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1550.	2018-05-09 16:07:26 -04:00
Steven Perron	0856997df6	Allow ADCE to remove more instructions. At this time, DCE will only remove an instruction if it is a combinator. However, there are certain non-combinator instructions that can be safely removed if their results are not used. The derivative instructions are on example. We are also missing some instructions from the list of combinators those are added as the same time.	2018-05-05 09:15:28 -04:00
Steven Perron	7d01643132	Allow hoisting code in if-conversion. When doing if-conversion, we do not currently move code out of the side nodes. The reason for this is that it can increase the number of instructions that get executed because both side nods will have to be executed now. In this commit, we add code to move an instruction, and all of the instructions it depends on, out of a side node and into the header of the selection construct. However to keep the cost down, we only do it when the two values in the OpPhi node compute the same value. This way we have to move only one of the instructions and the other becomes unused most of the time. So no real extra cost. Makes the value number table an alalysis in the ir context. Added more opcodes to list of code motion safe opcodes. Fixes #1526.	2018-05-04 12:56:29 -04:00
Stephen McGroarty	1c2cbaf569	Add GetContinueBlock to loop class. Previously, the loop class used the terms latch and continue block interchangeably. This patch splits the two and corrects and tests some uses of the old uses of GetLatchBlock.	2018-05-03 14:30:41 -04:00
Steven Perron	70bb3c1cc2	Fold divide and multiply by same value. We want to fold code like (x*y)/x and other permutations of this. Fixes #1531.	2018-05-02 10:18:37 -04:00
Toomas Remmelg	1dc2458060	Add a loop fusion pass. This pass will look for adjacent loops that are compatible and legal to be fused. Loops are compatible if: - they both have one induction variable - they have the same upper and lower bounds - same initial value - same condition - they have the same update step - they are adjacent - there are no break/continue in either of them Fusion is legal if: - fused loops do not have any dependencies with dependence distance greater than 0 that did not exist in the original loops. - there are no function calls in the loops (could have side-effects) - there are no barriers in the loops It will fuse all such loops as long as the number of registers used for the fused loop stays under the threshold defined by max_registers_per_loop.	2018-05-01 15:40:37 -04:00
Stephen McGroarty	9a5dd6fe88	Support loop fission. Adds support for spliting loops whose register pressure exceeds a user provided level. This pass will split a loop into two or more loops given that the loop is a top level loop and that spliting the loop is legal. Control flow is left intact for dead code elimination to remove. This pass is enabled with the --loop-fission flag to spirv-opt.	2018-05-01 15:15:10 -04:00
Steven Perron	9ba0879ddf	Improve Vector DCE Track live scalars in VDCE as if they were single element vectors. Handle the extended instructions for GLSL in VDCE. Handle composite construct instructions in VDCE.	2018-04-30 11:55:50 -04:00
Steven Perron	a00a0a09ae	Revert "Improvements to vector dce." This reverts commit `2813722993`. A regression was found. Undoing the change until it is fixed.	2018-04-27 10:33:19 -04:00
Alan Baker	4246abdc74	Fixes handling of kill and unreachable ops in inlining. Fixes #1527 * Adds handling for copying OpKill and OpUnreachable and forces the generation of a new basic block * Adds tests to check	2018-04-27 09:42:37 -04:00
Steven Perron	e1bcd2b2d8	Fold OpVectorTimesScalar and OpPhi better. If one of the operands to an OpVectorTimesScalar instruction is zero, then the result will be the 0 vector. Currently we do not fold the insturction unless both operands are constants. This change fixes that. We also allow folding of OpPhi instructions where the incoming values are either an OpUndef or the OpPhi instruction itself. As with other cases, this can be simplified to the OpUndef.	2018-04-26 12:41:16 -04:00
Steven Perron	2813722993	Improvements to vector dce. Track live scalars in VDCE as if they were single element vectors. Handle the extended instructions for GLSL in VDCE. Handle composite construct instructions in VDCE. Fixes #1511.	2018-04-26 11:07:48 -04:00
Cort Stratton	72524db2de	Fixes #1521 : PadToWord() should use std::move() in && variant	2018-04-25 22:03:14 -04:00
Greg Fischer	268be6143d	LocalSingleBlockElim: Add store-store elimination Eliminate unused store to variable if followed by store to same variable in same block. Most significantly, this cleans up stores made unused by this pass. These useless stores can inhibit subsequent optimizations, specifically LocalSingleStoreElim. Eliminating them makes subsequent optimization more effective. The main effect of this pass is to simplify the work done by the SSA rewriter. It catches many local loads/stores that help speeding up the work done by the main rewriter.	2018-04-25 10:30:18 -04:00
Steven Perron	ee8cd5c847	Add Dead insert elmination back in.	2018-04-24 10:10:30 -04:00
Steven Perron	2c0ce87210	Vector DCE (#1512 ) Introduce a pass that does a DCE type analysis for vector elements instead of the whole vector as a single element. It will then rewrite instructions that are not used with something else. For example, an instruction whose value are not used, even though it is referenced, is replaced with an OpUndef.	2018-04-23 11:13:07 -04:00
Victor Lomuller	efc5061929	Dominator analysis interface clean. Remove the CFG requirement when querying a dominator/post-dominator from an IRContext. Updated all uses of the function and tests.	2018-04-20 15:41:59 -04:00
Jaebaek Seo	48802bad72	Constant folding for OpVectorTimesScalar	2018-04-20 13:43:04 -04:00
Victor Lomuller	0ec08c28c1	Add register liveness analysis. For each function, the analysis determine which SSA registers are live at the beginning of each basic block and which one are killed at the end of the basic block. It also includes utilities to simulate the register pressure for loop fusion and fission. The implementation is based on the paper "A non-iterative data-flow algorithm for computing liveness sets in strict ssa programs" from Boissinot et al.	2018-04-20 09:45:15 -04:00
Alan Baker	09c206b6fb	Fixes #1480 . Validate group non-uniform scopes. * Adds new pass for validating non-uniform group instructions * Currently on checks execution scope for Vulkan 1.1 and SPIR-V 1.3 * Added test framework	2018-04-20 09:25:00 -04:00
David Neto	e7c2e91ded	Fix for old XCode: std::set has explicit ctor	2018-04-19 16:33:12 -04:00
Greg Fischer	df7f00f60e	DeadInsertElim: Don't revisit select phi nodes during MarkInsertChain Fixes #1487.	2018-04-19 14:40:00 -04:00
Jaebaek Seo	430a29335e	Fix broken pointer of CommonUniformElimPass	2018-04-19 09:36:10 -04:00
Steven Perron	c20a718e00	Rewrite local-single-store-elim to not create large data structures. The local-single-store-elim algorithm is not fundamentally bad. However, when there are a large number of variables, some of the maps that are used can become very large. These large data structures then take a very long time to be destroyed. I've seen cases around 40% if the time. I've rewritten that algorithm to not use as much memory. This give a significant improvement when running a large number of shader through DXC. I've also made a small change to local-single-block-elim to delete the loads that is has replaced. That way local-single-store-elim will not have to look at those. local-single-store-elim now does the same thing. The time for one set goes from 309s down to 126s. For another set, the time goes from 102s down to 88s.	2018-04-18 16:38:18 -04:00
Jaebaek Seo	0fa42996b5	Merge pull request #1461 from jaebaek/fnegate Add constant folding for OpFNegate Contributes to #709	2018-04-18 13:46:10 -04:00
Toomas Remmelg	0f335cf87e	Add support for MIV and Delta test dependence analysis. GCD MIV test as described in Chapter 3 of "Optimizing Compilers for Modern Architectures: A Dependence-Based Approach" by Randy Allen, and Ken Kennedy. Delta test as described in Figure 3 of "Practical Dependence Testing" by Gina Goff, Ken Kennedy, and Chau-Wen Tseng from PLDI '91.	2018-04-17 13:57:02 -04:00
Jaebaek Seo	d8b9306a4f	Add more unit tests	2018-04-17 12:08:45 -04:00
Jaebaek Seo	79491259e0	Add constant folding for FNegate	2018-04-17 12:08:45 -04:00
Alan Baker	38359ba800	Fixes #1483 . Validating Vulkan 1.1 barrier execution scopes * Reworked how execution model limitations are checked * Now OpFunction checks which entry points call it and checks its registered limitations instead of building a call stack in the entry point * New tests * Moving function to entry point mapping into VState	2018-04-17 10:26:38 -04:00
David Neto	152b9a681e	ADCE: Remove OpDecorateStringGOOGLE Also fix a few failures to set "modified" status when removing global values. Add OpDecorateStringGOOGLE to decoration ordering Fixes #1492	2018-04-17 10:24:30 -04:00
Alan Baker	0e80b86dbe	Fixes #1472 . Per-vertex variable validation fixes. Relaxs checks for per-vertex builtin variables. If the builtin decoration is applied to a variable, then those checks now allow a level of arraying on the variable before checking the type consistency. * Allows arrays of variables to be present for the per-vertex variables: * Position * PointSize * ClipDistance * CullDistance * Updated tests	2018-04-16 12:58:35 -04:00
Rex Xu	7fe186476a	Fix validation issues relevant to SPV_AMD_gpu_shader_int16. Frexp/FrexpStruct allows exp to be either 16-bit or 32 bit integer if SPV_AMD_gpu_shader_int16 is enabled.	2018-04-16 10:49:01 -04:00
David Neto	e8814be732	Add validator test for OpBranch Add test for case where OpBranch branches to a value (a function value). Previous tests only checked a label value (name of a block.). Update validate_id.cpp to remove the TODO for OpBranch and say that it is already checked in validate_cfg.cpp	2018-04-16 10:27:51 -04:00
Steven Perron	d42f65e7c1	Use a bit vector in ADCE The unordered_set in ADCE that holds all of the live instructions takes a very long time to be destroyed. In some shaders, it takes over 40% of the time. If we look at the unique ids of the live instructions, I believe they are dense enough make a simple bit vector a good choice for to hold that data. When I check the density of the bit vector for larger shaders, we are usually using less than 4 bytes per element in the vector, and almost always less than 16. So, in this commit, I introduce a simple bit vector class, and use it in ADCE. This help improve the compile time for some shaders on windows by the 40% mentioned above. Contributes to https://github.com/KhronosGroup/SPIRV-Tools/issues/1328.	2018-04-13 16:38:02 -04:00
Steven Perron	8190c26270	Change parameter to Mempass::RemovePhiOperands Pass a hashtable by const ref instead of by value. Big impact on compile time.	2018-04-13 09:53:37 -04:00
Alan Baker	e805d1f8d7	Fixes #1469 . Allow subgroup memory scope for Vulkan 1.1 * New error that prevents CrossDevice memory scope for all vulkan * Old error specifically references Vulkan 1.0 * New tests	2018-04-12 13:16:04 -04:00
Alan Baker	c522b697bf	Fixes #1470 . Don't restrict WGS storage class * Removed restriction that workgroup size can only be on Input storage class * added test	2018-04-12 09:22:34 -04:00
Steven Perron	bc648fd76a	Delete unused code in MemPass Since the SSA rewriter was added, the code old phi insertion code is no longer used. It is going stale and should be deleted.	2018-04-11 15:40:33 -04:00
Steven Perron	c584ac4fc6	Don't allow an instance of a pass to be run multiple times.	2018-04-11 12:02:30 -04:00
Victor Lomuller	10e5d7cf13	Add a loop peeling pass. For each loop in a function, the pass walks the loops from inner to outer most loop and tries to peel loop for which a certain amount of iteration can be done before or after the loop. To limit code growth, peeling will not happen if the growth in code size goes above a configurable threshold.	2018-04-11 15:41:29 +01:00
Alexander Johnston	61b50b3bfa	ZIV and SIV loop dependence analysis. Provides functionality to perform ZIV and SIV dependency analysis tests between a load and store within the same loop. Dependency tests rely on scalar analysis to prove and disprove dependencies with regard to the loop being analysed. Based on the 1990 paper Practical Dependence Testing by Goff, Kennedy, Tseng Adds support for marking loops in the loop nest as IRRELEVANT. Loops are marked IRRELEVANT if the analysed instructions contain no induction variables for the loops, i.e. the loops induction variable is not relevent to the dependence of the store and load.	2018-04-11 09:32:42 -04:00
Steven Perron	53bc1623ec	Fold OpDot Adding three rules to fold OpDot (implemented as two). - When an OpDot has two constants, then fold to the resulting const. - When one of the inputs is the 0 vector, then fold to zero. - When one of the inputs is a single 1 with 0s, then rewrite to an OpCompositeExtract of the appropriate element. This will help find even more folding opportunities. Contributes to #709.	2018-04-10 13:09:37 -04:00
Alan Baker	42840d15e4	Fixes #1433 . Validate binary version * Validates SPIR-V binary version against target environment	2018-04-06 22:41:50 -04:00
Lei Zhang	26a698c347	Fix PrimitiveId builtin check for Vulkan According to Vulkan spec 1.1.72: > The PrimitiveId decoration must be used only within fragment, > tessellation control, tessellation evaluation, and geometry shaders. > In a tessellation control or tessellation evaluation shader, any > variable decorated with PrimitiveId must be declared using the Input > storage class. We were enforcing that PrimitiveId can only be used with Output storage class for TCS and TES before.	2018-04-06 22:38:32 -04:00
David Neto	a91cbfbf75	Optimizer: update extension whitelists Add two new extensions: - SPV_NV_shader_subgroup_partitioned - SPV_EXT_descriptor_indexing	2018-04-06 15:56:20 -04:00
GregF	6fbfe1c016	Fix SSA rewrite for nested loops. From the test case, the slice of the CFG that is interesting for the bug is 25 \| v 30 \| v 31<-+ \| \| v \| 34--+ 1. In block 25, we have a Phi candidate for %f with arguments %47 = Phi[%float_0, %0]. This merges %float_0 and a yet unknown argument from the external loop backedge. 2. We are now processing block 34: i. The load %35 = OpLoad %f triggers a Phi candidate to be placed in block 31. ii. The Phi candidate %50 = Phi needs two arguments. The one coming from block 30 is %47. But the one coming from block 34 (which we are now processing and have marked sealed), finds %50 itself as the reaching def for %f. 3. This wrongfully marks %50 as a copy-of Phi, which ultimately makes both %47 and %50 copy-of Phis that get eliminated.	2018-04-06 15:17:52 -04:00
Alan Baker	e66e305b46	Re-enabled checks for UConvert	2018-04-06 10:51:57 -04:00
Pierre Moreau	caf7da87e1	linker: Properly remove FuncParamAttr from imported symbols Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/898	2018-04-06 09:55:54 -04:00
Lei Zhang	43ca2112b8	Stop asking for extensions if feature avaiable in core SPIR-V Migrating to unified grammar means we sometimes have two fields for a certain feature: version and extensions. It means the feature in question can be used either in SPIR-V of advanced-enough versions or in any SPIR-V with with the specified extensions. Validator now respects the above rules.	2018-04-05 15:14:07 -04:00
Andrey Tuganov	d7fff408e3	Fix bug validate_builtins (additional def checks) At every definition of a builtin id, run at-reference-check rules on the defining instruction as well. Previosly the validation was missing the case when invalid storage class was defined in the instruction which defines the built-in, and not in the instruction which references the built-in.	2018-04-05 13:55:18 -04:00
Andrey Tuganov	691eed92cb	Fix major bug in validate_builtins Fixed an early return in the loop, resulting in only one decoration being checked.	2018-04-05 13:45:45 -04:00
Andrey Tuganov	da332cf332	Execution mode/model available in validation state Refactored validate built-ins to make GetExecutionModels(entry_point) and GetExecutionModes(entry_point) available in validation state. Entry points are allowed to have multiple execution modes and execution models. Finished the last missing feature in Vulkan built-ins validation: FragDepth requires DepthReplacing.	2018-04-05 11:55:42 -04:00
Steven Perron	742454968d	OpName and decorations should not stop array copy prop.	2018-04-04 22:24:10 -04:00
Steven Perron	7c5d49bf2a	Teach ADCE about OpImageTexelPointer Currently OpImageTexelPointer operations are treat like a use of the pointer, but it does not look for the memory being referenced to make sure stores are not removed. This change teaches it so identify the memory being accessed, and treats it as if that memory is loaded. Fixes to #1445.	2018-04-04 13:45:29 -04:00
Steven Perron	c33af63264	Teach array copy propagation about OpImageTexelPointer. OpImageTexelPointer acts like a special kind of load. It is not an array load, but it also cannot be removed the same way a regular load can. The type of propagation that needs to be done is similar to what we do for arrays, so I want to merge that code into that optmization. Contributers to #1445.	2018-04-04 13:42:51 -04:00
Steven Perron	e64a4656b3	Teach the private to local about OpImageTexelPointer. OpImageTexelPointer acts like a special kind of load. It is still safe to change the storage class of a variable used in a OpImageTexalPointer instruction. Contributes to #1445.	2018-04-04 13:42:35 -04:00
Neil Roberts	57a2441791	hex_float: Use max_digits10 for the float precision CPPreference.com has this description of digits10: “The value of std::numeric_limits<T>::digits10 is the number of base-10 digits that can be represented by the type T without change, that is, any number with this many significant decimal digits can be converted to a value of type T and back to decimal form, without change due to rounding or overflow.” This means that any number with this many digits can be represented accurately in the corresponding type. A change in any digit in a number after that may or may not cause it a different bitwise representation. Therefore this isn’t necessarily enough precision to accurately represent the value in text. Instead we need max_digits10 which has the following description: “The value of std::numeric_limits<T>::max_digits10 is the number of base-10 digits that are necessary to uniquely represent all distinct values of the type T, such as necessary for serialization/deserialization to text.” The patch includes a test case in hex_float_test which tries to do a round-robin conversion of a number that requires more than 6 decimal places to be accurately represented. This would fail without the patch. Sadly this also breaks a bunch of other tests. Some of the tests in hex_float_test use ldexp and then compare it with a value which is not the same as the one returned by ldexp but instead is the value rounded to 6 decimals. Others use values that are not evenly representable as a binary floating fraction but then happened to generate the same value when rounded to 6 decimals. Where the actual value didn’t seem to matter these have been changed with different values that can be represented as a binary fraction.	2018-04-03 12:53:10 -04:00
James Jones	6dd5e955f5	Add missing function parameters in libspirv.h When building C code with gcc and the -Wstrict-prototypes option, function declarations and definitions that don't specify their argument types generate warnings. Functions that don't take parameters need to specify (void) as their parameter list, rather than leaving it empty. Note this only applies to C, so only the functions exported in C-compatible headers need fixing. In C++ functions can't be declared/defined without a parameter list, so C++ can safely allow an empty parameter list to imply (void).	2018-04-03 10:10:43 -04:00
Lei Zhang	fc9f621e8b	Add missing <iterator> header for std::back_inserter	2018-03-30 11:30:25 -04:00
Lei Zhang	ddbaf32460	Use standard SPIR-V version scheme for version requirement Previously we use symbols in spv_target_env as the minimum version requirements for features. That makes version check implicitly relies on the order of entries in the spv_target_env enum, which also contains client APIs. Instead, we should use the standard scheme for constructing SPIR-V version; and by doing that we can also map client API entries to universial SPIR-V versions.	2018-03-29 12:06:54 -04:00
Steven Perron	cbceeceab4	In copy-prop-arrays, indentify copies via OpCompositeInsert When the original code copies an entire array or struct one element at a time, this turns into a series of OpCompositeInsert instruction followed by a store of the whole array. We currently miss opportunities in copy propagate arrays because we do not recognize this as a copy. This commit adds code to copy propagate arrays to identify this code pattern. Also updates the performance passed to run array copy propagation.	2018-03-29 09:39:55 -04:00
Steven Perron	d8ca09821d	Handle non-constant accesses in memory objects (copy prop arrays) The first implementation of MemroyObject, which is used in copy propagate arrays, forced the access chain to be like the access chains in OpCompositeExtract. This excluded the possibility of the memory object from representing an array element that was extracted with a variable index. Looking at the code, that restriction is not neccessary. I also see some opportunities for doing this in some real shaders. Contributes to #1430.	2018-03-28 20:23:47 -04:00
Stephen McGroarty	ad7e4b8401	Initial patch for scalar evolution analysis This patch adds support for the analysis of scalars in loops. It works by traversing the defuse chain to build a DAG of scalar operations and then simplifies the DAG by folding constants and grouping like terms. It represents induction variables as recurrent expressions with respect to a given loop and can simplify DAGs containing recurrent expression by rewritting the entire DAG to be a recurrent expression with respect to the same loop.	2018-03-28 16:34:23 -04:00
Steven Perron	c26866ee74	Preserve analyses after copy propagate arrays Contributes to #1430.	2018-03-28 10:38:52 -04:00
Alan Baker	0a2ee65f57	Fixes #1403 . Don't validate composite insert, extract and construct instructions against spec constant sized arrays. * Added predicate for spec constant opcodes * Added tests	2018-03-28 09:04:08 -04:00
Alan Baker	97c8fdccd2	Adding OpPhi validation rules. * Added tests * Fixes SSA check for unreachable phi parents * Fixes invalid cfg cleanup test	2018-03-27 17:26:26 -04:00
Andrey Tuganov	95843d7bd0	New spirv-1.3 rules for control barrier Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1427 Adjusting validation to the new rule: "Before version 1.3, it is only valid to use this instruction with TessellationControl, GLCompute, or Kernel execution models. There is no such restriction starting with version 1.3." Also fixed wrong version numbers in source/spirv_target_env.cpp.	2018-03-27 12:29:50 -04:00
Steven Perron	5e07ab1358	Handle more cases in copy propagate arrays. When we change the type of an object that gets stored, we do not want to change the type of the memory location being stored to. In order to still be able to do the rewrite, we will decompose and rebuild the object so it is the type that can be stored. Fixes #1416.	2018-03-27 11:04:49 -04:00
Steven Perron	c4dc046399	Copy propagate arrays The sprir-v generated from HLSL code contain many copyies of very large arrays. Not only are these time consumming, but they also cause problems for drivers because they require too much space. To work around this, we will implement an array copy propagation. Note that we will not implement a complete array data flow analysis in order to implement this. We will be looking for very simple cases: 1) The source must never be stored to. 2) The target must be stored to exactly once. 3) The store to the target must be a store to the entire array, and be a copy of the entire source. 4) All loads of the target must be dominated by the store. The hard part is keeping all of the types correct. We do not want to have to do too large a search to update everything, which may not be possible, do we give up if we see any instruction that might be hard to update. Also in types.h, the element decorations are not stored in an std::map. This change was done so the hashing algorithm for a Struct is consistent. With the std::unordered_map, the traversal order was non-deterministic leading to the same type getting hashed to different values. See \|Struct::GetExtraHashWords\|. Contributes to #1416.	2018-03-26 14:44:41 -04:00
Andrey Tuganov	9cf87ecbc8	Add Vulkan specific atomic result type restriction Atomic instructions must declare a scalar 32-bit integer type for the “Result Type”.	2018-03-26 12:06:25 -04:00
Andrey Tuganov	fe9121f721	Add Vulkan validation rules for BuiltIn variables Added a framework for validation of BuiltIn variables. The framework allows implementation of flexible abstract rules which are required for built-ins as the information (decoration, definition, reference) is not in one place, but is scattered all over the module. Validation rules are implemented as a map id -> list<functor(instrution)> Ids which are dependent on built-in types or objects receive a task list, such as "this id cannot be referenced from function which is called from entry point with execution model X; propagate this rule to your descendants in the global scope". Also refactored test/val/val_fixtures. All built-ins covered by tests	2018-03-23 14:02:42 -04:00
Eleni Maria Stea	045cc8f75b	Fixes compile errors generated with -Wpedantic This patch fixes the compile errors generated when the options SPIRV_WARN_EVERYTHING and SPIRV_WERROR (that force -Wpedantic) are set to cmake.	2018-03-22 09:40:11 -04:00
Steven Perron	dbb35c4260	Fixed remaining review comments from #1380	2018-03-21 16:47:01 -04:00
Diego Novillo	2e644e4578	Fix VS2013 build failures.	2018-03-20 21:44:17 -04:00
Jaebaek Seo	3b594e1630	Add --time-report to spirv-opt This patch adds a new option --time-report to spirv-opt. For each pass executed by spirv-opt, the flag prints resource utilization for the pass (CPU time, wall time, RSS and page faults) This fixes issue #1378	2018-03-20 21:30:06 -04:00
Diego Novillo	735d8a579e	SSA rewrite pass. This pass replaces the load/store elimination passes. It implements the SSA re-writing algorithm proposed in Simple and Efficient Construction of Static Single Assignment Form. Braun M., Buchwald S., Hack S., Leißa R., Mallon C., Zwinkau A. (2013) In: Jhala R., De Bosschere K. (eds) Compiler Construction. CC 2013. Lecture Notes in Computer Science, vol 7791. Springer, Berlin, Heidelberg https://link.springer.com/chapter/10.1007/978-3-642-37051-9_6 In contrast to common eager algorithms based on dominance and dominance frontier information, this algorithm works backwards from load operations. When a target variable is loaded, it queries the variable's reaching definition. If the reaching definition is unknown at the current location, it searches backwards in the CFG, inserting Phi instructions at join points in the CFG along the way until it finds the desired store instruction. The algorithm avoids repeated lookups using memoization. For reducible CFGs, which are a superset of the structured CFGs in SPIRV, this algorithm is proven to produce minimal SSA. That is, it inserts the minimal number of Phi instructions required to ensure the SSA property, but some Phi instructions may be dead (https://en.wikipedia.org/wiki/Static_single_assignment_form).	2018-03-20 20:56:55 -04:00
Victor Lomuller	bdf421cf40	Add loop peeling utility The loop peeler util takes a loop as input and create a new one before. The iterator of the duplicated loop then set to accommodate the number of iteration required for the peeling. The loop peeling pass that decided to do the peeling and profitability analysis is left for a follow-up PR.	2018-03-20 10:21:10 -04:00
Steven Perron	b3daa93b46	Change merge return pass to handle structured cfg. We are seeing shaders that have multiple returns in a functions. These functions must get inlined for legalization purposes; however, the inliner does not know how to inline functions that have multiple returns. The solution we will go with it to improve the merge return pass to handle structured control flow. Note that the merge return pass will assume the cfg has been cleanedup by dead branch elimination. Fixes #857.	2018-03-19 13:49:04 -04:00
Lei Zhang	1ef6b19260	Migrate to use unified grammar tables Previously we keep a separate static grammar table for opcodes/ operands per SPIR-V version. This commit changes that to use a single unified static grammar table for opcodes/operands. This essentially changes how grammar facts are queried against a certain target environment. There are only limited filtering according to the desired target environment; a symbol is considered as available as long as: 1. The target environment satisfies the minimal requirement of the symbol; or 2. There is at least one extension enabling this symbol. Note that the second rule assumes the extension enabling the symbol is indeed requested in the SPIR-V code; checking that should be the validator's work. Also fixed a few grammar related issues: * Rounding mode capability requirements are moved to client APIs. * Reserved symbols not available in any extension is no longer recognized by assembler.	2018-03-17 15:25:26 -04:00
David Neto	844e186cf7	Add --strip-reflect pass Strips reflection info. This is limited to decorations and decoration instructions related to the SPV_GOOGLE_hlsl_functionality1 extension. It will remove the OpExtension for SPV_GOOGLE_hlsl_functionality1. It will also remove the OpExtension for SPV_GOOGLE_decorate_string if there are no further remaining uses of OpDecorateStringGOOGLE. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1398	2018-03-15 21:20:42 -04:00
David Neto	2e3aec23ca	Add recent Google extensions to optimizer whitelists Optimizations should work in the presence of recent SPV_GOOGLE_decorate_string and SPV_GOOGLE_hlsl_functionality1 SPV_GOOGLE_decorate_string: - Adds operation OpDecorateStringGOOGLE to decorate an object with decorations having string operands. SPV_GOOGLE_hlsl_functionality1: - Adds HlslSemanticGOOGLE, used to decorate an interface variable with an HLSL semantic string. Optimizations already preserve those variables as required because they are interface variables (with uses), independent of whether they have HLSL decorations. - Adds HlslCounterBufferGOOGLE, used to associate a buffer with a counter variable. Fixes #1391	2018-03-15 11:16:20 -04:00
Alan Baker	9f3a1c85cc	NFC: Speed up dead insert phi traversal on Windows.	2018-03-14 17:45:47 -04:00
David Neto	884933366b	Teach DecorationManager about OpDecorateStringGOOGLE Also add more decoration manager test coverage for OpDecorateId. Fixes #1396	2018-03-13 22:18:33 -04:00
Alan Baker	7e03e76a5f	Fixes #1402 . Don't merge non-branch terminators into loop header. Added tests	2018-03-13 22:16:17 -04:00
Alan Baker	43d1609183	Fixes #1407 . Removing assertion against void pointer Added test	2018-03-13 19:45:20 -04:00
Alan Baker	4065adf05d	Fixes #1404 . Don't DCE workgroup size Added test.	2018-03-13 19:38:31 -04:00
Greg Fischer	077249b67f	Fix InsertFeedingExtract rule when extract remains.	2018-03-12 22:06:23 -04:00
Pierre Moreau	5bd55f10cd	Reimplement the DecorationManager This reimplementation fixes several issues when removing decorations associated to an ID (partially addresses #1174 and gives tools for fixing #898), as well as making it easier to remove groups; a few additional tests have been added. DecorationManager::RemoveDecoration() will still not delete dead decorations it created, but I do not think it is its job either; given the following input ``` OpCapability Shader OpCapability Linkage OpMemoryModel Logical GLSL450 OpDecorate %2 Restrict %2 = OpDecorationGroup OpGroupDecorate %2 %1 %3 OpDecorate %4 Invariant %4 = OpDecorationGroup OpGroupDecorate %4 %2 %uint = OpTypeInt 32 0 %1 = OpVariable %uint Uniform %3 = OpVariable %uint Uniform ``` which of the following two outputs would you expect RemoveDecoration(2) to produce: ``` OpCapability Shader OpCapability Linkage OpMemoryModel Logical GLSL450 %uint = OpTypeInt 32 0 %1 = OpVariable %uint Uniform %3 = OpVariable %uint Uniform ``` or ``` OpCapability Shader OpCapability Linkage OpMemoryModel Logical GLSL450 OpDecorate %4 Invariant %4 = OpDecorationGroup %uint = OpTypeInt 32 0 %1 = OpVariable %uint Uniform %3 = OpVariable %uint Uniform ``` Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/924 Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1174	2018-03-12 09:56:14 -04:00
David Neto	340370eddb	Remove extension whitelist from some transforms Remove extension whitelists from transforms that are essentially combinatorial (and avoiding pointers) or which affect only control flow. It's very very unlikely an extension will add a new control flow construct. Remove from: - dead branch elimination - dead insertion elimination - insert extract elimination - block merge Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1392	2018-03-08 12:25:49 -05:00
Rex Xu	314cfa29b2	Add missing SPV extension strings	2018-03-08 21:54:00 +08:00
Alan Baker	bc9cfee6fa	Fixes #1385 . Grab correct input to calculate indices. * Added tests to catch the bug	2018-03-07 16:07:40 -05:00
Andrey Tuganov	03b8a3fe54	AMD_gpu_shader_half_float enables float16 Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1375 Hardcoded float16 feature enabling if extension SPV_AMD_gpu_shader_half_float is present.	2018-03-07 11:07:58 -05:00
David Neto	00fa39318f	Support SPIR-V 1.3 and Vulkan 1.1 The default target is SPIR-V 1.3. For example, spirv-as will generate a SPIR-V 1.3 binary by default. Use command line option "--target-env spv1.0" if you want to make a SPIR-V 1.0 binary or validate against SPIR-V 1.0 rules. Example: # Generate a SPIR-V 1.0 binary instead of SPIR-V 1.3 spirv-as --target-env spv1.0 a.spvasm -o a.spv spirv-as --target-env vulkan1.0 a.spvasm -o a.spv # Validate as SPIR-V 1.0. spirv-val --target-env spv1.0 a.spv # Validate as Vulkan 1.0 spirv-val --target-env vulkan1.0 a.spv	2018-03-06 15:17:31 -05:00
Alan Baker	5f50e6209c	Fixes #1376 . Don't handle half folding gracefully. * Added early returns to folding rules to prevent half attempts * Added some tests	2018-03-06 14:00:02 -05:00
David Neto	5f69f75126	Support SPV_GOOGLE_decorate_string and SPV_GOOGLE_hlsl_functionality1 This commit add assembling, disassembling, and basic validation for two Google extensions to better support HLSL translation.	2018-03-05 13:34:13 -05:00
Steven Perron	9ba50e34f2	Avoid generating duplicate names when merging types The merging types we do not remove other information related to the types. We simply leave it duplicated, and hope it is removed later. This is what happens with decorations. They are removed in the next phase of remove duplicates. However, for OpNames that is not the case. We end up with two different names for the same id, which does not make sense. The solution is to remove the names and decorations for the type being removed instead of rewriting them to refer to the other type. Note that it is possible that if the first type does not have a name, then the types will end up with no name. That is fine because the names should not have any semantic significance anyway. The was identified in issue #1372, but this does not fix that issue.	2018-03-05 12:02:50 -05:00
Alan Baker	52bceb3569	Handles more cases of redundant selects * Handles OpConstantNull and vector types * vector selects (except against a null) are converted to vector shuffles * Added tests	2018-03-02 14:28:08 -05:00
Alan Baker	824625760b	Fixes #1361 . Mark all non-constant global values as varying in CCP * Also mark function parameters as varying * Conservatively mark assignment instructions as varying if any input is varying after attempting to fold * Added a test to catch this case	2018-03-01 15:24:41 -05:00
Arseny Kapoulkine	8b27ba834d	Vulkan BuiltIn variables can't have Location/Component decorations As per Vulkan spec, BuiltIn variables can't have Location or Component decorations. On some drivers, these can lead to driver crashing when compiling the shader pipeline; for example, NVidia/AMD desktop drivers: https://github.com/KhronosGroup/glslang/issues/1182. This change adds validation and tests to catch this.	2018-03-01 15:00:08 -05:00
Alan Baker	ce5941a642	Fixes #1357 . Support null constants better in folding * getFloatConstantKind() now handles OpConstantNull * PerformOperation() now handles OpConstantNull for vectors * Fixed some instances where we would attempt to merge a division by 0 * added tests	2018-02-28 23:12:27 -05:00
GregF	bdaf8d56fb	Opt: Add constant folding for FToI and IToF	2018-02-28 23:08:52 -05:00
Alan Baker	9457cabbce	Fixes #1354 . Do not merge integer division. * Removes merging of div with a div or mul for integers * Updated tests	2018-02-28 13:33:21 -05:00
Steven Perron	588f4fcc95	Add more folding rules for vector shuffle. Adds rule to fold OpVectorShuffle with constant inputs. Adds rules to fold OpCompositeExtrac being fed by an OpVectorShuffle.	2018-02-27 21:20:22 -05:00
Victor Lomuller	90e1637ce4	Remove Function::GetBlocks pushed by accident	2018-02-27 21:07:10 -05:00
Steven Perron	2cb589cc14	Remove uses DCEInst and call ADCE The algorithm used in DCEInst to remove dead code is very slow. It is fine if you only want to remove a small number of instructions, but, if you need to remove a large number of instructions, then the algorithm in ADCE is much faster. This PR removes the calls to DCEInst in the load-store removal passes and adds a pass of ADCE afterwards. A number of different iterations of the order of optimization, and I believe this is the best I could find. The results I have on 3 sets of shaders are: Legalization: Set 1: 5.39 -> 5.01 Set 2: 13.98 -> 8.38 Set 3: 98.00 -> 96.26 Performance passes: Set 1: 6.90 -> 5.23 Set 2: 10.11 -> 6.62 Set 3: 253.69 -> 253.74 Size reduction passes: Set 1: 7.16 -> 7.25 Set 2: 17.17 -> 16.81 Set 3: 112.06 -> 107.71 Note that the third set's compile time is large because of the large number of basic blocks, not so much because of the number of instructions. That is why we don't see much gain there.	2018-02-27 21:06:08 -05:00
David Neto	0c13467161	Consistently include latest spirv.h header file. Use indirection through latest_version_spirv.h Also, when generating enum tables, use the unified1 JSON grammar since it now has FragmentFullyCoveredEXT but the other JSON grammars don't. They are starting to fall behind.	2018-02-27 18:47:29 -05:00
Alan Baker	802cf053c7	Merge arithmetic with non-trivial constant operands Adding basis of arithmetic merging * Refactored constant collection in ConstantManager * New rules: * consecutive negates * negate of arithmetic op with a constant * consecutive muls * reciprocal of div * Removed IRContext::CanFoldFloatingPoint * replaced by Instruction::IsFloatingPointFoldingAllowed * Fixed some bad tests * added some header comments Added PerformIntegerOperation * minor fixes to constants and tests * fixed IntMultiplyBy1 to work with 64 bit ints * added tests for integer mul merging Adding test for vector integer multiply merging Adding support for merging integer add and sub through negate * Added tests Adding rules to merge mult with preceding divide * Has a couple tests, but needs more * Added more comments Fixed bug in integer division folding * Will no longer merge through integer division if there would be a remainder in the division * Added a bunch more tests Adding rules to merge divide and multiply through divide * Improved comments * Added tests Adding rules to handle mul or div of a negation * Added tests Changes for review * Early exit if no constants are involved in more functions * fixed some comments * removed unused declaration * clarified some logic Adding new rules for add and subtract * Fold adds of adds, subtracts or negates * Fold subtracts of adds, subtracts or negates * Added tests	2018-02-27 13:02:13 -05:00
Pierre Moreau	9394272c98	linker: merge debug annotations from category c) Fixes: https://github.com/KhronosGroup/SPIRV-Tools/issues/1218	2018-02-27 12:31:50 -05:00
Pierre Moreau	bdd6617faa	linker: Allow modules to be partially linked Fixes: https://github.com/KhronosGroup/SPIRV-Tools/issues/1144	2018-02-27 12:21:13 -05:00
Victor Lomuller	3497a94460	Add loop unswitch pass. It moves all conditional branching and switch whose conditions are loop invariant and uniform. Before performing the loop unswitch we check that the loop does not contain any instruction that would prevent it (barriers, group instructions etc.).	2018-02-27 08:52:46 -05:00
Stephen McGroarty	e354984b09	Unroller support for multiple induction variables Support for multiple induction variables within a loop and support for loop condition operands <= and >=.	2018-02-27 11:50:08 +00:00
Steven Perron	94af58a350	Clean up variables before sroa In some shaders there are a lot of very large and deeply nested structures. This creates a lot of work for scalar replacement. Also, since commit `ca4457b` we have been very aggressive as rewriting variables. This has causes a large increase in compile time in creating and then deleting the instructions. To help low the costs, I want to run a cleanup of some of the easy loads and stores to remove. This reduces the number of symbols sroa has to work on. It also reduces the amount of code the simplifier has to simplify because it was not generated by sroa. To confirm the improvement, I ran numbers on three different sets of shaders: Time to run --legalize-hlsl: Set #1: 55.89s -> 12.0s Set #2: 1m44s -> 1m40.5s Set #3: 6.8s -> 5.7s Time to run -O Set #1: 18.8s -> 10.9s Set #2: 5m44s -> 4m17s Set #3: 7.8s -> 7.8s Contributes to #1328.	2018-02-22 21:40:58 -05:00
Steven Perron	3f19c2031a	Preserve analysies in the simplification pass Fixes a bug at the same time. In `UpdateDefUse`, if the definition already exists, we are not suppose to analyse it again. When you do the entries for the definition are deleted, and we don't want that. The check for this was wrong.	2018-02-22 16:06:30 -05:00
GregF	46a9ec9d23	Opt: Check for side-effects in DCEInst() This function now checks for side-effects before adding operand instructions to the dead instruction work list. Because this fix puts more pressure on IsCombinatorInstruction() to be correct, this commit adds all OpConstant* and OpType* instructions to combinator_ops_ set. Fixes #1341.	2018-02-22 12:24:13 -05:00
Alan Baker	01760d2f0f	Fixes #1338 . Handle OpConstantNull in branch/switch conditions * No longer assume the branch/switch condition must be bool or int constants (respectively) * Added a couple unit tests for each case	2018-02-21 10:22:39 -05:00
Steven Perron	51ecc7318f	Reduce instruction create and deletion during inlining. When inlining a function call the instructions in the same basic block as the call get cloned. The clone is added to the set of new blocks containing the inlined code, and the original instructions are deleted. This PR will change this so that we simply move the instructions to the new blocks. This saves on the creation and deletion of the instructions. Contributes to #1328.	2018-02-21 09:50:47 -05:00
Steven Perron	c1b936637e	Add Insert-extract elimination back into legalization passes. Fixes #1326.	2018-02-21 09:46:51 -05:00
Arseny Kapoulkine	309be423cc	Add folding for redundant add/sub/mul/div/mix operations This change implements instruction folding for arithmetic operations that are redundant, specifically: x + 0 = 0 + x = x x - 0 = x 0 - x = -x x * 0 = 0 * x = 0 x * 1 = 1 * x = x 0 / x = 0 x / 1 = x mix(a, b, 0) = a mix(a, b, 1) = b Cache ExtInst import id in feature manager This allows us to avoid string lookups during optimization; for now we just cache GLSL std450 import id but I can imagine caching more sets as they become utilized by the optimizer. Add tests for add/sub/mul/div/mix folding The tests cover scalar float/double cases, and some vector cases. Since most of the code for floating point folding is shared, the tests for vector folding are not as exhaustive as scalar. To test sub->negate folding I had to implement a custom fixture.	2018-02-20 18:29:27 -05:00
Steven Perron	fa3ac3cc33	Revert "Preserve analysies in the simplification pass" This reverts commit `ec3bbf093e`.	2018-02-20 18:21:25 -05:00
Steven Perron	ec3bbf093e	Preserve analysies in the simplification pass Building the def-use chains is very expensive, so we do not want to invalidate them it if is not necessary. At the moment, it seems like most optimizatoins are good at not invalidating the def-use chains, but simplification does. This PR get the simlification pass to keep the analysies valid. Contributes to #1328.	2018-02-20 14:45:08 -05:00
Diego Novillo	6c75050136	Speed up Phi insertion. On some shader code we have in our testsuite, Phi insertion is showing massive compile time slowdowns, particularly during destruction. The specific shader I was looking at has about 600 variables to keep track of and around 3200 basic blocks. The algorithm is currently O(var x blocks), which means maps with around 2M entries. This was taking about 8 minutes of compile time. This patch changes the tracking of stored variables to be more sparse. Instead of having every basic block contain all the tracked variables in the map, they now have only the variables actually stored in that block. This speeds up deallocation, which brings down compile time to about 1m20s. Note that this is not the definite fix for this. I will re-write Phi insertion to use a standard SSA rewriting algorithm (https://github.com/KhronosGroup/SPIRV-Tools/issues/893). This contributes to https://github.com/KhronosGroup/SPIRV-Tools/issues/1328.	2018-02-20 12:04:06 -05:00
Steven Perron	9d95a91a9f	Fix folding insert feeding extract I mixed up two cases when folding an OpCompositeExtract that is feed by and OpCompositeInsert. The specific cases are demonstracted in the new test. I mixed up the conditions for the cases, and treated one like the other. Fixes #1323.	2018-02-20 11:22:51 -05:00
Alan Baker	c3f34d8bf3	Fixes #1300 . Adding checks for bad CCP transitions and unsettled values * Now track propagation status and assert on bad statuses * Added helper methods to access instruction propagation status * Modified the phi meet operator to properly reflect the paper it is based on * Modified SSA edge addition so that all edge are added, but only on state changes * Fixed a bug in instruction simulation where interesting conditional branches would not mark the interesting edge as executed * Added a test to catch this bug * Added an ostream operator for SSAPropagator::PropStatus	2018-02-18 19:41:34 -05:00
Andrew Woloszyn	e543b195df	Removed warnings from hex_float.h Bitcasting FloatProxy<->uint_type was hitting a warning with g++8.0.1. Replace bitcasts with new casting traits for FloatProxy.	2018-02-16 21:15:51 -05:00
Steven Perron	04cd63e5b9	Make better use of simplification pass The simplification pass works better after all of the dead branches are removed. So swapping them around in the legalization passes. Also adding the simplification pass to performance passes right after dead branch elimination. Added CCP to the legalization passes so we can propagate the constants into the branchs, and remove as many branches a possible. CCP is designed to still get opportunities even if the branches are dead, so it is a good place for it. Fixes #1118	2018-02-16 20:46:49 -05:00
Arseny Kapoulkine	1054413600	Add constant folding rules for floating-point comparison This change handles all 6 regular comparison types in two variations, ordered (true if values are ordered and comparison is true) and unordered (true if values are unordered or comparison is true). Ordered comparison matches the default floating-point behavior on host but we use std::isnan to check ordering explicitly anyway. This change also slightly reworks the floating-point folding support code to make it possible to define a folding operation that returns boolean instead of floating point. These tests exhaustively test ordered/unordered comparisons for float/double. Since for NaN inputs the comparison result doesn't depend on the comparison function, we just test == and !=; NaN inputs result in true unordered comparisons and false ordered comparisons.	2018-02-16 20:41:22 -05:00
Arseny Kapoulkine	27d23a92a0	Remove constants from constant manager in KillInst Registering a constant in constant manager establishes a relation between instruction that defined it and constant object. On complex shaders this could result in the constant definition getting removed as part of one of the DCE pass, and a subsequent simplification pass trying to use the defining instruction for the constant. To fix this, we now remove associated constant entries from constant manager when killing constant instructions; the constant object is still registered and can be remapped to a new instruction later. GetDefiningInstruction shouldn't ever return nullptr after this change so add an assertion to check for that.	2018-02-16 20:37:12 -05:00
Steven Perron	50f307f889	Simplify OpPhi instructions referencing unreachable continues In dead branch elimination, we already recognize unreachable continue blocks, and update OpPhi instruction accordingly. This change adds an extra check: if the head block has exactly 1 other incoming edge, then replace the OpPhi with the value from that edge. Fixes #1314.	2018-02-16 18:58:03 -05:00
Steven Perron	3756b387f3	Get CCP to use the constant floating point rules. Fixes #1311	2018-02-16 13:49:47 -05:00
Lei Zhang	f3a10470d3	Avoid using static unordered_map (#1304 ) unordered_map is not POD. Using it as static may cause problems when operator new() and operator delete() is customized. Also changed some function signatures to use const char* instead of std::string, which will give caller the flexibility to avoid creating a std::string.	2018-02-15 10:19:15 -05:00
Arseny Kapoulkine	32a8e04c7d	Add folding of redundant OpSelect insns We can fold OpSelect into one of the operands in two cases: - condition is constant - both results are the same Even if the original shader doesn't have either of these, if-conversion pass sometimes ends up generating instructions like %7127 = OpSelect %int %3220 %7058 %7058 And this optimization cleans them up.	2018-02-15 10:03:22 -05:00
Steven Perron	0e9f2f948a	Add id to name map Adding a map from an id to it set of OpName and OpMemberName instructions. This will be used in KillNameAndDecorates to kill the names for the ids that are being removed. In my test, the compile time for 50 shaders went from 1m57s to 55s. This was on linux using the release build. Fixes #1290.	2018-02-14 15:53:13 -05:00
Steven Perron	6669d8163d	Fold binary floating point operators. Adds the floating rules for FAdd, FDiv, FMul, and FSub. Contributes to #1164.	2018-02-14 15:48:15 -05:00
Stephen McGroarty	dd8400e150	Initial support for loop unrolling. This patch adds initial support for loop unrolling in the form of a series of utility classes which perform the unrolling. The pass can be run with the command spirv-opt --loop-unroll. This will unroll loops within the module which have the unroll hint set. The unroller imposes a number of requirements on the loops it can unroll. These are documented in the comments for the LoopUtils::CanPerformUnroll method in loop_utils.h. Some of the restrictions will be lifted in future patches.	2018-02-14 15:44:38 -05:00
Alan Baker	229ebc0665	Fixes #1295 . Mark undef values as varying in ccp. * Undef now marked as varying in ccp * this prevents incorrect meet operations since phis were always not interesting * added a test to catch the bug	2018-02-14 10:21:26 -05:00
Diego Novillo	08699920ad	Cleanup. Use proper #include guard. NFC.	2018-02-12 13:21:48 -05:00
Steven Perron	06b437dedc	Avoid using the def-use manager during inlining. There seems to only be a single location where the def-use manager is used. It is to get information about a type. We can do that with the type manager instead. Fixes #1285	2018-02-12 09:47:55 -05:00
Arseny Kapoulkine	70bf3514e8	Fix spirv.h include to rely on include paths This is important when SPIRV-Headers are not checked out to external/ folder and mirrors other places in the code where spirv.h is included.	2018-02-09 18:29:17 -08:00
Steven Perron	1d7b1423f9	Add folding of OpCompositeExtract and OpConstantComposite constant instructions. Create files for constant folding rules. Add the rules for OpConstantComposite and OpCompositeExtract.	2018-02-09 17:52:33 -05:00
David Neto	886859159e	Fix generation of Vim syntax file	2018-02-09 17:47:51 -05:00
Steven Perron	1a849ffb60	Add header files missing from CMakeLists.txt	2018-02-08 23:02:22 -05:00
Alexander Johnston	84ccd0b9ae	Loop invariant code motion initial implementation	2018-02-08 22:55:47 -05:00
GregF	ca4457b4b6	SROA: Do replacement on structs with no partial references.	2018-02-08 15:20:02 -05:00
Steven Perron	06cdb96984	Make use of the instruction folder. Implementation of the simplification pass. - Create pass that calls the instruction folder on each instruction and propagate instructions that fold to a copy. This will do copy propagation as well. - Did not use the propagator engine because I want to modify the instruction as we go along. - Change folding to not allocate new instructions, but make changes in place. This change had a big impact on compile time. - Add simplification pass to the legalization passes in place of insert-extract elimination. - Added test cases for new folding rules. - Added tests for the simplification pass - Added a method to the CFG to apply a function to the basic blocks in reverse post order. Contributes to #1164.	2018-02-07 23:01:47 -05:00
Andrey Tuganov	a61e4c1356	Disable check which fails Vulkan CTS	2018-02-07 13:31:35 -05:00
Andrey Tuganov	2f0c3aaa11	Add Vulkan-specific validation rules for atomics Added atomic instructions validation rules from https://www.khronos.org/registry/vulkan/specs/1.0/html/vkspec.html#spirvenv-module-validation	2018-02-07 13:31:35 -05:00
Józef Kucia	3013897556	Build SPIRV-Tools as shared library Add pkg-config file for shared libraries Properly build SPIRV-Tools DLL Test C interface with shared library Set PATH to shared library file for c_interface_shared test Otherwise, the test won't find SPIRV-Tools-shared.dll. Do not use private functions when testing with shared library Make all symbols hidden by default for shared library target	2018-02-07 10:43:32 -05:00
Alan Baker	871022772e	Registering a type now rebuilds it out of memory owned by the manager. * Added TypeManager::RebuildType * rebuilds the type and its constituent types in terms of memory owned by the manager. * Used by TypeManager::RegisterType to properly allocate memory * Adding an unit test to expose the issue * Added some tests to provide coverage of RebuildType * Added an accessor to the target pointer for a forward pointer	2018-02-06 10:17:56 -05:00
GregF	860b2ee5fc	ADCE: Fix combinator initialization The combinator initialization was only looking at the capabilities in the shader and not the inferred capabilities. Geometry and tessellation shaders were not setting the Shader capability which is inferred. So the combinator set was not initialized correctly causing problems for ADCE.	2018-02-05 16:54:03 -05:00
David Neto	9e19fc0f31	VS2013: LoopDescriptor LoopContainerType can't contain unique_ptr The loop descriptor must explicitly manage the storage for contained Loop objects. Fixes #1262	2018-02-05 14:19:21 -05:00
Andrey Tuganov	12e6860d07	Add barrier instructions validation pass	2018-02-05 13:14:55 -05:00
David Neto	3ef4bb600f	Avoid vector copies in range-for loops in opt/types.cpp Also be more explicit about iterated types in other range-for loops.	2018-02-05 13:08:39 -05:00
David Neto	87f9cfaba3	Disambiguate between const and nonconst ForEachSuccessorLabel This helps VisualStudio 2013 compile the code. Contributes to #1262	2018-02-02 17:54:40 -05:00
Steven Perron	bc1ec9418b	Add general folding infrastructure. Create the folding engine that will 1) attempt to fold an instruction. 2) iterates on the folding so small folding rules can be easily combined. 3) insert new instructions when needed. I've added the minimum number of rules needed to test the features above.	2018-02-02 12:24:11 -05:00
Alan Baker	abe113219e	Reordering performance passes ordering to produce better opts * Moved initial insert/extract passes later to cover more opportunities * Added an extra set of passes to clean up opportunities exposed later in the pipeline	2018-02-01 18:01:10 -05:00
Victor Lomuller	50e85c865c	Add LoopUtils class to gather some loop transformation support. This patch adds LoopUtils class to handle some loop related transformations. For now it has 2 transformations that simplifies other transformations such as loop unroll or unswitch: - Dedicate exit blocks: this ensure that all exit basic block (out-of-loop basic blocks that have a predecessor in the loop) have all their predecessors in the loop; - Loop Closed SSA (LCSSA): this ensure that all definitions in a loop are used inside the loop or in a phi instruction in an exit basic block. It also adds the following capabilities: - Loop::IsLCSSA to test if the loop is in a LCSSA form - Loop::GetOrCreatePreHeaderBlock that can build a loop preheader if required; - New methods to allow on the fly updates of the loop descriptors. - New methods to allow on the fly updates of the CFG analysis. - Instruction::SetOperand to allow expression of the index relative to Instruction::NumOperands (to be compatible with the index returned by DefUseManager::ForEachUse)	2018-02-01 15:35:09 -05:00
Steven Perron	61d8c0384b	Add pass to reaplce invalid opcodes Creates a pass that will remove instructions that are invalid for the current shader stage. For the instruction to be considered for replacement 1) The opcode must be valid for a shader modules. 2) The opcode must be invalid for the current shader stage. 3) All entry points to the module must be for the same shader stage. 4) The function containing the instruction must be reachable from an entry point. Fixes #1247.	2018-02-01 15:25:09 -05:00
Andrey Tuganov	d37869c842	Added OpenCL ExtInst validation rules	2018-02-01 14:14:13 -05:00
Jeremy Hayes	cd68f2b176	Add adjacency validation pass Validate OpPhi predecessors. Validate OpLoopMerge successors. Validate OpSelectionMerge successors. Fix collateral damage to existing tests. Remove ValidateIdWithMessage.OpSampledImageUsedInOpPhiBad.	2018-02-01 14:10:55 -05:00
Andrey Tuganov	905536c519	Fixed harmless uninit var warning	2018-01-31 17:49:01 -05:00
David Neto	ac537c71a8	Use SPIR-V headers from "unified1" directory	2018-01-31 15:36:50 -05:00
Alan Baker	2735e0851e	Remove constexpr from Analysis operators * Had to remove templating from InstructionBuilder as a result * now preserved analyses are specified as a constructor argument * updated tests and uses * changed static_assert to a runtime assert * this should probably get further changes in the future	2018-01-31 14:44:43 -05:00
GregF	0aa0ac52f7	Opt: Add ScalarReplacement to RegisterSizePasses	2018-01-31 10:19:17 -05:00
Andrey Tuganov	44d88c8d9c	Add memory semantics checks to validate atomics	2018-01-30 18:00:01 -05:00
Alan Baker	16949236fe	Prevent unnecessary changes to the IR in dead branch elim * When handling unreachable merges and continues, do not optimize to the same IR * pass did not check whether the unreachable blocks were in the optimized form before transforming them * added a test to catch this issue	2018-01-30 16:51:58 -05:00
Andrey Tuganov	c86cb76a22	Improved error message in val capabilities	2018-01-30 16:22:10 -05:00
Alan Baker	e661da7941	Enhancements to block merging * Should handle all possibilities * Stricter checks for what is disallowed: * header and header * merge and merge * Allow header and merge blocks to be merged * Erases the structured control declaration if merging header and merge blocks together.	2018-01-30 16:05:51 -05:00
Alan Baker	6704233d39	Fix dereference of possibly nullptr * If the dead branch elim is performed on a module without structured control flow, the OpSelectionMerge may not be present * Add a check for pointer validity before dereferencing * Added a test to catch the bug	2018-01-30 10:15:43 -05:00
GregF	f28b106173	InsertExtractElim: Split out DeadInsertElim as separate pass	2018-01-30 08:52:14 -05:00
Alan Baker	1b46f7ecad	Fixes in CCP for #1228 * Forces traversal of phis if the def has changed to varying * Mark a phi as varying if all incoming values are varying * added a test to catch the bug	2018-01-29 15:12:05 -05:00
Victor Lomuller	6018de81de	Add LoopDescriptor as an IRContext analysis. Move some function definitions from header to source to avoid circular definition.	2018-01-25 16:12:32 -05:00
Greg Fischer	684997eb72	DeadInsertElim: Detect and DCE dead Inserts This adds Dead Insert Elimination to the end of the --eliminate-insert-extract pass. See the new tests for examples of code that will benefit. Essentially, this removes OpCompositeInsert instructions which are not used, either because there is no instruction which uses the value at the index it is inserted, or because a subsequent insert intercepts any such use. This code has been seen to remove significant amounts of dead code from real-life HLSL shaders being ported to Vulkan. In fact, it is needed to remove dead texture samples which cause Vulkan validation layer errors (unbound textures and samplers) if not removed . Such DCE is thus required for fxc equivalence and legalization. This analysis operates across "chains" of Inserts which can also contain Phi instructions.	2018-01-25 16:07:21 -05:00
Alan Baker	2e93e806e4	Initial implementation of if conversion * Handles simple cases only * Identifies phis in blocks with two predecessors and attempts to convert the phi to an select * does not perform code motion currently so the converted values must dominate the join point (e.g. can't be defined in the branches) * limited for now to two predecessors, but can be extended to handle more cases * Adding if conversion to -O and -Os	2018-01-25 09:42:00 -08:00
Andrey Tuganov	b2eb840468	Validator: restricted some atomic ops for shaders Ban floating point case for OpAtomicLoad, OpAtomicExchange, OpAtomicCompareExchange. In graphics (Shader) environments, these instructions only operate on scalar integers. Ban the floating point case. OpenCL supports atomic_float.	2018-01-24 14:06:06 -08:00
Andrey Tuganov	bdc78377bc	Added Vulkan-specifc checks to image validation Implemented Vulkan-specific rules: - OpTypeImage must declare a scalar 32-bit float or 32-bit integer type for the “Sampled Type”. - OpSampledImage must only consume an “Image” operand whose type has its “Sampled” operand set to 1.	2018-01-24 14:05:42 -08:00
Steven Perron	c4835e1bd8	Use id_map in Fold*ToConstant The folding routines are suppose to use the id_map provided to map the ids in the instruction. The ones I just added are missing it.	2018-01-22 16:27:31 -05:00
Steven Perron	6c409e30a2	Add generic folding function and use in CCP The current folding routines have a very cumbersome interface, make them harder to use, and not a obvious how to extend. This change is to create a new interface for the folding routines, and show how it can be used by calling it from CCP. This does not make a significant change to the behaviour of CCP. In general it should produce the same code as before; however it is possible that an instruction that takes 32-bit integers as inputs and the result is not a 32-bit integer or bool will not be folded as before. It seems like andriod has a problem with INT32_MAX and the like. I'll explicitly define those if the are not already defined.	2018-01-22 14:26:49 -05:00
Alan Baker	3b780db7f8	Fixes infinite loop in ADCE * Addresses how breaks are indentified to prevent infinite loops when back to back loop share a merge and header * Added test to catch the bug	2018-01-19 11:08:46 -05:00
Victor Lomuller	cf3b2a58c4	Introduce an instruction builder helper class. The class factorize the instruction building process. Def-use manager analysis can be updated on the fly to maintain coherency. To be updated to take into account more analysis.	2018-01-19 10:17:45 -05:00
Alan Baker	73940aba1b	Simplifying code for adding instructions to worklist * AddToWorklist can now be called unconditionally * It will only add instructions that have not already been marked as live * Fixes a case where a merge was not added to the worklist because the branch was already marked as live * Added two similar tests that fail without the fix	2018-01-18 20:36:46 -05:00
Steven Perron	34d4294c2c	Create a pass to work around a driver bug related to OpUnreachable. We have come across a driver bug where and OpUnreachable inside a loop is causing the shader to go into an infinite loop. This commit will try to avoid this bug by turning OpUnreachable instructions that are contained in a loop into branches to the loop merge block. This is not added to "-O" and "-Os" because it should only be used if the driver being targeted has this problem. Fixes #1209.	2018-01-18 20:31:46 -05:00
Victor Lomuller	0b1372a8ca	CFG: force the creation of a predecessor entry for all basic block. This ensure that all basic blocks in a function have a valid entry the CFG object. The entry block has no predecessors but remains a valid basic block for which we might want to query the number of predecessors. Some unreachable basic blocks may not have predecessors as well.	2018-01-18 10:22:00 -05:00
Alan Baker	5e70d20d80	Fixing missing early exit from break identification	2018-01-17 14:09:24 -05:00
Alan Baker	80b743a570	Adding support for switch removal in ADCE * Updated code to handle switches * Enabled disabled test and added a couple new ones	2018-01-17 11:05:42 -05:00
Alan Baker	3a0eb44da3	Capturing value table by reference in local redundancy elim	2018-01-17 09:58:32 -05:00
Alan Baker	5ffe862f28	Fixes missing increment in common uniform elim * Addresses #1203 * Increments inIdx in IsConstantIndexAccessChain * added test to catch the bug	2018-01-16 14:47:35 -05:00
Steven Perron	6cc772c3ce	Skip SpecConstants in CCP. At the moment specialization constants look like constants to ccp. This causes a problem because they are handled differently by the constant manager. I choose to simply skip over them, and not try to add them to the value table. We can do specialization before ccp if we want to be able to propagate these values. Fixes #1199.	2018-01-15 09:53:23 -05:00
Greg Fischer	c2aadb02d9	Add MatrixConstant	2018-01-12 18:49:36 -05:00
Steven Perron	8cb0aec724	Remove redundant passes from legalization passes With work that Alan has done, some passes have become redundant. ADCE now removed unused variables. Dead branch elimination removes unreachable blocks. This means we can remove CFG Cleanup and dead variable elimination.	2018-01-12 17:47:50 -05:00
Alan Baker	6587d3f8a3	Adding early exit versions of several ForEach* methods * Looked through code for instances where code would benefit from early exit * Added a corresponding WhileEach* method and updated the code	2018-01-12 17:05:09 -05:00
Steven Perron	24f9947050	Move initialization of the const mgr to the constructor. The current code expects the users of the constant manager to initialize it with all of the constants in the module. The problem is that you do not want to redo the work multiple times. So I decided to move that code to the constructor of the constant manager. This way it will always be initialized on first use. I also removed an assert that expects all constant instructions to be successfully mapped. This is because not all OpConstant* instruction can map to a constant, and neither do the OpSpecConstant* instructions. The real problem is that an OpConstantComposite can contain a member that is OpUndef. I tried to treat OpUndef like OpConstantNull, but this failed because an OpSpecConstantComposite with an OpUndef cannot be changed to an OpConstantComposite. Since I feel this case will not be common, I decided to not complicate the code. Fixes #1193.	2018-01-12 13:53:21 -05:00
Alan Baker	672494da13	Adding ostream operators for IR structures * Added for Instruction, BasicBlock, Function and Module * Uses new disassembly functionality that can disassemble individual instructions * For debug use only (no caching is done) * Each output converts module to binary, parses and outputs an individual instruction * Added a test for whole module output * Disabling Microsoft checked iterator warnings * Updated check_copyright.py to accept 2018	2018-01-12 11:19:58 -05:00
Alan Baker	eb0c73dad6	Maintain instruction to block mapping in phi insertion * Changed MemPass::InsertPhiInstructions to set basic blocks for new phis * Local SSA elim now maintains instr to block mapping * Added a test and confirmed it fails without the updated phis * IRContext::set_instr_block no longer builds the map if the analysis is invalid * Added instruction to block mapping verification to IRContext::IsConsistent()	2018-01-12 10:16:53 -05:00
Greg Fischer	5eafc00ad5	InsertExtractElim: Optimize through VectorShuffle, Mix This improves Extract replacement to continue through VectorShuffle. It will also handle Mix with 0.0 or 1.0 in the a-value of the desired component. To facilitate optimization of VectorShuffle, the algorithm was refactored to pass around the indices of the extract in a vector rather than pass the extract instruction itself. This allows the indices to be modified as the algorithm progresses.	2018-01-12 09:41:45 -05:00
Steven Perron	1ebd860daa	Add generic folding function and use in CCP The current folding routines have a very cumbersome interface, make them harder to use, and not a obvious how to extend. This change is to create a new interface for the folding routines, and show how it can be used by calling it from CCP. This does not make a significant change to the behaviour of CCP. In general it should produce the same code as before; however it is possible that an instruction that takes 32-bit integers as inputs and the result is not a 32-bit integer or bool will not be folded as before.	2018-01-10 13:17:25 -05:00
Alan Baker	3a054e1ddc	Adding additional functionality to ADCE. Modified ADCE to remove dead globals. * Entry point and execution mode instructions are marked as alive * Reachable functions and their parameters are marked as alive * Instruction deletion now deferred until the end of the pass * Eliminated dead insts set, added IsDead to calculate that value instead * Ported applicable dead variable elimination tests * Ported dead constant elim tests Added dead function elimination to ADCE * ported dead function elim tests Added handling of decoration groups in ADCE * Uses a custom sorter to traverse decorations in a specific order * Simplifies necessary checks Updated -O and -Os pass lists.	2018-01-10 08:35:48 -05:00
Andrey Tuganov	d54a286c75	Fix validation rules for GLSL pack/unpack 2x32	2018-01-09 13:10:29 -05:00
Alan Baker	1b6cfd3409	Rewriting dead branch elimination. Pass now paints live blocks and fixes constant branches and switches as it goes. No longer requires structured control flow. It also removes unreachable blocks as a side effect. It fixes the IR (phis) before doing any code removal (other than terminator changes). Added several unit tests for updated/new functionality. Does not remove dead edge from a phi node: * Checks that incoming edges are live in order to retain them * Added BasicBlock::IsSuccessor * added test Fixing phi updates in the presence of extra backedge blocks * Added tests to catch bug Reworked how phis are updated * Instead of creating a new Phi and RAUW'ing the old phi with it, I now replace the phi operands, but maintain the def/use manager correctly. For unreachable merge: * When considering unreachable continue blocks the code now properly checks whether the incoming edge will continue to be live. Major refactoring for review * Broke into 4 major functions * marking live blocks * marking structured targets * fixing phis * deleting blocks	2018-01-09 12:21:39 -05:00
Diego Novillo	e5560d64de	Fix constant propagation of induction variables. This fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1143. When an instruction transitions from constant to bottom (varying) in the lattice, we were telling the propagator that the instruction was varying, but never updating the actual value in the values table. This led to incorrect value substitutions at the end of propagation. The patch also re-enables CCP in -O and -Os.	2018-01-08 15:34:35 -05:00
David Neto	a82a0ea886	Fix method comment for BasicBlock::MegeBlockIdIfAny Fixes #1177	2018-01-08 10:42:02 -05:00
Lei Zhang	44f27f9289	Allow relaxing validation of pointers in logical addressing mode In HLSL structured buffer legalization, pointer to pointer types are emitted to indicate a structured buffer variable should be treated as an alias of some other variable. We need an option to relax the check of pointer types in logical addressing mode to catch other validation errors.	2018-01-08 10:36:23 -05:00
Victor Lomuller	e8ad02f3dd	Add loop descriptors and some required dominator tree extensions. Add post-order tree iterator. Add DominatorTreeNode extensions: - Add begin/end methods to do pre-order and post-order tree traversal from a given DominatorTreeNode Add DominatorTree extensions: - Add begin/end methods to do pre-order and post-order tree traversal - Tree traversal ignore by default the pseudo entry block - Retrieve a DominatorTreeNode from a basic block Add loop descriptor: - Add a LoopDescriptor class to register all loops in a given function. - Add a Loop class to describe a loop: - Loop parent - Nested loops - Loop depth - Loop header, merge, continue and preheader - Basic blocks that belong to the loop Correct a bug that forced dominator tree to be constantly rebuilt.	2018-01-08 09:31:13 -05:00
David Neto	6e9ea2e584	AnalyzeInstUse: Reuse the instruction lookup	2018-01-07 11:30:48 -05:00
David Neto	3fbbd3c772	Remove CCP from size and performance recipes, pending bugfixes Currently CCP is incorrectly optimizing loops. See https://github.com/KhronosGroup/SPIRV-Tools/issues/1143	2018-01-05 14:01:18 -05:00
Pierre Moreau	7183ad526e	Linker code cleanups Turn `Linker::Link()` into free functions As very little information was kept in the Linker class, we can get rid of the whole class and have the `Link()` as free functions instead; the environment target as well as the consumer are passed along through an `spv_context` object. The resulting linked_binary is passed as a pointer rather than a reference to follow the Google C++ Style guidelines. Addresses remaining comments from https://github.com/KhronosGroup/SPIRV-Tools/pull/693 about the SPIR-V linker. Fix variable naming in the linker Some of the variables were using mixed case, which did not follow the Google C++ Style guidelines. Linker: Use EXPECT_EQ when possible and update some test * Replace occurrences of ASSERT_EQ by EXPECT_EQ when possible; * Reformulated some of the error messages; * Added the symbol name in the error message when there is a type or decoration mismatch between the imported and exported declarations. Opt: List all duplicates removed by RemoveDuplicatePass in the header Opt: Make the const version of GetLabelInst() return a pointer For consistency with the non-const version, as well as other similar functions. Opt: Rename function_end to EndInst() As pointed out by dneto0 the previous name was quite confusing and could be mistaken with a function returning an end iterator. Also change the return type of the const version to a pointer rather than a reference, for consistency. Opt: Add performance comment to RemoveDuplicateTypes and decorations This comment was requested during the review of https://github.com/KhronosGroup/SPIRV-Tools/pull/693. Opt: Add comments and fix variable naming in RemoveDuplicatePass * Add missing comments to private functions; * Rename variables that were using mixed case; * Add TODO for moving AreTypesEqual out. Linker: Remove commented out code and add TODOs Linker: Merged together strings that were too much splitted Implement a C++ RAII wrapper around spv_context	2018-01-05 13:28:44 -05:00
Steven Perron	ccb921dd2b	Allow getting the base pointer of an image load/store. In value numbering, we treat loads and stores of images, ie OpImageLoad, as a memory operation where it is interested in the "base address" of the instruction. In those cases, it is an image instruction. The problem is that `Instruction::GetBaseAddress()` does not account for the image instructions, so the assert at the end to make sure it found a valid base address for its addressing mode fails. The solution is to look at the load/store instruction to determine how the assertion should be done. Fixes #1160.	2018-01-05 13:26:10 -05:00
Diego Novillo	716718a5e9	Fix infinite simulation cycles in SSA propagator. This fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1159. I had missed a nuance in the original algorithm. When simulating Phi instructions, the SSA edges out of a Phi instruction should never be added to the list of edges to simulate. Phi instructions can be in SSA def-use cycles with other Phi instructions. This was causing the propagator to fall into an infinite loop when the same def-use edge kept being added to the queue. The original algorithm in the paper specifically separates the visit of a Phi instruction vs the visit of a regular instruction. This fix makes the implementation match the original algorithm.	2018-01-05 10:29:39 -05:00
David Neto	ac9a828e6e	dead branch elim: Track killed backedges When deleting branches and blocks, also remove them from the backedges set, in case they were there. This prevents us from keeping stale pointers to deleted Instruction objects. That memory could be used later by another instruction, incorrectly signaling that something has a backedge reference, and the dead branch eliminator could end up deleting live blocks. Adds accessor method ir::BasicBlock::terminator Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1168	2018-01-04 19:06:55 -05:00
David Neto	c32e79eeef	Add --print-all optimizer option Adds optimizer API to write disassembly to a given output stream before each pass, and after the last pass. Adds spirv-opt --print-all option to write disassembly to stderr before each pass, and after the last pass.	2018-01-04 18:34:18 -05:00
Pierre Moreau	702852bd22	Opt: Make DecorationManager::HaveTheSameDecorations symmetric Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1112 Also: Add SpvOpDecorateId to IsAnnotationInst()	2018-01-04 14:07:25 -05:00
Andrey Tuganov	a376b197ae	Validator checks out of bounds composite access 1. Added OpCompositeExtract/Insert out of bounds checks where possible (everything except RuntimeArray) 2. Moved validation of OpCompositeExtract/Insert from validate_id.cpp to validate_composites.cpp.	2018-01-04 14:02:38 -05:00
Diego Novillo	5b52626eaa	Address review comments from https://github.com/KhronosGroup/SPIRV-Tools/pull/985 .	2018-01-04 13:20:49 -05:00
Steven Perron	7834beea80	Update legalization passes I've a few passes the legalization passes. The first is to add the more specialized load-store removal passes to help improve the compile time, as was suggested in #1118. I've also added dead branch elimination while we wait for the behaviour of dead branch elimination to be folded into CFG cleanup. I did not add CCP because it seems like most of the constant propagation what is needed is already being done by the load-store removal passes, which call `ReplaceAllUsesWith`. We can reconsider this if needed.	2018-01-04 11:04:49 -05:00
Steven Perron	e8f2890c30	Replace calls to `ToNop` by `KillInst`. Calling `ToNop` leaves around instructions that are pointless. In general it is better to remove the instruction completely. That way other optimizations will not need to look at them. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1003.	2018-01-04 11:03:04 -05:00
Diego Novillo	5f100789fb	Handle execution termination instructions when building edges. This fixes issue https://github.com/KhronosGroup/SPIRV-Tools/issues/1153. When building CFG edges, edges out of a OpKill and OpUnreachable instruction should be directed to the CFG's pseudo exit block.	2018-01-03 15:25:03 -05:00
Diego Novillo	135150a1a8	Do not insert Phi nodes in CCP propagator. In CCP we should not need to insert Phi nodes because CCP never looks at loads/stores. This required adjusting two tests that relied on Phi instructions being inserted. I changed the tests to have the Phi instructions pre-inserted. I also added a new test to make sure that CCP does not try to look through stores and loads. Finally, given that CCP does not handle loads/stores, it's better to run mem2reg before it. I've changed the -O/-Os schedules to run local multi-store elimination before CCP. Although this is just an efficiency fix for CCP, it is also working around a bug in Phi insertion. When Phi instructions are inserted, they are never associated a basic block. This causes a segfault when the propagator tries to lookup CFG edges when analyzing Phi instructions.	2018-01-03 15:12:25 -05:00
Andrey Tuganov	25d396b4a2	Add ExtInst validation pass (GLSL only for now) Validates all GLSL.std.450 extended instructions.	2018-01-02 16:53:25 -05:00
Diego Novillo	1acce99255	Fix https://github.com/KhronosGroup/SPIRV-Tools/issues/1130 This addresses review feedback for the CCP implementation (which fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/889). This adds more protection around the folding of instructions that would not be supported by the folder.	2017-12-22 13:33:17 -05:00
Andrey Tuganov	a91aa53893	Disallow Dim=SubpassData for OpImageSparseRead	2017-12-22 09:45:15 -05:00
David Neto	59de6100b5	Add asm, dis support for DebugInfo extended instruction set Add grammar file for DebugInfo extended instruction set - Each new operand enum kind in extinst.debuginfo.grammar.json maps to a new value in spv_operand_type_t. - Add new concrete enum operand types for DebugInfo Generate a C header for the DebugInfo extended instruction set Add table lookup of DebugInfo extended instrutions Handle the debug info operand types in binary parser, disassembler, and assembler. Add DebugInfo round trip tests for assembler, disassembler Android.mk: Support DebugInfo extended instruction set The extinst.debuginfo.grammar.json file is currently part of SPIRV-Tools source. It contributes operand type enums, so it has to be processed along with the core grammar files. We also generate a C header DebugInfo.h. Add necessary grammar file processing to Android.mk.	2017-12-22 09:39:36 -05:00
Diego Novillo	4ba9dcc8a0	Implement SSA CCP (SSA Conditional Constant Propagation). This implements the conditional constant propagation pass proposed in Constant propagation with conditional branches, Wegman and Zadeck, ACM TOPLAS 13(2):181-210. The main logic resides in CCPPass::VisitInstruction. Instruction that may produce a constant value are evaluated with the constant folder. If they produce a new constant, the instruction is considered interesting. Otherwise, it's considered varying (for unfoldable instructions) or just not interesting (when not enough operands have a constant value). The other main piece of logic is in CCPPass::VisitBranch. This evaluates the selector of the branch. When it's found to be a known value, it computes the destination basic block and sets it. This tells the propagator which branches to follow. The patch required extensions to the constant manager as well. Instead of hashing the Constant pointers, this patch changes the constant pool to hash the contents of the Constant. This allows the lookups to be done using the actual values of the Constant, preventing duplicate definitions.	2017-12-21 14:29:45 -05:00
Steven Perron	756b277fb8	Store all enabled capabilities in the feature manger. In order to keep track of all of the implicit capabilities as well as the explicit ones, we will add them all to the feature manager. That is the object that needs to be queried when checking if a capability is enabled. The name of the "HasCapability" function in the module was changed to make it more obvious that it does not check for implied capabilities. Keep an spv_context and AssemblyGrammar in IRContext	2017-12-21 11:14:53 -05:00
Alan Baker	1ab8ad654a	Fixing bugs in type manager memory management * changed the way duplicate types are removed to stop copying instructions * Reworked RemoveDuplicatesPass::AreTypesSame to use type manager and type equality * Reworked TypeManager memory management to store a pool of unique pointers of types * removed unique pointers from id map * fixed instances where free'd memory could be accessed	2017-12-21 08:59:06 -05:00
Steven Perron	7505d24225	Update the legalization passes. Changes the set of optimizations done for legalization. While doing this, I added documentation to explain why we want each optimization. A new option "--legalize-hlsl" is added so the legalization passes can be easily run from the command line. The legalize option implies skip-validation.	2017-12-20 17:56:03 -05:00
Pierre Moreau	424f744db1	Opt: Fix implementation and comment of AreDecorationsTheSame Target should not be ignored when comparing decorations in RemoveDuplicates Opt: Remove unused code in RemoveDuplicateDecorations	2017-12-19 15:36:47 -05:00
Steven Perron	79a00649b4	Allow pointers to pointers in logical addressing mode. A few optimizations are updates to handle code that is suppose to be using the logical addressing mode, but still has variables that contain pointers as long as the pointer are to opaque objects. This is called "relaxed logical addressing". \|Instruction::GetBaseAddress\| will check that pointers that are use meet the relaxed logical addressing rules. Optimization that now handle relaxed logical addressing instead of logical addressing are: - aggressive dead-code elimination - local access chain convert - local store elimination passes.	2017-12-19 14:29:14 -05:00
Steven Perron	b86eb6842b	Convert private variables to function scope. When a private variable is used in a single function, it can be converted to a function scope variable in that function. This adds a pass that does that. The pass can be enabled using the option `--private-to-local`. This transformation allows other transformations to act on these variables. Also moved `FindPointerToType` from the inline class to the type manager.	2017-12-19 14:21:04 -05:00
David Neto	8135dd6375	More validation on primitive instructions - Test validation success for OpEmitVertex OpEndPrimitive - Test missing capabilities for primitive instructions - Primitive instructions require Geometry execution model	2017-12-19 13:26:07 -05:00
Jesus Carabano	4dbcef62ee	validate & test of literal's upper bits Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/660	2017-12-19 13:19:56 -05:00
Pierre Moreau	f35963588b	Opt: Remove commented out duplicated type_id function This code was wrongly added by #693.	2017-12-18 17:29:21 -05:00
Jeremy Hayes	0d8ea48652	Fix comment in primitives validation Also refactor type query for efficiency.	2017-12-18 17:27:06 -05:00
Andrey Tuganov	dbc3a662c6	Image Operand Sample allows sparse image opcodes @ehsannas had filed an issue against SPIR-V spec, concerning Image Operands section (3.14): Sample A following operand is the sample number of the sample to use. Only valid with OpImageFetch, OpImageRead, and OpImageWrite. Relaxing the check to allow OpImageSparseRead and OpImageSparseFetch to fix failing tests.	2017-12-18 11:21:38 -05:00
David Neto	0dbe184d32	Remove concept of FIRST_CONCRETE_* operand types	2017-12-18 09:48:51 -05:00
Alan Baker	616908503d	Improving the usability of the type manager. The type manager hashes types. This allows the lookup of type declaration ids from arbitrarily constructed types. Users should be cautious when dealing with non-unique types (structs and potentially pointers) to get the exact id if necessary. * Changed the spec composite constant folder to handle ambiguous composites * Added functionality to create necessary instructions for a type * Added ability to remove ids from the type manager	2017-12-18 08:20:56 -05:00
GregF	0f80406315	ADCE: Only mark true breaks and continues of live loops This fixes issue #1075 - Mark continue when conditional branch with merge block. Only mark if merge block is not continue block. - Handle conditional branch break with preceding merge	2017-12-15 11:53:57 -05:00
Jeremy Hayes	cdfbf26c13	Add primitive instruction validation pass	2017-12-15 09:53:29 -05:00
Andrey Tuganov	af7d5799a5	Refactor include of latest spir-v header versions	2017-12-14 11:18:20 -05:00
Andrey Tuganov	532b327d4d	Add validation rules for atomic instructions Validates all OpAtomicXXX instructions.	2017-12-13 18:29:38 -05:00
Diego Novillo	853a3d6c31	Fix uninitialized warning at -Os.	2017-12-12 15:46:09 -05:00
Greg Fischer	22faa2b083	ADCE: Empty Loop Elimination This entirely eliminates loops which do not contain live code.	2017-12-12 13:53:15 -05:00
Steven Perron	07ce16d1e7	Set the parent for basic blocks during inlining. Inlining is not setting the parent (function) for each basic block. This can cause problems for later optimizations. The solution is to set the parent for each new block just before it is linked into the function.	2017-12-12 13:39:08 -05:00
Andrey Tuganov	c520d43649	Add validator checks for sparse image opcodes	2017-12-12 12:04:23 -05:00
Pierre Moreau	12447d8465	Support OpenCL 1.2 and 2.0 target environments include: Add target environment enums for OpenCL 1.2 and 2.0 Validator: Validate OpenCL capabilities Update validate capabilities to handle embedded profiles Add test for OpenCL capabilities validation Update messages to mention the OpenCL profile used Re-format val_capability_test.cpp	2017-12-12 11:35:39 -05:00
Andrey Tuganov	dbd8d0e7b8	Reenable OpCopyObject validation rules Vulkan CTS fix has been submitted.	2017-12-11 12:33:11 -05:00
Alan Baker	867451f49e	Add scalar replacement Adds a scalar replacement pass. The pass considers all function scope variables of composite type. If there are accesses to individual elements (and it is legal) the pass replaces the variable with a variable for each composite element and updates all the uses. Added the pass to -O Added NumUses and NumUsers to DefUseManager Added some helper methods for the inst to block mapping in context Added some helper methods for specific constant types No longer generate duplicate pointer types. * Now searches for an existing pointer of the appropriate type instead of failing validation * Fixed spec constant extracts * Addressed changes for review * Changed RunSinglePassAndMatch to be able to run validation * current users do not enable it Added handling of acceptable decorations. * Decorations are also transfered where appropriate Refactored extension checking into FeatureManager * Context now owns a feature manager * consciously NOT an analysis * added some test * fixed some minor issues related to decorates * added some decorate related tests for scalar replacement	2017-12-11 10:51:13 -05:00
GregF	78c025abe9	MultiStore: Support OpVariable Initialization Treat an OpVariable with initialization as if it was an OpStore. With PR #1073, this completes work for issue #1017.	2017-12-11 10:37:14 -05:00
GregF	c6fdf68c2f	SingleStore: Support OpVariable Initialization Treat an OpVariable with initialization as if it was an OpStore. This fixes issue #1017.	2017-12-08 16:02:14 -05:00
Diego Novillo	241dcacc04	Add a new constant manager class. This patch adds a new constant manager class to interface with analysis::Constant. The new constant manager lives in ir::IRContext together with the type manager (analysis::TypeManager). The new analysis::ConstantManager is used by the spec constant folder and the constant propagator (in progress). Another cleanup introduced by this patch removes the ID management from the fold spec constant pass, and ir::IRContext and moves it to ir::Module. SSA IDs were maintained by IRContext and Module. That's pointless and leads to mismatch IDs. Fixed by moving all the bookkeeping to ir::Module.	2017-12-08 14:14:55 -05:00
Steven Perron	5d602abd66	Add global redundancy elimination Adds a pass that looks for redundant instruction in a function, and removes them. The algorithm is a hash table based value numbering algorithm that traverses the dominator tree. This pass removes completely redundant instructions, not partially redundant ones.	2017-12-07 18:35:38 -05:00
Steven Perron	851e1ad985	Kill names and decoration in inlining. Currently when inlining a call, the name and decorations for the result of the call is not deleted. This should be changed. Added a test for this as well. This fixes issue #622.	2017-12-07 12:20:45 -05:00
Victor Lomuller	731d1899b1	Add depth first iterator for trees - Add generic depth first iterator - Update the dominator tree to use this iterator instead of "randomly" iterate over the nodes	2017-12-07 10:07:56 -05:00
Diego Novillo	0c2396d20f	Revert extraneous changes from commit `8ec62deb2`. Commit `8ec62deb2` merged the code from PR #810, but it also re-introduces code that had been removed in #885. This patch removes the (now superfluous code).	2017-12-06 16:04:47 -05:00
Stephen McGroarty	8ba68fa9b9	Dominator Tree Analysis (#3 ) Support for dominator and post dominator analysis on ir::Functions. This patch contains a DominatorTree class for building the tree and DominatorAnalysis and DominatorAnalysisPass classes for interfacing and caching the built trees.	2017-12-05 22:59:43 -05:00
Andrey Tuganov	94e3e7b8ef	Add composite instruction validation pass Validates instructions in the opcode range from OpVectorExtractDynamic to OpTranspose.	2017-12-05 10:15:51 -05:00
Andrey Tuganov	bf184310b2	Fix some of the known issues in image validation Applied some of the spec clarifications made in conversation with @johnkslang.	2017-12-04 18:57:34 -05:00
Steven Perron	fd3a22042b	DCEInst kill the same instruction twice. In DCEInst, it is possible that the same instruction ends up in the queue multiple times, if the same id is used multiple times in the same instruction. The solution is to keep the ids in a set, to ensure no duplication in the list.	2017-12-04 18:15:35 -05:00
Diego Novillo	e9ecc0cbfd	Remove cfg_ field from SSAPropagator class - NFC. When I moved the CFG into IRContext (https://github.com/KhronosGroup/SPIRV-Tools/pull/1019), I forgot to update SSAPropagator to stop requiring one. Fixed with this patch.	2017-12-04 15:28:21 -05:00
Steven Perron	65046eca7c	Change IRContext::KillInst to delete instructions. The current method of removing an instruction is to call ToNop. The problem with this is that it leaves around an instruction that later passes will look at. We should just delete the instruction. In MemPass there is a utility routine called DCEInst. It can delete essentially any instruction, which can invalidate pointers now that they are actually deleted. The interface was changed to add a call back that can be used to update any local data structures that contain ir::Intruction*.	2017-12-04 11:07:45 -05:00
Steven Perron	b35b52f97b	Compute value number when the value table is constructed. Computing the value numbers on demand, as we do now, can lead to different results depending on the order in which the users asks for the value numbers. To make things more stable, we compute them ahead of time.	2017-12-04 11:02:04 -05:00
Daan Wendelen	b98254b282	Fixed typo that leaked to the binary The typo was found by lintian when I was packaging glslang	2017-12-03 20:42:14 -05:00
Lei Zhang	0dd4ee27b1	Fix Dref type check in validator Dref should be of 32-bit scalar floating type. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1012	2017-12-01 10:17:45 -05:00
Pierre Moreau	69043963e4	Opt: Remove unused lambda captures Those are reported as errors by clang 5.0.0, due to the flags -Werror and -Wunused-lambda-capture.	2017-12-01 09:54:37 -05:00
Lei Zhang	137953538a	Support outputting ANSI color escape sequences in library Previously we required _PRINT to enable _COLOR, which forbids outputting colored disassembly into a string in library. This commit will allow library users to request enabling ANSI color escape sequences.	2017-12-01 09:03:35 -05:00
David Neto	188cd3780d	Erase decorations removed from internal collections Fixes Android arm-64-v8a build with NDK r14. That's because we no longer ignore the result of the std::remove.	2017-11-30 11:35:02 -05:00
David Neto	3c2e4c7d99	Fix validation of group ops in SPV_AMD_shader_ballot This needs custom code since the rules from the extension are not encoded in the grammar. Changes are: - The new group instructions don't require Group capability when the extension is declared. - The Reduce, InclusiveScan, ExclusiveScan normally require the Kernel capability, but don't when the extension is declared. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/991	2017-11-30 10:26:04 -05:00
Diego Novillo	8cfa0c40e0	Fix #1034 - Give Edge::operator<() weak ordering semantics. This should fix #1034. It changes the predicate on operator< to use label IDs from each block and compares them as std:pair to define a weak ordering for std::set.	2017-11-29 17:29:17 -05:00
Andrey Tuganov	e1ceff9f54	Validate OpTypeImage and OpTypeSampleImage Added new validation rules to the validate image pass.	2017-11-29 13:21:04 -05:00
GregF	8dd3d93cf6	AggressiveDCE: Add merge and continue branches for live loop. This ensures that an if-break is not eliminated from a loop. This fixes issue #989	2017-11-29 09:56:21 -05:00
Diego Novillo	9f20799fb4	Convert the CFG to an on-demand analysis - NFC. This fixes some TODOs by moving the CFG into the IRContext as an analysis.	2017-11-28 13:25:41 -05:00
Diego Novillo	74327845aa	Generic value propagation engine. This class implements a generic value propagation algorithm based on the conditional constant propagation algorithm proposed in Constant propagation with conditional branches, Wegman and Zadeck, ACM TOPLAS 13(2):181-210. The implementation is based on A Propagation Engine for GCC Diego Novillo, GCC Summit 2005 http://ols.fedoraproject.org/GCC/Reprints-2005/novillo-Reprint.pdf The purpose of this implementation is to act as a common framework for any transformation that needs to propagate values from statements producing new values to statements using those values.	2017-11-27 23:32:06 -05:00
Diego Novillo	491b112fd2	Fix windows build. This fixes the lack of uint32_t definition in source/val/decoration.h.	2017-11-27 14:40:03 -05:00
Diego Novillo	83228137e1	Re-format source tree - NFC. Re-formatted the source tree with the command: $ /usr/bin/clang-format -style=file -i \ $(find include source tools test utils -name '.cpp' -or -name '.h') This required a fix to source/val/decoration.h. It was not including spirv.h, which broke builds when the #include headers were re-ordered by clang-format.	2017-11-27 14:31:49 -05:00
Andrey Tuganov	d8b2013ecf	Derivative opcodes require Fragment exec model Added validator check that all derivative opcodes require Fragment execution model.	2017-11-27 12:05:25 -05:00
Andrey Tuganov	c170afd93b	Relaxed OpImageWrite texel type check	2017-11-24 14:31:08 -05:00
Andrey Tuganov	f84f266977	Relaxed OpImageRead validation rules Removed the check that result type of OpImageRead should be a vector4. Will reenable/adapt once the spec is clarified on what the right dimension should be.	2017-11-24 10:12:24 -05:00
Alan Baker	0cae89e79e	Notify the context of instructions that are being erased. Fixes use-after-free error in RemoveDuplicatesPass Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1004	2017-11-23 23:43:25 -05:00
Andrey Tuganov	3e08a3f718	Add validation checks for Execution Model Currently checks that these instructions are called from entry points with Fragment execution model. OpImageImplicit* OpImageQueryLod OpKill	2017-11-23 23:38:03 -05:00
David Neto	d9129f00a5	Test for pollution of the global namespace Works on Linux only for now. That's a good start. Move ValidateBinaryUsingContextAndValidationState into anonymous namespace in source/validate.cpp.	2017-11-23 21:27:21 -05:00
Steven Perron	0b1cb27f83	Remove derivative instructions from the list of combinators. These instructions compute their value based the value of the immediate neighbours of the current fragment. This means the result is not defined purely by the operands of the instruction.	2017-11-23 18:37:43 -05:00
Lei Zhang	aec60b8158	Add RegisterLegalizationPasses() into the interface Add note to mention the use scenario. The original list came from Glslang.	2017-11-23 17:26:44 -05:00
Alan Baker	746bfd210a	Adding new def -> use mapping container Replaced representation of uses * Changed uses from unordered_map<uint32_t, UseList> to set<pairInstruction, Instruction>> * Replaced GetUses with ForEachUser and ForEachUse functions * updated passes to use new functions * partially updated tests * lots of cleanup still todo Adding an unique id to Instruction generated by IRContext Each instruction is given an unique id that can be used for ordering purposes. The ids are generated via the IRContext. Major changes: * Instructions now contain a uint32_t for unique id and a cached context pointer * Most constructors have been modified to take a context as input * unfortunately I cannot remove the default and copy constructors, but developers should avoid these * Added accessors to parents of basic block and function * Removed the copy constructors for BasicBlock and Function and replaced them with Clone functions * Reworked BuildModule to return an IRContext owning the built module * Since all instructions require a context, the context now becomes the basic unit for IR * Added a constructor to context to create an owned module internally * Replaced uses of Instruction's copy constructor with Clone whereever I found them * Reworked the linker functionality to perform clones into a different context instead of moves * Updated many tests to be consistent with the above changes * Still need to add new tests to cover added functionality * Added comparison operators to Instruction Adding tests for Instruction, IRContext and IR loading Fixed some header comments for BuildModule Fixes to get tests passing again * Reordered two linker steps to avoid use/def problems * Fixed def/use manager uses in merge return pass * Added early return for GetAnnotations * Changed uses of Instruction::ToNop in passes to IRContext::KillInst Simplifying the uses for some contexts in passes	2017-11-23 16:40:02 -05:00
Lei Zhang	b02c9a5802	Allow derived access chain without uses in access chain conversion	2017-11-23 16:00:28 -05:00
Andrey Tuganov	ab892f7bd6	Add derivatives validation pass Checks operands of instructions in opcode range from OpDPdx to OpFwidthCoarse.	2017-11-23 14:17:10 -05:00
David Neto	c2999273d9	Move SetContextMessageConsumer into libspirv namespace Avoid polluting the global namespace.	2017-11-23 13:56:12 -05:00
Steven Perron	28c415500d	Create a local value numbering pass Creates a pass that removes redundant instructions within the same basic block. This will be implemented using a hash based value numbering algorithm. Added a number of functions that check for the Vulkan descriptor types. These are used to determine if we are variables are read-only or not. Implemented a function to check if loads and variables are read-only. Implemented kernel specific and shader specific versions. A big change is that the Combinator analysis in ADCE is factored out into the IRContext as an analysis. This was done because it is being reused in the value number table.	2017-11-23 11:45:09 -05:00
Andrey Tuganov	f407ae2b50	Validator pass for image instructions Includes validation rules for OpImageXXX and ImageOperand. Doesn't include OpTypeImage and OpImageSparseXXX. Disabled an invalid test.	2017-11-22 14:34:15 -05:00
GregF	e28edd458b	Optimize loads/stores on nested structs Also fix LocalAccessChainConvert test: nested structs now convert Add InsertExtractElim test for nested struct	2017-11-21 17:56:03 -05:00
Andrey Tuganov	b14291581f	Fix move semantics in iterator make_range	2017-11-21 17:36:15 -05:00
Andrey Tuganov	250a235a8d	Add new compression algorithm and models Add new "short descriptor" algorithm to MARK-V codec. Add three shader compression models: lite - fast, poor compression mid - balanced max - best compression	2017-11-21 17:32:58 -05:00
Alan Baker	a771713e42	Adding an unique id to Instruction generated by IRContext Each instruction is given an unique id that can be used for ordering purposes. The ids are generated via the IRContext. Major changes: * Instructions now contain a uint32_t for unique id and a cached context pointer * Most constructors have been modified to take a context as input * unfortunately I cannot remove the default and copy constructors, but developers should avoid these * Added accessors to parents of basic block and function * Removed the copy constructors for BasicBlock and Function and replaced them with Clone functions * Reworked BuildModule to return an IRContext owning the built module * Since all instructions require a context, the context now becomes the basic unit for IR * Added a constructor to context to create an owned module internally * Replaced uses of Instruction's copy constructor with Clone whereever I found them * Reworked the linker functionality to perform clones into a different context instead of moves * Updated many tests to be consistent with the above changes * Still need to add new tests to cover added functionality * Added comparison operators to Instruction * Added an internal option to LinkerOptions to verify merged ids are unique * Added a test for the linker to verify merged ids are unique * Updated MergeReturnPass to supply a context * Updated DecorationManager to supply a context for cloned decorations * Reworked several portions of the def use tests in anticipation of next set of changes	2017-11-20 17:49:10 -05:00
Steven Perron	3214c3b0ca	Add dead function elimination to -O and -Os This pass is very useful in reducing the size of the code, and reducing the amount of work done by other optimizations.	2017-11-20 09:41:03 -05:00
Steven Perron	eb4653a67f	Add the decoration manager to the IRContext. To make the decoration manger available everywhere, and to reduce the number of times it needs to be build, I add one the IRContext. As the same time, I move code that modifies decoration instruction into the IRContext from mempass and the decoration manager. This will make it easier to keep everything up to date. This should take care of issue #928.	2017-11-15 12:48:03 -05:00
Alan Baker	a92d69b43d	Initial implementation of merge return pass. Works with current DefUseManager infrastructure. Added merge return to the standard opts. Added validation to passes. Disabled pass for shader capabilty.	2017-11-15 10:27:04 -05:00
Diego Novillo	98281ed411	Add analysis to compute mappings between instructions and basic blocks. This analysis builds a map from instructions to the basic block that contains them. It is accessed via get_instr_block(). Once built, it is kept up-to-date by the IRContext, as long as instructions are removed via KillInst. I have not yet marked passes that preserve this analysis. I will do it in a separate change. Other changes: - Add documentation about analysis values requirement to be powers of 2. - Force a re-build of the def-use manager in tests. - Fix AllPreserveFirstOnlyAfterPassWithChange to use the DummyPassPreservesFirst pass. - Fix sentinel value for IRContext::Analysis enum. - Fix logic for checking if the instr<->block mapping is valid in KillInst.	2017-11-13 13:21:48 -05:00
Daniel Schürmann	a76d0977ac	Fix decorations of inlined functions. Fixes issue #728. Currently the inliner is not generating decorations for inlined code which corresponds to function code which has decorations. An example of decorations that are relevant: RelaxedPrecision, NoContraction. The solution is to replicate the decoration during inlining.	2017-11-13 12:49:25 -05:00
Steven Perron	efe12ff5a1	Have all MemPasses preserve the def-use manager. Originally the passes that extended from MemPass were those that are of the def-use manager. I am assuming they would be able to preserve it because of that. Added a check to verify consistency of the IRContext. The IRContext relies on the pass to tell it if something is invalidated. It is possible that the pass lied. To help identify those situations, we will check if the valid analyses are correct after each pass. This will be enabled by default for the debug build, and disabled in the production build. It can be disabled in the debug build by adding "-DSPIRV_CHECK_CONTEXT=OFF" to the cmake command.	2017-11-10 11:17:12 -05:00
Diego Novillo	d2938e4842	Re-format files in source, source/opt, source/util, source/val and tools. NFC. This just makes sure every file is formatted following the formatting definition in .clang-format. Re-formatted with: $ clang-format -i $(find source tools include -name '.cpp') $ clang-format -i $(find source tools include -name '.h')	2017-11-08 14:03:08 -05:00
Steven Perron	f32d11f74b	Add the IRContext (part 2): Add def-use manager This change will move the instances of the def-use manager to the IRContext. This allows it to persists across optimization, and does not have to be rebuilt multiple times. Added test to ensure that the IRContext is validating and invalidating the analyses correctly.	2017-11-08 13:35:34 -05:00
GregF	ac04b2faea	Opt: Fix HasLoads to not report decoration as load.	2017-11-07 17:39:58 -05:00
GregF	d86c7ce808	Opt: Remove CommonUniformElimination from -O and -Os (for now) It is causing crashes for some drivers. Will try to re-enable it once existing drivers are able to deal better with it.	2017-11-07 16:55:12 -05:00
Nuno Subtil	2dddb8193b	Validate storage class of target pointer for OpStore	2017-11-02 13:44:11 -04:00
Diego Novillo	9d6cc26226	Move class CFG from namespace opt to namespace ir. It makes more sense to have the CFG inside the ir name space, as it is descriptive of the representation.	2017-11-02 11:51:07 -04:00
Diego Novillo	fef669f30f	Add a new class opt::CFG to represent the CFG for the module. This class moves some of the CFG-related functionality into a new class opt::CFG. There is some other code related to the CFG in the inliner and in opt::LocalSingleStoreElimPass that should also be moved, but that require more changes than this pure restructuring. I will move those bits in a follow-up PR. Currently, the CFG is computed every time a pass is instantiated, but this should be later moved to the new IRContext class that @s-perron is working on. Other re-factoring: - Add BasicBlock::ContinueBlockIdIfAny. Re-factored out of MergeBlockIdIfAny - Rewrite IsLoopHeader in terms of GetLoopMergeInst. - Run clang-format on some files.	2017-11-02 10:37:03 -04:00
Steven Perron	476cae6f7d	Add the IRContext (part 1) This is the first part of adding the IRContext. This class is meant to hold the extra data that is build on top of the module that it owns. The first part will simply create the IRContext class and get it passed to the passes in place of the module. For now it does not have any functionality of its own, but it acts more as a wrapper for the module. The functions that I added to the IRContext are those that either traverse the headers or add to them. I did this because we may decide to have other ways of dealing with these sections (for example adding a type pool, or use the decoration manager). I also added the function that add to the header because the IRContext needs to know when an instruction is added to update other data structures appropriately. Note that there is still lots of work that needs to be done. There are still many places that change the module, and do not inform the context. That will be the next step.	2017-10-31 13:46:05 -04:00
Nuno Subtil	d861ceffd4	Add validation for OpBranchConditional	2017-10-31 12:05:20 -04:00
Andrey Tuganov	7299fb5b7c	Lowered initial capacity of move-to-front sequence Also fixed outdated comments.	2017-10-31 12:00:42 -04:00
GregF	94bec26afe	ADCE: Dead if elimination Mark structured conditional branches live only if one or more instructions in their associated construct is marked live. After closure, replace dead structured conditional branches with a branch to its merge and remove dead blocks. ADCE: Dead If Elim: Remove duplicate StructuredOrder code Also generalize ComputeStructuredOrder so that the caller can specify the root block for the order. Phi insertion uses pseudo_entry_block and adce and dead branch elim use the first block of the function. ADCE: Dead If Elim: Pull redundant code out of InsertPhiInstructions ADCE: Dead If Elim: Encapsulate CFG Cleanup Initialization ADCE: Dead If Elim: Remove redundant code from ADCE initialization ADCE: Dead If: Use CFGCleanup to eliminate newly dead blocks Moved bulk of CFG Cleanup code into MemPass.	2017-10-31 11:51:30 -04:00
Diego Novillo	632e2068f3	More re-factoring to simplify pass initialization. This implements two cleanups suggested by @s-perron (https://github.com/KhronosGroup/SPIRV-Tools/pull/921): - Move FindNamedOrDecoratedIds() into MemPass::InitializeProcessing(). - Remove FinalizeNextId(). Always call SetIdBound() from Pass::TakeNextId().	2017-10-30 09:06:17 -04:00
Steven Perron	716138ee14	Add option to relax validation of store types. There are a number of users of spriv-opt that are hitting errors because of stores with different types. In general, this is wrong, but, in these cases, the types are the exact same except for decorations. The options is "--relax-store-struct", and it can be used with the validator or the optimizer. We assume that if layout information is missing it is consistent. For example if one struct has a offset of one of its members, and the other one does not, we will still consider them as being layout compatible. The problem will be if both struct has and offset decoration for corresponding members, and the offset are different.	2017-10-28 18:48:21 -04:00
Andrey Tuganov	6724c27251	Compression: removed 'presumed index' feature The feature used to improve compression of const integers which were presumed to be indices. Now obsolete as descriptor-based compression does this in a more generalized way.	2017-10-28 18:38:13 -04:00
Jesus Carabano	f063f91d24	Use std::lower_bound for opcode lookup Use std::lower_bound for opcode-to-string Stable sort the generated instruction table.	2017-10-28 18:34:01 -04:00
Diego Novillo	1040a95b3f	Re-factor Phi insertion code out of LocalMultiStoreElimPass Including a re-factor of common behaviour into class Pass: The following functions are now in class Pass: - IsLoopHeader. - ComputeStructuredOrder - ComputeStructuredSuccessors (annoyingly, I could not re-factor all instances of this function, the copy in common_uniform_elim_pass.cpp is slightly different and fails with the common implementation). - GetPointeeTypeId - TakeNextId - FinalizeNextId - MergeBlockIdIfAny This is a NFC (non-functional change)	2017-10-27 15:28:08 -04:00
Steven Perron	94dc66b74d	Change the sections in the module to use the InstructionList class. This change will replace a number of the std::vector<std::unique_ptr<Instruction>> member of the module to InstructionList. This is for consistency and to make it easier to delete instructions that are no longer needed.	2017-10-25 15:52:06 -04:00
Lei Zhang	063dbea0f1	Turn all function static non-POD variables into global POD variables Function static non-POD data causes problems with DLL lifetime. This pull request turns all static info tables into strict POD tables. Specifically, the capabilities/extensions field of opcode/operand/extended-instruction table are turned into two fields, one for the count and the other a pointer to an array of capabilities/extensions. CapabilitySet/EnumSet are not used in the static table anymore, but they are still used for checking inclusion by constructing on the fly, which should be cheap for the majority cases. Also moves all these tables into the global namespace to avoid C++11 function static thread-safe initialization overhead.	2017-10-25 15:44:19 -04:00
Józef Kucia	90862fe4b1	Validate SpvOpVectorShuffle	2017-10-24 11:45:03 -04:00
Jesus Carabano	13e6598947	restrict opcodes targeting OpDecorationGroup	2017-10-24 11:39:08 -04:00
Daniel Schürmann	97990dc907	Fixed --eliminate-common-uniform so that it does not eliminate loads of volatile variables.	2017-10-24 11:17:33 -04:00
David Neto	98072b749f	Optimizer: Line and NoLine are not debug1 or debug2 Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/911	2017-10-24 10:54:23 -04:00
Andrey Tuganov	cfd95f3d5a	Refactored compression debugger Markv codec now receives two optional callbacks: LogConsumer for internal codec logging DebugConsumer for testing if encoding->decoding produces the original results.	2017-10-23 22:12:40 -04:00
Steven Perron	8d6e4dbc72	Run dead variable elimination when using -O and -Os We want to run the optimization when using -O and -Os, but it was not added at part of https://github.com/KhronosGroup/SPIRV-Tools/pull/905. This change will add that a well as some minor formatting changes requested in that same pull request.	2017-10-23 22:09:12 -04:00
GregF	e3a7209330	DeadBranchElim: Fix dead block elimination The previous algorithm would leave invalid code in the case of unreachable blocks pointing into a dead branch. It would leave the unreachable blocks branching to labels that no longer exist. The previous algorithm also left unreachable blocks in some cases (a loop following an orphaned merge block). This fix also addresses that. This code will soon be replaced with the coming CFG cleanup.	2017-10-23 22:04:17 -04:00
Steven Perron	5834719fc1	Add pass to remove dead variables at the module level. There does not seem to be any pass that remove global variables. I think we could use one. This pass will look specifically for global variables that are not referenced and are not exported. Any decoration associated with the variable will also be removed. However, this could cause types or constants to become unreferenced. They will not be removed. Another pass will have to be called to remove those.	2017-10-23 13:57:05 -04:00
David Neto	2436794736	Optimizer: OpModuleProcessed is in its own layout section This is a recent decision from the SPIR WG. The spec update has not yet been published. Khronos SPIR-V internal issue 199	2017-10-23 10:46:37 -04:00
David Neto	d819f513f6	Fix cfg_cleanup.cpp. My bad.	2017-10-20 16:51:20 -04:00
David Neto	e6f3416617	Remove coding redundancy in cfg_cleanup_pass.cpp	2017-10-20 16:05:38 -04:00
Andrey Tuganov	39e25fd8ab	Add validation pass for conversion instructions The pass checks correctness of operands of instruction in opcode range OpConvertFToU - OpBitset. Disabled invalid tests Disabled UConvert validation until Vulkan CTS can catch up. Add validate_conversion to Android.mk Also remove duplicate entry in CMakeLists.txt.	2017-10-20 13:51:24 -04:00
Steven Perron	bb7802b18c	Change BasicBlock to use InstructionList to hold instructions. This is the first step in replacing the std::vector of Instruction pointers to using and intrusive linked list. To this end, we created the InstructionList class. It inherites from the IntrusiveList class, but add the extra concept of ownership. An InstructionList owns the instruction that are in it. This is to be consistent with the current ownership rules where the vector owns the instruction that are in it. The other larger change is that the inst_ member of the BasicBlock class was changed to using the InstructionList class. Added test for the InsertBefore functions, and making sure that the InstructionList destructor will delete the elements that it contains. I've also add extra comments to explain ownership a little better.	2017-10-20 12:37:44 -04:00
Andrey Tuganov	ea9d1d02b7	Removed todos from validate_id.cpp Removed todos for validation of opcodes handles in other passes.	2017-10-19 19:51:31 -04:00
David Neto	863578a38d	DeadBranchElim: Slightly more defensive coding	2017-10-19 19:28:45 -04:00
David Neto	8ec62deb23	The reviewed cfg_cleanup optimize pass	2017-10-19 15:28:09 -04:00
Diego Novillo	c75704ec08	CFG cleanup pass - Remove unreachable blocks. - Adds a new pass CFGCleanupPass. This serves as an umbrella pass to remove unnecessary cruft from a CFG. - Currently, the only cleanup operation done is the removal of unreachable basic blocks. - Adds unit tests. - Adds a flag to spirvopt to execute the pass (--cfg-cleanup).	2017-10-19 15:16:29 -04:00
Diego Novillo	332a1f1422	Re-factor generic constant folding code out of FoldSpecConstantOpAndCompositePass There are no functional changes in this patch. The generic folding routines in FoldSpecConstantOpAndCompositePass are now inside opt/fold.{cpp,h}. This code will be used by the upcoming constant propagation pass. In time, we'll add more expression folding and simplification into these two files.	2017-10-17 19:41:37 -04:00
GregF	1a9061a2be	ADCE: Treat privates like locals in entry point with no calls This is needed for ongoing legalization of HLSL. It allows removal of accesses to textures/buffers that are not used.	2017-10-13 15:39:14 -04:00
GregF	1e7994c085	Opt: Move *NextId functionality into MemPass	2017-10-13 15:22:19 -04:00
Andrey Tuganov	8de8dd8c8c	Reenable validate type unique pass Vulkan CTS patch fixing the instances of non-unique type declaration in autogenerated code has recently been submitted.	2017-10-12 15:46:06 -04:00
Andrey Tuganov	2401fc0a72	Refactored MARK-V API - switched from C to C++ - moved MARK-V model creation from backend to frontend - The same MARK-V model object can be used to encode/decode multiple files - Added MARK-V model factory (currently only one option) - Added --validate option to spirv-markv (run validation while encoding/decoding)	2017-10-12 15:40:40 -04:00
Andrey Tuganov	b54997e6eb	Validator checks OpReturn called from void func Added check into validate_cfg which checks that OpReturn is not called from functions which are supposed to return a value.	2017-10-12 15:32:32 -04:00
Steven Perron	720beb161a	Generic intrusive linked list class. This commit is the initial implementation of the intrusive linked list class. It includes the implementation in the header files, and unit test. The iterators are circular: incrementing end() gives begin() and decrementing begin() gives end(). Also made it valid to decrement end(). Expliticly defines move constructor and move assignment - Visual Studio 2013 does not implicitly generate the move constructor or move assignments. So they need to be explicit, otherwise it will try to use the copy constructor, which we explicitly deleted. - Can't use "= default" either. Seems like VS2013 does not support explicitly using the default move constructors and move assignments, so I wrote them out.	2017-10-12 12:40:18 -04:00
GregF	63064bd9eb	DeadBranchElim: Add dead case elimination Expands dead branch elimination to eliminate dead switch cases. It also changes dbe to eliminate orphaned merge blocks and recursively eliminate any blocks thereby orphaned.	2017-10-12 11:44:05 -04:00
Diego Novillo	c90d7305e7	Add -O, -Os and -Oconfig flags. These flags are expanded to a series of spirv-opt flags with the following semantics: -O: expands to passes that attempt to improve the performance of the generated code. -Os: expands to passes that attempt to reduce the size of the generated code. -Oconfig=<file> expands to the sequence of passes determined by the flags specified in the user-provided file.	2017-10-10 12:14:09 -04:00
Pierre Moreau	86627f7b3f	Implement Linker (module combiner) Add extra iterators for ir::Module's sections Add extra getters to ir::Function Add a const version of BasicBlock::GetLabelInst() Use the max of all inputs' version as version Split debug in debug1 and debug2 - Debug1 instructions have to be placed before debug2 instructions. Error out if different addressing or memory models are found Exit early if no binaries were given Error out if entry points are redeclared Implement copy ctors for Function and BasicBlock - Visual Studio ends up generating copy constructors that call deleted functions while compiling the linker code, while GCC and clang do not. So explicitly write those functions to avoid Visual Studio messing up. Move removing duplicate capabilities to its own pass Add functions running on all IDs present in an instruction Remove duplicate SpvOpExtInstImport Give default options value for link functions Remove linkage capability if not making a library Check types before allowing to link Detect if two types/variables/functions have different decorations Remove decorations of imported variables/functions and their types Add a DecorationManager Add a method for removing all decorations of id Add methods for removing operands from instructions Error out if one of the modules has a non-zero schema Update README.md to talk about the linker Do not freak out if an imported built-in variable has no export	2017-10-06 18:33:53 -04:00
Andrew Woloszyn	d7f199b5d4	Hack around bug in gcc-4.8.1 templates. This keeps the previous behavior for other compilers that will throw warnings on a negative shift operation, but works around the internal compiler error in GCC.	2017-10-06 10:26:17 -04:00
GregF	da04f5640e	AggressiveDCE: Fix to not treat parameter memory refs as local This fixes a bug that incorrectly deletes stores to parameters, which can be used to return values from functions.	2017-10-05 10:59:45 -04:00
Pierre Moreau	c87e9671ab	Compact-ids pass should update the header ID bound	2017-10-03 11:24:28 -04:00
David Neto	169266e9b8	DiagnosticStream move ctor moves output duties to new object - Take over contents of the expiring message stream - Prevent the expiring object from emitting anything during destruction	2017-10-03 11:23:54 -04:00
David Neto	17a843c6b0	Cache end iterators for speed Helps scaling of DefUseManager on modules with many thousands of instructions.	2017-09-29 16:13:55 -04:00
jcaraban	6526c42603	No use to check OpBitCount result width	2017-09-29 09:14:02 +03:00
David Neto	77feb8dd03	Compact-ids pass should update instruction's result_id member Also update the result type field. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/827	2017-09-27 08:31:05 -04:00
Andrey Tuganov	64d5e5214f	Add bitwise operations validator pass The pass checks correctness of operand types of all bitwise instructions (opcode range from SpvOpShiftRightLogical to SpvOpBitCount).	2017-09-26 14:22:37 -04:00
Andrey Tuganov	dcf42433a6	Add remaining opcodes to arithmetics validation Add validation rules for: - OpIAddCarry - OpISubBorrow - OpUMulExtended - OpSMulExtended Includes some refactoring of old code.	2017-09-26 11:47:34 -04:00
Steven Perron	e43c91046b	Create the dead function elimination pass Creates a pass called eliminate dead functions that looks for functions that could never be called, and deletes them from the module. To support this change a new function was added to the Pass class to traverse the call trees from diffent starting points. Includes a test to ensure that annotations are removed when deleting a dead function. They were not, so fixed that up as well. Did some cleanup of the assembly for the test in pass_test.cpp. Trying to make them smaller and easier to read.	2017-09-26 11:18:06 -04:00
Andrey Tuganov	976e4218d5	Detach MARK-V from the validator MARK-V codec was previously dependent on the validation state. Now it doesn't need the validator to function, but can still optionally create it and validate every instruction once it's decoded.	2017-09-26 11:10:23 -04:00
Lei Zhang	16981f87fe	Avoid using global static variables Previously we have several grammar tables defined as global static variables and these grammar table entries contains non-POD struct fields (CapabilitySet/ExtensionSet). The initialization of these non-POD struct fields may require calling operator new. If used as a library and the caller defines its own operator new, things can screw up. This pull request changes all global static variables into function static variables, which is lazy evaluated in a thread safe way as guaranteed by C++11.	2017-09-26 10:59:15 -04:00
Andrey Tuganov	c25b5bea35	Add SPIRV_SPIRV_COMPRESSION option to cmake The option is off by default. cmake -DSPIRV_BUILD_COMPRESSION=ON .. enables the compression lib, executable, and test build. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/834	2017-09-25 14:37:08 -04:00
Andrey Tuganov	3f5e1a91ae	Validator: fix logicals pass for OpSelect pointers OpSelect works with pointers also when capability VariablePointersStorageBuffer is declared (before worked only with capability VariablePointers).	2017-09-21 16:12:14 -04:00
David Neto	33b879c105	elim-multi-store: only patch loop header phis that we created There can already be OpPhi instructions in a loop header that are unrelated to the optimization. We should not be patching those. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/826	2017-09-21 10:01:30 -04:00
Andrey Tuganov	cf85ad1429	Add validate logicals pass to the validator New pass checks operands of all instructions listed under 3.32.15. Relational and Logical Instructions	2017-09-20 10:37:12 -04:00
Andrey Tuganov	4e3cc2f57f	Refactored validate_aritmetics.cpp Improved error messages and readability.	2017-09-20 10:30:54 -04:00
Andrey Tuganov	9b14dd0cb4	Updated markv_autogen - now includes a table of all descriptors with coding scheme (improves performance by 5% by allowing to avoid creation of move-to-front sequences which will never be used) - increased the size of markv_autogen.inc, clang doesn't seem to have the long compilation time problem now (probably was inadvertently fixed by using Huffman codec serialization)	2017-09-20 10:23:22 -04:00
Greg Fischer	8be28f7524	ElimLocalMultiStore: Reset structured successors for each function	2017-09-19 13:47:28 -06:00
Steven Perron	e4c7d8e748	Add strength reduction; for now replace multiply by power of 2 Create a new optimization pass, strength reduction, which will replace integer multiplication by a constant power of 2 with an equivalent bit shift. More changes could be added later. - Does not duplicate constants - Adds vector \|Concat\| utility function to a common test header.	2017-09-18 17:01:36 -04:00
GregF	7be791aaaa	ExtractInsert: Handle rudimentary CompositeConstruct and ConstantComposite This optimizes a single index extract whose composite value terminates with a CompositeConstruct (or ConstantComposite) by evaluating to the correct component. This was needed for opaque legalization. This highlights the need/opportunity to improve this optimization to deal with more complex composite expressions including currently handled ops plus Null ops and special vector composition. A TODO has been added.	2017-09-15 20:33:53 -04:00
Andrey Tuganov	c6dfc11880	Add new checks to validate arithmetics pass New operations: - OpDot - OpVectorTimesScalar - OpMatrixTimesScalar - OpVectorTimesMatrix - OpMatrixTimesVector - OpMatrixTimesMatrix - OpOuterProduct	2017-09-08 11:08:41 -04:00
David Neto	c843ef8ab5	validator: OpModuleProcessed allowed in layout section 7c Recent spec fix from SPIR Working group: Allow OpModuleProcessed after debug names, but before any annotation instructions.	2017-09-07 17:45:51 -04:00
Andrey Tuganov	b36acbec0e	Update MARK-V to version 1.01 Includes: - Multi-sequence move-to-front - Coding by id descriptor - Statistical coding of non-id words - Joint coding of opcode and num_operands Removed explicit form Huffman codec constructor - The standard use case for it is to be constructed from initializer list. Using serialization for Huffman codecs	2017-09-06 16:03:16 -04:00
David Neto	25ddfec08e	Inliner: Fix LoopMerge when inline into loop header of multi block loop This adapts the fix for the single-block loop. Split the loop like before. But when we move the OpLoopMerge back to the loop header, redirect the continue target only when the original loop was a single block loop. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/800	2017-09-05 19:46:24 -04:00
Andrey Tuganov	82df4bbd68	Add validation pass for arithmetic operations The pass checks if arithmetic operations (such as OpFMul) receive correct operands.	2017-09-05 12:21:53 -04:00
Andrey Tuganov	32cf85dd5a	Fix mingw build (source/print.cpp) source/print.cpp doesn't compile due to integer conversion. Tested by @dneto0 on a Windows machine.	2017-09-01 16:07:18 -04:00
David Neto	860c4197b0	Inliner: Remap callee entry block id to single-trip loop header Otherwise cloned phis can be invalid. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/790	2017-09-01 15:56:14 -04:00
David Neto	efff5fabfa	Inline: Fix single-block loop caller cases If the caller block is a single-block loop and inlining will replace the caller block by several blocks, then: - The original OpLoopMerge instruction will end up in the last such block. That's the wrong place to put it. - Move it back to the end of the first block. - Update its Continue Target ID to point to the last block We also have to take care of cases where the inlined code begins with a structured header block. In this case we need to ensure the restored OpLoopMerge does not appear in the same block as the merge instruction from the callee's first block. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/787	2017-09-01 15:47:17 -04:00
David Neto	cff2cd3343	BasicBlock: add ctail, GetMergeInst, GetLoopMergeInst	2017-09-01 11:01:36 -04:00
Andrey Tuganov	725284c2ef	Extension allows multiple same OpTypePointer types SPV_KHR_variable_pointers allows OpTypePointer to declare multiple pointer identical types. https://github.com/KhronosGroup/SPIRV-Tools/issues/781	2017-09-01 10:14:15 -04:00
GregF	7c3de19ce7	DeadBranchElim: Fix dead block detection to ignore backedges - DeadBranchElim: Make sure to mark orphan'd merge blocks and continue targets as live. - Add test with loop in dead branch - Add test that orphan'd merge block is handled. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/776	2017-08-30 13:37:46 -04:00
GregF	a699d1ade7	Inline: Fix remapping of non-label forward references in callee phi	2017-08-29 18:35:05 -06:00
Andrey Tuganov	d41a52415a	Fix encode zero bits on word boundary bug Bit stream writer was manifesting incorrect behaviour when the following two conditions were met: - writer was on 64-bit word boundary - WriteBits was invoked with num_bits=0 (can happen when a Huffman codec has only one value) The bug was causing very rare sporadic corruption which was detected by tests after a random experimental change in MARK-V model.	2017-08-28 13:36:39 -04:00
David Neto	63e1e348b0	Show result id for CompositeInsert validation failure	2017-08-25 15:13:31 -04:00
David Neto	0167758727	Windows: Increase intensity of blue text	2017-08-24 10:40:17 -04:00
Lukas Hermanns	4fe8e389a7	Fix: background color was erroneously reset on Win32 platform. Fix: background color was erroneously reset on Win32 platform.	2017-08-24 10:40:17 -04:00
GregF	429ca05b3f	Opt: Create InlineOpaquePass Only inline calls to functions with opaque params or return TODO: Handle parameter type or return type where the opqaue type is buried within an array.	2017-08-18 18:04:30 -04:00
GregF	c8c86a0d36	Opt: Have "size" passes process full entry point call tree. Includes code to deal correctly with OpFunctionParameter. This is needed by opaque propagation which may not exhaustively inline entry point functions. Adds ProcessEntryPointCallTree: a method to do work on the functions in the entry point call trees in a deterministic order.	2017-08-18 10:16:01 -04:00
Andrey Tuganov	17d941af4f	Huffman codec can serialize to text Refactored the Huffman codec implementation and added ability to serialize to C++-like text format. This would reduce the time-complexity if loading hard-coded codecs.	2017-08-15 23:57:21 -04:00
Andrey Tuganov	78cf86150e	Add id descriptor feature to SPIR-V Id descriptors are computed as a recursive hash of all instructions used to define an id. Descriptors are invarint of actual id values and the similar code in different files would produce the same descriptors. Multiple ids can have the same descriptor. For example %1 = OpConstant %u32 1 %2 = OpConstant %u32 1 would produce two ids with the same descriptor. But %3 = OpConstant %s32 1 %4 = OpConstant %u32 2 would have descriptors different from %1 and %2. Descriptors will be used as handles of move-to-front sequences in SPIR-V compression.	2017-08-10 18:44:52 -04:00
GregF	b0310a4156	ADCE: Add support for function calls ADCE will now generate correct code in the presence of function calls. This is needed for opaque type optimization needed by glslang. Currently all function calls are marked as live. TODO: mark calls live only if they write a non-local.	2017-08-10 17:30:05 -04:00
David Neto	2a1014be9c	Inliner: callee can have early return that isn't multi-return Avoid generating an invalid OpLabel. Create the continue target for the single-trip loop only if you actually created the header for the single-trip loop. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/755	2017-08-10 11:43:44 -04:00
GregF	f0fe601dc8	AccessChainConvert: Add HasOnlySupportedRefs() This avoids conversion on variables which will not ultimately be optimized. Also removed an obsolete restriction from FindTargetVars(). Also added decorates to supported refs (eg. RelaxedPrecision). Also fixed name to IsNonTypeDecorate().	2017-08-04 18:11:44 -04:00
GregF	e28bd39997	Inline: Split out InlineExhaustivePass from InlinePass	2017-08-04 17:56:46 -04:00
GregF	d9a450121e	Mem2Reg: Allow Image and Sampler types as base target types.	2017-08-04 17:52:32 -04:00
GregF	f4b29f3bf7	Add CommonUniformElim pass - UniformElim: Only process reachable blocks - UniformElim: Don't reuse loads of samplers and images across blocks. Added a second phase which only reuses loads within a block for samplers and images. - UniformElim: Upgrade CopyObject skipping in GetPtr - UniformElim: Add extensions whitelist Currently disallowing SPV_KHR_variable_pointers because it doesn't handle extended pointer forms. - UniformElim: Do not process shaders with GroupDecorate - UniformElim: Bail on shaders with non-32-bit ints. - UniformElim: Document support for only single index and add TODO.	2017-08-03 11:34:58 -04:00
GregF	c1b46eedbd	Add MemPass, move all shared functions to it.	2017-08-02 14:24:02 -04:00
Andrey Tuganov	30bee67439	Add multi-sequence move-to-front implementation Add MultiMoveToFront class which supports multiple move-to-front sequences and allows to promote value in all sequences at once. Added caching for last accessed sequence handle and last accessed value in each sequence.	2017-08-02 14:07:24 -04:00
Andrey Tuganov	55b73a0365	Added C++ code generation to spirv-stats The tool can now generate C++ code returning some of the historgrams and Huffman codecs generated from those historgrams.	2017-08-01 15:41:42 -04:00
GregF	7954740d54	Opt: Delete names and decorations of dead instructions	2017-07-26 18:36:41 -04:00
Lei Zhang	9f6efc76c8	Opt: HasOnlySupportedRefs should consider OpCopyObject This fixes test failure after merging the previous pull request.	2017-07-25 23:22:09 -04:00
Lei Zhang	4a539d77ef	Revert "Revert "Opt: LocalBlockElim: Add HasOnlySupportedRefs"" This reverts commit `df96e243c6`.	2017-07-25 23:22:09 -04:00
GregF	1182415581	Add extension whitelists to size-reduction passes. Currently only SPV_KHR_variable_pointers is disallowed in passes which do pointer analysis. Positive and negative tests of the general extensions mechanism were added to aggressive_dce but cover all passes.	2017-07-25 19:14:02 -04:00
Lei Zhang	df96e243c6	Revert "Opt: LocalBlockElim: Add HasOnlySupportedRefs" This reverts commit `2d0f7fbc11`.	2017-07-22 10:48:56 -04:00
greg-lunarg	2d0f7fbc11	Opt: LocalBlockElim: Add HasOnlySupportedRefs Verifies that targeted variables have only access chain and direct loads and stores as references.	2017-07-22 10:32:19 -04:00
GregF	adb237f3bd	Fix handling of CopyObject in GetPtr and its call sites	2017-07-21 18:08:01 -04:00
Lenny Komow	e9e4393b1c	Fix Visual Studio size_t cast compiler warning Visual Studio was complaining about possible loss of data on 64-bit builds, due to an implicit cast from size_t to int. This changes the data to use an int with no cast.	2017-07-13 13:02:43 -06:00
Greg Fischer	fe24e0316f	LocalMultiStore: Always put varId for backedge on loop phi function. And always patch the backedge operand when patching phi functions. This approach is more correct and cleaner. The previous code was generating incorrect phis when the backedge block had no predecessors.	2017-07-12 16:42:07 -04:00
GregF	e2544ddc90	DeadBranchElim: Improve algorithm to only remove blocks with no predecessors Must be careful not to remove blocks pointed at by unreachable blocks	2017-07-12 15:58:42 -04:00
David Neto	06d4fd52c2	Minor code review feedback on AggressiveDCE	2017-07-10 11:45:59 -04:00
GregF	9de4e69856	Add AggressiveDCEPass Create aggressive dead code elimination pass This pass eliminates unused code from functions. In addition, it detects and eliminates code which may have spurious uses but which do not contribute to the output of the function. The most common cause of such code sequences is summations in loops whose result is no longer used due to dead code elimination. This optimization has additional compile time cost over standard dead code elimination. This pass only processes entry point functions. It also only processes shaders with logical addressing. It currently will not process functions with function calls. It currently only supports the GLSL.std.450 extended instruction set. It currently does not support any extensions. This pass will be made more effective by first running passes that remove dead control flow and inlines function calls. This pass can be especially useful after running Local Access Chain Conversion, which tends to cause cycles of dead code to be left after Store/Load elimination passes are completed. These cycles cannot be eliminated with standard dead code elimination. Additionally: This transform uses a whitelist of instructions that it knows do have side effects, (a.k.a. combinators). It assumes other instructions have side effects: it will not remove them, and assumes they have side effects via their ID operands.	2017-07-10 11:30:25 -04:00
GregF	cc8bad3a5b	Add LocalMultiStoreElim pass A SSA local variable load/store elimination pass. For every entry point function, eliminate all loads and stores of function scope variables only referenced with non-access-chain loads and stores. Eliminate the variables as well. The presence of access chain references and function calls can inhibit the above optimization. Only shader modules with logical addressing are currently processed. Currently modules with any extensions enabled are not processed. This is left for future work. This pass is most effective if preceeded by Inlining and LocalAccessChainConvert. LocalSingleStoreElim and LocalSingleBlockElim will reduce the work that this pass has to do.	2017-07-07 17:54:21 -04:00
GregF	52e247f221	DeadBranchElim: Add DeadBranchElimPass	2017-07-07 15:16:25 -04:00
David Neto	35a0695844	Include memory and semantics IDs when iterating over inbound IDs Fixes Instruction::ForEachInId so it covers SPV_OPERAND_TYPE_MEMORY_SEMANTICS_ID and SPV_OPERAND_TYPE_SCOPE_ID. Future proof a bit by using the common spvIsIdType routine. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/697	2017-07-05 10:36:57 -04:00
Andrey Tuganov	abc6f5a672	MARK-V decoder supports extended instructions	2017-07-04 16:31:19 -04:00
d3x0r	fd70a1d7a0	Define variable to skip installation If this is used as a static library in another project, this does not need to be installed, and otherwise will just clutter the application's install. To use, define SKIP_SPIRV_TOOLS_INSTALL which internally defines ENABLE_SPIRV_TOOLS_INSTALL to control installation. Also include GNUInstallDirs to get standard output 'lib' directory which is sometimes 'lib64' and not 'lib'	2017-07-04 12:24:44 -04:00

... 19 20 21 22 23 ...

2572 Commits