SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2024-12-25 17:21:06 +00:00

Author	SHA1	Message	Date
alan-baker	3726b500b1	Treat access chain indexes as signed in SROA (#2776 ) Fixes #2768 * In scalar replacement, interpret access chain indexes as signed counts * Use Constant::GetSignExtendedValue and Constant::GetZeroExtendedValue where appropriate * new tests	2019-07-31 15:39:33 -04:00
David Neto	31590104ec	Add pass to inject code for robust-buffer-access semantics (#2771 ) spirv-opt: Add --graphics-robust-access Clamps access chain indices so they are always in bounds. Assumes: - Logical addressing mode - No runtime-array-descriptor-indexing - No variable pointers Adds stub code for clamping coordinate and samples for OpImageTexelPointer. Adds SinglePassRunAndFail optimizer test fixture. Android.mk: add source/opt/graphics_robust_access_pass.cpp Adds Constant::GetSignExtendedValue, Constant::GetZeroExtendedValue	2019-07-30 19:52:46 -04:00
Ryan Harrison	4a28259cc8	Update OpMemoryBarriers rules for WebGPU (#2775 ) Part of #2724	2019-07-30 14:50:55 -04:00
David Neto	7621034aae	Add opt test fixture method SinglePassRunAndFail (#2770 ) Checks for failure status code and matches against the expected error message.	2019-07-30 10:38:46 -04:00
Diego Novillo	49797609b7	Protect against out-of-bounds references when folding OpCompositeExtract (#2774 ) This fixes #2608. The original test case had an out-of-bounds reference that ended up folding into OpCompositeExtract that was indexing right outside the constant composite. The returned constant would then cause a segfault during constant propagation.	2019-07-29 13:27:40 -07:00
alan-baker	7fd2365b06	Don't move debug or decorations when folding (#2772 ) Fixes #2764 * Don't replace all uses when simplifying instructions, instead only update non-debug, non-decoration uses * added a test * Add a new version of RAUW that takes a predicate to decide whether to replace the use or not * used in simplification pass	2019-07-29 16:20:43 -04:00
Ryan Harrison	7bafeda284	Update OpControlBarriers rules for WebGPU (#2769 ) * Update OpControlBarriers rules for WebGPU Part of #2724	2019-07-29 12:53:27 -04:00
Diego Novillo	9559cdbdf0	Fix #2609 - Handle out-of-bounds scalar replacements. (#2767 ) * Fix #2609 - Handle out-of-bounds scalar replacements. When SROA tries to do a replacement for an OpAccessChain that is exactly one element out of bounds, the code was trying to access its internal array of replacements and segfaulting. This protects the code from doing this, and it additionally fixes the way SROA works by not returning failure when it refuses to do a replacement. Instead of failing the optimization pass, SROA will now simply refuse to do the replacement and keep going. Additionally, this patch fixes the SROA logic to now return a proper status so we can correctly state that the pass made no changes to the IR if it only found invalid references.	2019-07-26 12:33:40 -04:00
Alastair Donaldson	f54b8653dd	Limit fuzzer tests so that they take less time to run (#2763 ) The recently added fuzzer_replayer and fuzzer_shrinker tests were rather heavyweight and were leading to CI timeouts. This change reduces the runtime of those tests by having them do fewer iterations.	2019-07-25 13:09:49 -04:00
Steven Perron	bb0e2f65bb	Fix check for unreachable blocks in merge-return (#2762 ) Merge return expects unreachable merge block to look a certain way, and unreachable continue blocks to look a certain way. What if an unreachable block is both a merge and a continue? The continue is suppose to take precedent, but merge-return implements it with the merge taking precedent. This change flips that around. Fixes #2746	2019-07-25 09:34:18 -04:00
Alastair Donaldson	1a89ac8b28	Transformation and fuzzer pass to add dead continues (#2758 ) Similar to the existing 'add dead breaks' pass, this adds a pass to add dead continues to blocks in loops where such a transformation is viable. Various functionality common to this new pass and 'add dead breaks' has been factored into 'fuzzer_util', and some small improvements to 'add dead breaks' that were identified while reviewing that code again have been applied. Fixes #2719.	2019-07-25 13:50:33 +01:00
Ryan Harrison	65f49dfc39	Remove unneeded future imports (#2739 ) Also, adds explicitly setting python executable in the NDK build script, rewrites some Python2-isms to 3isms, and formats some code. Fixes #2738	2019-07-24 15:29:38 -04:00
Steven Perron	c7fcb8c3b9	Process OpDecorateId in ADCE (#2761 ) * Process OpDecorateId in ADCE When there is an OpDecorateId instruction that is live, the ids that is references must be kept live. This change adds them to the worklist. I've also updated a validator check to allow OpDecorateId to be able to apply to decoration groups. Fixes #1759. * Remove dead code.	2019-07-24 14:43:49 -04:00
Steven Perron	fb83b6fbb5	Record correct dominators in merge return (#2760 ) In merge return, we need to know the original dominator for a block in order to traverse code from the original dominator to the new dominator and add appropriate Phi nodes. The current code gets this wrong because the dominator tree is build as needed. The first time we get the immediate dominator for a function we just built the dominator tree and it takes into account that a block has been split. The second time it does not. This inconsistency needs to be fixed. We do that by recording the original dominator for all blocks at the start of the pass. If we were to record just the basic block, that could change if the block is split. We want to traverse the code in the body of the original dominator, whatever block it ends up in. To make this easy to track, we not save the terminator instruction to represent the original dominator. Fixes #2745	2019-07-24 13:56:54 -04:00
Steven Perron	c9190a54da	SSA rewriter: Don't use trivial phis (#2757 ) When a phi candidate is marked as trivial, we are suppose to update all of its uses to the reference the value that it is being folded to. However, the code updates the uses misses `defs_at_block_`. So at a later time, the id for the trivial phi can reemerge. Fixes #2744	2019-07-23 17:59:30 -04:00
alan-baker	aea4e6b1b9	Fix block depth rule priority (#2755 ) Fixes #2743 * Continue depth calculation should take precedence over merge calculation	2019-07-23 13:57:44 -04:00
alan-baker	a94ddc267c	Case validation with repeated labels (#2689 ) Fixes #2686 * Update validation to handle the default case being mentioned multiple times * new tests	2019-07-23 11:23:32 -04:00
greg-lunarg	3855447d93	Bindless Instrument: Make init check depend solely on input_init_enabled (#2753 ) * Bindless Instrument: Make init check depend solely on input_init_enabled Previously was dependent on presense of descriptor_indexing extension in SPIR-V, but this missed some cases. Tests updated to refect this new policy. * Fix format.	2019-07-22 13:51:39 -04:00
Kévin Petit	11516c0b9a	Validate storage class OpenCL environment rules for atomics (#2750 ) This change refactors all storage class validation for atomics to reflect the similar refactoring in the specification. It is currently not possible to write a test for the check rejecting Generic in an OpenCL 1.2 environment as the required GenericPointer capability isn't allowed there. I've decided to keep the check nonetheless to guard against the capability becoming available without the rules for atomics being updated. The ID changes in existing tests aren't ideal but introducing names drags in a substantial refactoring of this file. Contributes to #2595. Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-07-22 08:38:42 -04:00
Jason Macnak	bac82f49aa	Allow LOD ops in compute shaders with derivative group execution modes (#2752 ) Also update existing derivative check to be based on the execution mode instead of just the extension being present. More info about extension: - https://github.com/KhronosGroup/SPIRV-Registry/blob/master/extensions/NV/SPV_NV_compute_shader_derivatives.asciidoc	2019-07-22 08:37:44 -04:00
Steven Perron	aa9e8f5380	Revert "Do not inline OpKill Instructions (#2713 )" (#2749 ) This reverts commit `fe7cc9c612`.	2019-07-17 14:59:05 -04:00
Jeff Bolz	58e2ec25ba	For Vulkan, disallow structures containing opaque types (#2546 )	2019-07-16 16:16:19 -04:00
Steven Perron	230c9e4371	Fix bug in merge return (#2734 ) * Fix bug in merge return The merge return pass seems to assume that the only new edges in the cfg are from return block to merge blocks. However, it is possible that a merge block branches to a merge block when it did not before. This change add a new variable to track all of the new edges. It also renames some other variables and cleans us the code to make it a bit easier to read. Fixes #2702.	2019-07-16 09:11:22 -04:00
Jason Macnak	1fedf72e50	Allow ray tracing shaders in inst bindle check pass. (#2733 ) Adds the ray tracing stages (ray gen, intersection, any hit, closest hit, miss, and callable) to the allowed stages in pass instrumentation and add debug records for these stages to output the global launch id. More information for ray tracing shaders: - https://github.com/KhronosGroup/GLSL/blob/master/extensions/nv/GLSL_NV_ray_tracing.txt	2019-07-15 16:24:42 -04:00
Ryan Harrison	032adc4d7e	Correctly implement WebGPU related flag exclusions (#2737 ) Fixes #2736	2019-07-12 14:14:46 -04:00
greg-lunarg	92c41ff1e7	Remove Common Uniform Elimination Pass (#2731 ) Remove Common Uniform Elimination Pass Fixes #2520.	2019-07-12 11:02:10 -04:00
Ryan Harrison	55adf4cf70	Update execution scope rules for WebGPU (#2730 ) Fixes #2722	2019-07-11 14:37:36 -04:00
alan-baker	1a2de48a12	Extra small storage validation (#2732 ) Fixes #2729 * Check acceptable uses of small type generators	2019-07-11 13:05:14 -04:00
Jeff Bolz	327963765b	Add validation for SPV_EXT_demote_to_helper_invocation (#2707 )	2019-07-11 10:33:22 -04:00
Steven Perron	5ce8cf781f	Change the order branches are simplified in dead branch elim (#2728 ) Dead branch elimination needs to know about the constructs that a block is contained it when determining what to do with its merge instruction. We currently fold branches in block as we see them, which is parent constructs before their children. This causes the struct cfg analysis to crash because it tries to get the parent construct for a block after the parent has been folded. This can be fixed by folding the branch of the children before the parents. Fixes #2667.	2019-07-10 14:59:44 -04:00
Thomas Roughton	cd153db8ed	Add —preserve-bindings and —preserve-spec-constants (#2693 ) Add optimizer options to for preservation of spec constants and variable with binding decorations. They are to be preserved even if they are unused.	2019-07-10 14:12:19 -04:00
Steven Perron	86e45efe15	Handle decorations better in some optimizations (#2716 ) There are a couple spots where we are not looking at decorations when we should. 1. Value numbering is suppose to assign a different value number to ids if they have different decorations. However that is not being done for OpCopyObject and OpPhi. 1. Instruction simplification is propagating OpCopyObject instruction without checking for decorations. It should only do that if no decorations are being lost. Add a new function to the decoration manager to check if the decorations of one id are a subset of the decorations of another. Fixes #2715.	2019-07-10 11:37:16 -04:00
Ryan Harrison	3a252a267b	Update memory scope rules for WebGPU (#2725 ) Fixes #2721	2019-07-10 10:34:50 -04:00
alan-baker	0c4feb643b	Remove extra semis (#2717 ) * Remove extra semi-colons * Update re2 dep	2019-07-08 15:07:36 -04:00
alan-baker	456cc598af	Validate usage of 8- and 16-bit types with only storage capabilities (#2704 ) Fixes #2669 * Check capabilities when validating variables * validate load and store types * Constant check * Don't checks pointers for stores, constants and loads * Validate composite instructions * Validate conversions for 8- and 16-bit limited types * Unified tests and expanded them * Disallow OpCopyMemory * new tests and update old tests	2019-07-08 14:10:13 -04:00
Alastair Donaldson	b8ab80843f	Shrinker for spirv-fuzz (#2708 ) Adds to spirv-fuzz the option to shrink a sequence of transformations that lead to an interesting binary to be generated, to find a smaller sub-sequence of transformations that still lead to an interesting (but hopefully simpler) binary being generated. The notion of what counts as "interesting" comes from a user-provided script, the "interestingness function", similar to the way the spirv-reduce tool works. The shrinking process will give up after a maximum number of steps, which can be configured on the command line. Tests for the combination of fuzzing and shrinking are included, using a variety of interestingness functions.	2019-07-07 08:55:30 +01:00
Steven Perron	37e8f79946	Perform merge return with single return in loop. (#2714 ) Inlining does not inline functions that have a single return that is in a loop. This is because the return cannot be replaced by a branch outside of the loop easily. Merge return knows how to rewrite the function so the return is replaced by a branch. Fixes #2038.	2019-07-04 14:14:49 -04:00
Steven Perron	fe7cc9c612	Do not inline OpKill Instructions (#2713 ) It is illegal to inline an OpKill instruction into a continue construct because the continue header will no longer dominate the backedge. This commit adds a check for this, and does not inline. If we still want to be able to inline a function that contains an OpKill, we can add a new pass that will wrap OpKill instructions into its own function with just the single instruction. I do not believe that this is a common case right now, so I will not do that yet. Fixes #2433.	2019-07-04 12:08:23 -04:00
Jason Macnak	e6e3e2ccc6	Update type for loaded builtin GlobalInvocationID in pass instrumentation (#2705 ) When working on descriptor indexing validation for compute shaders, the gl_GlobalInvocationID builtin was being loaded as uint which would cause compute shaders instrumented by the bindless check pass to have: %83 = OpLoad %uint %gl_GlobalInvocationID %84 = OpCompositeExtract %uint %83 0 %85 = OpCompositeExtract %uint %83 1 %86 = OpCompositeExtract %uint %83 2 which results in validation failures: error: line 127: Reached non-composite type while indexes still remain to be traversed. %84 = OpCompositeExtract %uint %83 0 for trying to extract a uint from a uint.	2019-06-28 09:46:16 -04:00
Alastair Donaldson	efde682369	Disallow movement of unreachable blocks. (#2700 ) Fixes #2695. Allowing unreachable blocks to be moved can lead to an unreachable block A getting placed after an unreachable successor B, which is a problem if B uses ids that A generates.	2019-06-26 15:32:25 +01:00
Alastair Donaldson	dfcb5a1e10	Refactor fuzzer transformations (#2694 ) Introduced abstract class for transformations, and refactored all transformations to inherit from this abstract class.	2019-06-25 20:49:46 +01:00
Józef Kucia	888aeef8a9	Fix Component decoration validation for arrays (#2697 )	2019-06-25 13:28:16 -04:00
Józef Kucia	7c294608ca	Basic validation for Component decorations (#2679 ) * Add basic validation for Component decoration * Add validator tests for Component decoration	2019-06-20 18:16:12 -04:00
alan-baker	2b84d25f10	Fix store to uniform Vulkan check (#2688 ) * Wrong operands were used for pointer and array types * added tests to catch the wierd number corner	2019-06-20 14:22:41 -04:00
Alastair Donaldson	51b0d5ce50	Represent uniform facts via descriptor set and binding. (#2681 ) * Represent uniform facts via descriptor set and binding. Previously uniform facts were expressed with resepect to the id of a uniform variable. Describing them with respect to a descriptor set and binding is more convenient from the point of view of expressing facts about a shader without requiring analysis of its SPIR-V. * Fix equality testing for uniform buffer element descriptors. The equality test now checks that the lengths of the index vectors match. Added a test that exposes the previous omission.	2019-06-19 20:45:14 +01:00
Alastair Donaldson	001e823b65	Add fuzzer pass to obfuscate constants. (#2671 ) Adds a new transformation that can replace a constant with a uniform known to have the same value, and adds a fuzzer pass that (a) replaces a boolean with a comparison of literals (e.g. replacing "true" with "42 > 24"), and then (b) obfuscates the literals appearing in this comparison by replacing them with identically-valued uniforms, if available. The fuzzer_replayer test file has also been updated to allow initial facts to be provided, and to do error checking of the status results returned by the fuzzer and replayer components.	2019-06-18 18:41:08 +01:00
alan-baker	2090d7a2d2	Handle volatile memory semantics in upgrade (#2674 ) * If an atomic is decorated with volatile add the volatile bit to its memory semantics	2019-06-17 16:01:37 -04:00
alan-baker	3d5fb7b908	Validate Volatile memory semantics bit (#2672 ) * Can only be used with Vulkan memory model * Can only be used with atomics * Bit setting must match for compare exchange opcodes * Updated memory semantics checks to allow constant instructions generally with CooperativeMatrixNV	2019-06-17 13:35:40 -04:00
alan-baker	400dbde0ba	Disallow stores to UBOs (#2651 ) Fixes #2638 * Adds a check that errors out if there is a store to a UBO in the Vulkan environment * tests * Function to trace pointers	2019-06-17 13:13:07 -04:00
alan-baker	59983a6010	Validate variable initializer type (#2668 ) Fixes #249 * The pointed to type of Result Type must match the initializer type * Had to update some opt tests to be valid	2019-06-15 00:34:18 -04:00

1 2 3 4 5 ...

1373 Commits