SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2024-12-24 16:51:06 +00:00

Author	SHA1	Message	Date
Alastair Donaldson	8336d1925f	Extend reducer to remove relaxed precision decorations (#2797 ) Adds a reduction pass that removes OpDecorate and OpMemberDecorate instructions that annotate instructions and members with RelaxedPrecision. As well as being useful in its own right, removing such references allows other passes to remove further instructions.	2019-08-22 23:33:09 +01:00
Steven Perron	b00ef0d26e	Handle Id overflow in private-to-local (#2807 ) We need to handle id overflow in the private to local pass. Fixes https://crbug.com/962295	2019-08-22 09:14:48 -04:00
Steven Perron	aef8f92b2b	Even more id overflow in sroa (#2806 ) Now we need to handle id overflow when we overflow while replacing uses of the variable. While looking at this code, I noticed an error in the way we handle access chains that cannot be replaced because of overflow. Name it will make some change, and then give up by returning SuccessWithoutChange. But it was changed. This is fixed up by returning Failure if we notice the error at the time of rewriting the users. This is for both id overflow or out-of-bounds accesses. Code is added to "CheckUses" to remove variables that have out-of-bounds accesses from the candidate list, so we don't even try to rewrite its uses. Fixes https://crbug.com/995032	2019-08-21 13:12:42 -04:00
Steven Perron	c5d1dab99e	Add name for variables in desc sroa (#2805 ) Fixes #2802.	2019-08-21 10:55:02 -04:00
Steven Perron	bc62722b80	Handle overflow in wrap-opkill (#2801 ) Fixes https://crbug/994203	2019-08-18 19:00:18 -04:00
Steven Perron	9cd07272a6	More handle overflow in sroa (#2800 ) If we run out of ids when creating a new variable, sroa does not recognize the error, and continues doing work. This leads to segmentation faults. Fixes https://crbug/969655	2019-08-16 13:15:17 -04:00
greg-lunarg	06407250a1	Instrument: Add support for Buffer Device Address extension (#2792 )	2019-08-16 09:18:34 -04:00
Toomas Remmelg	7b4e5bd5ec	Update remquo validation to match the OpenCL Extended Instruction Set Specification (#2791 )	2019-08-15 09:38:37 -04:00
alan-baker	bbd80462f5	Fix validation of constant matrices (#2794 ) Fixes #2793 * Don't special case matrix validation compared to other composites * just check the constituents are constants or undefs * later checking validates the column type * new test	2019-08-14 11:26:41 -04:00
Steven Perron	60043edfa1	Replace OpKill With function call. (#2790 ) We are no able to inline OpKill instructions into a continue construct. See #2433. However, we have to be able to inline to correctly do legalization. This commit creates a pass that will wrap OpKill instructions into a function of its own. That way we are able to inline the rest of the code. The follow up to this will be to not inline any function that contains an OpKill. Fixes #2726	2019-08-14 09:27:12 -04:00
Steven Perron	f701237f2d	Remove useless semi-colons (#2789 ) Later versions of clang seem to pick up more useless semi-colons. I've removed them.	2019-08-12 08:52:39 -04:00
greg-lunarg	95386f9e45	Instrument: Fix version 2 output record write for tess eval shaders. (#2782 ) Fix output record write for tess eval shaders. Also change command line for bindless instrumentation to use use output record version 2.	2019-08-09 08:22:41 -04:00
Steven Perron	4b64beb1ae	Add descriptor array scalar replacement (#2742 ) Creates a pass that will replace a descriptor array with individual variables. See #2740 for details. Fixes #2740.	2019-08-08 10:53:19 -04:00
greg-lunarg	29af42df12	Add SPV_EXT_physical_storage_buffer to opt whitelists (#2779 ) This also fixes ADCE to not remove possibly needed OpTypeForwardPointer. The bug, its fix and the corresponding test have a circular dependency with the extension, so they are packaged together.	2019-08-08 09:45:59 -04:00
Steven Perron	b029d3697e	Handle RelaxedPrecision in SROA (#2788 ) If a member of a struct has a relaxed precision, sroa will not split the struct. This means we do not get all cases. This commit handles these cases. The other part is that the decoration needs to be passed on to the new variables. Fixes #2786	2019-08-07 12:17:26 -04:00
Alastair Donaldson	698b56a8f0	Add 'copy object' transformation (#2766 ) This transformation can introduce an instruction that uses OpCopyObject to make a copy of some other result id. This change introduces the transformation, but does not yet introduce a fuzzer pass to actually apply it.	2019-08-05 18:00:13 +01:00
Ryan Harrison	5ada98d0bb	Update WebGPU validation rules of OpAtomic*s (#2777 ) Fixes #2723	2019-07-31 17:15:47 -04:00
alan-baker	3726b500b1	Treat access chain indexes as signed in SROA (#2776 ) Fixes #2768 * In scalar replacement, interpret access chain indexes as signed counts * Use Constant::GetSignExtendedValue and Constant::GetZeroExtendedValue where appropriate * new tests	2019-07-31 15:39:33 -04:00
David Neto	31590104ec	Add pass to inject code for robust-buffer-access semantics (#2771 ) spirv-opt: Add --graphics-robust-access Clamps access chain indices so they are always in bounds. Assumes: - Logical addressing mode - No runtime-array-descriptor-indexing - No variable pointers Adds stub code for clamping coordinate and samples for OpImageTexelPointer. Adds SinglePassRunAndFail optimizer test fixture. Android.mk: add source/opt/graphics_robust_access_pass.cpp Adds Constant::GetSignExtendedValue, Constant::GetZeroExtendedValue	2019-07-30 19:52:46 -04:00
Ryan Harrison	4a28259cc8	Update OpMemoryBarriers rules for WebGPU (#2775 ) Part of #2724	2019-07-30 14:50:55 -04:00
David Neto	7621034aae	Add opt test fixture method SinglePassRunAndFail (#2770 ) Checks for failure status code and matches against the expected error message.	2019-07-30 10:38:46 -04:00
Diego Novillo	49797609b7	Protect against out-of-bounds references when folding OpCompositeExtract (#2774 ) This fixes #2608. The original test case had an out-of-bounds reference that ended up folding into OpCompositeExtract that was indexing right outside the constant composite. The returned constant would then cause a segfault during constant propagation.	2019-07-29 13:27:40 -07:00
alan-baker	7fd2365b06	Don't move debug or decorations when folding (#2772 ) Fixes #2764 * Don't replace all uses when simplifying instructions, instead only update non-debug, non-decoration uses * added a test * Add a new version of RAUW that takes a predicate to decide whether to replace the use or not * used in simplification pass	2019-07-29 16:20:43 -04:00
Ryan Harrison	7bafeda284	Update OpControlBarriers rules for WebGPU (#2769 ) * Update OpControlBarriers rules for WebGPU Part of #2724	2019-07-29 12:53:27 -04:00
Diego Novillo	9559cdbdf0	Fix #2609 - Handle out-of-bounds scalar replacements. (#2767 ) * Fix #2609 - Handle out-of-bounds scalar replacements. When SROA tries to do a replacement for an OpAccessChain that is exactly one element out of bounds, the code was trying to access its internal array of replacements and segfaulting. This protects the code from doing this, and it additionally fixes the way SROA works by not returning failure when it refuses to do a replacement. Instead of failing the optimization pass, SROA will now simply refuse to do the replacement and keep going. Additionally, this patch fixes the SROA logic to now return a proper status so we can correctly state that the pass made no changes to the IR if it only found invalid references.	2019-07-26 12:33:40 -04:00
Alastair Donaldson	f54b8653dd	Limit fuzzer tests so that they take less time to run (#2763 ) The recently added fuzzer_replayer and fuzzer_shrinker tests were rather heavyweight and were leading to CI timeouts. This change reduces the runtime of those tests by having them do fewer iterations.	2019-07-25 13:09:49 -04:00
Steven Perron	bb0e2f65bb	Fix check for unreachable blocks in merge-return (#2762 ) Merge return expects unreachable merge block to look a certain way, and unreachable continue blocks to look a certain way. What if an unreachable block is both a merge and a continue? The continue is suppose to take precedent, but merge-return implements it with the merge taking precedent. This change flips that around. Fixes #2746	2019-07-25 09:34:18 -04:00
Alastair Donaldson	1a89ac8b28	Transformation and fuzzer pass to add dead continues (#2758 ) Similar to the existing 'add dead breaks' pass, this adds a pass to add dead continues to blocks in loops where such a transformation is viable. Various functionality common to this new pass and 'add dead breaks' has been factored into 'fuzzer_util', and some small improvements to 'add dead breaks' that were identified while reviewing that code again have been applied. Fixes #2719.	2019-07-25 13:50:33 +01:00
Ryan Harrison	65f49dfc39	Remove unneeded future imports (#2739 ) Also, adds explicitly setting python executable in the NDK build script, rewrites some Python2-isms to 3isms, and formats some code. Fixes #2738	2019-07-24 15:29:38 -04:00
Steven Perron	c7fcb8c3b9	Process OpDecorateId in ADCE (#2761 ) * Process OpDecorateId in ADCE When there is an OpDecorateId instruction that is live, the ids that is references must be kept live. This change adds them to the worklist. I've also updated a validator check to allow OpDecorateId to be able to apply to decoration groups. Fixes #1759. * Remove dead code.	2019-07-24 14:43:49 -04:00
Steven Perron	fb83b6fbb5	Record correct dominators in merge return (#2760 ) In merge return, we need to know the original dominator for a block in order to traverse code from the original dominator to the new dominator and add appropriate Phi nodes. The current code gets this wrong because the dominator tree is build as needed. The first time we get the immediate dominator for a function we just built the dominator tree and it takes into account that a block has been split. The second time it does not. This inconsistency needs to be fixed. We do that by recording the original dominator for all blocks at the start of the pass. If we were to record just the basic block, that could change if the block is split. We want to traverse the code in the body of the original dominator, whatever block it ends up in. To make this easy to track, we not save the terminator instruction to represent the original dominator. Fixes #2745	2019-07-24 13:56:54 -04:00
Steven Perron	c9190a54da	SSA rewriter: Don't use trivial phis (#2757 ) When a phi candidate is marked as trivial, we are suppose to update all of its uses to the reference the value that it is being folded to. However, the code updates the uses misses `defs_at_block_`. So at a later time, the id for the trivial phi can reemerge. Fixes #2744	2019-07-23 17:59:30 -04:00
alan-baker	aea4e6b1b9	Fix block depth rule priority (#2755 ) Fixes #2743 * Continue depth calculation should take precedence over merge calculation	2019-07-23 13:57:44 -04:00
alan-baker	a94ddc267c	Case validation with repeated labels (#2689 ) Fixes #2686 * Update validation to handle the default case being mentioned multiple times * new tests	2019-07-23 11:23:32 -04:00
greg-lunarg	3855447d93	Bindless Instrument: Make init check depend solely on input_init_enabled (#2753 ) * Bindless Instrument: Make init check depend solely on input_init_enabled Previously was dependent on presense of descriptor_indexing extension in SPIR-V, but this missed some cases. Tests updated to refect this new policy. * Fix format.	2019-07-22 13:51:39 -04:00
Kévin Petit	11516c0b9a	Validate storage class OpenCL environment rules for atomics (#2750 ) This change refactors all storage class validation for atomics to reflect the similar refactoring in the specification. It is currently not possible to write a test for the check rejecting Generic in an OpenCL 1.2 environment as the required GenericPointer capability isn't allowed there. I've decided to keep the check nonetheless to guard against the capability becoming available without the rules for atomics being updated. The ID changes in existing tests aren't ideal but introducing names drags in a substantial refactoring of this file. Contributes to #2595. Signed-off-by: Kevin Petit <kevin.petit@arm.com>	2019-07-22 08:38:42 -04:00
Jason Macnak	bac82f49aa	Allow LOD ops in compute shaders with derivative group execution modes (#2752 ) Also update existing derivative check to be based on the execution mode instead of just the extension being present. More info about extension: - https://github.com/KhronosGroup/SPIRV-Registry/blob/master/extensions/NV/SPV_NV_compute_shader_derivatives.asciidoc	2019-07-22 08:37:44 -04:00
Steven Perron	aa9e8f5380	Revert "Do not inline OpKill Instructions (#2713 )" (#2749 ) This reverts commit `fe7cc9c612`.	2019-07-17 14:59:05 -04:00
Jeff Bolz	58e2ec25ba	For Vulkan, disallow structures containing opaque types (#2546 )	2019-07-16 16:16:19 -04:00
Steven Perron	230c9e4371	Fix bug in merge return (#2734 ) * Fix bug in merge return The merge return pass seems to assume that the only new edges in the cfg are from return block to merge blocks. However, it is possible that a merge block branches to a merge block when it did not before. This change add a new variable to track all of the new edges. It also renames some other variables and cleans us the code to make it a bit easier to read. Fixes #2702.	2019-07-16 09:11:22 -04:00
Jason Macnak	1fedf72e50	Allow ray tracing shaders in inst bindle check pass. (#2733 ) Adds the ray tracing stages (ray gen, intersection, any hit, closest hit, miss, and callable) to the allowed stages in pass instrumentation and add debug records for these stages to output the global launch id. More information for ray tracing shaders: - https://github.com/KhronosGroup/GLSL/blob/master/extensions/nv/GLSL_NV_ray_tracing.txt	2019-07-15 16:24:42 -04:00
Ryan Harrison	032adc4d7e	Correctly implement WebGPU related flag exclusions (#2737 ) Fixes #2736	2019-07-12 14:14:46 -04:00
greg-lunarg	92c41ff1e7	Remove Common Uniform Elimination Pass (#2731 ) Remove Common Uniform Elimination Pass Fixes #2520.	2019-07-12 11:02:10 -04:00
Ryan Harrison	55adf4cf70	Update execution scope rules for WebGPU (#2730 ) Fixes #2722	2019-07-11 14:37:36 -04:00
alan-baker	1a2de48a12	Extra small storage validation (#2732 ) Fixes #2729 * Check acceptable uses of small type generators	2019-07-11 13:05:14 -04:00
Jeff Bolz	327963765b	Add validation for SPV_EXT_demote_to_helper_invocation (#2707 )	2019-07-11 10:33:22 -04:00
Steven Perron	5ce8cf781f	Change the order branches are simplified in dead branch elim (#2728 ) Dead branch elimination needs to know about the constructs that a block is contained it when determining what to do with its merge instruction. We currently fold branches in block as we see them, which is parent constructs before their children. This causes the struct cfg analysis to crash because it tries to get the parent construct for a block after the parent has been folded. This can be fixed by folding the branch of the children before the parents. Fixes #2667.	2019-07-10 14:59:44 -04:00
Thomas Roughton	cd153db8ed	Add —preserve-bindings and —preserve-spec-constants (#2693 ) Add optimizer options to for preservation of spec constants and variable with binding decorations. They are to be preserved even if they are unused.	2019-07-10 14:12:19 -04:00
Steven Perron	86e45efe15	Handle decorations better in some optimizations (#2716 ) There are a couple spots where we are not looking at decorations when we should. 1. Value numbering is suppose to assign a different value number to ids if they have different decorations. However that is not being done for OpCopyObject and OpPhi. 1. Instruction simplification is propagating OpCopyObject instruction without checking for decorations. It should only do that if no decorations are being lost. Add a new function to the decoration manager to check if the decorations of one id are a subset of the decorations of another. Fixes #2715.	2019-07-10 11:37:16 -04:00
Ryan Harrison	3a252a267b	Update memory scope rules for WebGPU (#2725 ) Fixes #2721	2019-07-10 10:34:50 -04:00

1 2 3 4 5 ...

1390 Commits