SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2024-11-24 20:40:13 +00:00

Author	SHA1	Message	Date
Steven Perron	6a9be627c7	Keep NOPs when comparing with original binary (#2931 ) We have a check that ensures that the optimizer did not change the binary when it says that it did not. However, when the binary is converted back to a binary, we made a decision to remove OpNop instructions. This means that any spv file that contains a NOP originally will fail this check. To get around this, we convert the module to a second binary that keeps the OpNop instructions. That binary is compared against the original. Fixes https://crbug.com/1010191	2019-10-18 09:53:29 -04:00
Jakub Kuderski	e3da3143b2	Disallow use of OpCompositeExtract/OpCompositeInsert with no indices (#2980 )	2019-10-17 13:53:34 -04:00
Jakub Kuderski	e99b918221	Support constant-folding UConvert and SConvert (#2960 )	2019-10-16 16:29:55 -04:00
alan-baker	2276e59788	Validate that selections are structured (#2962 ) * Validate that selections are structured WIP * new checks that switch and conditional branch are proceeded by a selection merge where necessary * Don't consider unreachable blocks * Add some tests * Changed how labels are marked as seen * Moved check to more appropriate place * Labels are now marked as seen when there are encountered in a terminator instead of when the block is checked * more tests * more tests * Method comment * new test for a bad case	2019-10-11 17:01:30 -04:00
Steven Perron	32f76efa6c	Link cfg and dominator analysis in the context (#2946 ) Fixes #2889	2019-10-08 10:16:18 -04:00
Jeremy Hayes	3c7ff8d4f0	Enable OpTypeCooperativeMatrix specialization (#2927 )	2019-10-07 09:52:48 -04:00
Steven Perron	c18c9ff6bc	Handle OpKill better (#2933 ) We want to handle OpKill better. The wrap opkill causes lots of extra code to be generated, even when they are not needed to avoid the main problem: OpKill cannot be found directly in a continue construct. This change will be more selective on which functions the OpKill will be wrapped and inlining will avoid inlining. Fixes #2912	2019-10-04 13:05:32 -04:00
greg-lunarg	ad3d23f478	Generate null pointer by converting uint64 zero to pointer. (#2935 ) Fixes #2929.	2019-10-04 12:26:38 -04:00
alan-baker	9d7428b052	Validate physical storage buffer restrictions (#2930 ) * Physical storage buffer cannot be used with OpConstantNull, OpPtrEqual, OpPtrNotEqual or OpPtrDiff * new tests * see also #2929	2019-10-02 21:12:57 -04:00
Steven Perron	9eb1c9a4c4	Add continue construct analysis to struct cfg analysis (#2922 ) * Add continue construct analysis to struct cfg analysis Add the ability to identify which blocks are in the continue construct for a loop, and to get functions that are called from those blocks, directly or indirectly. Part of https://github.com/KhronosGroup/SPIRV-Tools/issues/2912.	2019-10-01 10:27:09 -04:00
Steven Perron	85c67b5e08	Record trailing line dbg instructions (#2926 ) There is nothing in the spir-v spec that says the last instructions in a module cannot be OpLine or OpNoLine. However, the code that parses the module will simply drop these instructions. We add code that will preserve these instructions. Strip-debug-info is updated to remove these instructions. Fixes https://crbug.com/1000689.	2019-09-27 16:03:45 -04:00
Ryan Harrison	4075b921f9	Add removing references to debug instructions when removing them (#2923 ) Fixes #2921	2019-09-27 13:23:06 -05:00
Steven Perron	2a11f365bc	Handle id overflow in wrap-opkill (#2916 ) New code in wrap-opkill does not handle id overflow correctly. We fix that up. Fixes https://crbug.com/1007144	2019-09-25 17:42:58 -04:00
Steven Perron	55ea57a785	Handle extract with no indexes (#2910 ) * Handle extract with no indexes It is possible that OpCompositeExtract instructions will not have any indexes. This is not handled well by scalar replacement and instruction folding. Fixes https://crbug.com/1006435 * Fix typo.	2019-09-24 16:19:31 -04:00
Steven Perron	6f26d9ad81	Handle id overflow in convert local access chains (#2908 ) Fixes https://crbug.com/1004453	2019-09-24 14:04:54 -04:00
Steven Perron	6b07212659	Use OpReturn* in wrap-opkill (#2886 ) * Use OpReturn* in wrap-opkill The warp-opkill pass is generating incorrect code. It is placing an OpUnreachable at the end of a basic block, when the block can be reached. We can't reach the end of the block, but we can reach the end. Instead we will add a return instruction. Fixes #2875.	2019-09-20 10:32:27 -04:00
Steven Perron	61edde52a0	Revert "Use OpReturn* in wrap-opkill" This reverts commit `87f0fa432f`.	2019-09-19 22:39:56 -04:00
Steven Perron	87f0fa432f	Use OpReturn* in wrap-opkill The warp-opkill pass is generating incorrect code. It is placing an OpUnreachable at the end of a basic block, when the block can be reached. We can't reach the end of the block, but we can reach the end. Instead we will add a return instruction. Fixes #2875.	2019-09-19 22:34:57 -04:00
Steven Perron	248c80b049	Handle OpConstantNull in copy-prop-arrays. (#2870 ) Many of the places in copy propagate arrays assumes that integer constant will be defined by an OpConstant instruction. That is not always true. We fix these spots by allowing for an OpConstantNull.	2019-09-19 10:24:00 -04:00
alan-baker	5a48c0da15	SPIRV-Tools support for SPIR-V 1.5 (#2865 ) * Ensure same enum values have consistent extension lists * val: fix checking of capabilities The operand for an OpCapability should only be checked for the extension or core version. The InstructionPass registers a capability, and all its implied sub-capabilities before actually checking the operand to an OpCapability. * Add basic support for SPIR-V 1.5 - Adds SPV_ENV_UNIVERSAL_1_5 - Command line tools default to spv1.5 environment - SPIR-V 1.5 incorporates several extensions. Now the disassembler prefers outputing the non-EXT or non-KHR names. This requires updates to many tests, to make strings match again. - Command line tests: Expect SPIR-V 1.5 by default * Test validation of SPIR-V 1.5 incorporated extensions Starting with 1.5, incorporated features no longer require the associated OpExtension instruction.	2019-09-13 14:59:02 -04:00
Steven Perron	c7a39bc40f	Don't inline function containing OpKill (#2842 ) If an OpKill instruction is inlined into a continue construct, then the spir-v is no longer valid. To avoid this issue, we do inline into an OpKill at all. This method was chosen because it is difficult to keep track of whether or not you are in a continue construct while changing the function that is being inlined into. This will work well with wrap OpKill because every will still be inlined except for the OpKill instruction itself. Fixes #2554 Fixes #2433 This reverts commit `aa9e8f5380`.	2019-09-11 13:26:55 -04:00
Steven Perron	4f9256db35	Handle id overflow in wrap op kill. (#2851 ) Fixes https://crbug.com/997729	2019-09-11 13:26:42 -04:00
David Neto	9f188e3374	Assembler: Can't set an ID in instruction without result ID (#2852 ) Fix tests that violated this rule. Fixes #2257	2019-09-11 13:15:25 -04:00
Steven Perron	35c9518c4e	Handle id overflow in the ssa rewriter. (#2845 ) * Handle id overflow in the ssa rewriter. Remove LocalSSAElim pass at the same time. It does the same thing as the SSARewrite pass. Then even share almost all of the same code. Fixes crbug.com/997246	2019-09-10 09:38:23 -04:00
Steven Perron	7f7236f1eb	Handle id overflow in the constant manager. (#2844 ) Fixes crbug.com/997246	2019-09-09 15:12:26 -04:00
Steven Perron	76261e2a7d	Replace CubeFaceCoord and CubeFaceIndexAMD (#2840 ) Part of #2814.	2019-09-06 17:11:37 -04:00
Steven Perron	b218ad1994	Fold Min, Max, and Clamp instructions. (#2836 ) Fixes #2830.	2019-09-05 13:30:03 -04:00
Steven Perron	a41520eaa4	Replace uses of SPV_AMD_shader_trinary_minmax extension (#2835 ) Part of #2814	2019-09-05 09:29:04 -04:00
Ryan Harrison	19b256616d	For WebGPU<->Vulkan optimization, set correct execution environment (#2834 ) Fixes #2833	2019-09-04 13:08:58 -04:00
greg-lunarg	d11725b1d4	Add --relax-float-ops and --convert-relaxed-to-half (#2808 ) The first pass applies the RelaxedPrecision decoration to all executable instructions with float32 based type results. The second pass converts all executable instructions with RelaxedPrecision result to the equivalent float16 type, inserting converts where necessary.	2019-09-03 13:22:13 -04:00
Steven Perron	b54d950298	Fold Fmix should accept vector operands. (#2826 ) Fixes #2819	2019-09-03 09:17:18 -04:00
Steven Perron	d67130caca	Replace SwizzleInvocationsAMD extended instruction. (#2823 ) Part of #2814	2019-08-30 14:07:24 -04:00
Steven Perron	ad71c057c7	Replace SwizzleInvocationsMaskedAMD extended instruction. (#2822 ) Part of #2814	2019-08-30 10:48:42 -04:00
Steven Perron	35d98be3bc	Amd ext to khr (#2811 ) Add the first steps to removing the AMD extension VK_AMD_shader_ballot. Splitting up to make the PRs smaller. Adding utilities to add capabilities and change the version of the module. Replaces the instructions: OpGroupIAddNonUniformAMD = 5000 OpGroupFAddNonUniformAMD = 5001 OpGroupFMinNonUniformAMD = 5002 OpGroupUMinNonUniformAMD = 5003 OpGroupSMinNonUniformAMD = 5004 OpGroupFMaxNonUniformAMD = 5005 OpGroupUMaxNonUniformAMD = 5006 OpGroupSMaxNonUniformAMD = 5007 and extentend instructions WriteInvocationAMD = 3 MbcntAMD = 4 Part of #2814	2019-08-29 12:48:17 -04:00
Steven Perron	73422a0a5e	Check feature mgr in context consistency check (#2818 ) We add a check that the feature manager is correcter after each pass. This resulted in a couple failing tests cases. Those are fixed. Part of #2814	2019-08-28 11:49:16 -04:00
Steven Perron	15fc19d091	Refactor instruction folders (#2815 ) * Refactor instruction folders We want to refactor the instruction folder to allow different sets of rules to be added to the instruction folder. We might want different sets of rules in different circumstances. We also need a way to add rules for extended instructions. Changes are made to the FoldingRules class and ConstFoldingRules class to enable that. We added tests to check that we can fold extended instructions using the new framework. At the same time, I noticed that there were two tests that did not tests what they were suppose to. They could not be easily salvaged. #2813 was opened to track adding the new tests.	2019-08-26 18:54:11 -04:00
Steven Perron	b00ef0d26e	Handle Id overflow in private-to-local (#2807 ) We need to handle id overflow in the private to local pass. Fixes https://crbug.com/962295	2019-08-22 09:14:48 -04:00
Steven Perron	aef8f92b2b	Even more id overflow in sroa (#2806 ) Now we need to handle id overflow when we overflow while replacing uses of the variable. While looking at this code, I noticed an error in the way we handle access chains that cannot be replaced because of overflow. Name it will make some change, and then give up by returning SuccessWithoutChange. But it was changed. This is fixed up by returning Failure if we notice the error at the time of rewriting the users. This is for both id overflow or out-of-bounds accesses. Code is added to "CheckUses" to remove variables that have out-of-bounds accesses from the candidate list, so we don't even try to rewrite its uses. Fixes https://crbug.com/995032	2019-08-21 13:12:42 -04:00
Steven Perron	c5d1dab99e	Add name for variables in desc sroa (#2805 ) Fixes #2802.	2019-08-21 10:55:02 -04:00
Steven Perron	bc62722b80	Handle overflow in wrap-opkill (#2801 ) Fixes https://crbug/994203	2019-08-18 19:00:18 -04:00
Steven Perron	9cd07272a6	More handle overflow in sroa (#2800 ) If we run out of ids when creating a new variable, sroa does not recognize the error, and continues doing work. This leads to segmentation faults. Fixes https://crbug/969655	2019-08-16 13:15:17 -04:00
greg-lunarg	06407250a1	Instrument: Add support for Buffer Device Address extension (#2792 )	2019-08-16 09:18:34 -04:00
Steven Perron	60043edfa1	Replace OpKill With function call. (#2790 ) We are no able to inline OpKill instructions into a continue construct. See #2433. However, we have to be able to inline to correctly do legalization. This commit creates a pass that will wrap OpKill instructions into a function of its own. That way we are able to inline the rest of the code. The follow up to this will be to not inline any function that contains an OpKill. Fixes #2726	2019-08-14 09:27:12 -04:00
Steven Perron	f701237f2d	Remove useless semi-colons (#2789 ) Later versions of clang seem to pick up more useless semi-colons. I've removed them.	2019-08-12 08:52:39 -04:00
greg-lunarg	95386f9e45	Instrument: Fix version 2 output record write for tess eval shaders. (#2782 ) Fix output record write for tess eval shaders. Also change command line for bindless instrumentation to use use output record version 2.	2019-08-09 08:22:41 -04:00
Steven Perron	4b64beb1ae	Add descriptor array scalar replacement (#2742 ) Creates a pass that will replace a descriptor array with individual variables. See #2740 for details. Fixes #2740.	2019-08-08 10:53:19 -04:00
greg-lunarg	29af42df12	Add SPV_EXT_physical_storage_buffer to opt whitelists (#2779 ) This also fixes ADCE to not remove possibly needed OpTypeForwardPointer. The bug, its fix and the corresponding test have a circular dependency with the extension, so they are packaged together.	2019-08-08 09:45:59 -04:00
Steven Perron	b029d3697e	Handle RelaxedPrecision in SROA (#2788 ) If a member of a struct has a relaxed precision, sroa will not split the struct. This means we do not get all cases. This commit handles these cases. The other part is that the decoration needs to be passed on to the new variables. Fixes #2786	2019-08-07 12:17:26 -04:00
alan-baker	3726b500b1	Treat access chain indexes as signed in SROA (#2776 ) Fixes #2768 * In scalar replacement, interpret access chain indexes as signed counts * Use Constant::GetSignExtendedValue and Constant::GetZeroExtendedValue where appropriate * new tests	2019-07-31 15:39:33 -04:00
David Neto	31590104ec	Add pass to inject code for robust-buffer-access semantics (#2771 ) spirv-opt: Add --graphics-robust-access Clamps access chain indices so they are always in bounds. Assumes: - Logical addressing mode - No runtime-array-descriptor-indexing - No variable pointers Adds stub code for clamping coordinate and samples for OpImageTexelPointer. Adds SinglePassRunAndFail optimizer test fixture. Android.mk: add source/opt/graphics_robust_access_pass.cpp Adds Constant::GetSignExtendedValue, Constant::GetZeroExtendedValue	2019-07-30 19:52:46 -04:00

1 2 3 4 5 ...

524 Commits