SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2024-11-29 22:41:03 +00:00

Author	SHA1	Message	Date
Steven Perron	5f4284aa78	Add limit for scalar replacment when fuzzing (#4843 ) The fuzzer cretes code with very large array, and scalar replacement times out. Adding a limit on the size of the composites that will be split when fuzzing. Fixes https://crbug.com/oss-fuzz/48630	2022-07-05 20:12:58 -04:00
Steven Perron	92fe420c8a	Reduce load size does not work for array with spec const size (#4845 ) Arrays do not have to have a size that is known at compile time. It could be a spec constant. In these cases, treat the array as if it is arbitrarily long. This commit will treat it like it is an array of size UINT32_MAX. Fixes https://crbug.com/oss-fuzz/47397.	2022-07-05 16:16:50 -04:00
Steven Perron	d5a3bfcf2f	Avoid undefined behaviour when getting debug opcode (#4842 ) If the `instruction` operand in an extended instruction instruction is too large, it causes undefined behaviour when that value is cast to the enum for the corresponding set. This is done with the NonSemanticDebug100 instruction set. We need to avoid the undefined behaviour. Fixes #4727	2022-07-05 14:14:29 -04:00
Steven Perron	32622ba7c6	DCE: clean up the cfg for all functions that were processed. (#4840 ) Which functions are processed is determined by which ones are on the call tree from the entry points before dead code is removed. So it is possible that a function is process because it is called from an entry point, but the CFG is not cleaned up because the call to the function was removed. The fix is to process and cleanup every function in the module. Since all of the dead functions would have already been removed in an earlier step of DCE, it should not make a different in compile time. Fixes #4731	2022-07-05 12:23:32 -04:00
alan-baker	286e9c1187	Use structural dominance to validate cfg (#4832 ) * Structural dominance introduced in SPIR-V 1.6 rev2 * Changes the structured cfg validation to use structural dominance * structural dominance is based on a cfg where merge and continue declarations are counted as graph edges * Basic blocks now track structural predecessors and structural successors * Add validation for entry into a loop * Fixed an issue with inlining a single block loop * The continue target needs to be moved to the latch block * Simplify the calculation of structured exits * no longer requires block depth * Update many invalid tests	2022-06-29 23:32:20 -04:00
Steven Perron	66d88508dd	Build struct order only for the section needed when unrolling. (#4830 ) We currently build the structured order for all nodes reachable from the loop header when unrolling a loop. However, unrolling only needs the nodes in the loop and possibly the merge node. To avoid needless computation, I have implemented a search that will stop at the merge node. Fixes #4827	2022-06-29 09:53:26 -04:00
Alastair Donaldson	f2dfa53ae5	Avoid unrolling large loops while fuzzing (#4835 ) Uses a preprocessor macro to bail out of unrolling loops with large iteration counts during fuzzing, to reduce the number of timeouts/memouts that arise. Related issue: #4728.	2022-06-29 09:12:09 -04:00
Steven Perron	37d2396cab	Fix SplitLoopHeader to handle single block loop (#4829 ) The code in `CFG::SplitLoopHeader` assumes the loop header is not the latch. This leads to it not being able to find the latch block. This has been fixed, and a test added. Fixes #4527	2022-06-24 12:33:45 -04:00
Steven Perron	3c9fd7577f	Avoid if-conversion if both predecessors are the same (#4826 ) If the predecessor blocks are the same, then there is only 1 value for the OpPhi. The simplition pass will simplify it, and it causes problems for if-conversion. In these cases, if-conversion can just punt. Fixes #3554.	2022-06-24 15:28:06 +00:00
PENGUINLIONG	c4ed5157dc	Fixed crash unrolling loops with residual iterations (#4820 )	2022-06-23 16:01:44 -04:00
Steven Perron	845d98d468	Do not check if the binary changed if encoding is different (#4824 ) There is an assert that verifies that the binary did not change when the optimizer said that it did not. However, if the input binary is in big endian format, the optimizer will encode the optimized binary in little endian. This causes the assert to fail. Since we do not believe that anybody cares about a big endien formate, we will disable the assert in that case. Fixes #4722	2022-06-21 19:58:21 +00:00
Steven Perron	4f321f862a	Avoid undefined divide-by-0 (#4821 ) The Reciprocal function expects a divide-by-0 to return nan,and then Reciprocal will return 0. Since the divide-by-0 is actually undefined, we will identify this case early, and return 0. No new tests are needed because we already tests folding divide-by-0. Fixes #4715	2022-06-21 11:37:34 -04:00
manas-kulkarni	fbcb6cf4c8	Ability to fold Constant Vector times Matrix and Matrix times vector instructions (#4818 )	2022-06-16 13:54:12 -04:00
Steven Perron	76ebfb989f	Avoid replacing access chain with OOB access (#4819 ) An access chain could have a constant index that is an out of bounds access. This is valid spir-v, even if it can cause problems at runtime. However, it is not valid to have an OpCompositeExtract with an out of bounds access. This means we have to stop local-access-chain-convert from making that change. Fixes #4605	2022-06-14 13:06:38 -04:00
David Neto	8f7f5024f8	Simplify invocation of snprintf (#4815 )	2022-06-10 17:55:45 -04:00
strangewiz	fad68a7551	Fix usage of sprintf. (#4811 )	2022-06-08 19:49:46 +00:00
Nicolas Capens	130a05d2e3	Fold multiply and subtraction into FMA with negation (#4808 ) This change adds a folding rule which transforms x * y - a and a - x * y into FMA(x, y, -a) and FMA(-x, y, a), respectively. While the SPIR-V instruction count remains the same, target instruction sets typically feature FMA instruction variants that can negate an operand. Also this transformation may unlock further optimizations which eliminate the negation. (Google bug: b/226145988)	2022-05-31 12:03:56 -04:00
Steven Perron	088cb1a5c8	Add more folding for composite instructions (#4802 ) * Add move folding for composite instructions Fold chains of insert into construct If a chain of OpCompositeInsert instruction write to every element of a composite object, then we can replace it with an OpCompositeConstruct. Fold a construct fed by extracts to a single extract We already fold an OpCompositeConstruct when it is simlpy reconstructing an object that was decomposed by a series of OpCompositeExtract instructions. However, we do not do that if that object is an element of a larger object. I have updated the rule, so that if the original object is a an element of a larger object, then the OpCompositeConstruct is replaced with a single OpCompositeExtract from the larger object. Fixes #4371.	2022-05-26 10:29:02 -04:00
stu-s	c267127846	Add SPV_KHR_fragment_shader_barycentric support (#4805 ) * Add SPV_KHR_fragment_shader_barycentric support	2022-05-25 09:20:39 -04:00
Steven Perron	f74b85853c	Handle 64-bit integers in local access chain convert (#4798 ) * Handle 64-bit integers in local access chain convert The local access chain convert pass does on run on module that have 64-bit integers, even if they have nothing to to with access chains. This is very limiting because other passes rely on the access chains being removed. So this commit will add this functionality to the pass.	2022-05-10 17:02:14 +00:00
Daniele Vettorel	f7a6e3b9d5	Handle chains of OpAccessChain in replacing variable index access for flattened resources. (#4797 )	2022-05-10 11:41:43 -04:00
Jaebaek Seo	ad3514b732	spirv-opt: add pass for interface variable scalar replacement (#4779 ) Replace shader's stage variables whose types are array or matrix with scalars/vectors. For example, ``` Before: %foo = OpVariable %_ptr_Output__arr_v2float_uint_4 Output After: %foo = OpVariable %_ptr_Output_v2float Output %foo_0 = OpVariable %_ptr_Output_v2float Output %foo_1 = OpVariable %_ptr_Output_v2float Output %foo_2 = OpVariable %_ptr_Output_v2float Output ```	2022-05-09 14:04:52 -04:00
JiaoluAMD	c11ea09652	spirv-opt : Add FixFuncCallArgumentsPass (#4775 ) spirv validation require OpFunctionCall with memory object, usually this is non issue as all the functions are inlined. This pass deal with some case for DontInline function. accesschain input operand would be replaced new created variable	2022-05-06 10:39:26 -04:00
JiaoluAMD	2c7fb9707b	Handle dontinline function in spread-volatile-semantics (#4776 ) Handle function calls in spread-volatile-semantics	2022-05-04 10:52:58 -04:00
Steven Perron	1295dca8e2	Reapply "Add folding rule to generate Fma instructions (#4783 )" (#4789 ) This reverts commit `671f6e633f`. PR #4783 was reverted because it caused OpenCL CTS failures for clvk. The was in clspv, which was not adding the no contract decoration when it was required. This has been fixed in https://github.com/google/clspv/pull/845. We can now reapply #4783.	2022-05-03 10:20:23 -04:00
sindney	46492aa45a	spirv-opt: skips if_conversion when dontflatten is set (#4770 )	2022-04-28 19:26:02 +00:00
Daniele Vettorel	671f6e633f	Revert "Add folding rule to generate Fma instructions (#4783 )" (#4785 ) This reverts commit `2b2b0282af`.	2022-04-20 10:55:20 -04:00
Steven Perron	2b2b0282af	Add folding rule to generate Fma instructions (#4783 ) Adding Fma instruction can speed up the code. This was requested by swiftshader, so they do not have to do this analysis themselves. It can also help reduce the code size, and the work the ICD compilers have to do.	2022-04-19 11:25:07 -04:00
Steven Perron	92c17edde7	Don't try to unroll loop with step count 0. (#4769 ) These loop are infinite loop, so there is no reason to unroll the loop. Fixes #4711.	2022-04-11 10:21:15 -04:00
Jaebaek Seo	05745cc9d4	Handle shaders without execution model in spread-volatile-semantics (#4766 ) spread-volatile-semantics pass spreads Volatile semantics for builtin variables used by certain execution models based on VUID-StandaloneSpirv-VulkanMemoryModel-04678 and VUID-StandaloneSpirv-VulkanMemoryModel-04679 (See "Standalone SPIR-V Validation" section of Vulkan spec "Appendix A: Vulkan Environment for SPIR-V"). Therefore, shaders without execution model (e.g., used only for linkage) are not the target of the pass. This commit lets the pass just return SuccessWithoutChange in that case.	2022-03-25 17:54:46 +00:00
Nikita	a3fbc9331b	Support SPV_KHR_uniform_group_instructions (#4734 )	2022-03-25 08:32:50 -04:00
Greg Fischer	9d1b572884	spirv-opt: (WIP) Eliminate Dead Input Component Pass (#4720 ) This adds the --eliminate-dead-input-components pass which currently removes trailing unused components from input arrays. Fixes #4532	2022-03-22 20:50:52 -06:00
Daniel Thornburgh	3820c4f6e2	Qualify std::move. (#4741 ) Clang added -Wunqualified-std-cast-call in https://reviews.llvm.org/D119670, which warns on unqualified std::move and std::forward calls. This change qualifies these calls to allow the project to build on HEAD Clang -Werror.	2022-03-22 11:20:11 -04:00
Steven Perron	0741f42738	Reset the id bound on the module in compact ids (#4744 ) If the body of the module does not have any ids change, compact ids will not change the id bound. This can cause problems because the id bound could be much higher than the largest id in that is used. It should be reset any time it is not the larger id used + 1. Fixes #4604	2022-03-07 20:33:01 +00:00
Steven Perron	48a36c72e4	Better handling of 0xFFFFFFFF when folding vector shuffle (#4743 ) When folding a vector shuffle feeding a vector shuffle, we do not propagate an 0xFFFFFFFF, which has a special meaning, correctly. We adjust the value making it lose it meaning as an undefined value. Fixes #4581	2022-03-07 19:35:57 +00:00
Steven Perron	4fa1a6f9b4	Generalize assert in ccp (#4735 ) CCP does not want to fold an instruction unless it folds to a constant. There is an asser to check for this. The question if a spec constant counts as a constant. The constant folder considers a spec constant a constand, but CCP does not. I've fixed the assert in CCP to match what the folder does. It should not require any new changes to CCP.	2022-03-07 19:33:10 +00:00
Steven Perron	920156cf18	Add pass to remove DontInline function control (#4747 ) Swift shader needs a way to inline all functions, even those marked as DontInline. See https://github.com/KhronosGroup/SPIRV-Tools/pull/4471. This implements the suggestion I made in the PR. We add a pass that will remove the DontInline function control, so that the inlining passes will inline them. SwiftShader will still have to modify their code to add this pass before the other passes are run.	2022-03-07 12:45:17 -05:00
Steven Perron	0b8426346d	Don't rebuilt valid analyses. (#4733 ) The function `BuildInvalideAnalyses` will be rebuilt for every analysis that has been requested, but it is not necessary. It also can cause problems because if the CFG needs to be rebuilt, so do the dominator trees. This change will make the functionality match the description of the function.	2022-03-04 20:16:42 +00:00
pd-valve	d18d0d92e5	Optimize DefUseManager allocations (#4709 ) * Optimize DefUseManager allocations Saves around 30-35% of compilation time. For inst->use_ids, use a pool linked list instead of allocating vectors for every instruction. For inst->uses, use a "PooledLinkedList"' -- a linked list that has shared storage for all nodes. Neither re-use nodes, instead we do a bulk compaction operation when too much memory is being wasted (tuneable). Includes separate PooledLinkedList templated datastructure, a very special case construct, but split out to make the code a little easier to understand.	2022-02-15 19:17:30 -05:00
pd-valve	a123632ed9	Optimize Type::HashValue (#4707 ) Incrementally compute the hash instead of collecting words Avoids allocating temporary space in a std::vector and std::u32string, and making three passes over all the hashed data. Switch to using std::vector to prevent processing duplicates instead of std::unordered_set: avoids an allocation/deletion every call to ComputeHashValue, and ends up faster due to much better cache behaviour and smaller constant-factor when searching the (generally very small) list. In my test case, made Type::HashValue go from 7.5% of compilation time to .5%	2022-02-15 18:57:39 +00:00
sfricke-samsung	471428a04f	spirv-opt: Add OpExecutionModeId support (#4719 ) Needed for Vulkan1.3 adding LocalSizeId support	2022-02-14 14:33:29 +00:00
Natalie Chouinard	72e4475b41	Handle propagation of arrays with decorations (#4717 ) When copy propagating, OpDecorate instructions can be copied as is. For array flattening, they should be ignored.	2022-02-11 16:13:14 -05:00
pd-valve	940127a77d	avoid unnecessary reallocations in GetOperandConstants (#4708 ) reserve capacity since we know the size exactly	2022-02-10 18:31:24 +00:00
pd-valve	44923beb52	Optimize Instruction::Instruction (#4705 ) Avoid constructing temporary vector + copying operands multiple times. Add SmallVector(InputIt first, InputIt last), matching std::vector.	2022-02-10 18:31:07 +00:00
Steven Perron	5b371918b9	Have scalar replacement use undef instead of null (#4691 ) Scalar replacement generates a null when there value for a member will not be used. The null is used to make sure things are deterministic in case there is an error. However, some type cannot be null, so we will change that to use undef. To keep the code simpler we will always use the undef. Fixes #3996	2022-02-03 15:51:15 +00:00
Shahbaz Youssefi	7fa9e746ef	Introduce spirv-diff (#4611 ) spirv-diff is a new tool that produces diff-style output comparing two SPIR-V modules. The instructions between the src and dst modules are matched as best as the tool can, and output is produced (in src id-space) that shows which instructions are removed in src, added in dst or modified between them. The order of instructions are not retained. Matching instructions between two SPIR-V modules is not trivial, and thus a number of heuristics are applied in this tool. In particular, without debug information, it's hard to match functions as they can be reordered. As such, this tool is primarily useful to produce the diff of two SPIR-V modules derived from the same source. This tool can be useful in a number of scenarios: - Compare the SPIR-V before and after modifying a shader - Compare the SPIR-V produced from a shader before and after compiler codegen changes. - Compare the SPIR-V produced from a shader before and after some transformation or optimization. - Compare the SPIR-V produced from a shader with different compilers.	2022-02-02 10:33:18 -05:00
Steven Perron	b846f8f1dc	Complete handling of RayQueryKHR type (#4690 ) The handling of the RayQueryKHR type is not complete in the type manager. The tests were not picking this up. I've added a test to make sure that the `GenerateAllTypes` function actually does generate all of the types. Once it is added there other tests should pick up on the other parts that were missing.	2022-01-31 15:44:32 +00:00
luzpaz	65ecfd1093	Fix various source comment (doxygen) typos (#4680 ) Found via `codespell -q 3 -L fo,lod,parm	2022-01-26 15:13:08 -05:00
Jaebaek Seo	fb9a10cd48	spirv-opt: add pass to Spread Volatile semantics (#4667 ) Add a pass to spread Volatile semantics to variables with SMIDNV, WarpIDNV, SubgroupSize, SubgroupLocalInvocationId, SubgroupEqMask, SubgroupGeMask, SubgroupGtMask, SubgroupLeMask, or SubgroupLtMask BuiltIn decorations or OpLoad for them when the shader model is the ray generation, closest hit, miss, intersection, or callable shaders. This pass can be used for VUID-StandaloneSpirv-VulkanMemoryModel-04678 and VUID-StandaloneSpirv-VulkanMemoryModel-04679 (See "Standalone SPIR-V Validation" section of Vulkan spec "Appendix A: Vulkan Environment for SPIR-V"). Handle variables used by multiple entry points: 1. Update error check to make it working regardless of the order of entry points. 2. For a variable, if it is used by two entry points E1 and E2 and it needs the Volatile semantics for E1 while it does not for E2 - If VulkanMemoryModel capability is enabled, which means we have to set memory operation of load instructions for the variable, we update load instructions in E1, but do not update the ones in E2. - If VulkanMemoryModel capability is disabled, which means we have to add Volatile decoration for the variable, we report an error because E1 needs to add Volatile decoration for the variable while E2 does not. For the simplicity of the implementation, we assume that all functions other than entry point functions are inlined.	2022-01-25 13:14:36 -05:00
Pierre Moreau	42dc678913	Remove duplicated "the" from comments (#4666 )	2022-01-12 19:04:13 -05:00
Pierre Moreau	05c839ca01	Improvements to disassembly within PassManager (#4677 ) * PassManager: Print errors occurring during disassembly Otherwise one could be greeted by the following text when running spirv-opt withe the `--print-all` flag: ; IR before pass wrap-opkill ; IR before pass eliminate-dead-branches ; IR before pass merge-return With this commit, one will instead get: error: line 143: Invalid opcode: 400 warning: line 0: Disassembly failed before pass wrap-opkill error: line 143: Invalid opcode: 400 warning: line 0: Disassembly failed before pass eliminate-dead-branches error: line 143: Invalid opcode: 400 warning: line 0: Disassembly failed before pass merge-return * PassManager: Use the right target environment when disassembling Disassembly would fail if features from a newer version of SPIR-V than 1.2 were used.	2022-01-10 10:55:47 -05:00
Steven Perron	b9e0e13d19	Remove misleading comment. (#4676 ) The comment in `Array::GetExtraHashWords` is misleading because getting the hash words is split up between the generic `Type::GetHashWords` and the type specific `Type::GetExtraHashWords`. While `IsSameImpl` is self-contained. Removing the comment since it is misleading and no comment is really needed. Fixes #3248	2022-01-10 09:24:44 -05:00
Steven Perron	b7251d4fb7	reflect debug (#4662 ) The pass to remove the nonsemantic information and instructions is used for drivers or tools that may not support them. Debug information was only partially handle, which is causing a problem. We need to either fully remove debug information or not remove it all. Since I can see it being useful to keep the debug information even when the nonsemantic instructions are removed, I propose we do not remove debug info. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/4269	2021-12-15 11:06:51 -05:00
Steven Perron	354a46a2a2	Rename strip reflect to strip nonsemantic (#4661 ) In https://github.com/KhronosGroup/SPIRV-Tools/pull/3110, the strip reflect pass was changed to also remove all explicitly nonsemantic instructions. This makes it so that the name of the pass no longer reflects what the pass actually does. This change renames the pass so that it reflects what the pass actaully does.	2021-12-15 09:55:30 -05:00
Dave Airlie	3156158878	optimizer: restore previous ABI. (#4653 ) The change in commit `4ac8e5e541` Author: Greg Fischer <greg@lunarg.com> Date: Wed Sep 15 12:38:34 2021 -0600 Add preserve_interface mode to aggressive_dead_code_elim (#4520) Broke the C++ ABI for spirv-tools shared libraries on Linux, for not a great reason. Restore the previous ABI.	2021-12-09 15:58:53 -05:00
Sebastien Alaiwan	a2260d3b1f	Fix compilation (#4656 ) * Delete public accessor, whose only user is one unit-test The underlying container type becomes invisible from the outside. * Fix compilation	2021-12-09 12:42:53 -05:00
Andrei Malashkin	6926c3d9a3	treat google user type as normal semantic google. It's a backport from diligent fork (#4632 )	2021-12-09 09:42:40 -05:00
Sebastien Alaiwan	b9e255b366	DefUseManager: rename comparison operators to 'CompareAndPrintDifferences' (#4624 ) This make sense, as those are actually debug functions and shouldn't be used in production code.	2021-12-09 09:41:42 -05:00
Sebastien Alaiwan	f37551d2b6	Use a struct (instead of tuple), with explicit member names. (#4621 ) * Cleanup includes. * Simplify assertion. * Use a struct with named members for 'UserEntry'	2021-12-09 09:40:29 -05:00
Sebastien Alaiwan	64328e94db	Avoid an extra map lookup (#4623 ) Speedup: 5% less time on a real-world compilation batch (nearly 10k parallel compilations) (Durations: Before: 156.5s, After: 147.4s)	2021-12-08 14:53:36 -05:00
Marius Hillenbrand	1ed847f438	Fix endianness of string literals (#4622 ) * Fix endianness of string literals To get correct and consistent encoding and decoding of string literals on big-endian platforms, use spvtools::utils::MakeString and MakeVector (or wrapper functions) consistently for handling string literals. - add variant of MakeVector that encodes a string literal into an existing vector of words - add variants of MakeString - add a wrapper spvDecodeLiteralStringOperand in source/ - fix wrapper Operand::AsString to use MakeString (source/opt) - remove Operand::AsCString as broken and unused - add a variant of GetOperandAs for string literals (source/val) ... and apply those wrappers throughout the code. Fixes #149 * Extend round trip test for StringLiterals to flip word order In the encoding/decoding roundtrip tests for string literals, include a case that flips byte order in words after encoding and then checks for successful decoding. That is, on a little-endian host flip to big-endian byte order and then decode, and vice versa. * BinaryParseTest.InstructionWithStringOperand: also flip byte order Test binary parsing of string operands both with the host's and with the reversed byte order.	2021-12-08 12:01:26 -05:00
Shahbaz Youssefi	b162ede0de	Use schema instead of reserved in header description (#4615 ) For consistency with SPIR-V disassembly which outputs Schema for this field.	2021-12-08 11:55:36 -05:00
Alastair Donaldson	f9bcc82ec7	Exit when ID overflow occurs in a fuzzing build (#4652 ) Currently if an ID overflow occurs, spirv-opt (and other users of IRContext) emits a warning and starts returning 0 when fresh ids are requested. This tends to lead to crashes - such as null pointer exceptions. When these arise during fuzzing they lead to auto-reported bugs. This change uses an ifdef guard to instead gracefully exit as soon as an ID overflow occurs when the build is a fuzzing build. Related issue: #4539.	2021-12-04 07:18:21 +00:00
Diego Novillo	c75a1a46f3	Fix https://github.com/KhronosGroup/SPIRV-Tools/issues/4462 (#4651 ) This prevents CCP from making constant -> constant transitions when evaluating instruction values. In this case, FClamp is evaluated twice. On the first evaluation, if computes FClamp(0.5, 0.5, -1) which returns -1. On the second evaluation, it computes FClamp(0.5, 0.5, VARYING) which returns 0.5. Both fold() computations are correct given the semantics of FClamp() but this causes a lateral transition in the constant lattice which was not being considered VARYING by CCP.	2021-12-02 10:40:28 -05:00
Natalie Chouinard	d0a827a9f3	Copy OpDecorateStrings in DescriptorScalarReplacementPass (#4649 ) Along with OpDecorate, also clone the OpDecorateString instructions for variables created in the descriptor scalar replacement pass. Fixes microsoft/DirectXShaderCompiler#3705	2021-11-29 02:11:22 -05:00
Steven Perron	8c155b364c	Manually fold floating point division by zero (#4637 ) See https://github.com/KhronosGroup/SPIRV-Tools/issues/4636 for details. Fixes #4636.	2021-11-24 14:13:58 -05:00
alan-baker	4b092d2ab8	Allow ADCE to remove dead inputs (#4629 ) * https://github.com/KhronosGroup/Vulkan-Docs/issues/666 clearly specified that interfaces do not require an input if there is an associated output * ADCE can now remove unused input variables (though they are kept if the preserve interfaces option is used)	2021-11-16 15:54:17 -05:00
alan-baker	21e3f681e2	Update SPIRV-Headers (#4628 ) * Fix compile * Fix test	2021-11-10 16:32:09 -05:00
Greg Fischer	352a411278	Fix handling of OpPhi in convert-relaxed-to-half (#4618 ) Fixes #4452	2021-11-09 10:36:50 -07:00
Jaebaek Seo	1589720e10	spirv-opt: create OpDecorate for OpMemberDecorate in desc-sroa (#4617 ) The scalar replacement of a resource array/struct variable must create OpDecorate for elements if OpMemberDecorate instructions decorate the elements.	2021-11-05 11:05:36 -04:00
Steven Perron	1082de6bb3	Handle overflowing id in merge return (#4606 ) If the ids overflow when creating an integer constant in the ir_builder, there will be a nullptr dereference. This is happening from inside merge return. We need to propagate the error up, and make sure it is handled appropriately.	2021-11-01 08:45:32 -04:00
Steven Perron	6c7885dbde	Change branch handling in ADCE to fix errors (#4596 ) Consider the new test case. The conditional branch in the continue block is never marked as live. However, `IsDead` will say it is not dead, so it does not get deleted. Because it was never marked as live, `%false` was not mark as live either, but it gets deleted. This results in invalid code. To fix this properly, we had to reconsider how branches are handle. We make the following changes: 1) Terminator instructions that are not branch or OpUnreachable must be kept, so they are marked as live when initializing the worklist. 2) Branches and OpUnreachable instructions are marked as live if a) the block does not have a merge instruction and another instruction in the block is marked as live, or b) the merge instruction in the same block is marked as live. 3) Any instruction that is not marked as live is removed. 4) If a terminator is to be removed, an OpUnreachable is added. This happens when the entire block is dead, and the block will be removed. The OpUnreachable is generated to make sure the block still has a terminator, and is valid. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/4509.	2021-10-29 10:46:43 -04:00
Steven Perron	7c5b17d379	Update passes to handle function declarations (#4599 ) Spirv-opt has not had to handle module with function declarations. This lead many passes to assume that every function has a body. This is not always true. This commit will modify a number of passes to handle function declarations. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/4443	2021-10-28 11:54:37 -04:00
Steven Perron	b2ba019bf6	Delete decorations before replaces uses in dead branch elim (#4598 ) If we do not delete the decoration before all ReplaceAllUses, the decorations will be transferred to the new id. This can cause problems. Fixes #4442.	2021-10-28 10:25:37 -04:00
Steven Perron	3291b6951e	Do not fold snegate feeding sdiv. (#4600 ) When the variable value is INT_MIN, we cannot fold the negate into the divide, so we have to turn off that folding rule. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/4487.	2021-10-28 10:02:57 -04:00
Jaebaek Seo	d997c83b10	Add spirv-opt pass to replace descriptor accesses based on variable indices (#4574 ) This commit adds a spirv-opt pass to replace accesses to descriptor array based on variable indices with constant elements. Before: ``` %descriptor = OpVariable %_ptr_array_Image Uniform ... %ac = OpAccessChain %_ptr_Image %descriptor %variable_index (some image instructions using %ac) ``` After: ``` %descriptor = OpVariable %_ptr_array_Image Uniform ... OpSwitch %variable_index 0 %case0 1 %case1 ... ... %case0 = OpLabel %ac = OpAccessChain %_ptr_Image %descriptor %uint_0 ... %case1 = OpLabel %ac = OpAccessChain %_ptr_Image %descriptor %uint_1 ... (use OpPhi for value with concrete type) ```	2021-10-26 17:20:58 -04:00
Steven Perron	d78c1c4cd3	Make IsLocalVar in ADCE work at any time. (NFC) (#4595 ) Having IsLocalVar work only sometimes is something that could easily lead to an error. This change refactors the code so that the function can be called at any point. The current implementation was used because we did not want to do multiple searches to see if a function was an entry point or if it had a call. This was maintained by added a cache that will store of a given function is an entry point with no calls.	2021-10-26 13:24:29 -04:00
David Neto	7326b494d0	opt: set upper bits of spec constant according to spec (#4589 ) When setting default value for spec constants, for numeric bit types smaller than 32 bits, follow the SPIR-V rules for narrow literals: - signed integers are sign-extended - otherwise, upper bits are zero. Followup to #4588	2021-10-21 09:44:54 -04:00
Dave Airlie	f3fbd98ff5	opt/spec_constants: fix bit pattern width checks. (#4588 ) * test: add a test to show 8/16-bit * opt/spec_constants: fix bit pattern width checks. The input bit patterns are always at least 32-bits, so let the test pass for 8/16-bit values as well. This shouldn't have any effect on the 64-bit patterns I assume this was introduced for.	2021-10-20 11:53:24 -04:00
Greg Fischer	001604bd4a	Generate constants directly in CreateDebugInlinedAt (#4572 ) Do this if Constant or DefUse managers are invalid. Using the ConstantManager attempts to regenerate the DefUseManager which is not valid during inlining.	2021-10-19 18:27:16 -06:00
JiaoluAMD	387cae472e	Opt passes should apply to the exported functions (#4554 ) This is follow-up to the commit `bd3a271ce3`	2021-10-18 13:18:16 -04:00
Greg Fischer	3e6a85303d	Mark DebugInfoNone as live in ADCE when DebugInfo present (#4568 ) Otherwise KillInst() tries to generate it when the module is inconsistent.	2021-10-14 13:29:54 -04:00
David Neto	e6e77dbdfa	Enable OpConstFunctionPointerINTEL outside function (#4576 ) According to spec this opcode is a constant instruction - that's it can appear outside of function bodies. Co-authored-by: DmitryBushev <dmitry.bushev@intel.com>	2021-10-14 12:21:11 -04:00
Greg Fischer	6dd73728e9	Fix merge-block assertions with debugInfo (#4563 ) Fixes DefUse assertions and invalid DebugScope instruction between OpLoopMerge and OpBranch for included test shader.	2021-10-13 11:42:40 -06:00
David Neto	b46995741b	Avoid bugprone-move-forwarding-reference warning in Clang (#4560 ) Use std::forward<T> instead of std::move, on an argument with rvalue-reference of template-deduced type. See https://clang.llvm.org/extra/clang-tidy/checks/bugprone-move-forwarding-reference.html Bug: crbug.com/1134310	2021-10-06 16:50:16 -04:00
Greg Fischer	63a3912326	Fix ConstantManager to not run AnalyzeInstDefUse if DefUse not valid (#4557 ) This fixes inlining which has to create constant for DebugInlinedAt for NonSemantic.Shader.DebugInfo. Also adds regression tests.	2021-10-05 14:55:06 -04:00
Steven Perron	eeb973f502	More ADCE refactoring (#4548 ) Split the code that processes the work list into multiple functions. Move the code to remove the dead instructions in a function to its own function.	2021-10-04 08:33:10 -04:00
Lukas Hermanns	24476c2e32	spirv-opt: Don't eliminate dead members from StructuredBuffer (#4553 ) * Don't eliminate dead members from StructuredBuffer as layout(offset) qualifiers cannot be applied to structure fields. * Traverse arrays when marking structs as fully used. Co-authored-by: Steven Perron <stevenperron@google.com>	2021-10-01 08:31:40 -04:00
Steven Perron	c3adcb034f	Adce refactor (NFC) (#4547 ) * Have ADCE use cfg struct analysis (NFC) ADCE has a lot of code and variables to keep track of information that is easily obtains using the Struct cfg analysis. Most of this change is to refactor the code to have small functions to get the information from the struct cfg analysis. A few other changes small refactoring changes are done. * Factor out work list initialization in ADCE (NFC) We move the code that will initially populate the work list into its own function. We also simplify the code by making use of the struct cfg analysis. That way we can reduce the number of tables used to track information as we traverse the CFG.	2021-09-24 13:21:45 -04:00
Greg Fischer	19dc86c48c	Handle NonSemantic.Shader Debug[No]Line (#4530 ) Debug[No]Line are tracked and optimized using the same mechanism that tracks and optimizes Op[No]Line. Also: - Fix missing DebugScope at top of block. - Allow scalar replacement of access chain in DebugDeclare	2021-09-24 10:56:08 -04:00
Greg Fischer	f125452cf8	Fix inst_buff_addr_check to handle struct loads (#4489 )	2021-09-23 12:59:38 -04:00
Steven Perron	59f51bb4f8	Fix extract with out-of-bounds index (#4529 ) * Fix extract with out-of-bounds index When folding a OpCompositeExtract that is fed by an OpCompositeConstruct, we handle and out of bounds index, but only in the case where the result of the OpCompostiteConstruct is a struct. This change refactors that folding rule and then improves it to handle an out-of-bounds access when the result of the OpCompositeConstruct is a vector.	2021-09-20 13:02:47 -04:00
Greg Fischer	1454c95d1b	spirv-opt: Switch from Vulkan.DebugInfo to Shader.DebugInfo (#4493 ) Includes: - Shift to use of spirv-header extinst.nonsemantic.shader grammar.json - Remove extinst.nonsemantic.vulkan.debuginfo.100.grammar.json - Enable all optimizations for Shader.DebugInfo Also fixes scalar replacement to only insert DebugValue after all OpVariables. This is not necessary for OpenCL.DebugInfo, but it is for Shader.DebugInfo. Likewise, fixes Private-to-Local to insert DebugDeclare after all OpVariables. Also fixes inlining to handle FunctionDefinition which can show up after first block if early return processing happens. Co-authored-by: baldurk <baldurk@baldurk.org>	2021-09-15 14:38:53 -04:00
Greg Fischer	4ac8e5e541	Add preserve_interface mode to aggressive_dead_code_elim (#4520 ) This mode is needed by GPU-assisted validation instrumentation which cannot change the shader entry point interface.	2021-09-15 14:38:34 -04:00
Alastair Donaldson	36ff135341	spirv-opt: Avoid integer overflow during constant folding (#4511 ) In SPIR-V, integers use 2s complement representation, so that signed integer overflow and underflow is well defined. However, the constant folder was causing overflow / underflow at the C++ level. This change avoids such overflows by performing constant folding for IAdd, ISub and IMul in the context of unsigned values, which works because signedness is irrelevant according to the SPIR-V semantics for these instructions. Fixes #4510.	2021-09-14 21:09:05 +00:00
Steven Perron	8865b20295	Handle out-of-bounds accesses in VDCE (#4518 ) It is possible that other optimization will propagate a value into an OpCompositeExtract or OpVectorShuffle instruction that is larger than the vector size. Vector DCE has to be able to handle it. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/4513.	2021-09-13 09:57:44 -04:00
Jaebaek Seo	0c09258e07	Set threshold for reduce-load-size pass (#4499 ) Allow uses to set the threshold for spirv-opt reduce-load-size pass	2021-09-02 10:45:51 -04:00
Steven Perron	bd3a271ce3	Handle exported functions in ADCE (#4495 ) ADCE does not handle exported functions. This was an explicit decision because we did not believe that the linkage attribute could be used in shaders, but it can now. This change has been made. While fixing this error, I noticed that the OpName for labels is sometimes removed because the label instructions are not marked explicitly marked as live. This has able been fixed.	2021-08-31 12:39:46 -04:00
Jaebaek Seo	57e1d8ebe3	Add spirv-opt convert-to-sampled-image pass (#4340 ) convert-to-sampled-image pass converts images and/or samplers with given pairs of descriptor set and binding to sampled image. If a pair of an image and a sampler have the same pair of descriptor set and binding that is one of the given pairs, they will be converted to a sampled image. In addition, if only an image has the descriptor set and binding that is one of the given pairs, it will be converted to a sampled image as well. For example, when we have %a = OpLoad %type_2d_image %texture %b = OpLoad %type_sampler %sampler %combined = OpSampledImage %type_sampled_image %a %b %value = OpImageSampleExplicitLod %v4float %combined ... 1. If %texture and %sampler have the same descriptor set and binding %combine_texture_and_sampler = OpVaraible %ptr_type_sampled_image_Uniform ... %combined = OpLoad %type_sampled_image %combine_texture_and_sampler %value = OpImageSampleExplicitLod %v4float %combined ... 2. If %texture and %sampler have different pairs of descriptor set and binding %a = OpLoad %type_sampled_image %texture %extracted_image = OpImage %type_2d_image %a %b = OpLoad %type_sampler %sampler %combined = OpSampledImage %type_sampled_image %extracted_image %b %value = OpImageSampleExplicitLod %v4float %combined ...	2021-08-18 08:30:48 -04:00
Nicolas Capens	2c829c4155	Fix early-out for Clamp constant folding (#4461 ) Only the first two operands were tested for constness, missing the third one. Since the FoldFPBinaryOp() at the end of FoldClamp1() returns null when not both of its operands are constant, this doesn't change any behavior, but it avoids some needless work. Also the comment for FoldClamp2() was fixed.	2021-08-16 14:11:38 -04:00

1 2 3 4 5 ...

1007 Commits