SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2024-10-20 12:00:05 +00:00

Author	SHA1	Message	Date
Greg Fischer	c8e1588cfa	Add passes to eliminate dead output stores (#4970 ) This adds two passes to accomplish this: one pass to analyze a shader to determine the input slots that are live. The second pass is run on the preceding shader to eliminate any stores to output slots that are not consumed by the following shader. These passes support vert, tesc, tese, geom, and frag shaders. These passes are currently only available through the API. These passes together with dead code elimination, and elimination of dead input and output components and variables (WIP), will allow users to do dead code elimination across shader boundaries.	2022-11-02 11:23:25 -06:00
Jaebaek Seo	ad3514b732	spirv-opt: add pass for interface variable scalar replacement (#4779 ) Replace shader's stage variables whose types are array or matrix with scalars/vectors. For example, ``` Before: %foo = OpVariable %_ptr_Output__arr_v2float_uint_4 Output After: %foo = OpVariable %_ptr_Output_v2float Output %foo_0 = OpVariable %_ptr_Output_v2float Output %foo_1 = OpVariable %_ptr_Output_v2float Output %foo_2 = OpVariable %_ptr_Output_v2float Output ```	2022-05-09 14:04:52 -04:00
JiaoluAMD	2c7fb9707b	Handle dontinline function in spread-volatile-semantics (#4776 ) Handle function calls in spread-volatile-semantics	2022-05-04 10:52:58 -04:00
Steven Perron	0b8426346d	Don't rebuilt valid analyses. (#4733 ) The function `BuildInvalideAnalyses` will be rebuilt for every analysis that has been requested, but it is not necessary. It also can cause problems because if the CFG needs to be rebuilt, so do the dominator trees. This change will make the functionality match the description of the function.	2022-03-04 20:16:42 +00:00
luzpaz	65ecfd1093	Fix various source comment (doxygen) typos (#4680 ) Found via `codespell -q 3 -L fo,lod,parm	2022-01-26 15:13:08 -05:00
Marius Hillenbrand	1ed847f438	Fix endianness of string literals (#4622 ) * Fix endianness of string literals To get correct and consistent encoding and decoding of string literals on big-endian platforms, use spvtools::utils::MakeString and MakeVector (or wrapper functions) consistently for handling string literals. - add variant of MakeVector that encodes a string literal into an existing vector of words - add variants of MakeString - add a wrapper spvDecodeLiteralStringOperand in source/ - fix wrapper Operand::AsString to use MakeString (source/opt) - remove Operand::AsCString as broken and unused - add a variant of GetOperandAs for string literals (source/val) ... and apply those wrappers throughout the code. Fixes #149 * Extend round trip test for StringLiterals to flip word order In the encoding/decoding roundtrip tests for string literals, include a case that flips byte order in words after encoding and then checks for successful decoding. That is, on a little-endian host flip to big-endian byte order and then decode, and vice versa. * BinaryParseTest.InstructionWithStringOperand: also flip byte order Test binary parsing of string operands both with the host's and with the reversed byte order.	2021-12-08 12:01:26 -05:00
Alastair Donaldson	f9bcc82ec7	Exit when ID overflow occurs in a fuzzing build (#4652 ) Currently if an ID overflow occurs, spirv-opt (and other users of IRContext) emits a warning and starts returning 0 when fresh ids are requested. This tends to lead to crashes - such as null pointer exceptions. When these arise during fuzzing they lead to auto-reported bugs. This change uses an ifdef guard to instead gracefully exit as soon as an ID overflow occurs when the build is a fuzzing build. Related issue: #4539.	2021-12-04 07:18:21 +00:00
Greg Fischer	19dc86c48c	Handle NonSemantic.Shader Debug[No]Line (#4530 ) Debug[No]Line are tracked and optimized using the same mechanism that tracks and optimizes Op[No]Line. Also: - Fix missing DebugScope at top of block. - Allow scalar replacement of access chain in DebugDeclare	2021-09-24 10:56:08 -04:00
Greg Fischer	d9f8925785	spirv-opt: Where possible make code agnostic of opencl/vulkan debuginfo (#4385 ) Co-authored-by: baldurk <baldurk@baldurk.org>	2021-07-21 12:04:38 -04:00
Alastair Donaldson	4fcdc58946	Add IsReachable function to IRContext (#4323 ) There was a lot of code in the codebase that would get the dominator analysis for a function and then use it to check whether a block is reachable. In the fuzzer, a utility method had been introduced to make this more concise, but it was not being used consistently. This change moves the utility method to IRContext, so that it can be used throughout the codebase, and refactors all existing checks for block reachability to use the utility method.	2021-06-28 20:00:14 +01:00
Greg Fischer	18d45142e7	Fix crash when optimizing shaders with DebugPrintf (#4280 ) Fixes #4219	2021-05-13 13:19:56 -04:00
Jaebaek Seo	8a0ebd40f8	Correctly replace debug lexical scope of instruction (#3718 ) When we update OpenCL.DebugInfo.100 lexical scopes e.g., DebugFunction, we have to replace DebugScope of each instruction that uses the lexical scope correctly.	2020-08-31 10:05:38 -04:00
alan-baker	b4c4da3e76	Improve non-semantic instruction handling in the optimizer (#3693 ) * No longer blindly add global non-semantic info instructions to global types and values * functions now have a list of non-semantic instructions that succeed them in the global scope * global non-semantic instructions go in global types and values if they appear before any function, otherwise they are attached to the immediate function predecessor in the module * changed ADCE to use the function removal utility * Modified EliminateFunction to have special handling for non-semantic instructions in the global scope * non-semantic instructions are moved to an earlier function (or full global set) if the function they are attached to is eliminated * Added IRContext::KillNonSemanticInfo to remove the tree of non-semantic instructions that use an instruction * this is used in function elimination * There is still significant work in the optimizer to handle non-semantic instructions fully in the optimizer	2020-08-13 14:54:14 -04:00
Jaebaek Seo	b78f4b1518	Remove DebugDeclare only for target variables in ssa-rewrite (#3511 ) For each local variable, ssa-rewrite should remove its DebugDeclare if and only if it is replaced by any number of DebugValues for store and phi instructions. For example, when we have two variables `a` whose DebugDeclare will be replaced to DebugValues by ssa-rewrite pass and `b` whose DebugDeclare will not be replaced, we have to remove only DebugDeclare for `a`, not `b`.	2020-07-31 10:00:30 -04:00
Jaebaek Seo	d4b9f576eb	[spirv-opt] debug info preservation in ssa-rewrite (#3356 ) Add OpenCL.DebugInfo.100 `DebugValue` instructions for store and phi instructions of local variables to provide the debugger with the updated values of local variables correctly.	2020-06-19 14:57:43 -04:00
Ehsan	2a1b8c0622	Updated desc_sroa to support flattening structures (#3448 ) Not all structures should be flattened. Code patterns used by DXC are used to create checks for which structures should be flattened.	2020-06-19 14:35:18 -04:00
Jaebaek Seo	42268740c9	Add debug information analysis (#3305 ) We need an analysis for OpenCL.DebugInfo.100 extension instructions such as a map between function id and its DebugFunction. This commit add an analysis for it.	2020-04-27 15:18:55 -04:00
Jaebaek Seo	000040e707	Preserve debug info in eliminate-dead-functions (#3251 ) * Preserve debug info in eliminate-dead-functions The elimination of dead functions makes OpFunction operand of DebugFunction invalid. This commit replaces the operand with DebugInfoNone.	2020-04-13 09:29:36 -04:00
Jaebaek Seo	dd37d73c5e	Handle conflict between debug info and existing validation rule (#3104 ) * Allow OpExtInst for DebugInfo between secion 9 and 10 Fixes #3086 * Handle spirv-opt errors on DebugInfo Ext * Add IR Loader test * Fix ir loader bug * Handle DebugFunction/DebugTypeMember forward reference * Add test cases (forward reference to function) * Support old DebugInfo extension * Validate local debug info out of function	2020-01-23 17:04:30 -05:00
Steven Perron	9eb1c9a4c4	Add continue construct analysis to struct cfg analysis (#2922 ) * Add continue construct analysis to struct cfg analysis Add the ability to identify which blocks are in the continue construct for a loop, and to get functions that are called from those blocks, directly or indirectly. Part of https://github.com/KhronosGroup/SPIRV-Tools/issues/2912.	2019-10-01 10:27:09 -04:00
Ryan Harrison	4075b921f9	Add removing references to debug instructions when removing them (#2923 ) Fixes #2921	2019-09-27 13:23:06 -05:00
Steven Perron	a41520eaa4	Replace uses of SPV_AMD_shader_trinary_minmax extension (#2835 ) Part of #2814	2019-09-05 09:29:04 -04:00
Steven Perron	35d98be3bc	Amd ext to khr (#2811 ) Add the first steps to removing the AMD extension VK_AMD_shader_ballot. Splitting up to make the PRs smaller. Adding utilities to add capabilities and change the version of the module. Replaces the instructions: OpGroupIAddNonUniformAMD = 5000 OpGroupFAddNonUniformAMD = 5001 OpGroupFMinNonUniformAMD = 5002 OpGroupUMinNonUniformAMD = 5003 OpGroupSMinNonUniformAMD = 5004 OpGroupFMaxNonUniformAMD = 5005 OpGroupUMaxNonUniformAMD = 5006 OpGroupSMaxNonUniformAMD = 5007 and extentend instructions WriteInvocationAMD = 3 MbcntAMD = 4 Part of #2814	2019-08-29 12:48:17 -04:00
Steven Perron	73422a0a5e	Check feature mgr in context consistency check (#2818 ) We add a check that the feature manager is correcter after each pass. This resulted in a couple failing tests cases. Those are fixed. Part of #2814	2019-08-28 11:49:16 -04:00
Steven Perron	4b64beb1ae	Add descriptor array scalar replacement (#2742 ) Creates a pass that will replace a descriptor array with individual variables. See #2740 for details. Fixes #2740.	2019-08-08 10:53:19 -04:00
alan-baker	7fd2365b06	Don't move debug or decorations when folding (#2772 ) Fixes #2764 * Don't replace all uses when simplifying instructions, instead only update non-debug, non-decoration uses * added a test * Add a new version of RAUW that takes a predicate to decide whether to replace the use or not * used in simplification pass	2019-07-29 16:20:43 -04:00
Thomas Roughton	cd153db8ed	Add —preserve-bindings and —preserve-spec-constants (#2693 ) Add optimizer options to for preservation of spec constants and variable with binding decorations. They are to be preserved even if they are unused.	2019-07-10 14:12:19 -04:00
greg-lunarg	3d62cb8148	Instrument: Add version 2 of record formats (#2630 ) New version has additional word in stage-specific section. Also some changes in content for tesselation and compute shaders. Either version can be invoked at pass creation. This is done to ease integration and updating of validation layers. Version 1 is deprecated and eventually will go away. Also sneaking in fix to version 1 compute shaders.	2019-05-29 15:08:21 -04:00
Steven Perron	84503583c6	Handle id overflow in sroa better. (#2582 ) There is a case where sroa is not handling id overflow gracefully. It is handled and an error message is output when the ids overflow. Fixes https://crbug.com/961030.	2019-05-15 09:29:28 -04:00
Steven Perron	c2013e248b	Make the constant and type manager analyses. (#2250 ) Currently it is impossible to invalidate the constnat and type manager. However, the compact ids pass changes the ids for the types and constants, which makes them invalid. This change will make them analyses that have to been explicitly marked as preserved by passes. This will allow compact ids to invalidate them. Fixes #2220.	2018-12-20 18:00:05 +00:00
Steven Perron	2e4563d94f	Document in the context what happens with id overflow. (#2159 ) Added documentation to the ir context to indicates that TakeNextId() returns 0 when the max id is reached. TODOs were added to each call sight so that we know where we have to start to handle this case. Handle id overflow in \|SplitLoopHeader\|. Handle id overflow in \|GetOrCreatePreHeaderBlock\|. Handle failure to create preheader in LICM. Part of https://github.com/KhronosGroup/SPIRV-Tools/issues/1841.	2018-12-06 09:07:00 -05:00
Steven Perron	2d2a512691	Don't inline recursive functions. (#2130 ) * Move ProcessFunction* function from pass to the context. There are a few functions that are used to traverse the call tree. They currently live in the Pass class, but they have nothing to do with a pass, and may be needed outside of a pass. They would be better in the ir context, or in a specific call tree class if we ever have a need for it. * Don't inline recursive functions. Inlining does not check if a function is recursive or not. This has been fine as long as the shader was a Vulkan shader, which forbid recursive functions. However, not all shaders are vulkan, so either we limit inlining to Vulkan shaders or we teach it to look for recursive functions. I prefer to keep the passes as general as is reasonable. The change does not require much new code in inlining and gives a reason to refactor some other code. The changes are to add a member function to the Function class that checks if that function is recursive or not. Then this is used in inlining to not inlining a function call if it calls a recursive function. * Add id to function analysis There are a few places that build a map from ids to Function whose result is that id. I decided to add an analysis to the context for this to reduce that code, and simplify some of the functions. * Add missing file.	2018-11-29 14:24:58 -05:00
greg-lunarg	1e9fc1aac1	Add base and core bindless validation instrumentation classes (#2014 ) * Add base and core bindless validation instrumentation classes * Fix formatting. * Few more formatting fixes * Fix build failure * More build fixes * Need to call non-const functions in order. Specifically, these are functions which call TakeNextId(). These need to be called in a specific order to guarantee that tests which do exact compares will work across all platforms. c++ pretty much does not guarantee order of evaluation of operands, so any such functions need to be called separately in individual statements to guarantee order. * More ordering. * And more ordering. * And more formatting. * Attempt to fix NDK build * Another attempt to address NDK build problem. * One more attempt at NDK build failure * Add instrument.hpp to BUILD.gn * Some name improvement in instrument.hpp * Change all types in instrument.hpp to int. * Improve documentation in instrument.hpp * Format fixes * Comment clean up in instrument.hpp * imageInst -> image_inst * Fix GetLabel() issue.	2018-11-08 13:54:54 -05:00
greg-lunarg	6721478ef1	Don't assume one return means function can be inlined. (#2018 ) (#2025 ) If there is only 1 return and it is in a loop, then the function cannot be inlined. Fix condition when inlined code needs one-trip loop wrapper. The dummy loop is needed when there is a return inside a selection construct. Even if there is only 1 return.	2018-11-08 09:11:20 -05:00
Steven Perron	82663f34c9	Check for unreachable blocks in merge-return. (#1966 ) Merge return assumes that the only unreachable blocks are those needed to keep the structured cfg valid. Even those must be essentially empty blocks. If this is not the case, we get unpredictable behaviour. This commit add a check in merge return, and emits an error if it is not the case. Added a pass of dead branch elimination before merge return in both the performance and size passes. It is a precondition of merge return. Fixes #1962.	2018-10-10 15:18:15 -04:00
Steven Perron	75c1bf2843	Add option for the max id bound. (#1870 ) * Create a new entry point for the optimizer Creates a new struct to hold the options for the optimizer, and creates an entry point that take the optimizer options as a parameter. The old entry point that takes validator options are now deprecated. The validator options will be one of the optimizer options. Part of the optimizer options will also be the upper bound on the id bound. * Add a command line option to set the max value for the id bound. The default is 0x3FFFFF. * Modify `TakeNextIdBound` to return 0 when the limit is reached.	2018-09-10 11:49:41 -04:00
dan sinclair	1963a2dbda	Use MakeUnique. (#1837 ) This CL replaces instances of reset(new ..) with MakeUnique.	2018-08-14 15:01:50 -04:00
dan sinclair	9991d661f8	Fix readbility/braces warnings (#1804 )	2018-08-07 09:09:47 -04:00
dan sinclair	eda2cfbe12	Cleanup includes. (#1795 ) This Cl cleans up the include paths to be relative to the top level directory. Various include-what-you-use fixes have been added.	2018-08-03 15:06:09 -04:00
dan sinclair	58a6876cee	Rewrite include guards (#1793 ) This CL rewrites the include guards to make PRESUBMIT.py include guard check happy.	2018-08-03 08:05:33 -04:00
dan sinclair	c7da51a085	Cleanup extraneous namespace qualifies in source/opt. (#1716 ) This CL follows up on the opt namespacing CLs by removing the unnecessary opt:: and opt::analysis:: namespace prefixes.	2018-07-12 15:14:43 -04:00
dan sinclair	4cc6cd184a	Pass the IRContext into the folding rules. (#1709 ) This CL updates the folding rules to receive the IRContext as a paramter instead of retrieving off of the Instruction. Issue #1703	2018-07-12 09:12:23 -04:00
dan sinclair	e6b953361d	Move the ir namespace to opt. (#1680 ) This CL moves the files in opt/ to consistenly be under the opt:: namespace. This frees up the ir:: namespace so it can be used to make a shared ir represenation.	2018-07-09 11:32:29 -04:00
dan sinclair	3dad1cda11	Change libspirv to spvtools namespace (#1678 ) This CL changes all of the libspirv namespace code to spvtools to match the rest of the code base.	2018-07-07 09:38:00 -04:00
Steven Perron	a45d4cac61	Move folding routines into a class The folding routines are currently global functions. They also rely on data in an std::map that holds the folding rules for each opcode. This causes that map to not have a clear owner, and therefore never gets deleted. There has been a request to delete this map. To implement this, we will create a InstructionFolder class that owns the maps. The IRContext will own the InstructionFolder instance. Then the global functions will become public memeber functions of the InstructionFolder. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1659.	2018-07-05 17:52:43 -04:00
Steven Perron	9ecbcf5fc8	Make sure the constant folder get the correct type. There are a few locations where we need to handle duplicate types. We cannot merge them because they may be needed for reflection. When this happens we need do some extra lookups in the type manager. The specific fixes are: 1) When generating a constant through `GetDefiningInstruction` accept and use an id for the desired type of the constant. This will make sure you get the type that is needed. 2) In Private-to-local, make sure we to update the def-use chains when a new pointer type is created. 3) In the type manager, make sure that `FindPointerToType` returns a pointer that points to the given type and not a duplicate type. 4) In scalar replacment, make sure the null constants that are created are the correct type.	2018-07-05 14:34:30 -04:00
Steven Perron	7d01643132	Allow hoisting code in if-conversion. When doing if-conversion, we do not currently move code out of the side nodes. The reason for this is that it can increase the number of instructions that get executed because both side nods will have to be executed now. In this commit, we add code to move an instruction, and all of the instructions it depends on, out of a side node and into the header of the selection construct. However to keep the cost down, we only do it when the two values in the OpPhi node compute the same value. This way we have to move only one of the instructions and the other becomes unused most of the time. So no real extra cost. Makes the value number table an alalysis in the ir context. Added more opcodes to list of code motion safe opcodes. Fixes #1526.	2018-05-04 12:56:29 -04:00
Victor Lomuller	efc5061929	Dominator analysis interface clean. Remove the CFG requirement when querying a dominator/post-dominator from an IRContext. Updated all uses of the function and tests.	2018-04-20 15:41:59 -04:00
Victor Lomuller	0ec08c28c1	Add register liveness analysis. For each function, the analysis determine which SSA registers are live at the beginning of each basic block and which one are killed at the end of the basic block. It also includes utilities to simulate the register pressure for loop fusion and fission. The implementation is based on the paper "A non-iterative data-flow algorithm for computing liveness sets in strict ssa programs" from Boissinot et al.	2018-04-20 09:45:15 -04:00
Stephen McGroarty	ad7e4b8401	Initial patch for scalar evolution analysis This patch adds support for the analysis of scalars in loops. It works by traversing the defuse chain to build a DAG of scalar operations and then simplifies the DAG by folding constants and grouping like terms. It represents induction variables as recurrent expressions with respect to a given loop and can simplify DAGs containing recurrent expression by rewritting the entire DAG to be a recurrent expression with respect to the same loop.	2018-03-28 16:34:23 -04:00

1 2

76 Commits