SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2024-10-22 13:00:06 +00:00

Author	SHA1	Message	Date
pd-valve	44923beb52	Optimize Instruction::Instruction (#4705 ) Avoid constructing temporary vector + copying operands multiple times. Add SmallVector(InputIt first, InputIt last), matching std::vector.	2022-02-10 18:31:07 +00:00
Lukas Hermanns	24476c2e32	spirv-opt: Don't eliminate dead members from StructuredBuffer (#4553 ) * Don't eliminate dead members from StructuredBuffer as layout(offset) qualifiers cannot be applied to structure fields. * Traverse arrays when marking structs as fully used. Co-authored-by: Steven Perron <stevenperron@google.com>	2021-10-01 08:31:40 -04:00
Greg Fischer	19dc86c48c	Handle NonSemantic.Shader Debug[No]Line (#4530 ) Debug[No]Line are tracked and optimized using the same mechanism that tracks and optimizes Op[No]Line. Also: - Fix missing DebugScope at top of block. - Allow scalar replacement of access chain in DebugDeclare	2021-09-24 10:56:08 -04:00
Greg Fischer	1454c95d1b	spirv-opt: Switch from Vulkan.DebugInfo to Shader.DebugInfo (#4493 ) Includes: - Shift to use of spirv-header extinst.nonsemantic.shader grammar.json - Remove extinst.nonsemantic.vulkan.debuginfo.100.grammar.json - Enable all optimizations for Shader.DebugInfo Also fixes scalar replacement to only insert DebugValue after all OpVariables. This is not necessary for OpenCL.DebugInfo, but it is for Shader.DebugInfo. Likewise, fixes Private-to-Local to insert DebugDeclare after all OpVariables. Also fixes inlining to handle FunctionDefinition which can show up after first block if early return processing happens. Co-authored-by: baldurk <baldurk@baldurk.org>	2021-09-15 14:38:53 -04:00
Vasyl Teliman	63238d4f2a	Initialize context in `opt::Instruction`'s move constructor (#4397 ) Fixes #4396.	2021-07-23 10:09:51 +01:00
Greg Fischer	d9f8925785	spirv-opt: Where possible make code agnostic of opencl/vulkan debuginfo (#4385 ) Co-authored-by: baldurk <baldurk@baldurk.org>	2021-07-21 12:04:38 -04:00
Greg Fischer	8966cc2b27	Add common enum for debug info instructions from either opencl or vulkan (#4377 ) Co-authored-by: baldurk <baldurk@baldurk.org>	2021-07-16 16:28:14 -04:00
Jaebaek Seo	df4198e50e	Add DebugValue for DebugDecl invisible to value assignment (#3973 ) For some cases, we have DebugDecl invisible to a value assignment, but the value assignment information is important i.e., debugger cannot inspect the variable without the information. For example, a parameter of an inlined function must have its value assignment i.e., argument passing out of its function scope. If we simply remove DebugDecl because it is invisible to the argument passing, we cannot inspec the variable. This PR - Adds DebugValue for DebugDecl invisible to a value assignment. We use the value of the variable in the basic block that contains DebugDecl, which is found by ssa-rewrite. If the value instruction does not dominate DebugDecl, we use the value of the variable in the immediate dominator of the basic block. - Checks the visibility of DebugDecl for Phi value assignment based on the all value operands of the Phi. Since Phi just references multiple values from multiple basic blocks, scopes of value operands must be regarded as the scope of the Phi.	2020-10-27 15:10:08 -04:00
Jaebaek Seo	8a0ebd40f8	Correctly replace debug lexical scope of instruction (#3718 ) When we update OpenCL.DebugInfo.100 lexical scopes e.g., DebugFunction, we have to replace DebugScope of each instruction that uses the lexical scope correctly.	2020-08-31 10:05:38 -04:00
greg-lunarg	2205254cfb	Fix DebugNoScope to not output InlinedAt operand. (#3748 )	2020-08-25 23:27:10 -04:00
alan-baker	b4c4da3e76	Improve non-semantic instruction handling in the optimizer (#3693 ) * No longer blindly add global non-semantic info instructions to global types and values * functions now have a list of non-semantic instructions that succeed them in the global scope * global non-semantic instructions go in global types and values if they appear before any function, otherwise they are attached to the immediate function predecessor in the module * changed ADCE to use the function removal utility * Modified EliminateFunction to have special handling for non-semantic instructions in the global scope * non-semantic instructions are moved to an earlier function (or full global set) if the function they are attached to is eliminated * Added IRContext::KillNonSemanticInfo to remove the tree of non-semantic instructions that use an instruction * this is used in function elimination * There is still significant work in the optimizer to handle non-semantic instructions fully in the optimizer	2020-08-13 14:54:14 -04:00
André Perez	3f33a9aa55	spirv-opt: Improve the code of the Instruction class (#3610 )	2020-08-05 15:28:05 -04:00
Diego Novillo	4dbe18b0c8	Reject folding comparisons with unfoldable types. (#3370 ) Reject folding comparisons with unfoldable types. Fixes #3343 When CCP is evaluating an instruction, it was trying to fold a comparison with 64 bit integers. This was causing a fold failure later since the folder still cannot deal with 64 bit integers.	2020-05-21 12:58:08 -04:00
André Perez	a6b0e132ec	Add adjust branch weights transformation (#3336 ) In this PR, the classes that represent the adjust branch weights transformation and fuzzer pass were implemented. This transformation adjusts the branch weights of a OpBranchConditional instruction.	2020-05-14 11:38:34 +01:00
Alastair Donaldson	49842b88ee	Generalize IsReadOnlyVariable() to apply to pointers (#3325 ) Generalizes the IsReadOnlyVariable() method, and related methods, so that they can be used to ask whether pointer result ids are read-only. Fixes #3324.	2020-04-30 22:47:20 +01:00
Steven Perron	7d65bce0bb	Sampled images as read-only storage (#3295 ) There are some cases where a variable that is declared as a sampled image could be read only. That is when the image type has sampled == 1. Fixes #3288	2020-04-14 12:58:05 -04:00
Jaebaek Seo	000040e707	Preserve debug info in eliminate-dead-functions (#3251 ) * Preserve debug info in eliminate-dead-functions The elimination of dead functions makes OpFunction operand of DebugFunction invalid. This commit replaces the operand with DebugInfoNone.	2020-04-13 09:29:36 -04:00
alan-baker	022da4d0e0	Fix identification of Vulkan images and buffers (#3253 ) Fixes #3252 * Image and buffer queries did not account for optional level of arrayness on the variable * new tests	2020-03-25 17:38:24 -04:00
Jaebaek Seo	1c8bda3721	Add data structure for DebugScope, DebugDeclare in spirv-opt (#3183 ) When DebugScope is given in SPIR-V, each instruction following the DebugScope is from the lexical scope pointed by the DebugScope in the high level language. We add DebugScope struction to keep the scope information in Instruction class. When ir_loader loads DebugScope/DebugNoScope, it keeps the scope information in \|last_dbg_scope_\| and lets following instructions have that scope information. In terms of DebugDeclare/DebugValue, if it is in a function body but outside of a basic block, we keep it in \|debug_insts_in_header_\| of Function class. If it is in a basic block, we keep it as a normal instruction i.e., in a instruction list of BasicBlock.	2020-03-23 11:01:18 -04:00
Steven Perron	15fc19d091	Refactor instruction folders (#2815 ) * Refactor instruction folders We want to refactor the instruction folder to allow different sets of rules to be added to the instruction folder. We might want different sets of rules in different circumstances. We also need a way to add rules for extended instructions. Changes are made to the FoldingRules class and ConstFoldingRules class to enable that. We added tests to check that we can fold extended instructions using the new framework. At the same time, I noticed that there were two tests that did not tests what they were suppose to. They could not be easily salvaged. #2813 was opened to track adding the new tests.	2019-08-26 18:54:11 -04:00
David Neto	76b75c40a1	Document opt::Instruction::InsertBefore methods (#2751 )	2019-07-18 11:37:28 -04:00
alan-baker	87c4ef8a9c	Do not fold floating point if float controls used (#2569 ) Fixes #2558 * Mark floating point instructions as non-foldable if any SPV_KHR_float_controls capabilities are present * tests	2019-05-10 11:03:22 -04:00
Steven Perron	12e4a7b649	Handle variable pointer in some optimizations (#2490 ) * Check var pointer capability in ADCE. * Check var ptr capability for common uniform. * Check var ptr capability in access chain convert. Since we want this pass to run even if there are variable pointer on storage buffers, we had to remove asserts that assumed there were no variable pointers. The functions with the asserts will now work, it becomes the responsibility of the callers to deal with the output as appropriate. * Single block elimination and variable pointers. It seems like the code in local single block elimination is able to handle cases with variable pointers already. This is because the function `HasOnlySupportedRefs` ensures that variables that feed a variable pointer are not candidates. * Single store elimination and variable pointers. It seems like the code in local single stroe elimination is able to handle cases with variable pointers already. This is because the function `FindSingleStoreAndCheckUses` ensures that variables that feed a variable pointer are not candidates. * SSA rewriter and variable pointers. It seems like the code in the two passes that call the SSA rewriter are able to handle cases with variable pointers already. This is because the function `HasOnlySupportedRefs` ensures that variables that feed a variable pointer are not candidates. Fixes #2458.	2019-04-03 12:47:51 -04:00
Greg Fischer	d4a10590b7	Fix Instruction::IsFloatingPointFoldingAllowed() Was looking for decorations based on opcode. Should use result_id.	2018-11-14 15:25:51 -07:00
Steven Perron	ec5574a9c6	Instruction::GetBaseAddress to handle OpPtrAccessChain (#2050 ) That function currently only handled OpPtrAccessChain if it was in the middle of the chain, but not at the start. Fixing that up. Fixes crbug.com/905271.	2018-11-14 12:42:25 -05:00
greg-lunarg	e545564887	Consider atomics that load when analyzing live stores in ADCE (#1956 ) (#1958 ) Consider atomics that load when analyzing live stores in ADCE. Previously it asserted that the base of an OpImageTexelPointer should be an image. It is actually a pointer to an image, so IsValidBasePointer should suffice.	2018-10-12 08:46:35 -04:00
Diego Novillo	4a4632264e	Add IR dumping functions to use during debugging. When using lldb and/or gdb I frequently get odd std::string failures when using the IR printing instructions we have now. This adds the methods Instruction::Dump(), BasicBlock::Dump() and Function::Dump() to emit the output of the pretty print to stderr. With this I can now reliably print IR from gdb and lldb sessions.	2018-09-14 14:28:34 -04:00
dan sinclair	eda2cfbe12	Cleanup includes. (#1795 ) This Cl cleans up the include paths to be relative to the top level directory. Various include-what-you-use fixes have been added.	2018-08-03 15:06:09 -04:00
dan sinclair	effafedcee	Replace opt::Instruction type and result cache with flags. (#1718 ) Currentlty opt::Instruction class holds a cache of the result_id and type_id for the instruction. That cache needs to be updated if the underlying operand values are changes. This CL changes the cache to being a flag if there is a type or result id for the instruction. We then retrieve the value if needed from the operands.	2018-07-20 11:09:30 -04:00
Alan Baker	3c19651733	Add variable pointer support to IsValidBasePointer Fixes #1729 * Adds supported opcodes to IsValidBasePointer() enable by VariablePointers and VariablePointersStorageBuffer capabilities * Added tests	2018-07-19 14:43:59 -04:00
dan sinclair	c7da51a085	Cleanup extraneous namespace qualifies in source/opt. (#1716 ) This CL follows up on the opt namespacing CLs by removing the unnecessary opt:: and opt::analysis:: namespace prefixes.	2018-07-12 15:14:43 -04:00
dan sinclair	e6b953361d	Move the ir namespace to opt. (#1680 ) This CL moves the files in opt/ to consistenly be under the opt:: namespace. This frees up the ir:: namespace so it can be used to make a shared ir represenation.	2018-07-09 11:32:29 -04:00
Steven Perron	a45d4cac61	Move folding routines into a class The folding routines are currently global functions. They also rely on data in an std::map that holds the folding rules for each opcode. This causes that map to not have a clear owner, and therefore never gets deleted. There has been a request to delete this map. To implement this, we will create a InstructionFolder class that owns the maps. The IRContext will own the InstructionFolder instance. Then the global functions will become public memeber functions of the InstructionFolder. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1659.	2018-07-05 17:52:43 -04:00
Steven Perron	1f7b1f1bf7	Small vector optimization for operands. We replace the std::vector in the Operand class by a new class that does a small size optimization. This helps improve compile time on Windows. Tested on three sets of shaders. Trying various values for the small vector. The optimal value for the operand class was 2. However, for the Instruction class, using an std::vector was optimal. Size of "0" means that an std::vector was used. Instruction size 0 4 8 Operand Size 0 489 544 684 1 593 487 2 469 570 4 473 8 505 This is a single thread run of ~120 shaders. For the multithreaded run the results were the similar. The basline time was ~62sec. The optimal configuration was an 2 for the OperandData and an std::vector for the OperandList with a compile time of ~38sec. Similar expiriments were done with other sets of shaders. The compile time still improved, but not as much. Contributes to https://github.com/KhronosGroup/SPIRV-Tools/issues/1609.	2018-06-12 13:41:08 -04:00
Steven Perron	0856997df6	Allow ADCE to remove more instructions. At this time, DCE will only remove an instruction if it is a combinator. However, there are certain non-combinator instructions that can be safely removed if their results are not used. The derivative instructions are on example. We are also missing some instructions from the list of combinators those are added as the same time.	2018-05-05 09:15:28 -04:00
Steven Perron	7d01643132	Allow hoisting code in if-conversion. When doing if-conversion, we do not currently move code out of the side nodes. The reason for this is that it can increase the number of instructions that get executed because both side nods will have to be executed now. In this commit, we add code to move an instruction, and all of the instructions it depends on, out of a side node and into the header of the selection construct. However to keep the cost down, we only do it when the two values in the OpPhi node compute the same value. This way we have to move only one of the instructions and the other becomes unused most of the time. So no real extra cost. Makes the value number table an alalysis in the ir context. Added more opcodes to list of code motion safe opcodes. Fixes #1526.	2018-05-04 12:56:29 -04:00
Stephen McGroarty	9a5dd6fe88	Support loop fission. Adds support for spliting loops whose register pressure exceeds a user provided level. This pass will split a loop into two or more loops given that the loop is a top level loop and that spliting the loop is legal. Control flow is left intact for dead code elimination to remove. This pass is enabled with the --loop-fission flag to spirv-opt.	2018-05-01 15:15:10 -04:00
Steven Perron	9ba0879ddf	Improve Vector DCE Track live scalars in VDCE as if they were single element vectors. Handle the extended instructions for GLSL in VDCE. Handle composite construct instructions in VDCE.	2018-04-30 11:55:50 -04:00
Steven Perron	a00a0a09ae	Revert "Improvements to vector dce." This reverts commit `2813722993`. A regression was found. Undoing the change until it is fixed.	2018-04-27 10:33:19 -04:00
Steven Perron	2813722993	Improvements to vector dce. Track live scalars in VDCE as if they were single element vectors. Handle the extended instructions for GLSL in VDCE. Handle composite construct instructions in VDCE. Fixes #1511.	2018-04-26 11:07:48 -04:00
Steven Perron	7c5d49bf2a	Teach ADCE about OpImageTexelPointer Currently OpImageTexelPointer operations are treat like a use of the pointer, but it does not look for the memory being referenced to make sure stores are not removed. This change teaches it so identify the memory being accessed, and treats it as if that memory is loaded. Fixes to #1445.	2018-04-04 13:45:29 -04:00
Arseny Kapoulkine	309be423cc	Add folding for redundant add/sub/mul/div/mix operations This change implements instruction folding for arithmetic operations that are redundant, specifically: x + 0 = 0 + x = x x - 0 = x 0 - x = -x x * 0 = 0 * x = 0 x * 1 = 1 * x = x 0 / x = 0 x / 1 = x mix(a, b, 0) = a mix(a, b, 1) = b Cache ExtInst import id in feature manager This allows us to avoid string lookups during optimization; for now we just cache GLSL std450 import id but I can imagine caching more sets as they become utilized by the optimizer. Add tests for add/sub/mul/div/mix folding The tests cover scalar float/double cases, and some vector cases. Since most of the code for floating point folding is shared, the tests for vector folding are not as exhaustive as scalar. To test sub->negate folding I had to implement a custom fixture.	2018-02-20 18:29:27 -05:00
Steven Perron	3756b387f3	Get CCP to use the constant floating point rules. Fixes #1311	2018-02-16 13:49:47 -05:00
Alexander Johnston	84ccd0b9ae	Loop invariant code motion initial implementation	2018-02-08 22:55:47 -05:00
Alan Baker	672494da13	Adding ostream operators for IR structures * Added for Instruction, BasicBlock, Function and Module * Uses new disassembly functionality that can disassemble individual instructions * For debug use only (no caching is done) * Each output converts module to binary, parses and outputs an individual instruction * Added a test for whole module output * Disabling Microsoft checked iterator warnings * Updated check_copyright.py to accept 2018	2018-01-12 11:19:58 -05:00
Steven Perron	1ebd860daa	Add generic folding function and use in CCP The current folding routines have a very cumbersome interface, make them harder to use, and not a obvious how to extend. This change is to create a new interface for the folding routines, and show how it can be used by calling it from CCP. This does not make a significant change to the behaviour of CCP. In general it should produce the same code as before; however it is possible that an instruction that takes 32-bit integers as inputs and the result is not a 32-bit integer or bool will not be folded as before.	2018-01-10 13:17:25 -05:00
Steven Perron	ccb921dd2b	Allow getting the base pointer of an image load/store. In value numbering, we treat loads and stores of images, ie OpImageLoad, as a memory operation where it is interested in the "base address" of the instruction. In those cases, it is an image instruction. The problem is that `Instruction::GetBaseAddress()` does not account for the image instructions, so the assert at the end to make sure it found a valid base address for its addressing mode fails. The solution is to look at the load/store instruction to determine how the assertion should be done. Fixes #1160.	2018-01-05 13:26:10 -05:00
Diego Novillo	4ba9dcc8a0	Implement SSA CCP (SSA Conditional Constant Propagation). This implements the conditional constant propagation pass proposed in Constant propagation with conditional branches, Wegman and Zadeck, ACM TOPLAS 13(2):181-210. The main logic resides in CCPPass::VisitInstruction. Instruction that may produce a constant value are evaluated with the constant folder. If they produce a new constant, the instruction is considered interesting. Otherwise, it's considered varying (for unfoldable instructions) or just not interesting (when not enough operands have a constant value). The other main piece of logic is in CCPPass::VisitBranch. This evaluates the selector of the branch. When it's found to be a known value, it computes the destination basic block and sets it. This tells the propagator which branches to follow. The patch required extensions to the constant manager as well. Instead of hashing the Constant pointers, this patch changes the constant pool to hash the contents of the Constant. This allows the lookups to be done using the actual values of the Constant, preventing duplicate definitions.	2017-12-21 14:29:45 -05:00
Steven Perron	756b277fb8	Store all enabled capabilities in the feature manger. In order to keep track of all of the implicit capabilities as well as the explicit ones, we will add them all to the feature manager. That is the object that needs to be queried when checking if a capability is enabled. The name of the "HasCapability" function in the module was changed to make it more obvious that it does not check for implied capabilities. Keep an spv_context and AssemblyGrammar in IRContext	2017-12-21 11:14:53 -05:00
Steven Perron	79a00649b4	Allow pointers to pointers in logical addressing mode. A few optimizations are updates to handle code that is suppose to be using the logical addressing mode, but still has variables that contain pointers as long as the pointer are to opaque objects. This is called "relaxed logical addressing". \|Instruction::GetBaseAddress\| will check that pointers that are use meet the relaxed logical addressing rules. Optimization that now handle relaxed logical addressing instead of logical addressing are: - aggressive dead-code elimination - local access chain convert - local store elimination passes.	2017-12-19 14:29:14 -05:00

1 2

64 Commits