SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2025-01-05 14:31:05 +00:00

Author	SHA1	Message	Date
Alastair Donaldson	3b13040cf9	New spirv-reduce reduction pass: operand to dominating id. (#2099 ) * Added a reduction pass to replace ids with ids of the same type that dominate them. * Introduce helper method for querying whether an operand type is an input id.	2018-11-26 17:06:21 -05:00
Daniel Koch	3b210d6a63	Add basic support for EXT_fragment_invocation_density (#2100 ) Whitelisting the extension in optimizations * copying what was done for NV_shading_rate	2018-11-23 10:21:19 -05:00
dan sinclair	15fdcf94d7	Add missing override to ProcessLinesPass	2018-11-19 19:24:48 -05:00
greg-lunarg	c37388f1ad	Add passes to propagate and eliminate redundant line instructions (#2027 ). (#2039 ) These are bookend passes designed to help preserve line information across passes which delete, move and clone instructions. The propagation pass attaches a debug line instruction to every instruction based on SPIR-V line propagation rules. It should be performed before optimization. The redundant line elimination pass eliminates all line instructions which match the previous line instruction. This pass should be performed at the end of optimization to reduce physical SPIR-V file size. Fixes #2027.	2018-11-15 14:06:17 -05:00
Greg Fischer	d4a10590b7	Fix Instruction::IsFloatingPointFoldingAllowed() Was looking for decorations based on opcode. Should use result_id.	2018-11-14 15:25:51 -07:00
Steven Perron	dc9d155d62	Fix folding of volatile store. (#2048 ) When looking for the Volatile mask on a store, the instruction folder accesses an out-of-bounds element. We fix that up. Fixes crbug.com/903530.	2018-11-14 13:52:18 -05:00
Steven Perron	a6150a3fe7	Don't assert on void function parameters. (#2047 ) The type manager in spirv-opt currently asserts if a function parameter has type void. It is not exactly clear from the spec that this is disallowed, even if it probably will be disallowed. In either case, asserts should be used to verify assumptions that will actually make a difference to the code. As far as the optimizer is concerned, a void parameter does not matter. I don't see the point of the assert. I'll just remove it and let the validator decide whether to accept it or not. No test was added because it is not clear that it is legal, and should not force us to accept it in the future unless the spec make it clear that it is legal. Fixes crbug.com/903088.	2018-11-14 12:43:43 -05:00
Steven Perron	ec5574a9c6	Instruction::GetBaseAddress to handle OpPtrAccessChain (#2050 ) That function currently only handled OpPtrAccessChain if it was in the middle of the chain, but not at the start. Fixing that up. Fixes crbug.com/905271.	2018-11-14 12:42:25 -05:00
dan sinclair	f343a15764	Add missing overrides (#2041 )	2018-11-12 15:11:32 -05:00
greg-lunarg	1e9fc1aac1	Add base and core bindless validation instrumentation classes (#2014 ) * Add base and core bindless validation instrumentation classes * Fix formatting. * Few more formatting fixes * Fix build failure * More build fixes * Need to call non-const functions in order. Specifically, these are functions which call TakeNextId(). These need to be called in a specific order to guarantee that tests which do exact compares will work across all platforms. c++ pretty much does not guarantee order of evaluation of operands, so any such functions need to be called separately in individual statements to guarantee order. * More ordering. * And more ordering. * And more formatting. * Attempt to fix NDK build * Another attempt to address NDK build problem. * One more attempt at NDK build failure * Add instrument.hpp to BUILD.gn * Some name improvement in instrument.hpp * Change all types in instrument.hpp to int. * Improve documentation in instrument.hpp * Format fixes * Comment clean up in instrument.hpp * imageInst -> image_inst * Fix GetLabel() issue.	2018-11-08 13:54:54 -05:00
greg-lunarg	6721478ef1	Don't assume one return means function can be inlined. (#2018 ) (#2025 ) If there is only 1 return and it is in a loop, then the function cannot be inlined. Fix condition when inlined code needs one-trip loop wrapper. The dummy loop is needed when there is a return inside a selection construct. Even if there is only 1 return.	2018-11-08 09:11:20 -05:00
Jeff Bolz	c06a35b902	Rename PCH macro to spvtools_pch to avoid conflicts with other projects. Also add pch to test/opt. (#2034 )	2018-11-07 09:15:04 -05:00
Jeff Bolz	60fac96c6b	Enable precompiled headers for spirv-tools(-shared) and some unit tests (#2026 )	2018-11-06 09:26:23 -05:00
Steven Perron	f2cc71e5cb	Handle OpMemberDecorateStringGOOGLE in ACDE (#2029 ) Add missing case to the switch statement for the annotation instructions. See https://github.com/KhronosGroup/glslang/issues/1561.	2018-11-02 13:42:45 -04:00
Jeff Bolz	fb996dce75	Add /Zm flag as a workaround for VS2013 build (#2023 )	2018-10-31 07:59:43 -04:00
Steven Perron	6647884a13	Remove MemberDecorateStringGOOGLE during stript-refect. (#2021 ) The strip-reflect pass is not removing the reflection decorations that are decorating members. With this commit, they will now be removed. Fixes #2019.	2018-10-30 16:17:35 -04:00
alelenv	1c1e749f0b	Add support for nv-raytracing-final (#2010 ) Add support for nv-raytracing (non-experimental)	2018-10-25 14:07:46 -04:00
Steven Perron	18fe6d59e5	Fix dead branch elim infinite loop. (#2009 ) When looking for a break from a selection construct, we do not realize that a jump to the continue target of a loop containing the selection is a break. This causes and infinit loop, or possibly other failures. Fixes #2004.	2018-10-24 09:10:30 -04:00
Steven Perron	0ba35798c3	Fix dead branch elim infinite loop. (#1997 ) When looking for a break from a selection construct, we do not need to look inside nested constructs. However, if a loop header has an unconditional branch, then we enter the loop. Entering the loop causes an infinite loop because we keep going through the loop. The solution is to look for a merge block, if one exsits, even for block terminated by an OpBranch. Fixes #1979.	2018-10-22 13:59:20 -04:00
alan-baker	6e85d1a6fc	Fix restrictions in if conversion (#1998 ) Fixes #1991 * Improved identification of potential conditional branches * Pass changed to only work for shaders * added a test to catch the bug	2018-10-19 15:16:46 -04:00
Jeff Bolz	dd1e837e1c	Use per-configuration location for pch file (#1989 )	2018-10-19 14:58:26 -04:00
Steven Perron	715afb0cea	Add a nullptr check to array copy propagation. (#1987 ) We are missing a check for a nullptr that is causing things to fail. Added an extra test case, and fixed up others. This is the fix for https://github.com/Microsoft/DirectXShaderCompiler/issues/1598.	2018-10-19 12:53:40 -04:00
greg-lunarg	c4687889b7	Fix ADCE to treat OpUnreachable correctly during liveness analysis (#1984 ) ADCE liveness algorithm should treat OpUnreachable at least like other branch instructions. It was being treated as always live which was preventing useless structured constructs from being eliminated. OpUnreachable is generated by dead branch elimination which is now being required by merge return, so this fix should accompany that change.	2018-10-19 10:16:35 -04:00
Steven Perron	0e68bb3632	Only run merge-returnon reachable functions. (#1983 ) We currently run merge-return on all functions, but dead-branch-elimination only runs on function reachable from an entry point or exported function. Since dead-branch-elimination is needed for merge-return, they have to match. Fixes #1976.	2018-10-18 08:48:27 -04:00
greg-lunarg	ab45d69154	Fix ADCE liveness to include all enclosing control structures. (#1975 ) Was removing control structures which didn't have data dependency with enclosed live loop and otherwise did not contain live code. An example is a counting loop around a live loop. Fixes #1967.	2018-10-16 08:00:07 -04:00
Jeff Bolz	339d23275d	Enable precompiled headers for MSVC (#1969 )	2018-10-15 11:12:02 -04:00
greg-lunarg	e545564887	Consider atomics that load when analyzing live stores in ADCE (#1956 ) (#1958 ) Consider atomics that load when analyzing live stores in ADCE. Previously it asserted that the base of an OpImageTexelPointer should be an image. It is actually a pointer to an image, so IsValidBasePointer should suffice.	2018-10-12 08:46:35 -04:00
Steven Perron	82663f34c9	Check for unreachable blocks in merge-return. (#1966 ) Merge return assumes that the only unreachable blocks are those needed to keep the structured cfg valid. Even those must be essentially empty blocks. If this is not the case, we get unpredictable behaviour. This commit add a check in merge return, and emits an error if it is not the case. Added a pass of dead branch elimination before merge return in both the performance and size passes. It is a precondition of merge return. Fixes #1962.	2018-10-10 15:18:15 -04:00
Steven Perron	4e266f775a	Fold divisions by 0. (#1963 ) The current implementation in the folder when seeing a division by zero is to assert. In the release build, the compiler will attempt to compute the value, which causes its own problems. The solution I will go with is to fold the division, and just give it the value of 0. The same goes for remainder and mod operations. Fixes #1961.	2018-10-10 11:17:26 -04:00
Steven Perron	497958d899	Removing HLSLCounterBuffer decorations when not needed. (#1954 ) The HlslCounterBufferGOOGLE that was introduced changed the OpDecorateId so that is can now reference an id other than the target. If that other id is used only in the decoration, then the definition of the id will be removed because decoration do not count as real uses. However, if the target of the decoration is still live the decoration will not be removed. This leaves a reference to an id that is not defined. There are two solutions to consider. The first is that is the decoration is kept, then the definition of the id should be kept live. Implementing this change would be involved because the way ADCE handles decorations will have to be reimplemented. The other solution is to remove the decoration the id is otherwise dead. This works for this specific case. Also this is the more desirable behaviour in this case. The id will always be the id of a variable that belongs to a descriptor set. If that variable is not bound and we do not remove it, the driver will complain. I chose to implement the second solution. The first will be left to when a case for it comes up. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1885.	2018-10-05 08:23:09 -04:00
Alan Baker	3b5960174f	Don't scalarize spec constant sized arrays Fixes #1952 * Prevent scalarization of arrays that are sized by a specialization constant	2018-10-04 11:58:23 -04:00
Steven Perron	146eb3bdcf	Fix erroneous uses of the type manager in copy-prop-arrays. (#1942 ) There are a few spots where copy propagate arrays is trying to go from a Type to an id, but the type is not unique. When generating code this pass needs specific ids, otherwise we get type mismatches. However, the ambigous types means we can sometimes get the wrong type and generate invalid code. That code has been rewritten to not rely on the type manager, and just look at the instructions instead. I have opened https://github.com/KhronosGroup/SPIRV-Tools/issues/1939 to try to get a way to make this more robust.	2018-10-01 14:45:44 -04:00
Jeff Bolz	fe90a1d2dc	Enable /MP4 (parallel build across 4 cores for MSVC) for SPIRV-Tools/source[/opt] (#1930 )	2018-10-01 10:47:39 -04:00
Steven Perron	ddc705933d	Analyze uses for all instructions. (#1937 ) * Analyze uses for all instructions. The def-use manager needs to fill in the `inst_to_used_ids_` field for every instruction. This means we have to analyze the uses for every instruction, even if they do not have any uses. This mistake was not found earlier because there was a typo in the equality check for def-use managers. No new tests are needed. While looking into this I found redundant work in block merge. Cleaning that up at the same time. * Fix other transformations Aggressive dead code elimination did not update the OpGroupDecorate and the OpGroupMemberDecorate instructions properly when they are updated. That is fixed. Dead branch elimination did not analyze the OpUnreachable instructions that is would add. That is taken care of.	2018-09-28 14:39:06 -04:00
Steven Perron	32381e30ef	Handle decoration groups with no decorations. (#1921 ) In DecorationManager::RemoveDecorationsFrom, we do not remove the id from a decoration group if the group has no decorations. This causes problems because KillNamesAndDecorates is suppose to remove all references to the id, but in this case, there is still a reference. This is fixed by adding a special case. Also, there is the possibility of a double free because RemoveDecorationsFrom will delete the instructions defining \|id\| when \|id\| is a decoration group. Later, KillInst would later write to memory that has been deleted when trying to turn it into a Nop. To fix this, we will only remove the decorations that use \|id\| and not its definition in RemoveDecorationsFrom.	2018-09-28 14:16:04 -04:00
Steven Perron	80564a56ec	Keep analyses live in unrolling (#1929 ) Add code to keep the def-use manger and the inst-to-block mapping up-to-date. This means we do not have to rebuild them later. To make this work, we will have to have to find places to update the def-use manager. Updating the def-use manager is not straight forward because we are unrolling loops, and we have circular references. This forces one pass to register all of the definitions. A second one to analyze the uses. Also because there will be references to the new instructions in the old code, we want to register the definitions of the new instructions early, so we can update the uses of the older code as we go along. The inst-to-block mapping is not too difficult. It can be done as instructions are created. Fixes #1928.	2018-09-26 17:36:27 -04:00
Steven Perron	0e5fc7d75e	Allow 0 as argument to scalar replacement. (#1917 ) A limit of 0 for the scalar replacement options it used to indicate that there is no limit. The current implementation does not allow 0. This should be fixed.	2018-09-26 09:58:28 -04:00
Steven Perron	b85fb4a300	Get KillNameAndDecorates to handle group decorations. (#1919 ) It seems like the current implementation of KillNameAndDecorates does not handle group decorations correctly. The id being removed is not removed from the OpGroupDecorate instructions. Even worst, any decorations that apply to that group are removed. The solution is to use the function in the decoration manager that will remove the decorations and update the instructions instead of doing the work itself.	2018-09-25 12:57:44 -04:00
Chao Chen	6e2dab2ffd	Add support for Nvidia Turing extensions	2018-09-19 20:46:14 -04:00
Steven Perron	9fbcce4ca1	Add unrolling to the legalization passes (#1903 ) Adds unrolling to the legalization passes. After enabling unrolling I found a bug when there is a self-referencing phi node. That has been fixed. The test that checks for that the order of optimizations is correct also needed to be updated.	2018-09-19 16:40:09 -04:00
Steven Perron	7075c49923	Add dummy loop in merge-return. (#1896 ) The current implementation of merge return can create bad, but correct, code. When it is not in a loop construct, it will insert a lot of extra branch around code. The potentially large number of branches are bad. At the same time, it can separate code store to variables from its uses hiding the fact that the store dominates the load. This hurts the later analysis because the compiler thinks that multiple values can reach a load, when there is really only 1. This poorer analysis leads to missed optimizations. The solution is to create a dummy loop around the entire body of the function, then we can break from that loop with a single branch. Also only new merge nodes would be those at the end of loops meaning that most analysies will not be hurt. Remove dead code for cases that are no longer possible. It seems like some drivers expect there the be an OpSelectionMerge before conditional branches, even if they are not strictly needed. So we add them.	2018-09-18 08:52:47 -04:00
Steven Perron	5f599e700e	Fix infinite loop in dead-branch-elimination (#1891 ) * Create structed cfg analysis. There are lots of optimization that have to traverse the CFG in a structured order just because it wants to know which constructs a basic block in contained in. This adds extra complexity to these optimizations, for causes too much refactoring of older optimizations. To help with this problem, I have written an analysis that can give this information. * Identify branches breaking from loops. Dead branch elimination does a search for a conditional branch to the end of the current selection construct. This search assumes that the only way to leave the construct is through the merge node. But that is not true. The code can jump to the merge node of a loop that contains the construct. The search needs to take this into consideration.	2018-09-17 13:00:24 -04:00
Diego Novillo	4a4632264e	Add IR dumping functions to use during debugging. When using lldb and/or gdb I frequently get odd std::string failures when using the IR printing instructions we have now. This adds the methods Instruction::Dump(), BasicBlock::Dump() and Function::Dump() to emit the output of the pretty print to stderr. With this I can now reliably print IR from gdb and lldb sessions.	2018-09-14 14:28:34 -04:00
Steven Perron	6d5f1bc2e8	Allow merge blocks to merge two header blocks in some cases. (#1890 ) In merge blocks, we do not allow the merging of two blocks with merge instructions. This is because if the two block are merged only 1 of those instructions can exists. However, if the successor block is the merge block of the predecessor, then we can delete the merge instruction in the predecessor. In this case, we are able to merge the blocks.	2018-09-14 13:37:18 -04:00
Steven Perron	75c1bf2843	Add option for the max id bound. (#1870 ) * Create a new entry point for the optimizer Creates a new struct to hold the options for the optimizer, and creates an entry point that take the optimizer options as a parameter. The old entry point that takes validator options are now deprecated. The validator options will be one of the optimizer options. Part of the optimizer options will also be the upper bound on the id bound. * Add a command line option to set the max value for the id bound. The default is 0x3FFFFF. * Modify `TakeNextIdBound` to return 0 when the limit is reached.	2018-09-10 11:49:41 -04:00
Steven Perron	416b1ab4f3	Have the constant manager take ownership of constants. (#1866 ) * Have the constant manager take ownership of constants. Right now the owner of an object of type contant that is in the \|const_pool_\| of the constant manager is unclear. The constant manager does not delete them, there is no other reasonable owner. This causes memory leaks. This change fixes the memory leaks by having the constant manager take ownership of the constant that is stores in \|const_pool_\|. Other changes include interface changes to make it explicit that the constant manager takes ownership of the object when a constant is registered with the constant manager. Fixes #1865.	2018-08-27 09:53:47 -04:00
Steven Perron	47ee776a2c	Revert "Have the constant manager take ownership of constants." This reverts commit `b938b74bac`.	2018-08-24 15:12:49 -04:00
Steven Perron	b938b74bac	Have the constant manager take ownership of constants. Right now the owner of an object of type contant that is in the \|const_pool_\| of the constant manager is unclear. The constant manager does not delete them, there is no other reasonable owner. This causes memory leaks. This change fixes the memory leaks by having the constant manager take ownership of the constant that is stores in \|const_pool_\|. Other changes include interface changes to make it explicit that the constant manager takes ownership of the object when a constant is registered with the constant manager.	2018-08-24 15:08:12 -04:00
Steven Perron	d746681fe9	Copy decorations when creating new ids. (#1843 ) * Copy decorations when creating new ids. When creating a new value based on an old value, we need to copy the decorations to the new id. This change does this in 3 places: 1) The variable holding the return value of the function generated by merge return should get decorations from the function. 2) The results of the OpPhi instructions should get decorations from the variable they are replacing in the ssa writer. 3) In local access chain convert the intermediate struct (result of OpCompositeInsert) generated for the store replacement should get its decorations from the variable being stored to. Fixes #1787.	2018-08-24 11:55:39 -04:00
Steven Perron	b4d3618f77	Don't "break" from selection constructs. (#1862 ) If seems like at least 1 driver does not like a condition jump to the end of a selection construct. We are generating these in the merge return pass. This change stops merge return from generating this sequence. Part of #1861.	2018-08-23 14:38:25 -04:00

1 2 3 4 5 ...

506 Commits