SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2024-11-23 04:00:05 +00:00

Author	SHA1	Message	Date
Steven Perron	ca004da9f9	Add knowledge of cooperative matrices (#5720 ) * Add knowledge of cooperative matrices Some optimizations are not aware of cooperative matrices, and either do nothing or assert. This commits fixes that up. * Add int tests, and a handle a couple more cases. * Add float tests, and a handle a couple more cases. * Add NV coop matrix as well.	2024-06-26 08:00:29 -04:00
Nathan Gauër	ad11927e6c	opt: add SPV_EXT_mesh_shader to opt allowlist (#5551 ) Add this extension to the allowlist, allowing DCE and other optimizations on modules exposing this. Note: NV equivalent is already allowed.	2024-01-30 12:13:46 -05:00
Steven Perron	5bb595091b	Add ComputeDerivativeGroupNV capabilities to trim capabilities pass. (#5430 ) Add ComputeDerivativeGroupNV capabilities to trim capabilities pass. Add SPV_NV_compute_shader_derivatives to allow lists No tests needed for this. The code path is well tested. Just adding new data.	2023-10-16 19:03:33 +00:00
Steven Perron	d660bb55be	Add SPV_KHR_physical_storage_buffer to allowlists (#5402 ) Fixes #4896	2023-09-06 16:35:57 +00:00
Cassandra Beckley	d474a07088	Add SPV_EXT_fragment_shader_interlock to allow lists (#5393 )	2023-09-05 12:10:16 -04:00
Steven Perron	e68fe9be4e	Add SPV_EXT_shader_atomic_float_add to allow lists (#5348 ) Fixes #5346	2023-07-27 16:04:50 -07:00
Pankaj Mistry	82b1a87b21	Add SPV_NV_bindless_texture to spirv optimizations (#5231 )	2023-05-24 11:01:11 -04:00
Steven Perron	a0fcd06f8f	Add Vulkan memory model to allow lists (#5173 ) Fixes #5086	2023-03-28 16:57:45 -04:00
Spencer Fricke	fa69b09cff	spirv-opt: Remove unused includes and code (#5177 )	2023-03-28 12:40:30 -04:00
Nathan Gauër	1a7f71afb4	clean: constexpr-ify and unify anon namespace use (#4991 ) Constexpr guaranteed no runtime init in addition to const semantics. Moving all opt/ to constexpr. Moving all compile-unit statics to anonymous namespaces to uniformize the method used (anonymous namespace vs static has the same behavior here AFAIK). Signed-off-by: Nathan Gauër <brioche@google.com>	2022-11-17 19:02:50 +01:00
alan-baker	d35a78db57	Switch SPIRV-Tools to use spirv.hpp11 internally (#4981 ) Fixes #4960 * Switches to using enum classes with an underlying type to avoid undefined behaviour	2022-11-04 17:27:10 -04:00
Greg Fischer	11d0d16227	Cleanup code for `272e4b3d0` (#4934 ) Removed now unused DebugDeclare visibility logic for generating DebugValue. Also eliminated the phi sort introduced in `272e4b3`. This should have been removed in the first commit.	2022-09-20 15:27:23 -06:00
Greg Fischer	272e4b3d07	Fix missing and incorrect DebugValues (#4929 ) Specificially, fixes DebugValues coming out of eliminate-local-single-store and eliminate-local-multi-store AKA SSA rewrite.	2022-09-13 14:41:07 +00:00
stu-s	c267127846	Add SPV_KHR_fragment_shader_barycentric support (#4805 ) * Add SPV_KHR_fragment_shader_barycentric support	2022-05-25 09:20:39 -04:00
Nikita	a3fbc9331b	Support SPV_KHR_uniform_group_instructions (#4734 )	2022-03-25 08:32:50 -04:00
Marius Hillenbrand	1ed847f438	Fix endianness of string literals (#4622 ) * Fix endianness of string literals To get correct and consistent encoding and decoding of string literals on big-endian platforms, use spvtools::utils::MakeString and MakeVector (or wrapper functions) consistently for handling string literals. - add variant of MakeVector that encodes a string literal into an existing vector of words - add variants of MakeString - add a wrapper spvDecodeLiteralStringOperand in source/ - fix wrapper Operand::AsString to use MakeString (source/opt) - remove Operand::AsCString as broken and unused - add a variant of GetOperandAs for string literals (source/val) ... and apply those wrappers throughout the code. Fixes #149 * Extend round trip test for StringLiterals to flip word order In the encoding/decoding roundtrip tests for string literals, include a case that flips byte order in words after encoding and then checks for successful decoding. That is, on a little-endian host flip to big-endian byte order and then decode, and vice versa. * BinaryParseTest.InstructionWithStringOperand: also flip byte order Test binary parsing of string operands both with the host's and with the reversed byte order.	2021-12-08 12:01:26 -05:00
JiaoluAMD	387cae472e	Opt passes should apply to the exported functions (#4554 ) This is follow-up to the commit `bd3a271ce3`	2021-10-18 13:18:16 -04:00
Greg Fischer	1454c95d1b	spirv-opt: Switch from Vulkan.DebugInfo to Shader.DebugInfo (#4493 ) Includes: - Shift to use of spirv-header extinst.nonsemantic.shader grammar.json - Remove extinst.nonsemantic.vulkan.debuginfo.100.grammar.json - Enable all optimizations for Shader.DebugInfo Also fixes scalar replacement to only insert DebugValue after all OpVariables. This is not necessary for OpenCL.DebugInfo, but it is for Shader.DebugInfo. Likewise, fixes Private-to-Local to insert DebugDeclare after all OpVariables. Also fixes inlining to handle FunctionDefinition which can show up after first block if early return processing happens. Co-authored-by: baldurk <baldurk@baldurk.org>	2021-09-15 14:38:53 -04:00
Greg Fischer	d9f8925785	spirv-opt: Where possible make code agnostic of opencl/vulkan debuginfo (#4385 ) Co-authored-by: baldurk <baldurk@baldurk.org>	2021-07-21 12:04:38 -04:00
Jaebaek Seo	4baf3affe3	spirv-opt: support SPV_EXT_shader_image_int64 (#4379 )	2021-07-14 08:43:35 -04:00
Kévin Petit	e065c482c6	Initial support for SPV_KHR_integer_dot_product (#4327 ) * Initial support for SPV_KHR_integer_dot_product - Adds new operand types for packed-vector-format - Moves ray tracing enums to the end - PackedVectorFormat is a new optional operand type, so it requires special handling in grammar table generation. - Add SPV_KHR_integer_dot_product to optimizer whitelists. - Pass-through validation: valid cases pass validation Validation errors are not checked. - Update SPIRV-Headers Patch by David Neto <dneto@google.com> Rebase and minor tweaks by Kevin Petit <kevin.petit@arm.com> Signed-off-by: David Neto <dneto@google.com> Signed-off-by: Kevin Petit <kevin.petit@arm.com> Change-Id: Icb41741cb7f0f1063e5541ce25e5ba6c02266d2c * format fixes Change-Id: I35c82ec27bded3d1b62373fa6daec3ffd91105a3	2021-06-23 13:32:24 -04:00
alan-baker	4d22f58a81	Support SPV_KHR_subgroup_uniform_control_flow (#4318 ) * Support SPV_KHR_subgroup_uniform_control_flow Covers: - assembler - disassembler - validator - optimizer (add to whitelists) * fix copyright Co-authored-by: David Neto <dneto@google.com>	2021-06-15 10:07:42 -04:00
Jaebaek Seo	e8bd26e1f8	Set correct scope and line info for DebugValue (#4125 ) The existing spirv-opt `DebugInfoManager::AddDebugValueForDecl()` sets the scope and line info of the new added DebugValue using the scope and line of DebugDeclare. This is wrong because only a single DebugDeclare must exist under a scope while we have to add DebugValue for all the places where the variable's value is updated. Therefore, we have to set the scope and line of DebugValue based on the places of the variable updates. This bug makes https://github.com/google/amber/blob/main/tests/cases/debugger_hlsl_shadowed_vars.amber fail. This commit fixes the bug.	2021-01-28 12:57:35 -05:00
Jaebaek Seo	f686518cee	spirv-opt: properly preserve DebugValue indexes operand (#4022 ) spirv-opt has a bug that `DebugInfoManager::AddDebugValueWithIndex()` does not preserve `Indexes` operands of [DebugValue](https://www.khronos.org/registry/spir-v/specs/unified1/OpenCL.DebugInfo.100.html#DebugValue). It has to preserve all of those `Indexes` operands, but it preserves only the first index operand. This PR removes `DebugInfoManager::AddDebugValueWithIndex()` and lets the spirv-opt use `DebugInfoManager::AddDebugValueForDecl()`. `DebugInfoManager::AddDebugValueForDecl()` preserves the Indexes operand correctly.	2020-11-13 12:06:38 -05:00
Jaebaek Seo	c2b2b57885	Add DebugValue for invisible store in single_store_elim (#4002 ) The front-end language compiler would simply emit DebugDeclare for a variable when it is declared, which is effective through the variable's scope. Since DebugDeclare only maps an OpVariable to a local variable, the information can be removed when an optimization pass uses the loaded value of the variable. DebugValue can be used to specify the value of a variable. For each value update or phi instruction of a variable, we can add DebugValue to help debugger inspect the variable at any point of the program execution. For example, float a = 3; ... (complicated cfg) ... foo(a); // <-- variable inspection: debugger can find DebugValue of `float a` in the nearest dominant For the code with complicated CFG e.g., for-loop, if-statement, we need help of ssa-rewrite to analyze the effective value of each variable in each basic block. If the value update of the variable happens only once and it dominates all its uses, local-single-store-elim pass conducts the same value update with ssa-rewrite and we have to let it add DebugValue for the value assignment. One main issue is that we have to add DebugValue only when the value update of a variable is visible to DebugDeclare. For example, ``` { // scope1 %stack = OpVariable %ptr_int %int_3 { // scope2 DebugDeclare %foo %stack <-- local variable "foo" in high-level language source code is declared as OpVariable "%stack" // add DebugValue "foo = 3" ... Store %stack %int_7 <-- foo = 7, add DebugValue "foo = 7" ... // debugger can inspect the value of "foo" } Store %stack %int_11 <-- out of "scope2" i.e., scope of "foo". DO NOT add DebugValue "foo = 11" } ``` However, the initalization of a variable is an exception. For example, an argument passing of an inlined function must be done out of the function's scope, but we must add a DebugValue for it. ``` // in HLSL bar(float arg) { ... } ... float foo = 3; bar(foo); // in SPIR-V %arg = OpVariable OpStore %arg %foo <-- Argument passing. Out of "float arg" scope, but we must add DebugValue for "float arg" ... body of function bar(float arg) ... ``` This PR handles the except case in local-single-store-elim pass. It adds DebugValue for a store that is considered as an initialization. The same exception handling code for ssa-rewrite is done by this commit: `df4198e50e`.	2020-11-04 13:43:59 -05:00
Jaebaek Seo	df4198e50e	Add DebugValue for DebugDecl invisible to value assignment (#3973 ) For some cases, we have DebugDecl invisible to a value assignment, but the value assignment information is important i.e., debugger cannot inspect the variable without the information. For example, a parameter of an inlined function must have its value assignment i.e., argument passing out of its function scope. If we simply remove DebugDecl because it is invisible to the argument passing, we cannot inspec the variable. This PR - Adds DebugValue for DebugDecl invisible to a value assignment. We use the value of the variable in the basic block that contains DebugDecl, which is found by ssa-rewrite. If the value instruction does not dominate DebugDecl, we use the value of the variable in the immediate dominator of the basic block. - Checks the visibility of DebugDecl for Phi value assignment based on the all value operands of the Phi. Since Phi just references multiple values from multiple basic blocks, scopes of value operands must be regarded as the scope of the Phi.	2020-10-27 15:10:08 -04:00
Steven Perron	a187dd58a0	Allow SPV_KHR_8bit_storage extension. (#3780 )	2020-09-08 14:13:01 -04:00
Jaebaek Seo	ebaefda666	Debug info preservation in loop-unroll pass (#3548 ) When we copy the loop body to unroll it, we have to copy its instructions but DebugDeclare or DebugValue used for the declaration i.e., DebugValue with Deref must not be copied and only the first block can contain those instructions.	2020-07-30 12:18:06 -04:00
Jaebaek Seo	6a3eb679bd	Preserve debug info in scalar replacement pass (#3461 ) 1. Set the debug scope and line information for the new replacement instructions. 2. Replace DebugDeclare and DebugValue if their OpVariable or value operands are replaced by scalars. It uses 'Indexes' operand of DebugValue. For example, struct S { int a; int b;} S foo; // before scalar replacement int foo_a; // after scalar replacement int foo_b; DebugDeclare %dbg_foo %foo %null_expr // before DebugValue %dbg_foo %foo_a %Deref_expr 0 // after DebugValue %dbg_foo %foo_b %Deref_expr 1 // means Value(foo.members[1]) == Deref(%foo_b)	2020-07-27 13:02:25 -04:00
alan-baker	f3cec93665	Support SPV_KHR_terminate_invocation (#3568 ) Covers: - assembler - disassembler - validator - optimizer Co-authored-by: David Neto <dneto@google.com>	2020-07-22 11:45:02 -04:00
greg-lunarg	cf8c86a2d9	Preserve OpenCL.DebugInfo.100 through elim-local-single-store (#3498 ) This pass basically follows the same process as ssa-rewrite: it adds a DebugValue after each Store and removes the DebugDeclare or DebugValue Deref. It only does this if all instructions that are dependent on the Store are Loads and are replaced.	2020-07-10 15:17:14 -04:00
dan sinclair	52a5f074e9	Update access control lists. (#3433 ) This CL updates the access control lists used in SPIRV-Tools to the more descriptive allow/deny naming.	2020-06-15 13:20:40 -04:00
Daniel Koch	5a97e3a391	Add support for KHR_ray_{query,tracing} extensions (#3235 ) Update validator for SPV_KHR_ray_tracing. * Added handling for new enum types * Add SpvScopeShaderCallKHR as a valid scope * update spirv-headers Co-authored-by: alelenv <alele@nvidia.com> Co-authored-by: Torosdagli <ntorosda@amd.com> Co-authored-by: Tobias Hector <tobias.hector@amd.com> Co-authored-by: Steven Perron <stevenperron@google.com>	2020-03-17 15:30:19 -04:00
greg-lunarg	29af42df12	Add SPV_EXT_physical_storage_buffer to opt whitelists (#2779 ) This also fixes ADCE to not remove possibly needed OpTypeForwardPointer. The bug, its fix and the corresponding test have a circular dependency with the extension, so they are packaged together.	2019-08-08 09:45:59 -04:00
Steven Perron	12e4a7b649	Handle variable pointer in some optimizations (#2490 ) * Check var pointer capability in ADCE. * Check var ptr capability for common uniform. * Check var ptr capability in access chain convert. Since we want this pass to run even if there are variable pointer on storage buffers, we had to remove asserts that assumed there were no variable pointers. The functions with the asserts will now work, it becomes the responsibility of the callers to deal with the output as appropriate. * Single block elimination and variable pointers. It seems like the code in local single block elimination is able to handle cases with variable pointers already. This is because the function `HasOnlySupportedRefs` ensures that variables that feed a variable pointer are not candidates. * Single store elimination and variable pointers. It seems like the code in local single stroe elimination is able to handle cases with variable pointers already. This is because the function `FindSingleStoreAndCheckUses` ensures that variables that feed a variable pointer are not candidates. * SSA rewriter and variable pointers. It seems like the code in the two passes that call the SSA rewriter are able to handle cases with variable pointers already. This is because the function `HasOnlySupportedRefs` ensures that variables that feed a variable pointer are not candidates. Fixes #2458.	2019-04-03 12:47:51 -04:00
Steven Perron	2d2a512691	Don't inline recursive functions. (#2130 ) * Move ProcessFunction* function from pass to the context. There are a few functions that are used to traverse the call tree. They currently live in the Pass class, but they have nothing to do with a pass, and may be needed outside of a pass. They would be better in the ir context, or in a specific call tree class if we ever have a need for it. * Don't inline recursive functions. Inlining does not check if a function is recursive or not. This has been fine as long as the shader was a Vulkan shader, which forbid recursive functions. However, not all shaders are vulkan, so either we limit inlining to Vulkan shaders or we teach it to look for recursive functions. I prefer to keep the passes as general as is reasonable. The change does not require much new code in inlining and gives a reason to refactor some other code. The changes are to add a member function to the Function class that checks if that function is recursive or not. Then this is used in inlining to not inlining a function call if it calls a recursive function. * Add id to function analysis There are a few places that build a map from ids to Function whose result is that id. I decided to add an analysis to the context for this to reduce that code, and simplify some of the functions. * Add missing file.	2018-11-29 14:24:58 -05:00
Daniel Koch	3b210d6a63	Add basic support for EXT_fragment_invocation_density (#2100 ) Whitelisting the extension in optimizations * copying what was done for NV_shading_rate	2018-11-23 10:21:19 -05:00
alelenv	1c1e749f0b	Add support for nv-raytracing-final (#2010 ) Add support for nv-raytracing (non-experimental)	2018-10-25 14:07:46 -04:00
Chao Chen	6e2dab2ffd	Add support for Nvidia Turing extensions	2018-09-19 20:46:14 -04:00
dan sinclair	eda2cfbe12	Cleanup includes. (#1795 ) This Cl cleans up the include paths to be relative to the top level directory. Various include-what-you-use fixes have been added.	2018-08-03 15:06:09 -04:00
dan sinclair	a5a5ea0e2d	Remove using std::<foo> statements. (#1756 ) Many of the files have using std::<foo> statements in them, but then the use of <foo> will be inconsistently std::<foo> or <foo> scattered through the file. This CL removes all of the using statements and updates the code to have the required std:: prefix.	2018-08-01 14:58:12 -04:00
dan sinclair	c7da51a085	Cleanup extraneous namespace qualifies in source/opt. (#1716 ) This CL follows up on the opt namespacing CLs by removing the unnecessary opt:: and opt::analysis:: namespace prefixes.	2018-07-12 15:14:43 -04:00
dan sinclair	f96b7f1cb9	use Pass::Run to set the context on each pass. (#1708 ) Currently the IRContext is passed into the Pass::Process method. It is then up to the individual pass to store the context into the context_ variable. This CL changes the Run method to store the context before calling Process which no-longer receives the context as a parameter.	2018-07-12 09:08:45 -04:00
dan sinclair	e6b953361d	Move the ir namespace to opt. (#1680 ) This CL moves the files in opt/ to consistenly be under the opt:: namespace. This frees up the ir:: namespace so it can be used to make a shared ir represenation.	2018-07-09 11:32:29 -04:00
Victor Lomuller	efc5061929	Dominator analysis interface clean. Remove the CFG requirement when querying a dominator/post-dominator from an IRContext. Updated all uses of the function and tests.	2018-04-20 15:41:59 -04:00
Steven Perron	c20a718e00	Rewrite local-single-store-elim to not create large data structures. The local-single-store-elim algorithm is not fundamentally bad. However, when there are a large number of variables, some of the maps that are used can become very large. These large data structures then take a very long time to be destroyed. I've seen cases around 40% if the time. I've rewritten that algorithm to not use as much memory. This give a significant improvement when running a large number of shader through DXC. I've also made a small change to local-single-block-elim to delete the loads that is has replaced. That way local-single-store-elim will not have to look at those. local-single-store-elim now does the same thing. The time for one set goes from 309s down to 126s. For another set, the time goes from 102s down to 88s.	2018-04-18 16:38:18 -04:00
David Neto	a91cbfbf75	Optimizer: update extension whitelists Add two new extensions: - SPV_NV_shader_subgroup_partitioned - SPV_EXT_descriptor_indexing	2018-04-06 15:56:20 -04:00
Eleni Maria Stea	045cc8f75b	Fixes compile errors generated with -Wpedantic This patch fixes the compile errors generated when the options SPIRV_WARN_EVERYTHING and SPIRV_WERROR (that force -Wpedantic) are set to cmake.	2018-03-22 09:40:11 -04:00
David Neto	2e3aec23ca	Add recent Google extensions to optimizer whitelists Optimizations should work in the presence of recent SPV_GOOGLE_decorate_string and SPV_GOOGLE_hlsl_functionality1 SPV_GOOGLE_decorate_string: - Adds operation OpDecorateStringGOOGLE to decorate an object with decorations having string operands. SPV_GOOGLE_hlsl_functionality1: - Adds HlslSemanticGOOGLE, used to decorate an interface variable with an HLSL semantic string. Optimizations already preserve those variables as required because they are interface variables (with uses), independent of whether they have HLSL decorations. - Adds HlslCounterBufferGOOGLE, used to associate a buffer with a counter variable. Fixes #1391	2018-03-15 11:16:20 -04:00
Rex Xu	314cfa29b2	Add missing SPV extension strings	2018-03-08 21:54:00 +08:00

1 2

78 Commits