SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2025-01-16 03:00:06 +00:00

Author	SHA1	Message	Date
Ryan Harrison	0125b28ed4	Add compact ids to WebGPU <-> Vulkan transformations (#2639 ) Fixes #2634	2019-05-29 12:58:37 -07:00
greg-lunarg	3d62cb8148	Instrument: Add version 2 of record formats (#2630 ) New version has additional word in stage-specific section. Also some changes in content for tesselation and compute shaders. Either version can be invoked at pass creation. This is done to ease integration and updating of validation layers. Version 1 is deprecated and eventually will go away. Also sneaking in fix to version 1 compute shaders.	2019-05-29 15:08:21 -04:00
Ryan Harrison	4557d08584	Add in individual flags for Vulkan <-> WebGPU passes (#2615 ) Adds flags and/or documentation for individual transformation passes that had been missed in previous patches. Fixes #2574	2019-05-22 10:06:53 -07:00
Ryan Harrison	f6d9a17843	Add pass to fix some invalid unreachable blocks for WebGPU (#2563 ) Attempts to split up unreachable blocks that are used both as a merge-block and a continue-target. Fixes #2429	2019-05-09 12:56:10 -04:00
Ryan Harrison	048dcd38ce	Implement WebGPU->Vulkan initializer conversion for 'Function' variables (#2513 ) WebGPU requires certain variables to be initialized, whereas there are known issues with using initializers in Vulkan. This PR is the first of three implementing a pass to decompose initialized variables into a variable declaration followed by a store. This has been broken up into multiple PRs, because there 3 distinct cases that need to be handled, which require separate implementations. This first PR implements the basic infrastructure that is needed, and handling of Function storage class variables. Private and Output will be handled in future PRs. This is part of resolving #2388	2019-04-16 14:31:36 -04:00
Ryan Harrison	102e430a88	Add pass to legalize OpVectorShuffle for WebGPU (#2509 ) In WebGPU, the component operand 0xFFFFFFFF is forbidden, but in Vulkan it is used to indicate a value is undefined. When converting to WebGPU, 0xFFFFFFFF needs to converted to a legal value, though the specific one does not matter, since it was used to indicate an undefined entry in the original code. Choosing to use 0, since the operands are required to be on [0, N-1], so 0 is guaranteed to always be valid. Fixes #2349	2019-04-12 12:14:23 -04:00
Steven Perron	7ce37d66a8	Fix use of Logf to avoid format security warning (#2498 ) When -Wformat-security is enabled, we are getting an error. I do not claim to fully understand when the warning is triggered or not, but this one can be avoided by calling "Log" instead of "Logf" because the formating string is not needed.	2019-04-08 11:06:48 -04:00
Ryan Harrison	0cb2d4079e	Add WebGPU->Vulkan and Vulkan->WebGPU flags in spirv-opt (#2496 ) Renames the existing flag '--webgpu-mode' to '--vulkan-to-webgpu' for the Vulkan->WebGPU operation, and adds a new flag '--webgpu-to-vulkan' for the WebGPU->Vulkan operation. Currently '--webgpu-to-vulkan' doesn't have any passes associated with it yet, but further patches will implement them. Fixes #2495	2019-04-05 15:12:26 -04:00
JasperNV	9766b22b33	spirv-opt: Behave a bit better in the face of unknown instructions (#2487 ) * opt/ir_loader: Don't silently drop unknown instructions on the floor Currently, if spirv-opt sees an instruction it does not know, it will silently ignore it and move to the next one. This changes it to be an error, as dropping it on the floor is likely to generate invalid SPIR-V output. * opt/optimizer: Complain a bit louder for unexpected binary changes If a binary change happens despite a pass saying that the binaries should be identical, this is indicative of a bug in the pass itself. This does not change behavior for it to be an error, but simply emits a warning in this case.	2019-04-05 13:36:42 -04:00
Steven Perron	3a0bc9e724	Add fix storage class code. (#2434 ) This pass tries to fix validation error due to a mismatch of storage classes in instructions. There is no guarantee that all such error will be fixed, and it is possible that in fixing these errors, it could lead to other errors. Fixes #2430.	2019-04-05 13:12:08 -04:00
Ryan Harrison	01964e325f	Add pass to generate needed initializers for WebGPU (#2481 ) Fixes #2387	2019-04-03 11:44:09 -04:00
alan-baker	42e6f1aa62	Add option to validate after each pass (#2462 ) * New command-line option to opt: --validate-after-all * Pass manager will validate after each pass it runs	2019-03-26 14:38:59 -04:00
greg-lunarg	e1a76269b6	Bindless Validation: Descriptor Initialization Check (#2419 ) If SPV_EXT_descriptor_indexing is enabled, add check that for a descriptor-based reference, the descriptor is initialized. Initialization data is stored in the debug input buffer, added to the length information already there. This feature must be seperately enabled on the pass creation routine. NOTE: Currently just supports image references; buffer references are still TODO.	2019-03-19 09:53:43 -04:00
Ryan Harrison	e545522146	Add --strip-atomic-counter-memory (#2413 ) Adds an optimization pass to remove usages of AtomicCounterMemory bit. This bit is ignored in Vulkan environments and outright forbidden in WebGPU ones. Fixes #2242	2019-03-14 13:34:33 -04:00
Steven Perron	1b0047f210	Add pass to remove dead members. (#2379 ) Add a pass that looks for members of structs whose values do not affects the output of the shader. Those members are then removed and just treated like padding in the struct.	2019-02-14 13:42:35 -05:00
Ryan Harrison	12b3d7e9d6	Add strip-debug to webgpu-mode passes (#2368 ) Fixes #2366	2019-02-08 14:26:17 -05:00
greg-lunarg	cf21146137	Expand bindless bounds checking to runtime-sized descriptor arrays (#2316 )	2019-02-07 14:00:36 -05:00
Ryan Harrison	0f4bf0720a	Add flatten-decorations flag to webgpu-mode flags (#2348 ) Fixes #2272	2019-02-05 14:07:53 -05:00
Steven Perron	9ab1c0ddd0	Remove code sinking for -O. (#2340 ) Community feedback says it is not generaly benificial, so we will remove it from the standard optimization set.	2019-01-28 11:50:50 -05:00
Steven Perron	dd4157dcee	Sink (#2284 ) Add code sinking pass. It will move OpLoad and OpAccessChain instructions as close as possible to their uses. Part of #1611.	2019-01-17 15:56:36 -05:00
Ryan Harrison	47c08a79c4	Implement initial --webgpu-mode flag (#2217 ) Fixes #2166	2018-12-18 15:10:34 -05:00
Ryan Harrison	e0292c269d	Add --target-env flag to spirv-opt (#2216 ) Fixes #2199	2018-12-17 16:54:23 -05:00
alan-baker	e510b1bac5	Update memory model (#1904 ) Upgrade to VulkanKHR memory model * Converts Logical GLSL450 memory model to Logical VulkanKHR * Adds extension and capability * Removes deprecated decorations and replaces them with appropriate flags on downstream instructions * Support for Workgroup upgrades * Support for copy memory * Adding support for image functions * Adding barrier upgrades and tests * Use QueueFamilyKHR scope instead of device	2018-11-30 14:15:51 -05:00
greg-lunarg	c37388f1ad	Add passes to propagate and eliminate redundant line instructions (#2027 ). (#2039 ) These are bookend passes designed to help preserve line information across passes which delete, move and clone instructions. The propagation pass attaches a debug line instruction to every instruction based on SPIR-V line propagation rules. It should be performed before optimization. The redundant line elimination pass eliminates all line instructions which match the previous line instruction. This pass should be performed at the end of optimization to reduce physical SPIR-V file size. Fixes #2027.	2018-11-15 14:06:17 -05:00
greg-lunarg	1e9fc1aac1	Add base and core bindless validation instrumentation classes (#2014 ) * Add base and core bindless validation instrumentation classes * Fix formatting. * Few more formatting fixes * Fix build failure * More build fixes * Need to call non-const functions in order. Specifically, these are functions which call TakeNextId(). These need to be called in a specific order to guarantee that tests which do exact compares will work across all platforms. c++ pretty much does not guarantee order of evaluation of operands, so any such functions need to be called separately in individual statements to guarantee order. * More ordering. * And more ordering. * And more formatting. * Attempt to fix NDK build * Another attempt to address NDK build problem. * One more attempt at NDK build failure * Add instrument.hpp to BUILD.gn * Some name improvement in instrument.hpp * Change all types in instrument.hpp to int. * Improve documentation in instrument.hpp * Format fixes * Comment clean up in instrument.hpp * imageInst -> image_inst * Fix GetLabel() issue.	2018-11-08 13:54:54 -05:00
Steven Perron	82663f34c9	Check for unreachable blocks in merge-return. (#1966 ) Merge return assumes that the only unreachable blocks are those needed to keep the structured cfg valid. Even those must be essentially empty blocks. If this is not the case, we get unpredictable behaviour. This commit add a check in merge return, and emits an error if it is not the case. Added a pass of dead branch elimination before merge return in both the performance and size passes. It is a precondition of merge return. Fixes #1962.	2018-10-10 15:18:15 -04:00
Steven Perron	0e5fc7d75e	Allow 0 as argument to scalar replacement. (#1917 ) A limit of 0 for the scalar replacement options it used to indicate that there is no limit. The current implementation does not allow 0. This should be fixed.	2018-09-26 09:58:28 -04:00
Steven Perron	9fbcce4ca1	Add unrolling to the legalization passes (#1903 ) Adds unrolling to the legalization passes. After enabling unrolling I found a bug when there is a self-referencing phi node. That has been fixed. The test that checks for that the order of optimizations is correct also needed to be updated.	2018-09-19 16:40:09 -04:00
Steven Perron	75c1bf2843	Add option for the max id bound. (#1870 ) * Create a new entry point for the optimizer Creates a new struct to hold the options for the optimizer, and creates an entry point that take the optimizer options as a parameter. The old entry point that takes validator options are now deprecated. The validator options will be one of the optimizer options. Part of the optimizer options will also be the upper bound on the id bound. * Add a command line option to set the max value for the id bound. The default is 0x3FFFFF. * Modify `TakeNextIdBound` to return 0 when the limit is reached.	2018-09-10 11:49:41 -04:00
Diego Novillo	03000a3a38	Add testing framework for tools. This forks the testing harness from https://github.com/google/shaderc to allow testing CLI tools. New features needed for SPIRV-Tools include: 1- A new PlaceHolder subclass for spirv shaders. This place holder calls spirv-as to convert assembly input into SPIRV bytecode. This is required for most tools in SPIRV-Tools. 2- A minimal testing file for testing basic functionality of spirv-opt. Add tests for all flags in spirv-opt. 1. Adds tests to check that known flags match the names that each pass advertises. 2. Adds tests to check that -O, -Os and --legalize-hlsl schedule the expected passes. 3. Adds more functionality to Expect classes to support regular expression matching on stderr. 4. Add checks for integer arguments to optimization flags. 5. Fixes #1817 by modifying the parsing of integer arguments in flags that take them. 6. Fixes -Oconfig file parsing (#1778). It reads every line of the file into a string and then parses that string by tokenizing every group of characters between whitespaces (using the standard cin reading operator). This mimics shell command-line parsing, but it does not support quoting (and I'm not planning to).	2018-08-17 15:03:14 -04:00
dan sinclair	1553025f4c	Move make_unique to source/util. (#1836 ) This MakeUnique code is used in places other then source/opt so move it to source/utils.	2018-08-14 12:44:54 -04:00
Steven Perron	bcb0b6935c	Reenable --skip-validation. (#1820 ) In previous changes, the option `--skip-validation` was disabled. This change is to reenable it.	2018-08-13 13:18:46 -04:00
Steven Perron	5c8b4f5a1c	Validate the input to Optimizer::Run (#1799 ) * Run the validator in the optimization fuzzers. The optimizers assumes that the input to the optimizer is valid. Since the fuzzers do not check that the input is valid before passing the spir-v to the optimizer, we are getting a few errors. The solution is to run the validator in the optimizer to validate the input. For the legalization passes, we need to add an extra option to the validator to accept certain types of variable pointers, even if the capability is not given. At the same time, we changed the option "--legalize-hlsl" to relax the validator in the same way instead of turning it off.	2018-08-08 11:16:19 -04:00
dan sinclair	eda2cfbe12	Cleanup includes. (#1795 ) This Cl cleans up the include paths to be relative to the top level directory. Various include-what-you-use fixes have been added.	2018-08-03 15:06:09 -04:00
Alan Baker	755e5c9420	Transform to combine consecutive access chains * Combines OpAccessChain, OpInBoundsAccessChain, OpPtrAccessChain and OpInBoundsPtrAccessChain * New folding rule to fold add with 0 for integers * Converts to a bitcast if the result type does not match the operand type V	2018-07-31 13:42:47 -04:00
Diego Novillo	99fe61e724	Add API to create passes out of a list of command-line flags. This re-implements the -Oconfig=<file> flag to use a new API that takes a list of command-line flags representing optimization passes. This moves the processing of flags that create new optimization passes out of spirv-opt and into the library API. Useful for other tools that want to incorporate a facility similar to -Oconfig. The main changes are: 1- Add a new public function Optimizer::RegisterPassesFromFlags. This takes a vector of strings. Each string is assumed to have the form '--pass_name[=pass_args]'. It creates and registers into the pass manager all the passes specified in the vector. Each pass is validated internally. Failure to create a pass instance causes the function to return false and a diagnostic is emitted to the registered message consumer. 2- Re-implements -Oconfig in spirv-opt to use the new API.	2018-07-27 15:10:08 -04:00
dan sinclair	e6b953361d	Move the ir namespace to opt. (#1680 ) This CL moves the files in opt/ to consistenly be under the opt:: namespace. This frees up the ir:: namespace so it can be used to make a shared ir represenation.	2018-07-09 11:32:29 -04:00
Steven Perron	101a9bcbb0	Add private to local to optimization and size passes. Many optimization will run on function scope symbols only. When symbols are moved from private scope to function scople, then these optimizations can do more. I believe it is a good idea to run this pass with both -O and -Os. To get the most out of it it should be run ASAP after inlining and something that remove all of the dead functions.	2018-07-04 21:26:09 -04:00
Steven Perron	465f2815cb	Revert change and stop running remove duplicates. Revert "Don't merge types of resources" This reverts commit `f393b0e480`, but leaves the tests that were added. Added new test. These test are the so that, if someone tries the same change I made, they will see the test that they need to handle. Don't run remove duplicates in -O and -Os Romve duplicates was run to help reduce compile time when looking for types in the type manager. I've run compile time test on three sets of shaders, and the compile time does not seem to change. It should be safe to remove it.	2018-06-29 14:09:44 -04:00
Steven Perron	fe2fbee294	Delete the insert-extract-elim pass. Replaces anything that creates an insert-extract-elim pass and create a simplifiation pass instead. Then delete the implementation of the pass. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1570.	2018-06-01 10:13:39 -04:00
Steven Perron	745dd00af9	Fold FMix feeding Extract, and use the simplification pass. We add a new rule to the folding rules to fold an FMix feeding an extract when the alpha value for the element being extracted is either 0 or 1. In those case, we can simple extract from one of the operands to the FMix. With that change the simplification pass completely subsumes the insert-extract elimination pass. So we remove the insert-extract elimination passes and replce them with calls to the simplification pass. In a follow up PR, we should delete the insert-extract elimination pass. Contributes to https://github.com/KhronosGroup/SPIRV-Tools/issues/1570.	2018-05-25 14:42:59 -04:00
Arseny Kapoulkine	f765d16bd9	Add external interface for creating a pass token Currently it's impossible for external code to register a pass because the only source file that can create pass tokens is optimizer.cpp. This makes it hard to add passes that can't be upstreamed since you can't run them from the usual pass sequence without reimplementing Optimizer. This change adds a PassToken constructor that takes unique_ptr to opt::Pass; if out-of-tree code implements opt::Pass it can register a custom pass without having to add it to SPIRV-Tools source code.	2018-05-25 09:19:43 -04:00
Steven Perron	a579e720a8	Remove the limit on struct size in SROA. Removes the limit on scalar replacement for the lagalization passes. This is done by adding an option to the pass (and command line option) to set the limit on maximum size of the composite that scalar replacement is willing to divide. Fixes #1494.	2018-05-18 10:03:46 -04:00
Steven Perron	af430ec822	Add pass to fold a load feeding an extract. We have already disabled common uniform elimination because it created sequences of loads an entire uniform object, then we extract just a single element. This caused problems in some drivers, and is just generally slow because it loads more memory than needed. However, there are other way to get into this situation, so I've added a pass that looks specifically for this pattern and removes it when only a portion of the load is used. Fixes #1547.	2018-05-14 15:40:34 -04:00
Toomas Remmelg	1dc2458060	Add a loop fusion pass. This pass will look for adjacent loops that are compatible and legal to be fused. Loops are compatible if: - they both have one induction variable - they have the same upper and lower bounds - same initial value - same condition - they have the same update step - they are adjacent - there are no break/continue in either of them Fusion is legal if: - fused loops do not have any dependencies with dependence distance greater than 0 that did not exist in the original loops. - there are no function calls in the loops (could have side-effects) - there are no barriers in the loops It will fuse all such loops as long as the number of registers used for the fused loop stays under the threshold defined by max_registers_per_loop.	2018-05-01 15:40:37 -04:00
Stephen McGroarty	9a5dd6fe88	Support loop fission. Adds support for spliting loops whose register pressure exceeds a user provided level. This pass will split a loop into two or more loops given that the loop is a top level loop and that spliting the loop is legal. Control flow is left intact for dead code elimination to remove. This pass is enabled with the --loop-fission flag to spirv-opt.	2018-05-01 15:15:10 -04:00
Steven Perron	ee8cd5c847	Add Dead insert elmination back in.	2018-04-24 10:10:30 -04:00
Steven Perron	2c0ce87210	Vector DCE (#1512 ) Introduce a pass that does a DCE type analysis for vector elements instead of the whole vector as a single element. It will then rewrite instructions that are not used with something else. For example, an instruction whose value are not used, even though it is referenced, is replaced with an OpUndef.	2018-04-23 11:13:07 -04:00
Victor Lomuller	10e5d7cf13	Add a loop peeling pass. For each loop in a function, the pass walks the loops from inner to outer most loop and tries to peel loop for which a certain amount of iteration can be done before or after the loop. To limit code growth, peeling will not happen if the growth in code size goes above a configurable threshold.	2018-04-11 15:41:29 +01:00
Steven Perron	cbceeceab4	In copy-prop-arrays, indentify copies via OpCompositeInsert When the original code copies an entire array or struct one element at a time, this turns into a series of OpCompositeInsert instruction followed by a store of the whole array. We currently miss opportunities in copy propagate arrays because we do not recognize this as a copy. This commit adds code to copy propagate arrays to identify this code pattern. Also updates the performance passed to run array copy propagation.	2018-03-29 09:39:55 -04:00

1 2 3

116 Commits