SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2024-11-28 22:21:03 +00:00

Author	SHA1	Message	Date
Diego Novillo	4ba9dcc8a0	Implement SSA CCP (SSA Conditional Constant Propagation). This implements the conditional constant propagation pass proposed in Constant propagation with conditional branches, Wegman and Zadeck, ACM TOPLAS 13(2):181-210. The main logic resides in CCPPass::VisitInstruction. Instruction that may produce a constant value are evaluated with the constant folder. If they produce a new constant, the instruction is considered interesting. Otherwise, it's considered varying (for unfoldable instructions) or just not interesting (when not enough operands have a constant value). The other main piece of logic is in CCPPass::VisitBranch. This evaluates the selector of the branch. When it's found to be a known value, it computes the destination basic block and sets it. This tells the propagator which branches to follow. The patch required extensions to the constant manager as well. Instead of hashing the Constant pointers, this patch changes the constant pool to hash the contents of the Constant. This allows the lookups to be done using the actual values of the Constant, preventing duplicate definitions.	2017-12-21 14:29:45 -05:00
Steven Perron	756b277fb8	Store all enabled capabilities in the feature manger. In order to keep track of all of the implicit capabilities as well as the explicit ones, we will add them all to the feature manager. That is the object that needs to be queried when checking if a capability is enabled. The name of the "HasCapability" function in the module was changed to make it more obvious that it does not check for implied capabilities. Keep an spv_context and AssemblyGrammar in IRContext	2017-12-21 11:14:53 -05:00
Alan Baker	1ab8ad654a	Fixing bugs in type manager memory management * changed the way duplicate types are removed to stop copying instructions * Reworked RemoveDuplicatesPass::AreTypesSame to use type manager and type equality * Reworked TypeManager memory management to store a pool of unique pointers of types * removed unique pointers from id map * fixed instances where free'd memory could be accessed	2017-12-21 08:59:06 -05:00
Steven Perron	7505d24225	Update the legalization passes. Changes the set of optimizations done for legalization. While doing this, I added documentation to explain why we want each optimization. A new option "--legalize-hlsl" is added so the legalization passes can be easily run from the command line. The legalize option implies skip-validation.	2017-12-20 17:56:03 -05:00
Pierre Moreau	424f744db1	Opt: Fix implementation and comment of AreDecorationsTheSame Target should not be ignored when comparing decorations in RemoveDuplicates Opt: Remove unused code in RemoveDuplicateDecorations	2017-12-19 15:36:47 -05:00
Steven Perron	79a00649b4	Allow pointers to pointers in logical addressing mode. A few optimizations are updates to handle code that is suppose to be using the logical addressing mode, but still has variables that contain pointers as long as the pointer are to opaque objects. This is called "relaxed logical addressing". \|Instruction::GetBaseAddress\| will check that pointers that are use meet the relaxed logical addressing rules. Optimization that now handle relaxed logical addressing instead of logical addressing are: - aggressive dead-code elimination - local access chain convert - local store elimination passes.	2017-12-19 14:29:14 -05:00
Steven Perron	b86eb6842b	Convert private variables to function scope. When a private variable is used in a single function, it can be converted to a function scope variable in that function. This adds a pass that does that. The pass can be enabled using the option `--private-to-local`. This transformation allows other transformations to act on these variables. Also moved `FindPointerToType` from the inline class to the type manager.	2017-12-19 14:21:04 -05:00
David Neto	8135dd6375	More validation on primitive instructions - Test validation success for OpEmitVertex OpEndPrimitive - Test missing capabilities for primitive instructions - Primitive instructions require Geometry execution model	2017-12-19 13:26:07 -05:00
Jesus Carabano	4dbcef62ee	validate & test of literal's upper bits Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/660	2017-12-19 13:19:56 -05:00
Pierre Moreau	f35963588b	Opt: Remove commented out duplicated type_id function This code was wrongly added by #693.	2017-12-18 17:29:21 -05:00
Jeremy Hayes	0d8ea48652	Fix comment in primitives validation Also refactor type query for efficiency.	2017-12-18 17:27:06 -05:00
Andrey Tuganov	dbc3a662c6	Image Operand Sample allows sparse image opcodes @ehsannas had filed an issue against SPIR-V spec, concerning Image Operands section (3.14): Sample A following operand is the sample number of the sample to use. Only valid with OpImageFetch, OpImageRead, and OpImageWrite. Relaxing the check to allow OpImageSparseRead and OpImageSparseFetch to fix failing tests.	2017-12-18 11:21:38 -05:00
David Neto	0dbe184d32	Remove concept of FIRST_CONCRETE_* operand types	2017-12-18 09:48:51 -05:00
Alan Baker	616908503d	Improving the usability of the type manager. The type manager hashes types. This allows the lookup of type declaration ids from arbitrarily constructed types. Users should be cautious when dealing with non-unique types (structs and potentially pointers) to get the exact id if necessary. * Changed the spec composite constant folder to handle ambiguous composites * Added functionality to create necessary instructions for a type * Added ability to remove ids from the type manager	2017-12-18 08:20:56 -05:00
GregF	0f80406315	ADCE: Only mark true breaks and continues of live loops This fixes issue #1075 - Mark continue when conditional branch with merge block. Only mark if merge block is not continue block. - Handle conditional branch break with preceding merge	2017-12-15 11:53:57 -05:00
Jeremy Hayes	cdfbf26c13	Add primitive instruction validation pass	2017-12-15 09:53:29 -05:00
Andrey Tuganov	af7d5799a5	Refactor include of latest spir-v header versions	2017-12-14 11:18:20 -05:00
Andrey Tuganov	532b327d4d	Add validation rules for atomic instructions Validates all OpAtomicXXX instructions.	2017-12-13 18:29:38 -05:00
Diego Novillo	853a3d6c31	Fix uninitialized warning at -Os.	2017-12-12 15:46:09 -05:00
Greg Fischer	22faa2b083	ADCE: Empty Loop Elimination This entirely eliminates loops which do not contain live code.	2017-12-12 13:53:15 -05:00
Steven Perron	07ce16d1e7	Set the parent for basic blocks during inlining. Inlining is not setting the parent (function) for each basic block. This can cause problems for later optimizations. The solution is to set the parent for each new block just before it is linked into the function.	2017-12-12 13:39:08 -05:00
Andrey Tuganov	c520d43649	Add validator checks for sparse image opcodes	2017-12-12 12:04:23 -05:00
Pierre Moreau	12447d8465	Support OpenCL 1.2 and 2.0 target environments include: Add target environment enums for OpenCL 1.2 and 2.0 Validator: Validate OpenCL capabilities Update validate capabilities to handle embedded profiles Add test for OpenCL capabilities validation Update messages to mention the OpenCL profile used Re-format val_capability_test.cpp	2017-12-12 11:35:39 -05:00
Andrey Tuganov	dbd8d0e7b8	Reenable OpCopyObject validation rules Vulkan CTS fix has been submitted.	2017-12-11 12:33:11 -05:00
Alan Baker	867451f49e	Add scalar replacement Adds a scalar replacement pass. The pass considers all function scope variables of composite type. If there are accesses to individual elements (and it is legal) the pass replaces the variable with a variable for each composite element and updates all the uses. Added the pass to -O Added NumUses and NumUsers to DefUseManager Added some helper methods for the inst to block mapping in context Added some helper methods for specific constant types No longer generate duplicate pointer types. * Now searches for an existing pointer of the appropriate type instead of failing validation * Fixed spec constant extracts * Addressed changes for review * Changed RunSinglePassAndMatch to be able to run validation * current users do not enable it Added handling of acceptable decorations. * Decorations are also transfered where appropriate Refactored extension checking into FeatureManager * Context now owns a feature manager * consciously NOT an analysis * added some test * fixed some minor issues related to decorates * added some decorate related tests for scalar replacement	2017-12-11 10:51:13 -05:00
GregF	78c025abe9	MultiStore: Support OpVariable Initialization Treat an OpVariable with initialization as if it was an OpStore. With PR #1073, this completes work for issue #1017.	2017-12-11 10:37:14 -05:00
GregF	c6fdf68c2f	SingleStore: Support OpVariable Initialization Treat an OpVariable with initialization as if it was an OpStore. This fixes issue #1017.	2017-12-08 16:02:14 -05:00
Diego Novillo	241dcacc04	Add a new constant manager class. This patch adds a new constant manager class to interface with analysis::Constant. The new constant manager lives in ir::IRContext together with the type manager (analysis::TypeManager). The new analysis::ConstantManager is used by the spec constant folder and the constant propagator (in progress). Another cleanup introduced by this patch removes the ID management from the fold spec constant pass, and ir::IRContext and moves it to ir::Module. SSA IDs were maintained by IRContext and Module. That's pointless and leads to mismatch IDs. Fixed by moving all the bookkeeping to ir::Module.	2017-12-08 14:14:55 -05:00
Steven Perron	5d602abd66	Add global redundancy elimination Adds a pass that looks for redundant instruction in a function, and removes them. The algorithm is a hash table based value numbering algorithm that traverses the dominator tree. This pass removes completely redundant instructions, not partially redundant ones.	2017-12-07 18:35:38 -05:00
Steven Perron	851e1ad985	Kill names and decoration in inlining. Currently when inlining a call, the name and decorations for the result of the call is not deleted. This should be changed. Added a test for this as well. This fixes issue #622.	2017-12-07 12:20:45 -05:00
Victor Lomuller	731d1899b1	Add depth first iterator for trees - Add generic depth first iterator - Update the dominator tree to use this iterator instead of "randomly" iterate over the nodes	2017-12-07 10:07:56 -05:00
Diego Novillo	0c2396d20f	Revert extraneous changes from commit `8ec62deb2`. Commit `8ec62deb2` merged the code from PR #810, but it also re-introduces code that had been removed in #885. This patch removes the (now superfluous code).	2017-12-06 16:04:47 -05:00
Stephen McGroarty	8ba68fa9b9	Dominator Tree Analysis (#3 ) Support for dominator and post dominator analysis on ir::Functions. This patch contains a DominatorTree class for building the tree and DominatorAnalysis and DominatorAnalysisPass classes for interfacing and caching the built trees.	2017-12-05 22:59:43 -05:00
Andrey Tuganov	94e3e7b8ef	Add composite instruction validation pass Validates instructions in the opcode range from OpVectorExtractDynamic to OpTranspose.	2017-12-05 10:15:51 -05:00
Andrey Tuganov	bf184310b2	Fix some of the known issues in image validation Applied some of the spec clarifications made in conversation with @johnkslang.	2017-12-04 18:57:34 -05:00
Steven Perron	fd3a22042b	DCEInst kill the same instruction twice. In DCEInst, it is possible that the same instruction ends up in the queue multiple times, if the same id is used multiple times in the same instruction. The solution is to keep the ids in a set, to ensure no duplication in the list.	2017-12-04 18:15:35 -05:00
Diego Novillo	e9ecc0cbfd	Remove cfg_ field from SSAPropagator class - NFC. When I moved the CFG into IRContext (https://github.com/KhronosGroup/SPIRV-Tools/pull/1019), I forgot to update SSAPropagator to stop requiring one. Fixed with this patch.	2017-12-04 15:28:21 -05:00
Steven Perron	65046eca7c	Change IRContext::KillInst to delete instructions. The current method of removing an instruction is to call ToNop. The problem with this is that it leaves around an instruction that later passes will look at. We should just delete the instruction. In MemPass there is a utility routine called DCEInst. It can delete essentially any instruction, which can invalidate pointers now that they are actually deleted. The interface was changed to add a call back that can be used to update any local data structures that contain ir::Intruction*.	2017-12-04 11:07:45 -05:00
Steven Perron	b35b52f97b	Compute value number when the value table is constructed. Computing the value numbers on demand, as we do now, can lead to different results depending on the order in which the users asks for the value numbers. To make things more stable, we compute them ahead of time.	2017-12-04 11:02:04 -05:00
Daan Wendelen	b98254b282	Fixed typo that leaked to the binary The typo was found by lintian when I was packaging glslang	2017-12-03 20:42:14 -05:00
Lei Zhang	0dd4ee27b1	Fix Dref type check in validator Dref should be of 32-bit scalar floating type. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1012	2017-12-01 10:17:45 -05:00
Pierre Moreau	69043963e4	Opt: Remove unused lambda captures Those are reported as errors by clang 5.0.0, due to the flags -Werror and -Wunused-lambda-capture.	2017-12-01 09:54:37 -05:00
Lei Zhang	137953538a	Support outputting ANSI color escape sequences in library Previously we required _PRINT to enable _COLOR, which forbids outputting colored disassembly into a string in library. This commit will allow library users to request enabling ANSI color escape sequences.	2017-12-01 09:03:35 -05:00
David Neto	188cd3780d	Erase decorations removed from internal collections Fixes Android arm-64-v8a build with NDK r14. That's because we no longer ignore the result of the std::remove.	2017-11-30 11:35:02 -05:00
David Neto	3c2e4c7d99	Fix validation of group ops in SPV_AMD_shader_ballot This needs custom code since the rules from the extension are not encoded in the grammar. Changes are: - The new group instructions don't require Group capability when the extension is declared. - The Reduce, InclusiveScan, ExclusiveScan normally require the Kernel capability, but don't when the extension is declared. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/991	2017-11-30 10:26:04 -05:00
Diego Novillo	8cfa0c40e0	Fix #1034 - Give Edge::operator<() weak ordering semantics. This should fix #1034. It changes the predicate on operator< to use label IDs from each block and compares them as std:pair to define a weak ordering for std::set.	2017-11-29 17:29:17 -05:00
Andrey Tuganov	e1ceff9f54	Validate OpTypeImage and OpTypeSampleImage Added new validation rules to the validate image pass.	2017-11-29 13:21:04 -05:00
GregF	8dd3d93cf6	AggressiveDCE: Add merge and continue branches for live loop. This ensures that an if-break is not eliminated from a loop. This fixes issue #989	2017-11-29 09:56:21 -05:00
Diego Novillo	9f20799fb4	Convert the CFG to an on-demand analysis - NFC. This fixes some TODOs by moving the CFG into the IRContext as an analysis.	2017-11-28 13:25:41 -05:00
Diego Novillo	74327845aa	Generic value propagation engine. This class implements a generic value propagation algorithm based on the conditional constant propagation algorithm proposed in Constant propagation with conditional branches, Wegman and Zadeck, ACM TOPLAS 13(2):181-210. The implementation is based on A Propagation Engine for GCC Diego Novillo, GCC Summit 2005 http://ols.fedoraproject.org/GCC/Reprints-2005/novillo-Reprint.pdf The purpose of this implementation is to act as a common framework for any transformation that needs to propagate values from statements producing new values to statements using those values.	2017-11-27 23:32:06 -05:00
Diego Novillo	491b112fd2	Fix windows build. This fixes the lack of uint32_t definition in source/val/decoration.h.	2017-11-27 14:40:03 -05:00
Diego Novillo	83228137e1	Re-format source tree - NFC. Re-formatted the source tree with the command: $ /usr/bin/clang-format -style=file -i \ $(find include source tools test utils -name '.cpp' -or -name '.h') This required a fix to source/val/decoration.h. It was not including spirv.h, which broke builds when the #include headers were re-ordered by clang-format.	2017-11-27 14:31:49 -05:00
Andrey Tuganov	d8b2013ecf	Derivative opcodes require Fragment exec model Added validator check that all derivative opcodes require Fragment execution model.	2017-11-27 12:05:25 -05:00
Andrey Tuganov	c170afd93b	Relaxed OpImageWrite texel type check	2017-11-24 14:31:08 -05:00
Andrey Tuganov	f84f266977	Relaxed OpImageRead validation rules Removed the check that result type of OpImageRead should be a vector4. Will reenable/adapt once the spec is clarified on what the right dimension should be.	2017-11-24 10:12:24 -05:00
Alan Baker	0cae89e79e	Notify the context of instructions that are being erased. Fixes use-after-free error in RemoveDuplicatesPass Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/1004	2017-11-23 23:43:25 -05:00
Andrey Tuganov	3e08a3f718	Add validation checks for Execution Model Currently checks that these instructions are called from entry points with Fragment execution model. OpImageImplicit* OpImageQueryLod OpKill	2017-11-23 23:38:03 -05:00
David Neto	d9129f00a5	Test for pollution of the global namespace Works on Linux only for now. That's a good start. Move ValidateBinaryUsingContextAndValidationState into anonymous namespace in source/validate.cpp.	2017-11-23 21:27:21 -05:00
Steven Perron	0b1cb27f83	Remove derivative instructions from the list of combinators. These instructions compute their value based the value of the immediate neighbours of the current fragment. This means the result is not defined purely by the operands of the instruction.	2017-11-23 18:37:43 -05:00
Lei Zhang	aec60b8158	Add RegisterLegalizationPasses() into the interface Add note to mention the use scenario. The original list came from Glslang.	2017-11-23 17:26:44 -05:00
Alan Baker	746bfd210a	Adding new def -> use mapping container Replaced representation of uses * Changed uses from unordered_map<uint32_t, UseList> to set<pairInstruction, Instruction>> * Replaced GetUses with ForEachUser and ForEachUse functions * updated passes to use new functions * partially updated tests * lots of cleanup still todo Adding an unique id to Instruction generated by IRContext Each instruction is given an unique id that can be used for ordering purposes. The ids are generated via the IRContext. Major changes: * Instructions now contain a uint32_t for unique id and a cached context pointer * Most constructors have been modified to take a context as input * unfortunately I cannot remove the default and copy constructors, but developers should avoid these * Added accessors to parents of basic block and function * Removed the copy constructors for BasicBlock and Function and replaced them with Clone functions * Reworked BuildModule to return an IRContext owning the built module * Since all instructions require a context, the context now becomes the basic unit for IR * Added a constructor to context to create an owned module internally * Replaced uses of Instruction's copy constructor with Clone whereever I found them * Reworked the linker functionality to perform clones into a different context instead of moves * Updated many tests to be consistent with the above changes * Still need to add new tests to cover added functionality * Added comparison operators to Instruction Adding tests for Instruction, IRContext and IR loading Fixed some header comments for BuildModule Fixes to get tests passing again * Reordered two linker steps to avoid use/def problems * Fixed def/use manager uses in merge return pass * Added early return for GetAnnotations * Changed uses of Instruction::ToNop in passes to IRContext::KillInst Simplifying the uses for some contexts in passes	2017-11-23 16:40:02 -05:00
Lei Zhang	b02c9a5802	Allow derived access chain without uses in access chain conversion	2017-11-23 16:00:28 -05:00
Andrey Tuganov	ab892f7bd6	Add derivatives validation pass Checks operands of instructions in opcode range from OpDPdx to OpFwidthCoarse.	2017-11-23 14:17:10 -05:00
David Neto	c2999273d9	Move SetContextMessageConsumer into libspirv namespace Avoid polluting the global namespace.	2017-11-23 13:56:12 -05:00
Steven Perron	28c415500d	Create a local value numbering pass Creates a pass that removes redundant instructions within the same basic block. This will be implemented using a hash based value numbering algorithm. Added a number of functions that check for the Vulkan descriptor types. These are used to determine if we are variables are read-only or not. Implemented a function to check if loads and variables are read-only. Implemented kernel specific and shader specific versions. A big change is that the Combinator analysis in ADCE is factored out into the IRContext as an analysis. This was done because it is being reused in the value number table.	2017-11-23 11:45:09 -05:00
Andrey Tuganov	f407ae2b50	Validator pass for image instructions Includes validation rules for OpImageXXX and ImageOperand. Doesn't include OpTypeImage and OpImageSparseXXX. Disabled an invalid test.	2017-11-22 14:34:15 -05:00
GregF	e28edd458b	Optimize loads/stores on nested structs Also fix LocalAccessChainConvert test: nested structs now convert Add InsertExtractElim test for nested struct	2017-11-21 17:56:03 -05:00
Andrey Tuganov	b14291581f	Fix move semantics in iterator make_range	2017-11-21 17:36:15 -05:00
Andrey Tuganov	250a235a8d	Add new compression algorithm and models Add new "short descriptor" algorithm to MARK-V codec. Add three shader compression models: lite - fast, poor compression mid - balanced max - best compression	2017-11-21 17:32:58 -05:00
Alan Baker	a771713e42	Adding an unique id to Instruction generated by IRContext Each instruction is given an unique id that can be used for ordering purposes. The ids are generated via the IRContext. Major changes: * Instructions now contain a uint32_t for unique id and a cached context pointer * Most constructors have been modified to take a context as input * unfortunately I cannot remove the default and copy constructors, but developers should avoid these * Added accessors to parents of basic block and function * Removed the copy constructors for BasicBlock and Function and replaced them with Clone functions * Reworked BuildModule to return an IRContext owning the built module * Since all instructions require a context, the context now becomes the basic unit for IR * Added a constructor to context to create an owned module internally * Replaced uses of Instruction's copy constructor with Clone whereever I found them * Reworked the linker functionality to perform clones into a different context instead of moves * Updated many tests to be consistent with the above changes * Still need to add new tests to cover added functionality * Added comparison operators to Instruction * Added an internal option to LinkerOptions to verify merged ids are unique * Added a test for the linker to verify merged ids are unique * Updated MergeReturnPass to supply a context * Updated DecorationManager to supply a context for cloned decorations * Reworked several portions of the def use tests in anticipation of next set of changes	2017-11-20 17:49:10 -05:00
Steven Perron	3214c3b0ca	Add dead function elimination to -O and -Os This pass is very useful in reducing the size of the code, and reducing the amount of work done by other optimizations.	2017-11-20 09:41:03 -05:00
Steven Perron	eb4653a67f	Add the decoration manager to the IRContext. To make the decoration manger available everywhere, and to reduce the number of times it needs to be build, I add one the IRContext. As the same time, I move code that modifies decoration instruction into the IRContext from mempass and the decoration manager. This will make it easier to keep everything up to date. This should take care of issue #928.	2017-11-15 12:48:03 -05:00
Alan Baker	a92d69b43d	Initial implementation of merge return pass. Works with current DefUseManager infrastructure. Added merge return to the standard opts. Added validation to passes. Disabled pass for shader capabilty.	2017-11-15 10:27:04 -05:00
Diego Novillo	98281ed411	Add analysis to compute mappings between instructions and basic blocks. This analysis builds a map from instructions to the basic block that contains them. It is accessed via get_instr_block(). Once built, it is kept up-to-date by the IRContext, as long as instructions are removed via KillInst. I have not yet marked passes that preserve this analysis. I will do it in a separate change. Other changes: - Add documentation about analysis values requirement to be powers of 2. - Force a re-build of the def-use manager in tests. - Fix AllPreserveFirstOnlyAfterPassWithChange to use the DummyPassPreservesFirst pass. - Fix sentinel value for IRContext::Analysis enum. - Fix logic for checking if the instr<->block mapping is valid in KillInst.	2017-11-13 13:21:48 -05:00
Daniel Schürmann	a76d0977ac	Fix decorations of inlined functions. Fixes issue #728. Currently the inliner is not generating decorations for inlined code which corresponds to function code which has decorations. An example of decorations that are relevant: RelaxedPrecision, NoContraction. The solution is to replicate the decoration during inlining.	2017-11-13 12:49:25 -05:00
Steven Perron	efe12ff5a1	Have all MemPasses preserve the def-use manager. Originally the passes that extended from MemPass were those that are of the def-use manager. I am assuming they would be able to preserve it because of that. Added a check to verify consistency of the IRContext. The IRContext relies on the pass to tell it if something is invalidated. It is possible that the pass lied. To help identify those situations, we will check if the valid analyses are correct after each pass. This will be enabled by default for the debug build, and disabled in the production build. It can be disabled in the debug build by adding "-DSPIRV_CHECK_CONTEXT=OFF" to the cmake command.	2017-11-10 11:17:12 -05:00
Diego Novillo	d2938e4842	Re-format files in source, source/opt, source/util, source/val and tools. NFC. This just makes sure every file is formatted following the formatting definition in .clang-format. Re-formatted with: $ clang-format -i $(find source tools include -name '.cpp') $ clang-format -i $(find source tools include -name '.h')	2017-11-08 14:03:08 -05:00
Steven Perron	f32d11f74b	Add the IRContext (part 2): Add def-use manager This change will move the instances of the def-use manager to the IRContext. This allows it to persists across optimization, and does not have to be rebuilt multiple times. Added test to ensure that the IRContext is validating and invalidating the analyses correctly.	2017-11-08 13:35:34 -05:00
GregF	ac04b2faea	Opt: Fix HasLoads to not report decoration as load.	2017-11-07 17:39:58 -05:00
GregF	d86c7ce808	Opt: Remove CommonUniformElimination from -O and -Os (for now) It is causing crashes for some drivers. Will try to re-enable it once existing drivers are able to deal better with it.	2017-11-07 16:55:12 -05:00
Nuno Subtil	2dddb8193b	Validate storage class of target pointer for OpStore	2017-11-02 13:44:11 -04:00
Diego Novillo	9d6cc26226	Move class CFG from namespace opt to namespace ir. It makes more sense to have the CFG inside the ir name space, as it is descriptive of the representation.	2017-11-02 11:51:07 -04:00
Diego Novillo	fef669f30f	Add a new class opt::CFG to represent the CFG for the module. This class moves some of the CFG-related functionality into a new class opt::CFG. There is some other code related to the CFG in the inliner and in opt::LocalSingleStoreElimPass that should also be moved, but that require more changes than this pure restructuring. I will move those bits in a follow-up PR. Currently, the CFG is computed every time a pass is instantiated, but this should be later moved to the new IRContext class that @s-perron is working on. Other re-factoring: - Add BasicBlock::ContinueBlockIdIfAny. Re-factored out of MergeBlockIdIfAny - Rewrite IsLoopHeader in terms of GetLoopMergeInst. - Run clang-format on some files.	2017-11-02 10:37:03 -04:00
Steven Perron	476cae6f7d	Add the IRContext (part 1) This is the first part of adding the IRContext. This class is meant to hold the extra data that is build on top of the module that it owns. The first part will simply create the IRContext class and get it passed to the passes in place of the module. For now it does not have any functionality of its own, but it acts more as a wrapper for the module. The functions that I added to the IRContext are those that either traverse the headers or add to them. I did this because we may decide to have other ways of dealing with these sections (for example adding a type pool, or use the decoration manager). I also added the function that add to the header because the IRContext needs to know when an instruction is added to update other data structures appropriately. Note that there is still lots of work that needs to be done. There are still many places that change the module, and do not inform the context. That will be the next step.	2017-10-31 13:46:05 -04:00
Nuno Subtil	d861ceffd4	Add validation for OpBranchConditional	2017-10-31 12:05:20 -04:00
Andrey Tuganov	7299fb5b7c	Lowered initial capacity of move-to-front sequence Also fixed outdated comments.	2017-10-31 12:00:42 -04:00
GregF	94bec26afe	ADCE: Dead if elimination Mark structured conditional branches live only if one or more instructions in their associated construct is marked live. After closure, replace dead structured conditional branches with a branch to its merge and remove dead blocks. ADCE: Dead If Elim: Remove duplicate StructuredOrder code Also generalize ComputeStructuredOrder so that the caller can specify the root block for the order. Phi insertion uses pseudo_entry_block and adce and dead branch elim use the first block of the function. ADCE: Dead If Elim: Pull redundant code out of InsertPhiInstructions ADCE: Dead If Elim: Encapsulate CFG Cleanup Initialization ADCE: Dead If Elim: Remove redundant code from ADCE initialization ADCE: Dead If: Use CFGCleanup to eliminate newly dead blocks Moved bulk of CFG Cleanup code into MemPass.	2017-10-31 11:51:30 -04:00
Diego Novillo	632e2068f3	More re-factoring to simplify pass initialization. This implements two cleanups suggested by @s-perron (https://github.com/KhronosGroup/SPIRV-Tools/pull/921): - Move FindNamedOrDecoratedIds() into MemPass::InitializeProcessing(). - Remove FinalizeNextId(). Always call SetIdBound() from Pass::TakeNextId().	2017-10-30 09:06:17 -04:00
Steven Perron	716138ee14	Add option to relax validation of store types. There are a number of users of spriv-opt that are hitting errors because of stores with different types. In general, this is wrong, but, in these cases, the types are the exact same except for decorations. The options is "--relax-store-struct", and it can be used with the validator or the optimizer. We assume that if layout information is missing it is consistent. For example if one struct has a offset of one of its members, and the other one does not, we will still consider them as being layout compatible. The problem will be if both struct has and offset decoration for corresponding members, and the offset are different.	2017-10-28 18:48:21 -04:00
Andrey Tuganov	6724c27251	Compression: removed 'presumed index' feature The feature used to improve compression of const integers which were presumed to be indices. Now obsolete as descriptor-based compression does this in a more generalized way.	2017-10-28 18:38:13 -04:00
Jesus Carabano	f063f91d24	Use std::lower_bound for opcode lookup Use std::lower_bound for opcode-to-string Stable sort the generated instruction table.	2017-10-28 18:34:01 -04:00
Diego Novillo	1040a95b3f	Re-factor Phi insertion code out of LocalMultiStoreElimPass Including a re-factor of common behaviour into class Pass: The following functions are now in class Pass: - IsLoopHeader. - ComputeStructuredOrder - ComputeStructuredSuccessors (annoyingly, I could not re-factor all instances of this function, the copy in common_uniform_elim_pass.cpp is slightly different and fails with the common implementation). - GetPointeeTypeId - TakeNextId - FinalizeNextId - MergeBlockIdIfAny This is a NFC (non-functional change)	2017-10-27 15:28:08 -04:00
Steven Perron	94dc66b74d	Change the sections in the module to use the InstructionList class. This change will replace a number of the std::vector<std::unique_ptr<Instruction>> member of the module to InstructionList. This is for consistency and to make it easier to delete instructions that are no longer needed.	2017-10-25 15:52:06 -04:00
Lei Zhang	063dbea0f1	Turn all function static non-POD variables into global POD variables Function static non-POD data causes problems with DLL lifetime. This pull request turns all static info tables into strict POD tables. Specifically, the capabilities/extensions field of opcode/operand/extended-instruction table are turned into two fields, one for the count and the other a pointer to an array of capabilities/extensions. CapabilitySet/EnumSet are not used in the static table anymore, but they are still used for checking inclusion by constructing on the fly, which should be cheap for the majority cases. Also moves all these tables into the global namespace to avoid C++11 function static thread-safe initialization overhead.	2017-10-25 15:44:19 -04:00
Józef Kucia	90862fe4b1	Validate SpvOpVectorShuffle	2017-10-24 11:45:03 -04:00
Jesus Carabano	13e6598947	restrict opcodes targeting OpDecorationGroup	2017-10-24 11:39:08 -04:00
Daniel Schürmann	97990dc907	Fixed --eliminate-common-uniform so that it does not eliminate loads of volatile variables.	2017-10-24 11:17:33 -04:00
David Neto	98072b749f	Optimizer: Line and NoLine are not debug1 or debug2 Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/911	2017-10-24 10:54:23 -04:00
Andrey Tuganov	cfd95f3d5a	Refactored compression debugger Markv codec now receives two optional callbacks: LogConsumer for internal codec logging DebugConsumer for testing if encoding->decoding produces the original results.	2017-10-23 22:12:40 -04:00
Steven Perron	8d6e4dbc72	Run dead variable elimination when using -O and -Os We want to run the optimization when using -O and -Os, but it was not added at part of https://github.com/KhronosGroup/SPIRV-Tools/pull/905. This change will add that a well as some minor formatting changes requested in that same pull request.	2017-10-23 22:09:12 -04:00
GregF	e3a7209330	DeadBranchElim: Fix dead block elimination The previous algorithm would leave invalid code in the case of unreachable blocks pointing into a dead branch. It would leave the unreachable blocks branching to labels that no longer exist. The previous algorithm also left unreachable blocks in some cases (a loop following an orphaned merge block). This fix also addresses that. This code will soon be replaced with the coming CFG cleanup.	2017-10-23 22:04:17 -04:00
Steven Perron	5834719fc1	Add pass to remove dead variables at the module level. There does not seem to be any pass that remove global variables. I think we could use one. This pass will look specifically for global variables that are not referenced and are not exported. Any decoration associated with the variable will also be removed. However, this could cause types or constants to become unreferenced. They will not be removed. Another pass will have to be called to remove those.	2017-10-23 13:57:05 -04:00
David Neto	2436794736	Optimizer: OpModuleProcessed is in its own layout section This is a recent decision from the SPIR WG. The spec update has not yet been published. Khronos SPIR-V internal issue 199	2017-10-23 10:46:37 -04:00
David Neto	d819f513f6	Fix cfg_cleanup.cpp. My bad.	2017-10-20 16:51:20 -04:00
David Neto	e6f3416617	Remove coding redundancy in cfg_cleanup_pass.cpp	2017-10-20 16:05:38 -04:00
Andrey Tuganov	39e25fd8ab	Add validation pass for conversion instructions The pass checks correctness of operands of instruction in opcode range OpConvertFToU - OpBitset. Disabled invalid tests Disabled UConvert validation until Vulkan CTS can catch up. Add validate_conversion to Android.mk Also remove duplicate entry in CMakeLists.txt.	2017-10-20 13:51:24 -04:00
Steven Perron	bb7802b18c	Change BasicBlock to use InstructionList to hold instructions. This is the first step in replacing the std::vector of Instruction pointers to using and intrusive linked list. To this end, we created the InstructionList class. It inherites from the IntrusiveList class, but add the extra concept of ownership. An InstructionList owns the instruction that are in it. This is to be consistent with the current ownership rules where the vector owns the instruction that are in it. The other larger change is that the inst_ member of the BasicBlock class was changed to using the InstructionList class. Added test for the InsertBefore functions, and making sure that the InstructionList destructor will delete the elements that it contains. I've also add extra comments to explain ownership a little better.	2017-10-20 12:37:44 -04:00
Andrey Tuganov	ea9d1d02b7	Removed todos from validate_id.cpp Removed todos for validation of opcodes handles in other passes.	2017-10-19 19:51:31 -04:00
David Neto	863578a38d	DeadBranchElim: Slightly more defensive coding	2017-10-19 19:28:45 -04:00
David Neto	8ec62deb23	The reviewed cfg_cleanup optimize pass	2017-10-19 15:28:09 -04:00
Diego Novillo	c75704ec08	CFG cleanup pass - Remove unreachable blocks. - Adds a new pass CFGCleanupPass. This serves as an umbrella pass to remove unnecessary cruft from a CFG. - Currently, the only cleanup operation done is the removal of unreachable basic blocks. - Adds unit tests. - Adds a flag to spirvopt to execute the pass (--cfg-cleanup).	2017-10-19 15:16:29 -04:00
Diego Novillo	332a1f1422	Re-factor generic constant folding code out of FoldSpecConstantOpAndCompositePass There are no functional changes in this patch. The generic folding routines in FoldSpecConstantOpAndCompositePass are now inside opt/fold.{cpp,h}. This code will be used by the upcoming constant propagation pass. In time, we'll add more expression folding and simplification into these two files.	2017-10-17 19:41:37 -04:00
GregF	1a9061a2be	ADCE: Treat privates like locals in entry point with no calls This is needed for ongoing legalization of HLSL. It allows removal of accesses to textures/buffers that are not used.	2017-10-13 15:39:14 -04:00
GregF	1e7994c085	Opt: Move *NextId functionality into MemPass	2017-10-13 15:22:19 -04:00
Andrey Tuganov	8de8dd8c8c	Reenable validate type unique pass Vulkan CTS patch fixing the instances of non-unique type declaration in autogenerated code has recently been submitted.	2017-10-12 15:46:06 -04:00
Andrey Tuganov	2401fc0a72	Refactored MARK-V API - switched from C to C++ - moved MARK-V model creation from backend to frontend - The same MARK-V model object can be used to encode/decode multiple files - Added MARK-V model factory (currently only one option) - Added --validate option to spirv-markv (run validation while encoding/decoding)	2017-10-12 15:40:40 -04:00
Andrey Tuganov	b54997e6eb	Validator checks OpReturn called from void func Added check into validate_cfg which checks that OpReturn is not called from functions which are supposed to return a value.	2017-10-12 15:32:32 -04:00
Steven Perron	720beb161a	Generic intrusive linked list class. This commit is the initial implementation of the intrusive linked list class. It includes the implementation in the header files, and unit test. The iterators are circular: incrementing end() gives begin() and decrementing begin() gives end(). Also made it valid to decrement end(). Expliticly defines move constructor and move assignment - Visual Studio 2013 does not implicitly generate the move constructor or move assignments. So they need to be explicit, otherwise it will try to use the copy constructor, which we explicitly deleted. - Can't use "= default" either. Seems like VS2013 does not support explicitly using the default move constructors and move assignments, so I wrote them out.	2017-10-12 12:40:18 -04:00
GregF	63064bd9eb	DeadBranchElim: Add dead case elimination Expands dead branch elimination to eliminate dead switch cases. It also changes dbe to eliminate orphaned merge blocks and recursively eliminate any blocks thereby orphaned.	2017-10-12 11:44:05 -04:00
Diego Novillo	c90d7305e7	Add -O, -Os and -Oconfig flags. These flags are expanded to a series of spirv-opt flags with the following semantics: -O: expands to passes that attempt to improve the performance of the generated code. -Os: expands to passes that attempt to reduce the size of the generated code. -Oconfig=<file> expands to the sequence of passes determined by the flags specified in the user-provided file.	2017-10-10 12:14:09 -04:00
Pierre Moreau	86627f7b3f	Implement Linker (module combiner) Add extra iterators for ir::Module's sections Add extra getters to ir::Function Add a const version of BasicBlock::GetLabelInst() Use the max of all inputs' version as version Split debug in debug1 and debug2 - Debug1 instructions have to be placed before debug2 instructions. Error out if different addressing or memory models are found Exit early if no binaries were given Error out if entry points are redeclared Implement copy ctors for Function and BasicBlock - Visual Studio ends up generating copy constructors that call deleted functions while compiling the linker code, while GCC and clang do not. So explicitly write those functions to avoid Visual Studio messing up. Move removing duplicate capabilities to its own pass Add functions running on all IDs present in an instruction Remove duplicate SpvOpExtInstImport Give default options value for link functions Remove linkage capability if not making a library Check types before allowing to link Detect if two types/variables/functions have different decorations Remove decorations of imported variables/functions and their types Add a DecorationManager Add a method for removing all decorations of id Add methods for removing operands from instructions Error out if one of the modules has a non-zero schema Update README.md to talk about the linker Do not freak out if an imported built-in variable has no export	2017-10-06 18:33:53 -04:00
Andrew Woloszyn	d7f199b5d4	Hack around bug in gcc-4.8.1 templates. This keeps the previous behavior for other compilers that will throw warnings on a negative shift operation, but works around the internal compiler error in GCC.	2017-10-06 10:26:17 -04:00
GregF	da04f5640e	AggressiveDCE: Fix to not treat parameter memory refs as local This fixes a bug that incorrectly deletes stores to parameters, which can be used to return values from functions.	2017-10-05 10:59:45 -04:00
Pierre Moreau	c87e9671ab	Compact-ids pass should update the header ID bound	2017-10-03 11:24:28 -04:00
David Neto	169266e9b8	DiagnosticStream move ctor moves output duties to new object - Take over contents of the expiring message stream - Prevent the expiring object from emitting anything during destruction	2017-10-03 11:23:54 -04:00
David Neto	17a843c6b0	Cache end iterators for speed Helps scaling of DefUseManager on modules with many thousands of instructions.	2017-09-29 16:13:55 -04:00
jcaraban	6526c42603	No use to check OpBitCount result width	2017-09-29 09:14:02 +03:00
David Neto	77feb8dd03	Compact-ids pass should update instruction's result_id member Also update the result type field. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/827	2017-09-27 08:31:05 -04:00
Andrey Tuganov	64d5e5214f	Add bitwise operations validator pass The pass checks correctness of operand types of all bitwise instructions (opcode range from SpvOpShiftRightLogical to SpvOpBitCount).	2017-09-26 14:22:37 -04:00
Andrey Tuganov	dcf42433a6	Add remaining opcodes to arithmetics validation Add validation rules for: - OpIAddCarry - OpISubBorrow - OpUMulExtended - OpSMulExtended Includes some refactoring of old code.	2017-09-26 11:47:34 -04:00
Steven Perron	e43c91046b	Create the dead function elimination pass Creates a pass called eliminate dead functions that looks for functions that could never be called, and deletes them from the module. To support this change a new function was added to the Pass class to traverse the call trees from diffent starting points. Includes a test to ensure that annotations are removed when deleting a dead function. They were not, so fixed that up as well. Did some cleanup of the assembly for the test in pass_test.cpp. Trying to make them smaller and easier to read.	2017-09-26 11:18:06 -04:00
Andrey Tuganov	976e4218d5	Detach MARK-V from the validator MARK-V codec was previously dependent on the validation state. Now it doesn't need the validator to function, but can still optionally create it and validate every instruction once it's decoded.	2017-09-26 11:10:23 -04:00
Lei Zhang	16981f87fe	Avoid using global static variables Previously we have several grammar tables defined as global static variables and these grammar table entries contains non-POD struct fields (CapabilitySet/ExtensionSet). The initialization of these non-POD struct fields may require calling operator new. If used as a library and the caller defines its own operator new, things can screw up. This pull request changes all global static variables into function static variables, which is lazy evaluated in a thread safe way as guaranteed by C++11.	2017-09-26 10:59:15 -04:00
Andrey Tuganov	c25b5bea35	Add SPIRV_SPIRV_COMPRESSION option to cmake The option is off by default. cmake -DSPIRV_BUILD_COMPRESSION=ON .. enables the compression lib, executable, and test build. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/834	2017-09-25 14:37:08 -04:00
Andrey Tuganov	3f5e1a91ae	Validator: fix logicals pass for OpSelect pointers OpSelect works with pointers also when capability VariablePointersStorageBuffer is declared (before worked only with capability VariablePointers).	2017-09-21 16:12:14 -04:00
David Neto	33b879c105	elim-multi-store: only patch loop header phis that we created There can already be OpPhi instructions in a loop header that are unrelated to the optimization. We should not be patching those. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/826	2017-09-21 10:01:30 -04:00
Andrey Tuganov	cf85ad1429	Add validate logicals pass to the validator New pass checks operands of all instructions listed under 3.32.15. Relational and Logical Instructions	2017-09-20 10:37:12 -04:00
Andrey Tuganov	4e3cc2f57f	Refactored validate_aritmetics.cpp Improved error messages and readability.	2017-09-20 10:30:54 -04:00
Andrey Tuganov	9b14dd0cb4	Updated markv_autogen - now includes a table of all descriptors with coding scheme (improves performance by 5% by allowing to avoid creation of move-to-front sequences which will never be used) - increased the size of markv_autogen.inc, clang doesn't seem to have the long compilation time problem now (probably was inadvertently fixed by using Huffman codec serialization)	2017-09-20 10:23:22 -04:00
Greg Fischer	8be28f7524	ElimLocalMultiStore: Reset structured successors for each function	2017-09-19 13:47:28 -06:00
Steven Perron	e4c7d8e748	Add strength reduction; for now replace multiply by power of 2 Create a new optimization pass, strength reduction, which will replace integer multiplication by a constant power of 2 with an equivalent bit shift. More changes could be added later. - Does not duplicate constants - Adds vector \|Concat\| utility function to a common test header.	2017-09-18 17:01:36 -04:00
GregF	7be791aaaa	ExtractInsert: Handle rudimentary CompositeConstruct and ConstantComposite This optimizes a single index extract whose composite value terminates with a CompositeConstruct (or ConstantComposite) by evaluating to the correct component. This was needed for opaque legalization. This highlights the need/opportunity to improve this optimization to deal with more complex composite expressions including currently handled ops plus Null ops and special vector composition. A TODO has been added.	2017-09-15 20:33:53 -04:00
Andrey Tuganov	c6dfc11880	Add new checks to validate arithmetics pass New operations: - OpDot - OpVectorTimesScalar - OpMatrixTimesScalar - OpVectorTimesMatrix - OpMatrixTimesVector - OpMatrixTimesMatrix - OpOuterProduct	2017-09-08 11:08:41 -04:00
David Neto	c843ef8ab5	validator: OpModuleProcessed allowed in layout section 7c Recent spec fix from SPIR Working group: Allow OpModuleProcessed after debug names, but before any annotation instructions.	2017-09-07 17:45:51 -04:00
Andrey Tuganov	b36acbec0e	Update MARK-V to version 1.01 Includes: - Multi-sequence move-to-front - Coding by id descriptor - Statistical coding of non-id words - Joint coding of opcode and num_operands Removed explicit form Huffman codec constructor - The standard use case for it is to be constructed from initializer list. Using serialization for Huffman codecs	2017-09-06 16:03:16 -04:00
David Neto	25ddfec08e	Inliner: Fix LoopMerge when inline into loop header of multi block loop This adapts the fix for the single-block loop. Split the loop like before. But when we move the OpLoopMerge back to the loop header, redirect the continue target only when the original loop was a single block loop. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/800	2017-09-05 19:46:24 -04:00
Andrey Tuganov	82df4bbd68	Add validation pass for arithmetic operations The pass checks if arithmetic operations (such as OpFMul) receive correct operands.	2017-09-05 12:21:53 -04:00
Andrey Tuganov	32cf85dd5a	Fix mingw build (source/print.cpp) source/print.cpp doesn't compile due to integer conversion. Tested by @dneto0 on a Windows machine.	2017-09-01 16:07:18 -04:00
David Neto	860c4197b0	Inliner: Remap callee entry block id to single-trip loop header Otherwise cloned phis can be invalid. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/790	2017-09-01 15:56:14 -04:00
David Neto	efff5fabfa	Inline: Fix single-block loop caller cases If the caller block is a single-block loop and inlining will replace the caller block by several blocks, then: - The original OpLoopMerge instruction will end up in the last such block. That's the wrong place to put it. - Move it back to the end of the first block. - Update its Continue Target ID to point to the last block We also have to take care of cases where the inlined code begins with a structured header block. In this case we need to ensure the restored OpLoopMerge does not appear in the same block as the merge instruction from the callee's first block. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/787	2017-09-01 15:47:17 -04:00
David Neto	cff2cd3343	BasicBlock: add ctail, GetMergeInst, GetLoopMergeInst	2017-09-01 11:01:36 -04:00
Andrey Tuganov	725284c2ef	Extension allows multiple same OpTypePointer types SPV_KHR_variable_pointers allows OpTypePointer to declare multiple pointer identical types. https://github.com/KhronosGroup/SPIRV-Tools/issues/781	2017-09-01 10:14:15 -04:00
GregF	7c3de19ce7	DeadBranchElim: Fix dead block detection to ignore backedges - DeadBranchElim: Make sure to mark orphan'd merge blocks and continue targets as live. - Add test with loop in dead branch - Add test that orphan'd merge block is handled. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/776	2017-08-30 13:37:46 -04:00
GregF	a699d1ade7	Inline: Fix remapping of non-label forward references in callee phi	2017-08-29 18:35:05 -06:00
Andrey Tuganov	d41a52415a	Fix encode zero bits on word boundary bug Bit stream writer was manifesting incorrect behaviour when the following two conditions were met: - writer was on 64-bit word boundary - WriteBits was invoked with num_bits=0 (can happen when a Huffman codec has only one value) The bug was causing very rare sporadic corruption which was detected by tests after a random experimental change in MARK-V model.	2017-08-28 13:36:39 -04:00
David Neto	63e1e348b0	Show result id for CompositeInsert validation failure	2017-08-25 15:13:31 -04:00
David Neto	0167758727	Windows: Increase intensity of blue text	2017-08-24 10:40:17 -04:00
Lukas Hermanns	4fe8e389a7	Fix: background color was erroneously reset on Win32 platform. Fix: background color was erroneously reset on Win32 platform.	2017-08-24 10:40:17 -04:00
GregF	429ca05b3f	Opt: Create InlineOpaquePass Only inline calls to functions with opaque params or return TODO: Handle parameter type or return type where the opqaue type is buried within an array.	2017-08-18 18:04:30 -04:00
GregF	c8c86a0d36	Opt: Have "size" passes process full entry point call tree. Includes code to deal correctly with OpFunctionParameter. This is needed by opaque propagation which may not exhaustively inline entry point functions. Adds ProcessEntryPointCallTree: a method to do work on the functions in the entry point call trees in a deterministic order.	2017-08-18 10:16:01 -04:00
Andrey Tuganov	17d941af4f	Huffman codec can serialize to text Refactored the Huffman codec implementation and added ability to serialize to C++-like text format. This would reduce the time-complexity if loading hard-coded codecs.	2017-08-15 23:57:21 -04:00
Andrey Tuganov	78cf86150e	Add id descriptor feature to SPIR-V Id descriptors are computed as a recursive hash of all instructions used to define an id. Descriptors are invarint of actual id values and the similar code in different files would produce the same descriptors. Multiple ids can have the same descriptor. For example %1 = OpConstant %u32 1 %2 = OpConstant %u32 1 would produce two ids with the same descriptor. But %3 = OpConstant %s32 1 %4 = OpConstant %u32 2 would have descriptors different from %1 and %2. Descriptors will be used as handles of move-to-front sequences in SPIR-V compression.	2017-08-10 18:44:52 -04:00
GregF	b0310a4156	ADCE: Add support for function calls ADCE will now generate correct code in the presence of function calls. This is needed for opaque type optimization needed by glslang. Currently all function calls are marked as live. TODO: mark calls live only if they write a non-local.	2017-08-10 17:30:05 -04:00
David Neto	2a1014be9c	Inliner: callee can have early return that isn't multi-return Avoid generating an invalid OpLabel. Create the continue target for the single-trip loop only if you actually created the header for the single-trip loop. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/755	2017-08-10 11:43:44 -04:00
GregF	f0fe601dc8	AccessChainConvert: Add HasOnlySupportedRefs() This avoids conversion on variables which will not ultimately be optimized. Also removed an obsolete restriction from FindTargetVars(). Also added decorates to supported refs (eg. RelaxedPrecision). Also fixed name to IsNonTypeDecorate().	2017-08-04 18:11:44 -04:00
GregF	e28bd39997	Inline: Split out InlineExhaustivePass from InlinePass	2017-08-04 17:56:46 -04:00
GregF	d9a450121e	Mem2Reg: Allow Image and Sampler types as base target types.	2017-08-04 17:52:32 -04:00
GregF	f4b29f3bf7	Add CommonUniformElim pass - UniformElim: Only process reachable blocks - UniformElim: Don't reuse loads of samplers and images across blocks. Added a second phase which only reuses loads within a block for samplers and images. - UniformElim: Upgrade CopyObject skipping in GetPtr - UniformElim: Add extensions whitelist Currently disallowing SPV_KHR_variable_pointers because it doesn't handle extended pointer forms. - UniformElim: Do not process shaders with GroupDecorate - UniformElim: Bail on shaders with non-32-bit ints. - UniformElim: Document support for only single index and add TODO.	2017-08-03 11:34:58 -04:00
GregF	c1b46eedbd	Add MemPass, move all shared functions to it.	2017-08-02 14:24:02 -04:00
Andrey Tuganov	30bee67439	Add multi-sequence move-to-front implementation Add MultiMoveToFront class which supports multiple move-to-front sequences and allows to promote value in all sequences at once. Added caching for last accessed sequence handle and last accessed value in each sequence.	2017-08-02 14:07:24 -04:00
Andrey Tuganov	55b73a0365	Added C++ code generation to spirv-stats The tool can now generate C++ code returning some of the historgrams and Huffman codecs generated from those historgrams.	2017-08-01 15:41:42 -04:00
GregF	7954740d54	Opt: Delete names and decorations of dead instructions	2017-07-26 18:36:41 -04:00
Lei Zhang	9f6efc76c8	Opt: HasOnlySupportedRefs should consider OpCopyObject This fixes test failure after merging the previous pull request.	2017-07-25 23:22:09 -04:00
Lei Zhang	4a539d77ef	Revert "Revert "Opt: LocalBlockElim: Add HasOnlySupportedRefs"" This reverts commit `df96e243c6`.	2017-07-25 23:22:09 -04:00
GregF	1182415581	Add extension whitelists to size-reduction passes. Currently only SPV_KHR_variable_pointers is disallowed in passes which do pointer analysis. Positive and negative tests of the general extensions mechanism were added to aggressive_dce but cover all passes.	2017-07-25 19:14:02 -04:00
Lei Zhang	df96e243c6	Revert "Opt: LocalBlockElim: Add HasOnlySupportedRefs" This reverts commit `2d0f7fbc11`.	2017-07-22 10:48:56 -04:00
greg-lunarg	2d0f7fbc11	Opt: LocalBlockElim: Add HasOnlySupportedRefs Verifies that targeted variables have only access chain and direct loads and stores as references.	2017-07-22 10:32:19 -04:00
GregF	adb237f3bd	Fix handling of CopyObject in GetPtr and its call sites	2017-07-21 18:08:01 -04:00
Lenny Komow	e9e4393b1c	Fix Visual Studio size_t cast compiler warning Visual Studio was complaining about possible loss of data on 64-bit builds, due to an implicit cast from size_t to int. This changes the data to use an int with no cast.	2017-07-13 13:02:43 -06:00
Greg Fischer	fe24e0316f	LocalMultiStore: Always put varId for backedge on loop phi function. And always patch the backedge operand when patching phi functions. This approach is more correct and cleaner. The previous code was generating incorrect phis when the backedge block had no predecessors.	2017-07-12 16:42:07 -04:00
GregF	e2544ddc90	DeadBranchElim: Improve algorithm to only remove blocks with no predecessors Must be careful not to remove blocks pointed at by unreachable blocks	2017-07-12 15:58:42 -04:00
David Neto	06d4fd52c2	Minor code review feedback on AggressiveDCE	2017-07-10 11:45:59 -04:00
GregF	9de4e69856	Add AggressiveDCEPass Create aggressive dead code elimination pass This pass eliminates unused code from functions. In addition, it detects and eliminates code which may have spurious uses but which do not contribute to the output of the function. The most common cause of such code sequences is summations in loops whose result is no longer used due to dead code elimination. This optimization has additional compile time cost over standard dead code elimination. This pass only processes entry point functions. It also only processes shaders with logical addressing. It currently will not process functions with function calls. It currently only supports the GLSL.std.450 extended instruction set. It currently does not support any extensions. This pass will be made more effective by first running passes that remove dead control flow and inlines function calls. This pass can be especially useful after running Local Access Chain Conversion, which tends to cause cycles of dead code to be left after Store/Load elimination passes are completed. These cycles cannot be eliminated with standard dead code elimination. Additionally: This transform uses a whitelist of instructions that it knows do have side effects, (a.k.a. combinators). It assumes other instructions have side effects: it will not remove them, and assumes they have side effects via their ID operands.	2017-07-10 11:30:25 -04:00
GregF	cc8bad3a5b	Add LocalMultiStoreElim pass A SSA local variable load/store elimination pass. For every entry point function, eliminate all loads and stores of function scope variables only referenced with non-access-chain loads and stores. Eliminate the variables as well. The presence of access chain references and function calls can inhibit the above optimization. Only shader modules with logical addressing are currently processed. Currently modules with any extensions enabled are not processed. This is left for future work. This pass is most effective if preceeded by Inlining and LocalAccessChainConvert. LocalSingleStoreElim and LocalSingleBlockElim will reduce the work that this pass has to do.	2017-07-07 17:54:21 -04:00
GregF	52e247f221	DeadBranchElim: Add DeadBranchElimPass	2017-07-07 15:16:25 -04:00
David Neto	35a0695844	Include memory and semantics IDs when iterating over inbound IDs Fixes Instruction::ForEachInId so it covers SPV_OPERAND_TYPE_MEMORY_SEMANTICS_ID and SPV_OPERAND_TYPE_SCOPE_ID. Future proof a bit by using the common spvIsIdType routine. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/697	2017-07-05 10:36:57 -04:00
Andrey Tuganov	abc6f5a672	MARK-V decoder supports extended instructions	2017-07-04 16:31:19 -04:00
d3x0r	fd70a1d7a0	Define variable to skip installation If this is used as a static library in another project, this does not need to be installed, and otherwise will just clutter the application's install. To use, define SKIP_SPIRV_TOOLS_INSTALL which internally defines ENABLE_SPIRV_TOOLS_INSTALL to control installation. Also include GNUInstallDirs to get standard output 'lib' directory which is sometimes 'lib64' and not 'lib'	2017-07-04 12:24:44 -04:00
Chris Forbes	78338d5ba9	Convert pattern stack from deque to vector, and share it Also move various vector::reserve calls to State ctor Negligible perf benefit, but more tidy.	2017-07-04 12:02:26 -04:00
Andrey Tuganov	e842c17eb5	Added fixed width encoding to bit_stream Fixed width encoding is intended to be used for small unsigned integers when the upper bound is known both to the encoder and the decoder (for example move-to-front rank).	2017-07-04 11:57:13 -04:00
Andrey Tuganov	73e8dac5b9	Added compression tool tools/spirv-markv. Work in progress. Command line application is located at tools/spirv-markv API at include/spirv-tools/markv.h At the moment only very basic compression is implemented, mostly varint. Scope of supported SPIR-V opcodes is also limited. Using a simple move-to-front implementation instead of encoding mapped ids. Work in progress: - Does not cover all of SPIR-V - Does not promise compatibility of compression/decompression across different versions of the code.	2017-06-30 12:22:48 -04:00
Andrey Tuganov	8d3882a408	Added log(n) move-to-front implementation The implementation is based on AVL and order statistic tree. It accepts all kinds of values and the implementation doesn't expect the behaviour to be consistent with id coding. Intended by SPIR-V compression algorithms.	2017-06-29 16:16:18 -04:00
Andrey Tuganov	40a2829611	Added Huffman codec to utils Attached ids to Huffman nodes for deterministic internal node comparison.	2017-06-29 14:51:01 -04:00
Chris Forbes	d431b69c28	Don't do hash lookup twice in FindDef	2017-06-28 11:13:26 -04:00
Chris Forbes	c14966b882	Move spv_instruction_t's into vector No need to incur another copy here. These guys have embedded vectors we'd rather not copy.	2017-06-28 11:13:26 -04:00
Chris Forbes	1cd47d7af2	Reserve expected length of instructions vector	2017-06-28 11:13:26 -04:00
Chris Forbes	fcd991f081	Move some temp vectors into parser state We don't need to churn the allocations for these every instruction.	2017-06-28 11:13:26 -04:00
GregF	ad1d0351a0	BlockMerge: Add BlockMergePass Also, add BasicBlock::tail()	2017-06-27 11:31:33 -04:00
Rex Xu	5fbbadca4e	Add support for SPV AMD extensions	2017-06-21 15:08:07 -04:00
GregF	6136bf9e0b	mem2reg: Add InsertExtractElimPass	2017-06-21 08:13:15 -04:00

... 2 3 4 5 6 ...

860 Commits