SPIRV-Tools

mirror of https://github.com/KhronosGroup/SPIRV-Tools synced 2024-10-19 03:20:14 +00:00

Author	SHA1	Message	Date
Steven Perron	d52c39c37d	Do not crash when folding 16-bit OpFDiv (#5338 ) The code currently tries to get the value of the floating point constant to see if it is -0.0. However, we are not able to get the value for 16-bit floating point value, and we hit an assert. To avoid this, we add an early check for the width to make sure it is either 32 or 64. Fixes https://github.com/microsoft/DirectXShaderCompiler/issues/5413.	2023-07-21 10:17:12 -04:00
Laura Hermanns	951980e5ac	Enable vector constant folding (#4913 ) (#5272 ) - Add test case 6 to UIntVectorInstructionFoldingTest - Add test case 3 to IntVectorInstructionFoldingTest	2023-06-19 15:01:51 -04:00
Steven Perron	6b9fc79330	Fold negation of integer vectors (#5269 )	2023-06-16 10:37:21 -04:00
Steven Perron	5ed21eb1e2	Add folding rule for OpTranspose (#5241 )	2023-06-01 12:09:08 -04:00
Steven Perron	af27ece750	Check if const is zero before getting components. (#5217 ) * Check if const is zero before getting components. Two folding rules try to cast a constant to a MatrixConstant before checking if it is a Null constant. This leads to the null pointer being dereferneced. The solution is to move the check for zero earlier. Fixes https://github.com/microsoft/DirectXShaderCompiler/issues/5063	2023-05-25 09:07:22 -04:00
Steve Urquhart	44c9da6fee	Remove const zero image operands (#5232 )	2023-05-24 10:30:10 -04:00
Ben Clayton	bec566a32b	opt: Fix null deref in OpMatrixTimesVector and OpVectorTimesMatrix (#5199 ) When some (not all) of the matrix columns are OpConstantNull	2023-04-18 14:58:12 -04:00
Spencer Fricke	fa69b09cff	spirv-opt: Remove unused includes and code (#5177 )	2023-03-28 12:40:30 -04:00
Laura Hermanns	bd83b772c3	Fix operand index out of bounds when folding OpCompositeExtract. (#5107 ) GetExtractOperandsForElementOfCompositeConstruct() states "Returns the empty vector if \|result_index\| is out-of-bounds", but violates that contract for non-vector result types.	2023-03-03 15:52:49 +00:00
Laura Hermanns	cac9a5a3ee	Fix null pointer in FoldInsertWithConstants. (#5093 ) * Fix null pointer in FoldInsertWithConstants. Struct types are not supported in constant folding yet. * Added 'Test case 16' to fold_test. Tests OpCompositeInsert not to be folded on a struct type.	2023-02-03 15:03:15 +00:00
alan-baker	d35a78db57	Switch SPIRV-Tools to use spirv.hpp11 internally (#4981 ) Fixes #4960 * Switches to using enum classes with an underlying type to avoid undefined behaviour	2022-11-04 17:27:10 -04:00
gmitrano-unity	1cecf91701	Support Narrow Types in BitCast Folding Rule (#4941 ) * Support Narrow Types in BitCast Folding Rule This change adds support for narrow types in the BitCastScalarOrVector folding rule. According to Section 2.2.1 of the SPIR-V spec, types that are narrower than 32 bits are automatically either sign extended, or zero extended depending on the type. With that guaranteed, we should be able to use the first 32-bit word of any narrow type for the folding logic without performing any special conversions. In order to reduce code duplication, this change moves the GetU32BitValue and GetU64BitValue functions from IntConstant to ScalarConstant. Without this move, we would have needed an identical version of GetU32BitValue on FloatConstant. * Add Tests for 16-bit BitCast Folding This change adds several new test cases to the IntegerInstructionFoldingTest which trigger the 16-bit BitCast logic. The logic for half types was also added to the integer case since we can't easily validate half float types in C++ code. It's easier to validate them as unsigned integers instead. Pllus this also allows us to verify the SPIR-V constant sign extension logic too. * Add 8-Bit Folding Test Cases This change adds a couple more test cases to the integer instruction folding test suite in order to ensure that the BitCast logic also works correctly with the Int8 shader capability.	2022-10-06 10:35:18 -04:00
Steven Perron	0a43a84e02	Fix shuffle feeding shuffle with undef literal (#4883 ) When folding a vector shuffle with an undef literal, it is possible that the literal is adjusted so that it will then be interpreted as an index into the input operands. This is fixed by special casing that case, and not adjusting those operands. Fixes #4859	2022-08-10 09:04:35 -04:00
manas-kulkarni	fbcb6cf4c8	Ability to fold Constant Vector times Matrix and Matrix times vector instructions (#4818 )	2022-06-16 13:54:12 -04:00
Nicolas Capens	130a05d2e3	Fold multiply and subtraction into FMA with negation (#4808 ) This change adds a folding rule which transforms x * y - a and a - x * y into FMA(x, y, -a) and FMA(-x, y, a), respectively. While the SPIR-V instruction count remains the same, target instruction sets typically feature FMA instruction variants that can negate an operand. Also this transformation may unlock further optimizations which eliminate the negation. (Google bug: b/226145988)	2022-05-31 12:03:56 -04:00
Steven Perron	088cb1a5c8	Add more folding for composite instructions (#4802 ) * Add move folding for composite instructions Fold chains of insert into construct If a chain of OpCompositeInsert instruction write to every element of a composite object, then we can replace it with an OpCompositeConstruct. Fold a construct fed by extracts to a single extract We already fold an OpCompositeConstruct when it is simlpy reconstructing an object that was decomposed by a series of OpCompositeExtract instructions. However, we do not do that if that object is an element of a larger object. I have updated the rule, so that if the original object is a an element of a larger object, then the OpCompositeConstruct is replaced with a single OpCompositeExtract from the larger object. Fixes #4371.	2022-05-26 10:29:02 -04:00
Steven Perron	1295dca8e2	Reapply "Add folding rule to generate Fma instructions (#4783 )" (#4789 ) This reverts commit `671f6e633f`. PR #4783 was reverted because it caused OpenCL CTS failures for clvk. The was in clspv, which was not adding the no contract decoration when it was required. This has been fixed in https://github.com/google/clspv/pull/845. We can now reapply #4783.	2022-05-03 10:20:23 -04:00
Daniele Vettorel	671f6e633f	Revert "Add folding rule to generate Fma instructions (#4783 )" (#4785 ) This reverts commit `2b2b0282af`.	2022-04-20 10:55:20 -04:00
Steven Perron	2b2b0282af	Add folding rule to generate Fma instructions (#4783 ) Adding Fma instruction can speed up the code. This was requested by swiftshader, so they do not have to do this analysis themselves. It can also help reduce the code size, and the work the ICD compilers have to do.	2022-04-19 11:25:07 -04:00
Steven Perron	48a36c72e4	Better handling of 0xFFFFFFFF when folding vector shuffle (#4743 ) When folding a vector shuffle feeding a vector shuffle, we do not propagate an 0xFFFFFFFF, which has a special meaning, correctly. We adjust the value making it lose it meaning as an undefined value. Fixes #4581	2022-03-07 19:35:57 +00:00
luzpaz	65ecfd1093	Fix various source comment (doxygen) typos (#4680 ) Found via `codespell -q 3 -L fo,lod,parm	2022-01-26 15:13:08 -05:00
Steven Perron	8c155b364c	Manually fold floating point division by zero (#4637 ) See https://github.com/KhronosGroup/SPIRV-Tools/issues/4636 for details. Fixes #4636.	2021-11-24 14:13:58 -05:00
Steven Perron	3291b6951e	Do not fold snegate feeding sdiv. (#4600 ) When the variable value is INT_MIN, we cannot fold the negate into the divide, so we have to turn off that folding rule. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/4487.	2021-10-28 10:02:57 -04:00
Steven Perron	59f51bb4f8	Fix extract with out-of-bounds index (#4529 ) * Fix extract with out-of-bounds index When folding a OpCompositeExtract that is fed by an OpCompositeConstruct, we handle and out of bounds index, but only in the case where the result of the OpCompostiteConstruct is a struct. This change refactors that folding rule and then improves it to handle an out-of-bounds access when the result of the OpCompositeConstruct is a vector.	2021-09-20 13:02:47 -04:00
Alastair Donaldson	36ff135341	spirv-opt: Avoid integer overflow during constant folding (#4511 ) In SPIR-V, integers use 2s complement representation, so that signed integer overflow and underflow is well defined. However, the constant folder was causing overflow / underflow at the C++ level. This change avoids such overflows by performing constant folding for IAdd, ISub and IMul in the context of unsigned values, which works because signedness is irrelevant according to the SPIR-V semantics for these instructions. Fixes #4510.	2021-09-14 21:09:05 +00:00
Nicolas Capens	869a550d26	Don't fold unsigned divides of an constant and a negation (#4457 ) Negating an unsigned constant results in its two's complement which is still interpreted as unsigned. For example -2u becomes 4294967294u. Fixes https://github.com/KhronosGroup/SPIRV-Tools/issues/4456	2021-08-16 09:56:05 -04:00
Jaebaek Seo	07ec4f83c5	Support folding OpBitcast with numeric constants (#4247 ) Add constant folding rule for OpBitcast with numeric scalar or vector constants.	2021-04-27 14:24:46 -04:00
Vasyl Teliman	948577c5df	Fix the bug (#3680 )	2020-08-13 09:09:57 -04:00
Diego Novillo	4dbe18b0c8	Reject folding comparisons with unfoldable types. (#3370 ) Reject folding comparisons with unfoldable types. Fixes #3343 When CCP is evaluating an instruction, it was trying to fold a comparison with 64 bit integers. This was causing a fold failure later since the folder still cannot deal with 64 bit integers.	2020-05-21 12:58:08 -04:00
Arseny Kapoulkine	0265a9d4de	Implement constant folding for many transcendentals (#3166 ) * Implement constant folding for many transcendentals This change adds support for folding of sin/cos/tan/asin/acos/atan, exp/log/exp2/log2, sqrt, atan2 and pow. The mechanism allows to use any C function to implement folding in the future; for now I limited the actual additions to the most commonly used intrinsics in the shaders. Unary folder had to be tweaked to work with extended instructions - for extended instructions, constants.size() == 2 and constants[0] == nullptr. This adjustment is similar to the one binary folder already performs. Fixes #1390. * Fix Android build On old versions of Android NDK, we don't get std::exp2/std::log2 because of partial C++11 support. We do get ::exp2, but not ::log2 so we need to emulate that.	2020-02-03 09:20:47 -05:00
Steven Perron	00ca4e5bdf	Don't crash when folding construct of empty struct (#3092 ) * Don't crash when folding construct of empty struct An OpCompositeConstruct of an empty struct will be folded to a constant under normal circumstances. However, if the id limit has been reached and the constant cannot be generated, then other folding rules will be tried. These rules do not handle the case of an empty struct. We add allow it to be handled. Fixes http://crbug/1030194 * Changes based on the review.	2019-12-10 14:58:30 -05:00
Steven Perron	3ed4586044	Folding: perform add and sub on mismatched integer types (#3084 ) Fixes #3040	2019-12-02 17:51:20 -05:00
Ehsan	12e54dae16	Update Offset to ConstOffset bitmask if operand is constant. (#3024 ) Update Offset to ConstOffset bitmask if operand is constant. Fixes #3005	2019-11-11 22:35:14 -05:00
greg-lunarg	5ea7099374	Add two new simplifications. (#2984 ) Implements the following simplifications: (a - b) + b => a (a * b) + (a * c) => a * (b + c) Also adds logic to simplification to handle rules that create new operations that might need simplification, such as the second rule above. Only perform the second simplification if the multiplies have the add as their only use. Otherwise this is a deoptimization of size and performance.	2019-10-28 08:19:38 -07:00
Jakub Kuderski	e3da3143b2	Disallow use of OpCompositeExtract/OpCompositeInsert with no indices (#2980 )	2019-10-17 13:53:34 -04:00
Steven Perron	55ea57a785	Handle extract with no indexes (#2910 ) * Handle extract with no indexes It is possible that OpCompositeExtract instructions will not have any indexes. This is not handled well by scalar replacement and instruction folding. Fixes https://crbug.com/1006435 * Fix typo.	2019-09-24 16:19:31 -04:00
Steven Perron	b218ad1994	Fold Min, Max, and Clamp instructions. (#2836 ) Fixes #2830.	2019-09-05 13:30:03 -04:00
Steven Perron	b54d950298	Fold Fmix should accept vector operands. (#2826 ) Fixes #2819	2019-09-03 09:17:18 -04:00
Steven Perron	15fc19d091	Refactor instruction folders (#2815 ) * Refactor instruction folders We want to refactor the instruction folder to allow different sets of rules to be added to the instruction folder. We might want different sets of rules in different circumstances. We also need a way to add rules for extended instructions. Changes are made to the FoldingRules class and ConstFoldingRules class to enable that. We added tests to check that we can fold extended instructions using the new framework. At the same time, I noticed that there were two tests that did not tests what they were suppose to. They could not be easily salvaged. #2813 was opened to track adding the new tests.	2019-08-26 18:54:11 -04:00
Diego Novillo	49797609b7	Protect against out-of-bounds references when folding OpCompositeExtract (#2774 ) This fixes #2608. The original test case had an out-of-bounds reference that ended up folding into OpCompositeExtract that was indexing right outside the constant composite. The returned constant would then cause a segfault during constant propagation.	2019-07-29 13:27:40 -07:00
Steven Perron	d9c00e1d2d	Add folding rules for OpQuantizeToF16 (#2614 ) Adding the folding rules for OpQuantizeToF16, and fixed some matching tests to check identify new lines.	2019-05-21 23:15:01 -07:00
alan-baker	87c4ef8a9c	Do not fold floating point if float controls used (#2569 ) Fixes #2558 * Mark floating point instructions as non-foldable if any SPV_KHR_float_controls capabilities are present * tests	2019-05-10 11:03:22 -04:00
alan-baker	cc3e93c4e6	Add tests for folding 1.4 selects (#2568 ) Fixes #2554 * Folding rules already handle 1.4 selects so I simply added some tests	2019-05-08 14:06:04 -04:00
Steven Perron	5186ffedb3	Remove duplicates from list of interface IDs in OpEntryPoint instruction (#2449 ) * Remove duplicates from list of interface IDs in OpEntryPoint instruction Fixes #2002.	2019-03-13 15:46:31 -04:00
Steven Perron	fde69dcd80	Fix OpDot folding of half float vectors. (#2411 ) * Fix OpDot folding of half float vectors. The code that folds OpDot does not handle half floats correctly. After trying to multiple the first components, we get a nullptr because we don't fold half float values. This nullptr gets passed to the code that does the addition, and causes an assert. Fixes #2405.	2019-02-20 20:05:08 -05:00
Steven Perron	464111eaef	Remove use of deprecated googletest macro (#2286 ) * Remove use of deprecated googletest macro INSTANTIATE_TEST_CASE_P has been deprecated. We need to use INSTANTIATE_TEST_SUITE_P instead. * Remove extra commas from test suites.	2019-01-29 18:56:52 -05:00
Steven Perron	213e15e100	Fix overflow when negating INT_MIN. (#2293 ) When doing (-INT_MIN) is considered overflow, so we cannot fold it by actually performing the negation. Fixes https://crbug.com/917991	2019-01-17 17:01:55 -05:00
Steven Perron	49b5b0abc6	Fix up bit shifts by 32. (#2292 ) In C++, a bit shift of the same size as the type is undefined, but it is defined in spir-v. When folding those cases, we have to be careful. We cannot simply do the shift in C++. Fixes https://crbug.com/917697.	2019-01-16 15:52:23 -05:00
Steven Perron	17cba4695c	Remove undefined behaviour when folding shifts. (#2157 ) We currently simulate all shift operations when the two operand are constants. The problem is that if the shift amount is larger than 32, the result is undefined. I'm changing the folder to return 0 if the shift value is too high. That way, we will have defined behaviour. https://crbug.com/910937.	2018-12-04 10:04:02 -05:00
Steven Perron	dc9d155d62	Fix folding of volatile store. (#2048 ) When looking for the Volatile mask on a store, the instruction folder accesses an out-of-bounds element. We fix that up. Fixes crbug.com/903530.	2018-11-14 13:52:18 -05:00

1 2

95 Commits