SPIRV-Cross

Author	SHA1	Message	Date
Lukas Hermanns	a9f3c981d9	Adjustments after rebase of ue4_dev branch.	2019-09-13 14:03:02 -04:00
Mark Satterthwaite	c4f9704af0	OpImageTexelPointer needs to use an int coordinate type for GLSL, but not for MSL.	2019-09-12 08:52:08 -04:00
Mark Satterthwaite	e4c6388571	More fixes to handling packing & access elements in an array. Made in two parts. 1. Don't allow AccessChain operations to add duplicated swizzles when accessing packed arrays. 2. Only pack arrays when there is the proper amount of space between members in a struct, otherwise it will definitely be wrong.	2019-09-11 16:15:10 -04:00
Mark Satterthwaite	a80c74b40e	There are occasions where phi-variable copies are introduced for original variables which are fully declared, which coud result in the phi-variable never being declared and the shader not compiling, so declare the phi-variables when this happens. Change made in two parts. 1. Ensure that we declare phi-variable copies even if the original declaration isn't deferred. 2. Only flush phi variables once, avoids duplicate definitions.	2019-09-11 14:00:49 -04:00
Mark Satterthwaite	2af70b837c	When converting from HLSL the dxc SPIRV output often contains variables that are written through (e.g. a = b = c;) which seems to break the tracking of expressions in SPIRV-Cross, so don't reset everything once configured.	2019-09-10 13:25:20 -04:00
Mark Satterthwaite	42b8a62870	Fixes to the generation of Metal tessellation shaders from SPIRV so that it works correctly in more complicated cases. First, when generating from HLSL before invoking the code that comes from the HLSL patch-function a control-flow and full memory-barrier are required to ensure that all the temporary values in thread-local storage for the patch are available. Second, the inputs to control and evaluation shaders must be properly forwarded from the global variables in SPIRV to the member variables in the relevant input structure. Finally when arrays of interpolators are used for input or output we need to add an extra level of array indirection because Metal works at a different granularity than SPIRV. Five parts. 1. Fix tessellation patch function processing. 2. Fix loads from tessellation control inputs not being forwarded to the gl_in structure array. 3. Fix loads from tessellation evaluation inputs not being forwarded to the stage_in structure array. 4. Workaround SPIRV losing an array indirection in tessellation shaders - not the best solution but enough to keep things progressing. 5. Apparently gl_TessLevelInner/Outer is special and needs to not be placed into the input array.	2019-09-10 10:37:07 -04:00
Mark Satterthwaite	d50659af92	Rework the way arrays are handled in Metal to remove the array copies as they are unnecessary from Metal 1.2. There were cases where copies were not being inserted and others appeared unncessary, using the template type should allow the 'metal' compiler to do the best possible optimisation. The changes are broken into three stages. 1. Allow Metal to use the array<T> template to make arrays a value type. 2. Force the use of C style array declaration for some cases which cannot be wrapped with a template. 3. Threadgroup arrays can't have a wrapper type. 4. Tweak the code to use unsafe_array in a few more places so that we can handle passing arrays of resources into the shader and then through shaders into sub-functions. 5. Handle packed matrix types inside arrays within structs. 6. Make sure that builtin arguments still retain their array qualifiers when used in leaf functions. 7. Fix declaration of array-of-array constants for Metal so we can use the array<T> template.	2019-09-05 12:39:44 -04:00
Chip Davis	39dce88d3b	MSL: Add support for sampler Y'CbCr conversion. This change introduces functions and in one case, a class, to support the `VK_KHR_sampler_ycbcr_conversion` extension. Except in the case of GBGR8 and BGRG8 formats, for which Metal natively supports implicit chroma reconstruction, we're on our own here. We have to do everything ourselves. Much of the complexity comes from the need to support multiple planes, which must now be passed to functions that use the corresponding combined image-samplers. The rest is from the actual Y'CbCr conversion itself, which requires additional post-processing of the sample retrieved from the image. Passing sampled images to a function was a particular problem. To support this, I've added a new class which is emitted to MSL shaders that pass sampled images with Y'CbCr conversions attached around. It can handle sampled images with or without Y'CbCr conversion. This is an awful abomination that should not exist, but I'm worried that there's some shader out there which does this. This support requires Metal 2.0 to work properly, because it uses default-constructed texture objects, which were only added in MSL 2. I'm not even going to get into arrays of combined image-samplers--that's a whole other can of worms. They are deliberately unsupported in this change. I've taken the liberty of refactoring the support for texture swizzling while I'm at it. It's now treated as a post-processing step similar to Y'CbCr conversion. I'd like to think this is cleaner than having everything in `to_function_name()`/`to_function_args()`. It still looks really hairy, though. I did, however, get rid of the explicit type arguments to `spvGatherSwizzle()`/`spvGatherCompareSwizzle()`. Update the C API. In addition to supporting this new functionality, add some compiler options that I added in previous changes, but for which I neglected to update the C API.	2019-09-01 18:35:53 -05:00
Chip Davis	5fe1ecc324	GLSL: Fix post-depth coverage for ESSL. ESSL does not support `GL_ARB_post_depth_coverage`. There, we must use `GL_EXT_post_depth_coverage`. I've added this as a fallback for desktop as well. Note that `GL_EXT_post_depth_coverage` also requires the fragment shader to set `early_fragment_tests` explicitly, while `GL_ARB_post_depth_coverage` does not. It doesn't really matter either way, since `SPV_KHR_post_depth_coverage` also requires both execution modes to be explicitly set.	2019-08-28 13:40:13 -05:00
Hans-Kristian Arntzen	3ccfbce264	Run format_all.sh.	2019-08-28 14:25:26 +02:00
Hans-Kristian Arntzen	563e994486	Merge pull request #1135 from KhronosGroup/fix-1119 MSL: Deal with array copies from and to threadgroup.	2019-08-27 15:48:08 +02:00
Hans-Kristian Arntzen	aec826222d	Merge pull request #1134 from KhronosGroup/fix-1117 Do not allow base expressions for non-native row-major matrices.	2019-08-27 15:47:33 +02:00
Hans-Kristian Arntzen	9436cd3036	MSL: Deal with array copies from and to threadgroup.	2019-08-27 13:18:01 +02:00
Hans-Kristian Arntzen	1017a02aad	Merge pull request #1133 from KhronosGroup/fix-1115 Deal with ldexp taking uint input.	2019-08-27 13:17:43 +02:00
Hans-Kristian Arntzen	7ff2db4570	Do not allow base expressions for non-native row-major matrices.	2019-08-27 11:41:54 +02:00
Hans-Kristian Arntzen	2f7848dcda	Deal with ldexp taking uint input. Need to value cast to int first.	2019-08-27 11:19:54 +02:00
Hans-Kristian Arntzen	5d97dae1eb	Move branchless analysis to CFG. Traverse backwards instead, far more robust. Should elide basically all redundant continue; statements now.	2019-08-27 10:19:19 +02:00
Hans-Kristian Arntzen	55c2ca90ae	Elide branches to continue block when continue block is also a merge.	2019-08-27 10:19:01 +02:00
Hans-Kristian Arntzen	b3305799a8	Deal correctly with sign on bitfield operations. Need a lot of special purpose implementation functions for these.	2019-08-26 11:36:36 +02:00
Hans-Kristian Arntzen	b97e9b0499	Fix severe performance issue with invariant expression invalidation. We were going down a tree of expressions multiple times and this caused an exponential explosion in time, which was not caught until recently. Fix this by blocking any traversal going through an ID more than one time. This fix overall improves performance by almost an order of magnitude on a particular test shader rather than slowing it down by ~75x.	2019-08-01 09:55:21 +02:00
Hans-Kristian Arntzen	c3e8e728d8	MSL: Cleanup temporary use with emit_uninitialized_temporary.	2019-07-26 11:16:43 +02:00
Hans-Kristian Arntzen	301eab1b7a	Merge pull request #1099 from KhronosGroup/fix-1091 Missed case where DoWhile continue block deals with Phi.	2019-07-25 17:44:17 +02:00
Hans-Kristian Arntzen	e06efb7259	Missed case where DoWhile continue block deals with Phi.	2019-07-25 12:30:50 +02:00
Hans-Kristian Arntzen	12ca9d1982	Vulkan GLSL: Support disabling samplerless texture function EXT. Some platforms support Vulkan GLSL, but not this extension apparently ...	2019-07-25 11:07:14 +02:00
Hans-Kristian Arntzen	d90eeddcf1	Fix some typos in comments.	2019-07-24 12:14:19 +02:00
Hans-Kristian Arntzen	461f1506e7	Do not eagerly invalidate all active variables on a branch. This is not necessary, as we must emit an invalidating store before we potentially consume an invalid expression. In fact, we're a bit conservative here in this case for example: int tmp = variable; if (...) { variable = 10; } else { // Consuming tmp here is fine, but it was // invalidated while emitting other branch. // Technically, we need to study if there is an invalidating store // in the CFG between the loading block and this block, and the other // branch will not be a part of that analysis. int tmp2 = tmp * tmp; } Fixing this case means complex CFG traversal everywhere, and it feels like overkill. Fixing this exposed a bug with access chains, so fix a bug where expression dependencies were not inherited properly in access chains. Access chains are now considered forwarded if there is at least one dependency which is also forwarded.	2019-07-24 11:17:30 +02:00
Hans-Kristian Arntzen	18bcc9b790	Do not disable temporary forwarding when we suppress usage tracking. This subtle bug removed any expression validation for trivially swizzled variables. Make usage suppression a more explicit concept rather than just hacking off forwarded_temporaries. There is some fallout here with loop generation since our expression invalidation is currently a bit too naive to handle loops properly. The forwarding bug masked this problem until now. If part of the loop condition is also used in the body, we end up reading an invalid expression, which in turn forces a temporary to be generated in the condition block, not good. We'll need to be smarter here ...	2019-07-23 19:18:44 +02:00
Hans-Kristian Arntzen	1ece67a050	Look at pointee type when unpacking expressions. We might be unpacking in OpLoad, so don't want any pointer types from access chains creeping in.	2019-07-23 17:07:15 +02:00
Hans-Kristian Arntzen	ebe109d91d	Deal correctly with non-forwarded packed loads. Need to unpack the expression if we're not forwarding.	2019-07-23 16:25:19 +02:00
Hans-Kristian Arntzen	3fa2b14634	Run format_all.sh.	2019-07-23 12:23:41 +02:00
Hans-Kristian Arntzen	ef1fa71bba	Unpack vector expression in Matrix-Vector multiplies.	2019-07-23 12:22:40 +02:00
Hans-Kristian Arntzen	46e757b278	GLSL/HLSL: Verify member alignment for explicit offset as well.	2019-07-23 11:53:33 +02:00
Hans-Kristian Arntzen	7277c7ac46	Use to_unpacked_row_major_expression to unify row-major in MSL/GLSL.	2019-07-23 11:36:54 +02:00
Hans-Kristian Arntzen	47a18b9f1b	Simplify row-major matrix/vector multiplies.	2019-07-23 10:56:57 +02:00
Hans-Kristian Arntzen	6224199c76	Add struct size padding tests.	2019-07-23 10:30:37 +02:00
Hans-Kristian Arntzen	2172b19be2	Remove obsolete matrix workaround code.	2019-07-22 16:27:47 +02:00
Hans-Kristian Arntzen	249f8e5180	MSL: Support storing to row-major column. Defer transposes to actual Load or Store.	2019-07-22 11:13:44 +02:00
Hans-Kristian Arntzen	be2fccd837	Tests run clean.	2019-07-22 10:23:39 +02:00
Hans-Kristian Arntzen	6c1f97b4a9	Fix unpacking of packed but not remapped types on load.	2019-07-19 14:50:35 +02:00
Hans-Kristian Arntzen	12c5020854	Pass down row-major state to unpacking functions.	2019-07-19 13:03:08 +02:00
Hans-Kristian Arntzen	f6251e4699	Can deal with std140 matrices now. Refactor is coming together.	2019-07-19 11:21:02 +02:00
Hans-Kristian Arntzen	dd7ebaf9f7	Start considering how to emit physical type ID.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	a86308bce1	MSL: Begin rewrite of buffer packing logic.	2019-07-19 10:06:19 +02:00
Chip Davis	12a8654784	Don't forward uses of an OpIsHelperInvocationEXT op. If this is computed before a `demote`, but used after, forwarding it will produce the wrong value. This does make for uglier shaders, but it's necessary right now to ensure correctness. I needed to use an assembly shader to produce the test for this. `spirv-opt` is not smart enough (or too smart?) to eliminate the variable that would be used in GLSL to express this.	2019-07-18 17:32:35 -05:00
Chip Davis	50dce10c5d	Support the SPV_EXT_demote_to_helper_invocation extension. This extension provides a new operation which causes a fragment to be discarded without terminating the fragment shader invocation. The invocation for the discarded fragment becomes a helper invocation, so that derivatives will remain defined. The old `HelperInvocation` builtin becomes undefined when this occurs, so a second new instruction queries the current helper invocation status. This is only fully supported for GLSL. HLSL doesn't support the `IsHelperInvocation` operation and MSL doesn't support the `DemoteToHelperInvocation` op. Fixes #1052.	2019-07-17 09:12:22 -05:00
Chip Davis	6a58554568	Support the SPV_KHR_device_group extension. The only piece added by this extension is the `DeviceIndex` builtin, which tells the shader which device in a grouped logical device it is running on. Metal's pipeline state objects are owned by the `MTLDevice` that created them. Since Metal doesn't support logical grouping of devices the way Vulkan does, we'll thus have to create a pipeline state for each device in a grouped logical device. The upcoming peer group support in Metal 3 will not change this. For this reason, for Metal, the device index is supplied as a constant at pipeline compile time. There's an interaction between `VK_KHR_device_group` and `VK_KHR_multiview` in the `VK_PIPELINE_CREATE_VIEW_INDEX_FROM_DEVICE_INDEX_BIT`, which defines the view index to be the same as the device index. The new `view_index_from_device_index` MSL option supports this functionality.	2019-07-13 16:45:54 -05:00
Hans-Kristian Arntzen	932ee0e328	Deal correctly with return sign of bitscan operations.	2019-07-12 10:57:56 +02:00
Hans-Kristian Arntzen	ad5eae46ed	Merge pull request #1078 from cdavis5e/post-depth-coverage Support the SPV_KHR_post_depth_coverage extension.	2019-07-12 09:56:26 +02:00
Hans-Kristian Arntzen	2e32d4c0db	Merge pull request #1079 from cdavis5e/msl-boolean-mix MSL: Use the select() function for OpSelect.	2019-07-12 09:52:57 +02:00
Chip Davis	6628ea6e48	MSL: Use the select() function for OpSelect. This significantly improves codegen for vector `OpSelect` in MSL.	2019-07-11 10:30:37 -05:00

1 2 3 4 5 ...

665 Commits