SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	00d5c78447	Remove obsolete use of AtomicCounterMemoryMask.	2019-12-04 15:30:07 +01:00
Hans-Kristian Arntzen	67b2991451	Don't emit memoryBarrierShared() in workgroup control barriers. This is implied in both GL and GLES. Emitting memoryBarrierShared() was based on earlier confusion in the spec which has since been fixed and clarified.	2019-12-04 15:06:19 +01:00
Hans-Kristian Arntzen	4edb99d476	Fix sign handling for S/UToF.	2019-11-28 13:55:28 +01:00
Hans-Kristian Arntzen	f5cb08c42f	Mark loop headers as complex as early as possible. We had a case where loops were marked complex in a cascading fashion where each loop iteration would discover one new complex loop. This was a problem with three nested loops.	2019-11-26 11:01:39 +01:00
Hans-Kristian Arntzen	0b417b586a	HLSL: Report more explicitly which member failed validation. This will be awkward to report in GLSL where we check multiple packing standards, but for HLSL it should be easy since there's only CBuffer packing standard to worry about.	2019-11-06 11:21:39 +01:00
Hans-Kristian Arntzen	a8d676f2e4	GLSL: Fix issue with array-of-array inputs in tess. Only one dimension can be unsized and wrong dimension was used for unrolling purposes.	2019-11-04 10:34:49 +01:00
Hans-Kristian Arntzen	8f13a3f4b1	MSL: Remove workaround for passing constant arrays to functions. Arrays are value-types now, so remove the old workaround.	2019-10-28 12:14:43 +01:00
Hans-Kristian Arntzen	fa011f8547	MSL: Declare arrays with proper type wrapper. Need to construct with value type spvUnsafeArray<T, N>({ elem0, elem1 }) to make array initialization work in complex scenarios.	2019-10-26 17:57:34 +02:00
Hans-Kristian Arntzen	3f569ed5ec	GLSL: Minor nit, check flushed_phi_variables with count().	2019-10-26 16:10:12 +02:00
Hans-Kristian Arntzen	9d18c82364	Clean up call to builtin_translates_to_nonarray. get_decoration is better than poking in ir.meta manually.	2019-10-26 16:10:12 +02:00
Hans-Kristian Arntzen	3b5c4c7316	Implement constant empty struct correctly on all backends. MSL actually supports empty structs, so enable that path as well.	2019-10-26 16:10:11 +02:00
Hans-Kristian Arntzen	8066d13599	MSL: Rewrite propagated depth comparison state handling. Far cleaner, and more correct to run the traversal twice. Fixes a case where we propagate depth state through multiple functions.	2019-10-26 16:10:11 +02:00
Hans-Kristian Arntzen	6edbf0c9e9	MSL: Minor cleanups for texture atomic emulation. Storing pointers to internal objects is generally not done, IDs are preferred.	2019-10-24 11:30:20 +02:00
Lukas Hermanns	6673a675ba	Simplified overriding of 'access_chain_internal' function in CompilerMSL.	2019-10-22 11:06:16 -04:00
Lukas Hermanns	84351d3aed	Merge remote-tracking branch 'upstream/master'	2019-10-21 18:55:36 -04:00
Hans-Kristian Arntzen	4550f18b37	Fix OpVectorExtractDynamic with spec constant op index. We cannot lower this to a swizzle, keep it dynamically indexed.	2019-10-17 11:12:14 +02:00
Lukas Hermanns	2482ff708c	Merge remote-tracking branch 'upstream/master'	2019-10-14 11:06:15 -04:00
Hans-Kristian Arntzen	a9be92569f	HLSL: Fix unrolled S/G LE/LT/GE/GT opcodes. Need to bitcast the unrolled expressions as well.	2019-10-14 16:08:39 +02:00
Hans-Kristian Arntzen	3bf9fa7ed6	GLSL: Deal correctly with bitwidth on integer compares.	2019-10-14 15:23:38 +02:00
Hans-Kristian Arntzen	b960ae3b70	HLSL: Partially implement Unordered compare. We cannot correctly implement unordered equal/ordered not equal without a lot of extra instructions which slows normal code down.	2019-10-14 15:15:03 +02:00
Hans-Kristian Arntzen	14a4b087fb	GLSL: Support unordered floating point compare. There is no direct way to express this, so invert boolean results to force any NaN -> true. glslang emits Ordered compare instructions everywhere, and the GLSL spec is not clear on this, so assume this is fine.	2019-10-14 13:48:22 +02:00
Hans-Kristian Arntzen	07e9501ae1	MSL: Fix regression with OpCompositeConstruct from std140 float[]. Simple fix, just need to use to_unpacked_expression rather than to_expression here to deal with this.	2019-10-11 11:21:43 +02:00
Lukas Hermanns	688a39e7f8	Merge remote-tracking branch 'upstream/master'	2019-10-09 10:12:04 -04:00
Hans-Kristian Arntzen	f59688b5d1	Workaround MSVC issue.	2019-10-07 12:40:21 +02:00
Hans-Kristian Arntzen	a0c13e4ee8	Do not consider aliased struct types if the master is not a block. It is possible for a shader to declare two plain struct types which simply share the same OpName without there being an implicit value/buffer alias relationship. For to_member_name(), make sure to use the type alias master when resolving member names. The member name may be different in a type alias master if the SPIR-V is being intentionally difficult.	2019-10-07 10:52:16 +02:00
Lukas Hermanns	f3a6d28a1d	Further updates for pull request #1162 ; also added two test cases for spvCubemapTo2DArrayFace function and added '--msl-framebuffer-fetch'/ '--msl-emulate-cube-array' compiler options.	2019-09-27 15:49:54 -04:00
Lukas Hermanns	7ad0a84778	Updates for pull request #1162	2019-09-24 14:35:25 -04:00
Lukas Hermanns	37df74035b	Merge branch 'ue4_dev'	2019-09-20 09:42:42 -04:00
Hans-Kristian Arntzen	3c11254ece	MSL: Fix 16-bit integer literals. There is no suffix, so bitcasts failed.	2019-09-19 10:19:51 +02:00
Lukas Hermanns	50ac6862ac	Rearranged all 'UE Change' comments to match to project's coding style.	2019-09-18 14:03:54 -04:00
Hans-Kristian Arntzen	c3ff67c3f0	Fix -Wshorten-64-to-32 warnings.	2019-09-17 10:18:38 +02:00
Lukas Hermanns	a9f3c981d9	Adjustments after rebase of ue4_dev branch.	2019-09-13 14:03:02 -04:00
Mark Satterthwaite	c4f9704af0	OpImageTexelPointer needs to use an int coordinate type for GLSL, but not for MSL.	2019-09-12 08:52:08 -04:00
Mark Satterthwaite	e4c6388571	More fixes to handling packing & access elements in an array. Made in two parts. 1. Don't allow AccessChain operations to add duplicated swizzles when accessing packed arrays. 2. Only pack arrays when there is the proper amount of space between members in a struct, otherwise it will definitely be wrong.	2019-09-11 16:15:10 -04:00
Mark Satterthwaite	a80c74b40e	There are occasions where phi-variable copies are introduced for original variables which are fully declared, which coud result in the phi-variable never being declared and the shader not compiling, so declare the phi-variables when this happens. Change made in two parts. 1. Ensure that we declare phi-variable copies even if the original declaration isn't deferred. 2. Only flush phi variables once, avoids duplicate definitions.	2019-09-11 14:00:49 -04:00
Mark Satterthwaite	2af70b837c	When converting from HLSL the dxc SPIRV output often contains variables that are written through (e.g. a = b = c;) which seems to break the tracking of expressions in SPIRV-Cross, so don't reset everything once configured.	2019-09-10 13:25:20 -04:00
Mark Satterthwaite	42b8a62870	Fixes to the generation of Metal tessellation shaders from SPIRV so that it works correctly in more complicated cases. First, when generating from HLSL before invoking the code that comes from the HLSL patch-function a control-flow and full memory-barrier are required to ensure that all the temporary values in thread-local storage for the patch are available. Second, the inputs to control and evaluation shaders must be properly forwarded from the global variables in SPIRV to the member variables in the relevant input structure. Finally when arrays of interpolators are used for input or output we need to add an extra level of array indirection because Metal works at a different granularity than SPIRV. Five parts. 1. Fix tessellation patch function processing. 2. Fix loads from tessellation control inputs not being forwarded to the gl_in structure array. 3. Fix loads from tessellation evaluation inputs not being forwarded to the stage_in structure array. 4. Workaround SPIRV losing an array indirection in tessellation shaders - not the best solution but enough to keep things progressing. 5. Apparently gl_TessLevelInner/Outer is special and needs to not be placed into the input array.	2019-09-10 10:37:07 -04:00
Hans-Kristian Arntzen	333980ae91	Refactor into stronger types in public API. Some fallout where internal functions are using stronger types. Overkill to move everything over to strong types right now, but perhaps move over to it slowly over time.	2019-09-06 12:29:47 +02:00
Chip Davis	cb35934248	MSL: Support dynamic offsets for buffers in argument buffers. Vulkan has two types of buffer descriptors, `VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER_DYNAMIC` and `VK_DESCRIPTOR_TYPE_STORAGE_BUFFER_DYNAMIC`, which allow the client to offset the buffers by an amount given when the descriptor set is bound to a pipeline. Metal provides no direct support for this when the buffer in question is in an argument buffer, so once again we're on our own. These offsets cannot be stored or associated in any way with the argument buffer itself, because they are set at bind time. Different pipelines may have different offsets set. Therefore, we must use a separate buffer, not in any argument buffer, to hold these offsets. Then the shader must manually offset the buffer pointer. This change fully supports arrays, including arrays of arrays, even though Vulkan forbids them. It does not, however, support runtime arrays. Perhaps later.	2019-09-05 23:29:00 -05:00
Mark Satterthwaite	d50659af92	Rework the way arrays are handled in Metal to remove the array copies as they are unnecessary from Metal 1.2. There were cases where copies were not being inserted and others appeared unncessary, using the template type should allow the 'metal' compiler to do the best possible optimisation. The changes are broken into three stages. 1. Allow Metal to use the array<T> template to make arrays a value type. 2. Force the use of C style array declaration for some cases which cannot be wrapped with a template. 3. Threadgroup arrays can't have a wrapper type. 4. Tweak the code to use unsafe_array in a few more places so that we can handle passing arrays of resources into the shader and then through shaders into sub-functions. 5. Handle packed matrix types inside arrays within structs. 6. Make sure that builtin arguments still retain their array qualifiers when used in leaf functions. 7. Fix declaration of array-of-array constants for Metal so we can use the array<T> template.	2019-09-05 12:39:44 -04:00
Hans-Kristian Arntzen	1dc7e938d0	Make sure not to propagate loads outside interlock region.	2019-09-04 12:33:20 +02:00
Hans-Kristian Arntzen	261b46982a	Deal with complex interlock cases in GLSL.	2019-09-04 12:18:04 +02:00
Chip Davis	2eff420d9a	Support the SPV_EXT_fragment_shader_interlock extension. This was straightforward to implement in GLSL. The `ShadingRateInterlockOrderedEXT` and `ShadingRateInterlockUnorderedEXT` modes aren't implemented yet, because we don't support `SPV_NV_shading_rate` or `SPV_EXT_fragment_invocation_density` yet. HLSL and MSL were more interesting. They don't support this directly, but they do support marking resources as "rasterizer ordered," which does roughly the same thing. So this implementation scans all accesses inside the critical section and marks all storage resources found therein as rasterizer ordered. They also don't support the fine-grained controls on pixel- vs. sample-level interlock and disabling ordering guarantees that GLSL and SPIR-V do, but that's OK. "Unordered" here merely means the order is undefined; that it just so happens to be the same as rasterizer order is immaterial. As for pixel- vs. sample-level interlock, Vulkan explicitly states: > With sample shading enabled, [the `PixelInterlockOrderedEXT` and > `PixelInterlockUnorderedEXT`] execution modes are treated like > `SampleInterlockOrderedEXT` or `SampleInterlockUnorderedEXT` > respectively. and: > If [the `SampleInterlockOrderedEXT` or `SampleInterlockUnorderedEXT`] > execution modes are used in single-sample mode they are treated like > `PixelInterlockOrderedEXT` or `PixelInterlockUnorderedEXT` > respectively. So this will DTRT for MoltenVK and gfx-rs, at least. MSL additionally supports multiple raster order groups; resources that are not accessed together can be placed in different ROGs to allow them to be synchronized separately. A more sophisticated analysis might be able to place resources optimally, but that's outside the scope of this change. For now, we assign all resources to group 0, which should do for our purposes. `glslang` doesn't support the `RasterizerOrdered` UAVs this implementation produces for HLSL, so the test case needs `fxc.exe`. It also insists on GLSL 4.50 for `GL_ARB_fragment_shader_interlock`, even though the spec says it needs either 4.20 or `GL_ARB_shader_image_load_store`; and it doesn't support the `GL_NV_fragment_shader_interlock` extension at all. So I haven't been able to test those code paths. Fixes #1002.	2019-09-02 12:31:10 -05:00
Chip Davis	39dce88d3b	MSL: Add support for sampler Y'CbCr conversion. This change introduces functions and in one case, a class, to support the `VK_KHR_sampler_ycbcr_conversion` extension. Except in the case of GBGR8 and BGRG8 formats, for which Metal natively supports implicit chroma reconstruction, we're on our own here. We have to do everything ourselves. Much of the complexity comes from the need to support multiple planes, which must now be passed to functions that use the corresponding combined image-samplers. The rest is from the actual Y'CbCr conversion itself, which requires additional post-processing of the sample retrieved from the image. Passing sampled images to a function was a particular problem. To support this, I've added a new class which is emitted to MSL shaders that pass sampled images with Y'CbCr conversions attached around. It can handle sampled images with or without Y'CbCr conversion. This is an awful abomination that should not exist, but I'm worried that there's some shader out there which does this. This support requires Metal 2.0 to work properly, because it uses default-constructed texture objects, which were only added in MSL 2. I'm not even going to get into arrays of combined image-samplers--that's a whole other can of worms. They are deliberately unsupported in this change. I've taken the liberty of refactoring the support for texture swizzling while I'm at it. It's now treated as a post-processing step similar to Y'CbCr conversion. I'd like to think this is cleaner than having everything in `to_function_name()`/`to_function_args()`. It still looks really hairy, though. I did, however, get rid of the explicit type arguments to `spvGatherSwizzle()`/`spvGatherCompareSwizzle()`. Update the C API. In addition to supporting this new functionality, add some compiler options that I added in previous changes, but for which I neglected to update the C API.	2019-09-01 18:35:53 -05:00
Chip Davis	5fe1ecc324	GLSL: Fix post-depth coverage for ESSL. ESSL does not support `GL_ARB_post_depth_coverage`. There, we must use `GL_EXT_post_depth_coverage`. I've added this as a fallback for desktop as well. Note that `GL_EXT_post_depth_coverage` also requires the fragment shader to set `early_fragment_tests` explicitly, while `GL_ARB_post_depth_coverage` does not. It doesn't really matter either way, since `SPV_KHR_post_depth_coverage` also requires both execution modes to be explicitly set.	2019-08-28 13:40:13 -05:00
Hans-Kristian Arntzen	3ccfbce264	Run format_all.sh.	2019-08-28 14:25:26 +02:00
Hans-Kristian Arntzen	563e994486	Merge pull request #1135 from KhronosGroup/fix-1119 MSL: Deal with array copies from and to threadgroup.	2019-08-27 15:48:08 +02:00
Hans-Kristian Arntzen	aec826222d	Merge pull request #1134 from KhronosGroup/fix-1117 Do not allow base expressions for non-native row-major matrices.	2019-08-27 15:47:33 +02:00
Hans-Kristian Arntzen	9436cd3036	MSL: Deal with array copies from and to threadgroup.	2019-08-27 13:18:01 +02:00
Hans-Kristian Arntzen	1017a02aad	Merge pull request #1133 from KhronosGroup/fix-1115 Deal with ldexp taking uint input.	2019-08-27 13:17:43 +02:00
Hans-Kristian Arntzen	7ff2db4570	Do not allow base expressions for non-native row-major matrices.	2019-08-27 11:41:54 +02:00
Hans-Kristian Arntzen	2f7848dcda	Deal with ldexp taking uint input. Need to value cast to int first.	2019-08-27 11:19:54 +02:00
Hans-Kristian Arntzen	5d97dae1eb	Move branchless analysis to CFG. Traverse backwards instead, far more robust. Should elide basically all redundant continue; statements now.	2019-08-27 10:19:19 +02:00
Hans-Kristian Arntzen	55c2ca90ae	Elide branches to continue block when continue block is also a merge.	2019-08-27 10:19:01 +02:00
Hans-Kristian Arntzen	b3305799a8	Deal correctly with sign on bitfield operations. Need a lot of special purpose implementation functions for these.	2019-08-26 11:36:36 +02:00
Hans-Kristian Arntzen	b97e9b0499	Fix severe performance issue with invariant expression invalidation. We were going down a tree of expressions multiple times and this caused an exponential explosion in time, which was not caught until recently. Fix this by blocking any traversal going through an ID more than one time. This fix overall improves performance by almost an order of magnitude on a particular test shader rather than slowing it down by ~75x.	2019-08-01 09:55:21 +02:00
Hans-Kristian Arntzen	c3e8e728d8	MSL: Cleanup temporary use with emit_uninitialized_temporary.	2019-07-26 11:16:43 +02:00
Hans-Kristian Arntzen	301eab1b7a	Merge pull request #1099 from KhronosGroup/fix-1091 Missed case where DoWhile continue block deals with Phi.	2019-07-25 17:44:17 +02:00
Hans-Kristian Arntzen	e06efb7259	Missed case where DoWhile continue block deals with Phi.	2019-07-25 12:30:50 +02:00
Hans-Kristian Arntzen	12ca9d1982	Vulkan GLSL: Support disabling samplerless texture function EXT. Some platforms support Vulkan GLSL, but not this extension apparently ...	2019-07-25 11:07:14 +02:00
Hans-Kristian Arntzen	d90eeddcf1	Fix some typos in comments.	2019-07-24 12:14:19 +02:00
Hans-Kristian Arntzen	461f1506e7	Do not eagerly invalidate all active variables on a branch. This is not necessary, as we must emit an invalidating store before we potentially consume an invalid expression. In fact, we're a bit conservative here in this case for example: int tmp = variable; if (...) { variable = 10; } else { // Consuming tmp here is fine, but it was // invalidated while emitting other branch. // Technically, we need to study if there is an invalidating store // in the CFG between the loading block and this block, and the other // branch will not be a part of that analysis. int tmp2 = tmp * tmp; } Fixing this case means complex CFG traversal everywhere, and it feels like overkill. Fixing this exposed a bug with access chains, so fix a bug where expression dependencies were not inherited properly in access chains. Access chains are now considered forwarded if there is at least one dependency which is also forwarded.	2019-07-24 11:17:30 +02:00
Hans-Kristian Arntzen	18bcc9b790	Do not disable temporary forwarding when we suppress usage tracking. This subtle bug removed any expression validation for trivially swizzled variables. Make usage suppression a more explicit concept rather than just hacking off forwarded_temporaries. There is some fallout here with loop generation since our expression invalidation is currently a bit too naive to handle loops properly. The forwarding bug masked this problem until now. If part of the loop condition is also used in the body, we end up reading an invalid expression, which in turn forces a temporary to be generated in the condition block, not good. We'll need to be smarter here ...	2019-07-23 19:18:44 +02:00
Hans-Kristian Arntzen	1ece67a050	Look at pointee type when unpacking expressions. We might be unpacking in OpLoad, so don't want any pointer types from access chains creeping in.	2019-07-23 17:07:15 +02:00
Hans-Kristian Arntzen	ebe109d91d	Deal correctly with non-forwarded packed loads. Need to unpack the expression if we're not forwarding.	2019-07-23 16:25:19 +02:00
Hans-Kristian Arntzen	3fa2b14634	Run format_all.sh.	2019-07-23 12:23:41 +02:00
Hans-Kristian Arntzen	ef1fa71bba	Unpack vector expression in Matrix-Vector multiplies.	2019-07-23 12:22:40 +02:00
Hans-Kristian Arntzen	46e757b278	GLSL/HLSL: Verify member alignment for explicit offset as well.	2019-07-23 11:53:33 +02:00
Hans-Kristian Arntzen	7277c7ac46	Use to_unpacked_row_major_expression to unify row-major in MSL/GLSL.	2019-07-23 11:36:54 +02:00
Hans-Kristian Arntzen	47a18b9f1b	Simplify row-major matrix/vector multiplies.	2019-07-23 10:56:57 +02:00
Hans-Kristian Arntzen	6224199c76	Add struct size padding tests.	2019-07-23 10:30:37 +02:00
Hans-Kristian Arntzen	2172b19be2	Remove obsolete matrix workaround code.	2019-07-22 16:27:47 +02:00
Hans-Kristian Arntzen	249f8e5180	MSL: Support storing to row-major column. Defer transposes to actual Load or Store.	2019-07-22 11:13:44 +02:00
Hans-Kristian Arntzen	be2fccd837	Tests run clean.	2019-07-22 10:23:39 +02:00
Hans-Kristian Arntzen	6c1f97b4a9	Fix unpacking of packed but not remapped types on load.	2019-07-19 14:50:35 +02:00
Hans-Kristian Arntzen	12c5020854	Pass down row-major state to unpacking functions.	2019-07-19 13:03:08 +02:00
Hans-Kristian Arntzen	f6251e4699	Can deal with std140 matrices now. Refactor is coming together.	2019-07-19 11:21:02 +02:00
Hans-Kristian Arntzen	dd7ebaf9f7	Start considering how to emit physical type ID.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	a86308bce1	MSL: Begin rewrite of buffer packing logic.	2019-07-19 10:06:19 +02:00
Chip Davis	12a8654784	Don't forward uses of an OpIsHelperInvocationEXT op. If this is computed before a `demote`, but used after, forwarding it will produce the wrong value. This does make for uglier shaders, but it's necessary right now to ensure correctness. I needed to use an assembly shader to produce the test for this. `spirv-opt` is not smart enough (or too smart?) to eliminate the variable that would be used in GLSL to express this.	2019-07-18 17:32:35 -05:00
Chip Davis	50dce10c5d	Support the SPV_EXT_demote_to_helper_invocation extension. This extension provides a new operation which causes a fragment to be discarded without terminating the fragment shader invocation. The invocation for the discarded fragment becomes a helper invocation, so that derivatives will remain defined. The old `HelperInvocation` builtin becomes undefined when this occurs, so a second new instruction queries the current helper invocation status. This is only fully supported for GLSL. HLSL doesn't support the `IsHelperInvocation` operation and MSL doesn't support the `DemoteToHelperInvocation` op. Fixes #1052.	2019-07-17 09:12:22 -05:00
Chip Davis	6a58554568	Support the SPV_KHR_device_group extension. The only piece added by this extension is the `DeviceIndex` builtin, which tells the shader which device in a grouped logical device it is running on. Metal's pipeline state objects are owned by the `MTLDevice` that created them. Since Metal doesn't support logical grouping of devices the way Vulkan does, we'll thus have to create a pipeline state for each device in a grouped logical device. The upcoming peer group support in Metal 3 will not change this. For this reason, for Metal, the device index is supplied as a constant at pipeline compile time. There's an interaction between `VK_KHR_device_group` and `VK_KHR_multiview` in the `VK_PIPELINE_CREATE_VIEW_INDEX_FROM_DEVICE_INDEX_BIT`, which defines the view index to be the same as the device index. The new `view_index_from_device_index` MSL option supports this functionality.	2019-07-13 16:45:54 -05:00
Hans-Kristian Arntzen	932ee0e328	Deal correctly with return sign of bitscan operations.	2019-07-12 10:57:56 +02:00
Hans-Kristian Arntzen	ad5eae46ed	Merge pull request #1078 from cdavis5e/post-depth-coverage Support the SPV_KHR_post_depth_coverage extension.	2019-07-12 09:56:26 +02:00
Hans-Kristian Arntzen	2e32d4c0db	Merge pull request #1079 from cdavis5e/msl-boolean-mix MSL: Use the select() function for OpSelect.	2019-07-12 09:52:57 +02:00
Chip Davis	6628ea6e48	MSL: Use the select() function for OpSelect. This significantly improves codegen for vector `OpSelect` in MSL.	2019-07-11 10:30:37 -05:00
Chip Davis	1df47db6ba	Support the SPV_KHR_post_depth_coverage extension. Using the `PostDepthCoverage` mode specifies that the `gl_SampleMaskIn` variable is to contain the computed coverage mask following the early fragment tests, which this mode requires and implicitly enables. Note that unlike Vulkan and OpenGL, Metal places this on the sample mask input itself, and furthermore does not implicitly enable early fragment testing. If it isn't enabled explicitly with an `[[early_fragment_tests]]` attribute, the compiler will error out. So we have to enable that mode explicitly if `PostDepthCoverage` is enabled but `EarlyFragmentTests` isn't. For Metal, only iOS supports this; for some reason, Apple has yet to implement it on macOS, even though many desktop cards support it.	2019-07-11 10:28:43 -05:00
Hans-Kristian Arntzen	63bcbd511e	GLSL: Need extension to use bitcast on GLSL < 330.	2019-07-11 15:32:57 +02:00
Hans-Kristian Arntzen	25c74b324e	Forget loop variable enables after emitting block chain.	2019-07-10 12:57:12 +02:00
Hans-Kristian Arntzen	f6f849397e	MSL: Re-roll array expressions in initializers. We cannot rely on copy path when using an array as part of a struct initializer, so reroll such expressions to an initializer list again.	2019-07-10 11:19:33 +02:00
Hans-Kristian Arntzen	53ab2144b9	Merge pull request #1064 from KhronosGroup/fix-1062 Fall back to complex loop if non-trivial continue block is found.	2019-07-08 13:58:35 +02:00
Hans-Kristian Arntzen	50342966c0	Fall back to complex loop if non-trivial continue block is found. There is a case where we can deduce a for/while loop, but the continue block is actually very painful to deal with, so handle that case as well. Removes an exceptional case.	2019-07-08 11:54:29 +02:00
Hans-Kristian Arntzen	d12b54bbb4	Propagate NonUniformEXT to dependent expressions. This decoration might only be present for the very last ID which is consumed by a sampling or Load/Store instruction. To make sure our access chains are emitted correctly, we have to back-propagate this decoration.	2019-07-08 11:19:38 +02:00
Lifeng Pan	5ca8779044	Parse SPIR-V debug information extended instructions, as well as OpNoLine. No impact on result shader string.	2019-07-04 16:21:44 +08:00
Hans-Kristian Arntzen	581ed0fd59	HLSL: Does not support case-fallthrough. Disable any fallthrough on HLSL. Risky business if fallthrough blocks had a barrier(), but can't do anything about that ...	2019-06-27 15:10:17 +02:00
Hans-Kristian Arntzen	c76b99b711	Handle more cases with FP16 and texture sampling.	2019-06-27 15:04:22 +02:00
Hans-Kristian Arntzen	bcef66fbf3	Fix declaration of loop variables with a Phi helper copy. Certain Phi variables need to maintain a temporary copy, but we forgot to declare them when the master variable is a loop variable itself.	2019-06-25 10:45:15 +02:00
Hans-Kristian Arntzen	7557ff5567	Workaround GCC 9 bug.	2019-06-24 10:17:25 +02:00
Hans-Kristian Arntzen	b4e0163749	Run format_all.sh.	2019-06-21 16:02:22 +02:00
Hans-Kristian Arntzen	bcec5cb370	Old MSVC does not like +[] constructs.	2019-06-21 14:59:51 +02:00
Hans-Kristian Arntzen	c365cc1b43	Deal with OpPhi and case fallthrough. This is quite complex since we cannot flush Phi inside the case labels, we have to do it outside by emitting a lot of manual branches ourselves. This should be extremely rare, but we need to handle this case.	2019-06-21 13:38:23 +02:00
Hans-Kristian Arntzen	22e3beaab9	Deal with switch block fallthrough more correctly ...	2019-06-20 12:14:19 +02:00
Hans-Kristian Arntzen	bc3bf47446	Rewrite how switch block case labels are emitted.	2019-06-20 11:57:05 +02:00
Hans-Kristian Arntzen	7fdb418f18	Merge pull request #1028 from KhronosGroup/fix-1010 MSL: Support barycentrics and PrimitiveID in fragment shaders	2019-06-19 15:29:14 +02:00
Hans-Kristian Arntzen	707312b83a	GLSL: Support NV barycentrics.	2019-06-19 09:52:35 +02:00
Hans-Kristian Arntzen	f171d82590	MSL: Support MinLod operand.	2019-06-19 09:43:03 +02:00
Hans-Kristian Arntzen	d81bfc5b58	MSL: Fix regression with Private parameter declaration. If we compile multiple times due to forced_recompile, we had deferred_declaration = true while emitting function prototypes which broke an assumption. Fix this by clearing out stale state before leaving a function.	2019-06-13 10:36:21 +02:00
Hans-Kristian Arntzen	a9da59b0b8	GLSL: Support GL_ARB_shader_stencil_export.	2019-06-12 10:06:54 +02:00
Hans-Kristian Arntzen	bf56dc88b9	Rewrite how loop dominators are propagated. Do this analysis in the CFG stage rather than last minute with the ad-hoc algorithm we had in place before CFG was introduced.	2019-06-06 12:17:46 +02:00
Hans-Kristian Arntzen	720681da39	Merge pull request #1006 from KhronosGroup/fix-1003 Deal with case where a block is somehow emitted in a duplicated fashion.	2019-06-05 16:11:06 +02:00
Patrick Mours	8d64d5e776	Fix storage packing qualifiers missing on "shaderRecordNV" buffers	2019-06-05 13:31:24 +02:00
Patrick Mours	be3035db26	Fix callable data variables	2019-06-05 13:31:24 +02:00
Patrick Mours	789178666f	Add support for "shaderRecordNV" qualifier	2019-06-05 13:31:24 +02:00
Hans-Kristian Arntzen	c09ca74c61	Deal with case where a block is somehow emitted in a duplicated fashion. We seen to have to deal with a case where a block is used multiple times without any "proper" structured control flow, so we risk losing deferred declaration state.	2019-06-05 12:39:40 +02:00
Hans-Kristian Arntzen	65af09d2d1	Support emitting OpLine directive. Facilitates easier mapping from source language to cross-compiled output in tooling.	2019-05-28 13:44:24 +02:00
Hans-Kristian Arntzen	23889f7b87	GLSL: Support std430 in UBOs with scalar layout.	2019-05-28 12:22:44 +02:00
Hans-Kristian Arntzen	b3094cd02a	Run format_all.sh.	2019-05-27 16:54:13 +02:00
Hans-Kristian Arntzen	7b9e0fb428	MSL: Implement OpArrayLength. This gets rather complicated because MSL does not support OpArrayLength natively. We need to pass down a buffer which contains buffer sizes, and we compute the array length on-demand. Support both discrete descriptors as well as argument buffers.	2019-05-27 16:13:09 +02:00
Hans-Kristian Arntzen	96492648d4	MSL: Fix struct declaration order with complex type aliases. MSL generally emits the aliases, which means we cannot always place the master type first, unlike GLSL and HLSL. The logic fix is just to reorder after we have tagged types with packing information, rather than doing it in the parser fixup.	2019-05-23 14:54:04 +02:00
Hans-Kristian Arntzen	45a36ad034	Run format_all.sh.	2019-05-14 09:54:35 +02:00
Hans-Kristian Arntzen	c52d6bcd0c	Merge pull request #975 from alpqr/master GLSL: Add option to disable buffer blocks regardless of version	2019-05-14 09:51:39 +02:00
Laszlo Agocs	7bc31491be	GLSL: Add option to disable buffer blocks regardless of version	2019-05-13 21:29:06 +02:00
Hans-Kristian Arntzen	647ddaee42	HLSL/MSL: Deal correctly with nonuniformEXT qualifier. MSL does not seem to have a qualifier for this, but HLSL SM 5.1 does. glslangValidator for HLSL does not support this, so skip any validation, but it passes in FXC.	2019-05-13 14:58:27 +02:00
Hans-Kristian Arntzen	6fcf8c83d9	GLSL: Support OpBitcast for buffer references. Update glslang/SPIRV-Tools/SPIRV-Headers references.	2019-05-09 10:29:31 +02:00
Hans-Kristian Arntzen	b6f8a20624	GLSL: Return correct sign for OpArrayLength. .length() returns int, not uint ...	2019-05-07 19:02:32 +02:00
Hans-Kristian Arntzen	3186701739	GLSL: Support GL_EXT_nonuniform_qualifier.	2019-05-02 11:15:51 +02:00
Hans-Kristian Arntzen	6f091e7c8f	GLSL: Support GL_EXT_scalar_block_layout.	2019-04-26 15:43:37 +02:00
Hans-Kristian Arntzen	758427e127	Fix GCC 4.x warning.	2019-04-26 13:09:54 +02:00
Hans-Kristian Arntzen	2cc374a0c8	GLSL: Implement GL_EXT_buffer_reference. Buffer objects can contain arbitrary pointers to blocks. We can also implement ConvertPtrToU and ConvertUToPtr. The latter can cast a uint64_t to any type as it pleases, so we will need to generate fake buffer reference blocks to be able to cast the type.	2019-04-26 11:43:51 +02:00
Hans-Kristian Arntzen	8b236f24f1	Fix infinite loop when OpAtomic* temporaries are used in other blocks. We made the mistake of registering a dependency on the atomic variable even if the atomic result was forced to a temporary. There is no need to register reads from atomic variables like this as we always force atomic results to a temporary and argument read/writes do not need to be tracked.	2019-04-24 09:33:39 +02:00
Hans-Kristian Arntzen	e23c9ea700	Force complex loop in certain rare access chain scenarios. If we generate an access chain in a loop body, and it is consumed in the loop continue block, we have a problem because we cannot emit a temporary here holding the access chain reference. Force a complex loop body to workaround this exceptionally rare case.	2019-04-10 16:02:03 +02:00
Hans-Kristian Arntzen	9ae91c2d1e	Deal with mismatched signs in S/U/F conversion opcodes.	2019-04-10 14:03:58 +02:00
Hans-Kristian Arntzen	a489ba7fd1	Reduce pressure on global allocation. - Replace ostringstream with custom implementation. ~30% performance uplift on vector-shuffle-oom test. Allocations are measurably reduced in Valgrind. - Replace std::vector with SmallVector. Classic malloc optimization, small vectors are backed by inline data. ~ 7-8% gain on vector-shuffle-oom on GCC 8 on Linux. - Use an object pool for IVariant type. We generally allocate a lot of SPIR* objects. We can amortize these allocations neatly by pooling them. - ~15% overall uplift on ./test_shaders.py --iterations 10000 shaders/.	2019-04-09 15:09:44 +02:00
Hans-Kristian Arntzen	b2c2f724f4	Merge pull request #938 from KhronosGroup/fix-937 MSL: Fix OpLoad of array which is forced to a temporary.	2019-04-09 15:08:53 +02:00
Hans-Kristian Arntzen	bf07e5fa7b	MSL: Fix OpLoad of array which is forced to a temporary.	2019-04-09 11:50:45 +02:00
lifpan	876627df3b	Add OpUndef instruction to block's instruction list for completeness.	2019-04-08 19:45:31 +08:00
Hans-Kristian Arntzen	3ca8bc5e0d	Support fma() in older GLSL targets.	2019-04-08 10:38:32 +02:00
Hans-Kristian Arntzen	317144a59c	Detect invalid DoWhileLoop early. We had a bug where error conditions in DoWhileLoop emit path would not detect that statements were being emitted due to the masking behavior which happens when force_recompile is true. Fix this. Also, refactor force_recompile into member functions so we can properly break on any situation where this is set, without having to rely on watchpoints in debuggers.	2019-04-05 12:19:32 +02:00
Hans-Kristian Arntzen	44834f2115	Merge pull request #927 from KhronosGroup/fix-925 GLSL: Fix OpImageFetch with uint coordinates and LOD.	2019-04-03 12:32:43 +02:00
Hans-Kristian Arntzen	e4d5c6183a	GLSL: Fix OpImageFetch with uint coordinates and LOD. Also fix some minor issues with too many coordinate dimensions in HLSL and GLSL.	2019-04-03 10:50:32 +02:00
Hans-Kristian Arntzen	7e37623e82	MSL: Fix depth2d 4-component fixup. Need to look at the backing image for the image. We might have found diverging use at the image variable level, not just expression level.	2019-04-03 10:24:22 +02:00
Hans-Kristian Arntzen	9b92e68d71	Add an option to override the namespace used for spirv_cross. This is a pragmatic trick to avoid symbol collision where a project links against SPIRV-Cross statically, while linking to other projects which also use SPIRV-Cross statically. We can end up with very awkward symbol collisions which can resolve themselves silently because SPIRV-Cross is pulled in as necessary. To fix this, we must use different symbols and embed two copies of SPIRV-Cross in this scenario, now with different namespaces, which in turn leads to different symbols.	2019-03-29 10:29:44 +01:00
Bill Hollings	c48702d8c2	Fix crash when backend.int16_t_literal_suffix set to null. The design of backend.int16_t_literal_suffix and backend.uint16_t_literal_suffix allows them to be set to null, but that was not always tested for. I have removed the expectation that they can be null and set backend.int16_t_literal_suffix to "" when no suffix is needed. That has the same effect, and seemed to be a more usable and defensive approach.	2019-03-28 14:23:32 -04:00
Hans-Kristian Arntzen	eeb3f24991	Properly deal with sign-dependent GLSL opcodes. The GLSLstd450 spec is very lax about input signs, so we need to do the bitcasting dance to implement it correctly.	2019-03-27 12:20:53 +01:00
Patrick Mours	c96bab0659	Replace usage of "require_extension" with "require_extension_internal" and "to_func_call_arg" with "to_expression"	2019-03-26 14:04:39 +01:00
Patrick Mours	c74d7a412c	Add "GL_NV_ray_tracing" extension to output when ray tracing execution model is found	2019-03-25 15:06:01 +01:00
Patrick Mours	b2651d01e5	Merge branch master into SPV_NV_ray_tracing	2019-03-25 14:09:15 +01:00
Hans-Kristian Arntzen	8eb33c8017	Support -1 index in OpVectorShuffle. -1 (0xffffffff) literal means the component should be undefined. Since we cannot express undefined directly, just use a 0 literal in the appropriate type.	2019-03-25 10:17:05 +01:00
Hans-Kristian Arntzen	2a0365c813	GLSL/HLSL: Implement NMin/NMax/NClamp. Need to emulate these calls for correctness.	2019-03-21 15:26:46 +01:00
Hans-Kristian Arntzen	0b20180537	GLSL: Deal with array loads from input in tessellation. We have an edge case where the array is declared with a concrete size, but in GLSL we must emit an unsized array, which breaks array copies. Deal explicitly with this.	2019-03-21 11:50:53 +01:00
Hans-Kristian Arntzen	d2961b30db	GLSL: Unroll loads from builtin pos/point arrays. Odd-ball case for certain geometry shaders coming from HLSL.	2019-03-21 11:25:41 +01:00
Hans-Kristian Arntzen	45baf24a17	Move check for structured OpSwitch to CompilerGLSL. Can still parse correctly.	2019-03-20 10:42:38 +01:00
Hans-Kristian Arntzen	a94490498d	Merge pull request #894 from KhronosGroup/fix-882 GLSL: Support emitting push constant block as a plain UBO.	2019-03-19 11:56:24 +01:00
Hans-Kristian Arntzen	1389aa34e4	GLSL: Check target version for push constant location = N.	2019-03-19 11:20:53 +01:00
Hans-Kristian Arntzen	0474848d4a	GLSL: Support emitting push constant block as a plain UBO.	2019-03-19 10:58:52 +01:00
Hans-Kristian Arntzen	7310274a4f	Fix build on Android API < 26.	2019-03-18 10:14:04 +01:00
Hans-Kristian Arntzen	cff057ca5a	We emit loop header variables even for while and dowhile. Make the name clearer.	2019-03-06 12:30:11 +01:00
Hans-Kristian Arntzen	8bfb04d29d	Run format_all.sh Disable clang format in C wrapper for now. Some weird formatting bug with the try/catch macro.	2019-03-06 12:20:13 +01:00
Hans-Kristian Arntzen	ef24337849	Support do-while where test is negative.	2019-03-06 12:17:38 +01:00
Hans-Kristian Arntzen	70ff96b03f	Deal with more for loop candidate cases. We can trivially deal with cases where the loop tests are simply inverted. We can also deal with cases where the condition block branches to the merge block via other noop blocks. This makes SPIR-V codegen easier when targeting SPIRV-Cross.	2019-03-06 11:24:43 +01:00
Hans-Kristian Arntzen	4096552c26	Use RADIXCHAR, which is the portable variant of DECIMAL_POINT.	2019-02-28 12:32:52 +01:00
Hans-Kristian Arntzen	8255dd3ed6	Use nl_langinfo on POSIX systems. localeconv is not MT-safe.	2019-02-28 11:51:08 +01:00
Hans-Kristian Arntzen	825ff4af7e	Replace locale handling. We were using std::locale::global() to force a C locale which is not safe when SPIRV-Cross is used in a multi-threaded environment. To fix this, we could tap into various per-platform specific locale handling to get safe thread-local locales, but since locales only affect the decimal point in floats, we simply query the locale instead and do the necessary radix replacement ourselves, without touching the locale. This should be much safer and cleaner than the alternative.	2019-02-28 11:28:31 +01:00
Patrick Mours	da39a7b02f	Add support for SPV_NV_ray_tracing	2019-02-26 15:43:03 +01:00
Hans-Kristian Arntzen	a4ac27546a	MSL: Fix textures which are sampled and compared against. depth2d in MSL only returns float, not float4, even for normal sampling. We need to conditionally remap-swizzle back to float4.	2019-02-22 12:27:40 +01:00
Hans-Kristian Arntzen	58f264c99d	Merge pull request #865 from KhronosGroup/fix-863 Always value-cast FP16 constants instead of using literals.	2019-02-20 14:58:44 +01:00
Hans-Kristian Arntzen	4ef51331b2	Always value-cast FP16 constants instead of using literals. GL_NV_gpu_shader5 doesn't support "hf", so to avoid lots of complicated workarounds, just value-cast the half literals.	2019-02-20 12:30:01 +01:00
Hans-Kristian Arntzen	056a0ba27e	Fix case where a struct is loaded which contains a row-major matrix.	2019-02-20 12:19:00 +01:00
Chip Davis	e75add42c9	MSL: Add support for tessellation evaluation shaders. These are mapped to Metal's post-tessellation vertex functions. The semantic difference is much less here, so this change should be simpler than the previous one. There are still some hairy parts, though. In MSL, the array of control point data is represented by a special type, `patch_control_point<T>`, where `T` is a valid stage-input type. This object must be embedded inside the patch-level stage input. For this reason, I've added a new type to the type system to represent this. On Mac, the number of input control points to the function must be specified in the `patch()` attribute. This is optional on iOS. SPIRV-Cross takes this from the `OutputVertices` execution mode; the intent is that if it's not set in the shader itself, MoltenVK will set it from the tessellation control shader. If you're translating these offline, you'll have to update the control point count manually, since this number must match the number that is passed to the `drawPatches:...` family of methods. Fixes #120.	2019-02-14 10:00:08 -06:00
Hans-Kristian Arntzen	d7090b8322	GLSL: Fix block name shenanigans in edge cases. When we force recompile, the old var.self name we used as a fallback name might have been disturbed, so we should recover certain names back to their original form in case we are forced to take a recompile to make the naming algorithm more deterministic.	2019-02-13 16:39:59 +01:00
Hans-Kristian Arntzen	3e584f2c3f	Support LUTs in single-function CFGs on Private storage class. Fairly common pattern in unoptimized SPIR-V. Support this case as well.	2019-02-06 10:38:59 +01:00
Chip Davis	ef0b1fc841	Move assertions after the check for equal types. `bitcast_glsl_op()` is sometimes called for `Boolean` types, e.g. for specialization constants. We don't want the assert to trip if this is going to be a no-op anyway.	2019-01-31 14:28:21 -06:00
Hans-Kristian Arntzen	2ed171e525	GLSL/MSL: Implement 8-bit part of VK_KHR_shader_float16_int8. Storage was in place already, so mostly just dealing with bitcasts and constants. Simplies some of the bitcasting logic, and this exposed some bugs in the implementation. Refactor to use correct width integers with explicit bitcast opcodes.	2019-01-30 15:45:24 +01:00
Hans-Kristian Arntzen	2edee351f0	Run format_all.sh.	2019-01-30 13:42:50 +01:00
Hans-Kristian Arntzen	3e09879131	Support initializers on StorageClassOutput.	2019-01-30 10:29:08 +01:00
Hans-Kristian Arntzen	8c632da461	MSL: Use correct alignment rule for whole structs. Structs are aligned as you would expect in MSL (maximum member alignment), and it is not minimum 16 bytes like in std140. Also rename the dummy "pad" members to a reserved naming scheme.	2019-01-28 15:20:30 +01:00
Hans-Kristian Arntzen	3aa08f764e	MSL: Fix image load/store for short vectors. Same fixes as for GLSL.	2019-01-17 14:54:29 +01:00
Hans-Kristian Arntzen	73d9da7070	Avoid unintentional name conflict with HLSL backend.	2019-01-17 12:21:16 +01:00
Hans-Kristian Arntzen	432aaed737	Need to know the original packed type when unpacking loads.	2019-01-17 11:39:46 +01:00
Hans-Kristian Arntzen	40e7723051	Run format_all.sh.	2019-01-17 11:29:50 +01:00
Hans-Kristian Arntzen	de7e5ccd8b	Refactor out packed expressions to extended decorations. Can't safely just cast to the original enum without lots of hacks.	2019-01-17 11:28:51 +01:00
Hans-Kristian Arntzen	72377366d3	Replace custom use of DecorationCPacked with an explicit one. Will need to use more variants of this decoration, so might as well make it clearer what is going on with CPacked.	2019-01-17 10:36:56 +01:00
Hans-Kristian Arntzen	f4026a5618	Refactor access_chain_internal to be more readable from callsite.	2019-01-17 10:30:13 +01:00
Hans-Kristian Arntzen	15b52bee48	Deal with packing/unpacking on store. Still a bit buggy, since we cannot deduce between float2[] and packed_float2. Need a deeper refactor to plumb this through ...	2019-01-17 10:06:23 +01:00
Hans-Kristian Arntzen	7ee04936ac	MSL: Fix case where we pass arrays to functions by value. MSL does not support value semantics for arrays (sigh), so we need to force constant references and deal with copies if we have a different address space than what we end up guessing.	2019-01-14 11:00:14 +01:00
Hans-Kristian Arntzen	6e1c3ccb72	Run format_all.sh.	2019-01-11 12:56:00 +01:00
Hans-Kristian Arntzen	2fb9aa251e	Workaround bugs on MSVC. Bug: https://developercommunity.visualstudio.com/content/problem/303996/c-error-c2668-ambiguous-overloaded-in-lambda-with.html	2019-01-11 09:29:28 +01:00
Hans-Kristian Arntzen	b629878f45	Make meta a hashmap. A flat array was consuming way too much memory and was far too slow to initialize properly with a very large ID bound (8 million IDs, showed up as #1 hotspot in perf). Meta struct does not have to be in-order as we never iterate over it in a meaningful way, so using a hashmap here is reasonable. Very few IDs should need decorations or meta-data, so this should also be a quite decent memory save. For the pathological case, a 6x uplift was observed.	2019-01-10 14:04:01 +01:00
Hans-Kristian Arntzen	d92de00cc1	Rewrite how IDs are iterated over. This is a fairly fundamental change on how IDs are handled. It serves many purposes: - Improve performance. We only need to iterate over IDs which are relevant at any one time. - Makes sure we iterate through IDs in SPIR-V module declaration order rather than ID space. IDs don't have to be monotonically increasing, which was an assumption SPIRV-Cross used to have. It has apparently never been a problem until now. - Support LUTs of structs. We do this by interleaving declaration of constants and struct types in SPIR-V module order. To support this, the ParsedIR interface needed to change slightly. Before setting any ID with variant_set<T> we let ParsedIR know that an ID with a specific type has been added. The surface for change should be minimal. ParsedIR will maintain a per-type list of IDs which the cross-compiler will need to consider for later. Instead of looping over ir.ids[] (which can be extremely large), we loop over types now, using: ir.for_each_typed_id<SPIRVariable>([&](uint32_t id, SPIRVariable &var) { handle_variable(var); }); Now we make sure that we're never looking at irrelevant types.	2019-01-10 12:52:56 +01:00
Hans-Kristian Arntzen	ddfd261776	Fix input array size in tessellation evaluation shaders.	2019-01-09 10:47:16 +01:00
Chip Davis	fc02b3d656	Rename get_non_pointer_type() methods. This better reflects their purpose now.	2019-01-08 12:55:22 -06:00
Chip Davis	3bfb2f94d4	MSL: Support SPV_KHR_variable_pointers. This allows shaders to declare and use pointer-type variables. Pointers may be loaded and stored, be the result of an `OpSelect`, be passed to and returned from functions, and even be passed as inputs to the `OpPhi` instruction. All types of pointers may be used as variable pointers. Variable pointers to storage buffers and workgroup memory may even be loaded from and stored to, as though they were ordinary variables. In addition, this enables using an interior pointer to an array as though it were an array pointer itself using the `OpPtrAccessChain` instruction. This is a rather large and involved change, mostly because this is somewhat complicated with a lot of moving parts. It's a wonder SPIRV-Cross's output is largely unchanged. Indeed, many of these changes are to accomplish exactly that! Perhaps the largest source of changes was the violation of the assumption that, when emitting types, the pointer type didn't matter. One of the test cases added by the change doesn't optimize very well; the output of `spirv-opt` here is invalid SPIR-V. I need to file a bug with SPIRV-Tools about this. I wanted to test that variable pointers to images worked too, but I couldn't figure out how to propagate the access qualifier properly--in MSL, it's part of the type, so getting this right is important. I've punted on that for now.	2019-01-07 11:19:10 -06:00
Hans-Kristian Arntzen	d4926a0405	Deal with phi copies which happen inside continue blocks.	2019-01-07 14:24:07 +01:00
Hans-Kristian Arntzen	c8ddf7e7d5	Fix case where OpPhi is used to swap values.	2019-01-07 13:54:16 +01:00
Hans-Kristian Arntzen	cacfeef89e	Merge pull request #804 from KhronosGroup/fix-788 Forward meta information in OpCompositeExtract.	2019-01-07 11:43:43 +01:00
Hans-Kristian Arntzen	66263d4569	Forward meta information in OpCompositeExtract. Just like OpAccessChain we need to make use of the meta information available to use from access_chain_internal as we can extract a packed vector or transposed vector from a composite, not just memory load.	2019-01-07 10:43:55 +01:00
Hans-Kristian Arntzen	5b8762223d	Run format_all.sh.	2019-01-07 10:01:28 +01:00
Hans-Kristian Arntzen	649ce3c7bb	MSL: Workaround missing gradient2d() for sampler_compare.	2019-01-07 10:01:00 +01:00
Sidney Just	fbb4df3f1a	Added support for sampler2DRect and legacy texture2DRect() sampling function	2019-01-06 12:21:59 -08:00
Hans-Kristian Arntzen	211abfb7ef	Merge pull request #799 from KhronosGroup/fix-780 Use correct block-name / other-name aliasing rules.	2019-01-04 16:08:10 +01:00
Hans-Kristian Arntzen	9728f9c1b7	Use correct block-name / other-name aliasing rules. A block name cannot alias with any name in its own scope, and it cannot alias with any other "global" name. To solve this, we need to complicate the name cache updates a little bit where we have a "primary" namespace and "secondary" namespace.	2019-01-04 15:02:54 +01:00
Hans-Kristian Arntzen	acae607703	Register implied expression reads in OpLoad/OpAccessChain. This is required to avoid relying on complex sub-expression elimination in compilers, and generates cleaner code. The problem case is if a complex expression is used in an access chain, like: Composite comp = buffer[texture(...)]; vec4 a = comp.a + comp.b + comp.c; Before, we did not have common subexpression tracking for OpLoad/OpAccessChain, so we easily ended up with code like: vec4 a = buffer[texture(...)].a + buffer[texture(...)].b + buffer[texture(...)].c; A good compiler will optimize this, but we should not rely on it, and forcing texture(...) to a temporary also looks better. The solution is to add a vector "implied_expression_reads", which works similarly to expression_dependencies. We also need an extra mechanism in to_expression which lets us skip expression read checking and do it later. E.g. for expr -> access chain -> load, we should only trigger a read of expr when using the loaded expression.	2019-01-04 14:56:12 +01:00
Hans-Kristian Arntzen	318c17cbb2	Nonfunctional: Update copyright headers for 2019.	2019-01-04 12:38:35 +01:00
Hans-Kristian Arntzen	61f1d8b2cf	Support gl_HelperInvocation on GLSL and MSL. There is no obvious builtin for this on HLSL.	2018-11-28 15:18:43 +01:00
Hans-Kristian Arntzen	d0b937206f	Keep track of pointer-to-pointer depth in parser. Defer failure of pointer-to-pointer to compilation time, so we can still reflect VK_KHR_variable_pointer shaders.	2018-11-26 12:23:28 +01:00
Hans-Kristian Arntzen	04f410d35c	Fix unsigned switch case selectors.	2018-11-26 10:36:50 +01:00
Hans-Kristian Arntzen	816c1167ce	Handle invariant decoration more robustly. Avoids certain cases of variance between translation units by forcing every dependent expression of a store to be temporary. Should avoid the major failure cases where invariance matters.	2018-11-22 11:55:57 +01:00
Hans-Kristian Arntzen	2a8a4fe706	GLSL: Support extended arithmetic opcodes. - uaddCarry - usubBorrow - umulExtended - imulExtended	2018-11-13 14:50:46 +01:00
Hans-Kristian Arntzen	4e5c8d7199	Deal with depth_greater/depth_less qualifiers. Adds support on HLSL SM 5.0, and fixes bug on GLSL. Makes sure early fragment tests is tested on MSL as well.	2018-11-12 10:35:36 +01:00
Chip Davis	0d949e11ff	Support bitcasts of 16-bit types.	2018-11-05 14:56:36 -06:00
Chip Davis	ca4744ab72	Support constants of 16-bit integral type in GLSL and MSL. Constants of 8-bit type aren't supported in GLSL, since there's no extension letting you use them.	2018-11-02 14:39:55 -05:00
Chip Davis	117ccf407c	Use specific base types for 8- and 16-bit integers.	2018-11-01 17:45:10 -05:00
Chip Davis	1fb27b4cda	Add support for 8- and 16-bit types to GLSL and MSL. In GLSL, 8-bit types require GL_EXT_shader_8bit_storage. 16-bit types can use either GL_AMD_gpu_shader_int16/GL_AMD_gpu_shader_half_float or GL_EXT_shader_16bit_storage.	2018-11-01 10:20:57 -05:00
Hans-Kristian Arntzen	480acdad18	Deal with OpSpecConstantOp used as array size. When trying to validate buffer sizes, we usually need to bail out when using SpecConstantOps, but for some very specific cases where we allow unsized arrays currently, we can safely allow "unknown" sized arrays as well. This is probably the best we can do, when we have even more difficult cases than this, we throw a more sensible error message.	2018-11-01 14:58:02 +01:00
Hans-Kristian Arntzen	6e99fcf695	Run format_all.sh.	2018-11-01 11:23:48 +01:00
Hans-Kristian Arntzen	fd6ff3617a	Support macro overrides for spec constants in HLSL.	2018-11-01 11:23:48 +01:00
Grigory Dzhavadyan	a5d82d1138	Alter the handling of spec consts in non-Vulkan GLSL Previously, when generating non-Vulkan GLSL, each use of a spec constant would be subsituted for its default value and the declaration of the constant itself would be omitted completely. This change slightly alters this behavior. The uses of the constant are kept, as well as the declaration, although the latter is stripped of the layout qualifier. The declaration is also prepended with the following code: #ifndef <constant name>_value #define <constant name> <default constant value> #endif and the constant itself now looks like const <constant type> <constant name> = <constant name>_value; The rationale for this change is that it gives the user a way to provide custom values for specialization constants even when the target does not support them.	2018-11-01 00:39:09 -07:00
Arseny Kapoulkine	7f055e8a68	Fix Options::force_temporary to work with OpenGL GLSL Setting force_temporary to true produces invalid GLSL because sampler variables are copied: highp sampler2D _377 = DiffuseMapTexture; This change fixes the problem by always forwarding forwardable variables. I also took an opportunity to restructure the code to make it easier to read and add extra conditions to in the future.	2018-10-30 10:49:18 -07:00
Hans-Kristian Arntzen	6157bf3cae	Add Windows support in Travis CI. - Add new Windows support - Use CMake/CTest instead of Make + shell scripts - Use --parallel in CTest - Fix CTest on Windows - Cleanups in test_shaders.py - Force specific commit for SPIRV-Headers - Fix Inf/NaN odd-ball case by moving to ASM	2018-10-27 00:22:30 +02:00
Hans-Kristian Arntzen	5bcf02f7c9	Hoist out parsing module from spirv_cross::Compiler. This is a large refactor which splits out the SPIR-V parser from Compiler and moves it into its more appropriately named Parser module. The Parser is responsible for building a ParsedIR structure which is then consumed by one or more compilers. Compiler can take a ParsedIR by value or move reference. This should allow for optimal case for both multiple compilations and single compilation scenarios.	2018-10-19 12:01:31 +02:00
Chip Davis	2506046cb4	Merge remote-tracking branch 'origin' into resource-arrays-msl	2018-09-27 10:50:16 -05:00
Hans-Kristian Arntzen	c07c303999	Use GL_EXT_samplerless_texture_functions in Vulkan GLSL.	2018-09-27 13:36:38 +02:00
Chip Davis	3a9af9681c	MSL: Expand arrays of buffers passed as input. Even as of Metal 2.1, MSL still doesn't support arrays of buffers directly. Therefore, we must manually expand them. In the prologue, we define arrays holding the argument pointers; these arrays are what the transpiled code ends up referencing. We might be able to do similar things for textures and samplers prior to MSL 2.0. Speaking of which, also enable texture arrays on iOS MSL 1.2.	2018-09-26 20:48:09 -05:00
Hans-Kristian Arntzen	de365f2e21	Merge branch 'master' of git://github.com/lifpan/SPIRV-Cross	2018-09-18 10:52:26 +02:00
Hans-Kristian Arntzen	3b5968bb26	Deal with switch cases which break out of a loop. Need some pretty hideous ladder variable system, but high level languages do not support breaking out of a loop. break in switch blocks and break in loops alias each other.	2018-09-18 10:50:48 +02:00
lifpan	e4d8ef2044	Propagate loop dominator to switch-default block This is necessary if OpSwitch is inside a loop.	2018-09-18 15:53:02 +08:00
Hans-Kristian Arntzen	737715214e	Implement atomic increment/decrement in GLSL and HLSL.	2018-09-17 15:54:21 +02:00
Hans-Kristian Arntzen	340957a3ab	Make fixup_hooks more flexible. No reason why it needs to return a string. Callbacks can just do one or more statements themselves.	2018-09-17 14:06:44 +02:00
Hans-Kristian Arntzen	d310060f92	MSL: Support global I/O block and struct Input/Output usage. Implement this by flattening outputs and unflattening inputs explicitly. This allows us to pass down a single struct instead of dealing with the insanity that would be passing down each flattened member separately. Remove stage_uniforms_var_id. Seems to be dead code. Naked uniforms do not exist in SPIR-V for Vulkan, which this seems to have been intended for. It was also unused elsewhere.	2018-09-13 16:04:24 +02:00
Hans-Kristian Arntzen	89e3b8ff0d	Run format_all.sh.	2018-09-12 10:53:50 +02:00
Hans-Kristian Arntzen	2f65a1583e	MSL: Support array-of-arrays composite construction.	2018-09-12 10:25:51 +02:00
Hans-Kristian Arntzen	32a0d05e05	Bitcast loads from builtin compute variables.	2018-09-11 09:43:28 +02:00
Hans-Kristian Arntzen	63f6466065	Support Component decoration in GLSL.	2018-09-10 12:13:26 +02:00
Hans-Kristian Arntzen	57a15dfb0c	Run format_all.sh.	2018-09-10 10:08:02 +02:00
Hans-Kristian Arntzen	b114889102	Only declare typed initializer list for non-array types. Also, cleanup now redundant constant_expression virtualization for MSL.	2018-09-10 10:04:17 +02:00
Chip Davis	3dc23615dd	Fix formatting.	2018-08-29 10:08:33 -05:00
Chip Davis	fcad019e11	Support the shader_draw_parameters extension.	2018-08-29 10:07:21 -05:00
Hans-Kristian Arntzen	87de951105	MSL: Fix naming issue of aliased global variables. When the name of an alias global variable collides with a global declaration, MSL would emit inconsistent names, sometimes with the naming fix, sometimes without, because names were being tracked in two separate meta blocks. Fix this by always redirecting parameter naming to the original base variable as necessary.	2018-08-27 09:59:55 +02:00
Hans-Kristian Arntzen	ae859934ca	Use GL_NV_gpu_shader5 as a fallback for AMD_gpu_shader_half_float.	2018-08-23 15:37:09 +02:00
Hans-Kristian Arntzen	20c8e6787c	Get fallback name for block if name is empty.	2018-08-21 12:17:40 +02:00
Hans-Kristian Arntzen	f6ec83e5d4	GLSL: Allow blocks to have their own namespace.	2018-08-21 11:29:08 +02:00
Hans-Kristian Arntzen	3a268796e2	Deal with loop variable initializers for non-for loops.	2018-08-06 12:52:22 +02:00
Hans-Kristian Arntzen	cc7679ee45	Workaround NOMINMAX issues on Windows. ::max() can be overridden if you forget NOMINMAX on Windows. Hardcode literals instead. UINT32_MAX also requires weird macros in C++.	2018-07-17 00:10:12 +02:00
Hans-Kristian Arntzen	18b82caf83	Properly track read dependencies for UAV access chain.	2018-07-09 14:02:50 +02:00
Hans-Kristian Arntzen	e1367e609a	Fix a lot of redundant code when loading flattened composites.	2018-07-06 10:57:23 +02:00
Hans-Kristian Arntzen	2bf57d6dff	Deal with composite constants in variable initializer.	2018-07-05 15:29:49 +02:00
Hans-Kristian Arntzen	8c314112b4	Run format_all.sh.	2018-07-05 14:18:34 +02:00
Hans-Kristian Arntzen	5143695080	Don't need to enclose expression for arrays.	2018-07-05 14:09:25 +02:00
Hans-Kristian Arntzen	d29f48ef06	Deduce constant LUTs from read-write variables.	2018-07-05 13:25:57 +02:00
Hans-Kristian Arntzen	b5ed706860	Hoist out variable scope analysis.	2018-07-05 10:42:05 +02:00

... 3 4 5 6 7 ...

901 Commits