SPIRV-Cross

Author	SHA1	Message	Date
Chip Davis	688c5fcbda	MSL: Add support for processing more than one patch per workgroup. This should hopefully reduce underutilization of the GPU, especially on GPUs where the thread execution width is greater than the number of control points. This also simplifies initialization by reading the buffer directly instead of using Metal's vertex-attribute-in-compute support. It turns out the only way in which shader stages are allowed to differ in their interfaces is in the number of components per vector; the base type must be the same. Since we are using the raw buffer instead of attributes, we can now also emit arrays and matrices directly into the buffer, instead of flattening them and then unpacking them. Structs are still flattened, however; this is due to the need to handle vectors with fewer components than were output, and I think handling this while also directly emitting structs could get ugly. Another advantage of this scheme is that the extra invocations needed to read the attributes when there were more input than output points are now no more. The number of threads per workgroup is now lcm(SIMD-size, output control points). This should ensure we always process a whole number of patches per workgroup. To avoid complexity handling indices in the tessellation control shader, I've also changed the way vertex shaders for tessellation are handled. They are now compute kernels using Metal's support for vertex-style stage input. This lets us always emit vertices into the buffer in order of vertex shader execution. Now we no longer have to deal with indexing in the tessellation control shader. This also fixes a long-standing issue where if an index were greater than the number of vertices to draw, the vertex shader would wind up writing outside the buffer, and the vertex would be lost. This is a breaking change, and I know SPIRV-Cross has other clients, so I've hidden this behind an option for now. In the future, I want to remove this option and make it the default.	2020-07-23 17:59:54 -05:00
dan sinclair	c4f3d4ae29	Roll GLSLang, SPIRV-Headers and SPIRV-Tools. This Cl updates the various dependencies and the test file outputs.	2020-07-22 23:03:11 -04:00
dan sinclair	63fbdaca93	Roll deps. This CL updates the GLSLang and SPIRV-Tools depedencies and updates test files as needed.	2020-07-06 11:24:30 -04:00
Hans-Kristian Arntzen	711300baed	MSL: Do not emit swizzled writes in packing fixups. Similar to scalar access chain fix, this causes a read-modify-write on memory we're not supposed to write to.	2020-07-06 10:03:46 +02:00
Hans-Kristian Arntzen	fa5b206d97	MSL: Workaround broken vector -> scalar access chain in MSL. On MSL, the compiler refuses to allow access chains into a normal vector type. What happens in practice instead is a read-modify-write where a vector type is loaded, modified and written back. The workaround is to convert a vector into a pointer-to-scalar before the access chain continues to add the scalar index.	2020-07-06 10:03:44 +02:00
Hans-Kristian Arntzen	3afbfdb090	Implement context-sensitive expression read tracking. When inside a loop, treat any read of outer expressions to happen multiple times, forcing a temporary of said outer expressions. This avoids the problem where we can end up relying on loop-invariant code motion to happen in the compiler when converting optimized shaders.	2020-06-29 12:20:35 +02:00
Hans-Kristian Arntzen	02db4c1f16	MSL: Add tests for array copies in and out of buffers.	2020-06-18 11:59:02 +02:00
Chip Davis	5281d9997e	MSL: Fix up input variables' vector lengths in all stages. Metal is picky about interface matching. If the types don't match exactly, down to the number of vector components, Metal fails pipline compilation. To support pipelines where the number of components consumed by the fragment shader is less than that produced by the vertex shader, we have to fix up the fragment shader to accept all the components produced.	2020-06-16 14:50:30 -05:00
Hans-Kristian Arntzen	107ab7c2b7	MSL: Avoid packed arrays in more cases. Extend the array stride relaxation to non-packed arrays as well, as long as the array in question contains a single array element.	2020-05-06 10:27:12 +02:00
Le Hoang Quyen	9ddfe6db6d	Fix #1359 : MSL: If the packed type is scalar, don't emit "pack_" prefix. Scalar type is already packed in metal.	2020-05-06 00:43:34 +08:00
Hans-Kristian Arntzen	17ad62eea4	MSL: Support edge case with DX layout in scalar block layout. DX may emit ArrayStride and MatrixStride of 16, but the size of the object does not align with that and expect to pack other members inside its last member. The workaround is to emit array size/col/row one less than we expect and rely on padding to carve out a "dead zone" for the last member.	2020-04-20 15:29:24 +02:00
Hans-Kristian Arntzen	ebf463674d	MSL: Allow removing clip distance user varyings. Only safe if user knows that subsequent shader stage will not read clip distance.	2020-04-20 09:58:40 +02:00
Chip Davis	96f7008aa8	MSL: Force disabled fragment builtins to have the right name. DXVK emits SPIR-V where fragment shader builtins have names derived from DXBC assembly, e.g. `oDepth` for `FragDepth`. When we declared the disabled output, we used this name, but when referencing it, we continued to use the GLSL name. This breaks compilation.	2020-04-15 19:25:18 -05:00
Chip Davis	495e48de44	MSL: Only disable output variables in fragment shaders. Forgot to do this in #1319. Fixes #1322.	2020-04-15 12:14:57 -05:00
Chip Davis	b29f83c383	MSL: Add options to control emission of fragment outputs. Like with `point_size` when not rendering points, Metal complains when writing to a variable using the `[[depth]]` qualifier when no depth buffer be attached. In that case, we must avoid emitting `FragDepth`, just like with `PointSize`. I assume it will also complain if there be no stencil attachment and the shader write to `[[stencil]]`, or it write to `[[color(n)]]` but there be no color attachment at n.	2020-04-13 15:29:11 -05:00
Hans-Kristian Arntzen	d91e134500	MSL: Add native array test for composite array initialization.	2020-02-24 13:34:51 +01:00
Hans-Kristian Arntzen	20b28f72fa	MSL: Reinstate workaround for returning arrays.	2020-02-24 13:04:10 +01:00
Hans-Kristian Arntzen	c9d4f9cd74	MSL: Add a workaround path to force native arrays for everything.	2020-02-24 12:47:14 +01:00
Chip Davis	ae6c05f6f4	MSL: Move inline uniform blocks to the end of the argument buffer. Limit inline blocks to one per descriptor set. This should avoid the need for complicated code to calculate the argument buffer ID stride of an inline uniform block. If there's demand for more inline blocks, we can revisit this.	2020-01-25 13:40:51 -06:00
Chip Davis	fedbc35315	MSL: Support inline uniform blocks in argument buffers. Here, the inline uniform block is explicit: we instantiate the buffer block itself in the argument buffer, instead of a pointer to the buffer. I just hope this will work with the `MTLArgumentDescriptor` API... Note that Metal recursively assigns individual members of embedded structs IDs. This means for automatic assignment that we have to calculate the binding stride for a given buffer block. For MoltenVK, we'll simply increment the ID by the size of the inline uniform block. Then the later IDs will never conflict with the inline uniform block. We can get away with this because Metal doesn't require that IDs be contiguous, only monotonically increasing.	2020-01-24 18:51:24 -06:00
Hans-Kristian Arntzen	93f3265fe0	MSL: Deal with packing vectors for vertex input/fragment output.	2020-01-07 14:14:31 +01:00
Hans-Kristian Arntzen	a3fe9756d2	MSL: Support ClipDistance as an input stage variable. MSL does not support this, so we have to emulate it by passing it around as a varying between stages. We use a special "user(clipN)" attribute for this rather than locN which is used for user varyings.	2019-12-02 13:19:42 +01:00
Hans-Kristian Arntzen	b85ab5f5ff	MSL: Fix automatic binding allocation for image atomic buffers. The Primary decoration was used by the atomic buffer, causing the texture binding to be potentially overlapping with other resources.	2019-11-28 11:07:44 +01:00
Hans-Kristian Arntzen	b56c2f4271	Merge branch 'deps' of git://github.com/dj2/SPIRV-Cross	2019-11-06 10:43:17 +01:00
Dan Sinclair	d409210ee5	Move all .invalid shaders into no-opt folders.	2019-11-05 13:19:19 -05:00
Dan Sinclair	9da21c24a4	Roll GLSLang and SPIRV-Tools This CL rolls the GLSLang and SPIRV-Tools dependencies to tip-of-tree and updates the tests as required.	2019-11-05 10:07:15 -05:00
Hans-Kristian Arntzen	32bd1329b6	Remove another dead reference file.	2019-10-28 11:51:31 +01:00
Hans-Kristian Arntzen	8670f7cbab	Remove some more dead reference files.	2019-10-28 11:49:57 +01:00
Hans-Kristian Arntzen	fa011f8547	MSL: Declare arrays with proper type wrapper. Need to construct with value type spvUnsafeArray<T, N>({ elem0, elem1 }) to make array initialization work in complex scenarios.	2019-10-26 17:57:34 +02:00
Hans-Kristian Arntzen	e1acbd3dcf	MSL: Declare struct type explicitly. Disambiguates initializer list.	2019-10-26 16:21:46 +02:00
Hans-Kristian Arntzen	1fc3347873	MSL: Fix array of array declaration. Arrays-of-arrays were declared in wrong order.	2019-10-26 16:10:12 +02:00
Hans-Kristian Arntzen	27d6d45671	MSL: Rewrite tessellation_access_chain. To support loading array of array properly in tessellation, we need a rewrite of how tessellation access chains are handled. The major change is to remove the implicit unflatten step inside access_chain which does not take into account the case where you load directly from a control point array variable. We defer unflatten step until OpLoad time instead. This fixes cases where we load array of {array,matrix,struct}. Removes the hacky path for MSL access chain index workaround.	2019-10-26 16:10:12 +02:00
Hans-Kristian Arntzen	3b5c4c7316	Implement constant empty struct correctly on all backends. MSL actually supports empty structs, so enable that path as well.	2019-10-26 16:10:11 +02:00
Hans-Kristian Arntzen	8066d13599	MSL: Rewrite propagated depth comparison state handling. Far cleaner, and more correct to run the traversal twice. Fixes a case where we propagate depth state through multiple functions.	2019-10-26 16:10:11 +02:00
Hans-Kristian Arntzen	2767257adc	MSL: Do not declare array of UBO/SSBO as spvUnsafeArray<T>. There is no need for these to be copied, and cuts down on template stamping bloat.	2019-10-26 16:10:08 +02:00
Hans-Kristian Arntzen	a82ecbeba9	MSL: Enable proper value types for return and value-passing of arrays. Now that we have spvUnsafeArray<T> there is no need to deal with these special purpose cases.	2019-10-24 13:03:38 +02:00
Hans-Kristian Arntzen	d1479f871a	MSL: Do not generate UnsafeArray<> for any array inside buffer objects. This avoids a lot of huge code changes. Arrays generally cannot be copied in and out of buffers, at least no compiler frontend seems to do it. Also avoids a lot of issues surrounding packed vectors and matrices.	2019-10-24 12:22:30 +02:00
Hans-Kristian Arntzen	db55d474f9	MSL: Do not declare complex composite array in main for non-inlined. Need to consider that complex composite arrays may be used in leaf functions, and avoid the MSL library link fix unless everything is nicely inlined.	2019-10-24 11:12:01 +02:00
Lukas Hermanns	c236ca4572	Moved all UE4 test shaders into 'shaders-ue4/' folder.	2019-10-23 17:39:05 -04:00
Lukas Hermanns	b0d616aa6d	Removed 'argument_buffer_offset' and fixed packed matrix Metal output.	2019-10-23 16:28:32 -04:00
Lukas Hermanns	84351d3aed	Merge remote-tracking branch 'upstream/master'	2019-10-21 18:55:36 -04:00
Lukas Hermanns	e1b161b54b	Removed bounds checks in favor of SPIRV-Tools pass '--graphics-robust-access'	2019-10-21 16:39:53 -04:00
Hans-Kristian Arntzen	4d6a223cbe	Merge pull request #1175 from KhronosGroup/fix-1164 Implement unordered compare on GLSL/HLSL.	2019-10-14 19:01:00 +02:00
Hans-Kristian Arntzen	4bb673a626	MSL: Add opt-in support for huge IABs. If there are enough members in an IAB, we cannot use the constant address space as MSL compiler complains about there being too many members. Support emitting the device address space instead.	2019-10-14 16:20:34 +02:00
Hans-Kristian Arntzen	a9be92569f	HLSL: Fix unrolled S/G LE/LT/GE/GT opcodes. Need to bitcast the unrolled expressions as well.	2019-10-14 16:08:39 +02:00
Lukas Hermanns	0853bcaee1	Disabled spvUnsafeArray<> type for packed vectors and added test cases for those arrays.	2019-10-09 17:59:47 -04:00
Lukas Hermanns	ffbd801853	Added '--msl-invariant-float-math' option and new test case for it.	2019-10-09 14:03:06 -04:00
Lukas Hermanns	f3a6d28a1d	Further updates for pull request #1162 ; also added two test cases for spvCubemapTo2DArrayFace function and added '--msl-framebuffer-fetch'/ '--msl-emulate-cube-array' compiler options.	2019-09-27 15:49:54 -04:00
Lukas Hermanns	c3d6022956	Update for pull request #1162 rev. 1	2019-09-24 18:13:04 -04:00
Lukas Hermanns	7ad0a84778	Updates for pull request #1162	2019-09-24 14:35:25 -04:00
Lukas Hermanns	37df74035b	Merge branch 'ue4_dev'	2019-09-20 09:42:42 -04:00
Hans-Kristian Arntzen	3c11254ece	MSL: Fix 16-bit integer literals. There is no suffix, so bitcasts failed.	2019-09-19 10:19:51 +02:00
Ryan Harrison	cf1bf1c6ae	Update external/ to SPIR-V 1.5 Rolled the hashes used for glslang, SPIRV-Tools, and SPIRV-Headers to HEAD, which includes the update to 1.5. Added passing '--amb' to glslang, so I didn't have to explicitly set bindings in a large number of test shaders that currently don't, and now glslang considers them invalid. Marked all shaders that no longer pass spirv-val as .invalid.	2019-09-18 16:04:27 -04:00
Lukas Hermanns	744cc3e595	Updated test shaders.	2019-09-18 14:18:22 -04:00
Lukas Hermanns	cb3ecb9e1b	Updated reference Metal shaders.	2019-09-17 15:11:19 -04:00
Lukas Hermanns	0be20cd933	Renamed new test shaders to fit the naming convention in SPIRV-Cross.	2019-09-16 10:33:45 -04:00
Lukas Hermanns	9573faa56d	Removed all '.DS_Store' files.	2019-09-13 14:04:32 -04:00
Mark Satterthwaite	564cb3c08d	Update the Metal shaders to account for changes in the shader compilation.	2019-09-11 15:06:05 -04:00
Chip Davis	cb35934248	MSL: Support dynamic offsets for buffers in argument buffers. Vulkan has two types of buffer descriptors, `VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER_DYNAMIC` and `VK_DESCRIPTOR_TYPE_STORAGE_BUFFER_DYNAMIC`, which allow the client to offset the buffers by an amount given when the descriptor set is bound to a pipeline. Metal provides no direct support for this when the buffer in question is in an argument buffer, so once again we're on our own. These offsets cannot be stored or associated in any way with the argument buffer itself, because they are set at bind time. Different pipelines may have different offsets set. Therefore, we must use a separate buffer, not in any argument buffer, to hold these offsets. Then the shader must manually offset the buffer pointer. This change fully supports arrays, including arrays of arrays, even though Vulkan forbids them. It does not, however, support runtime arrays. Perhaps later.	2019-09-05 23:29:00 -05:00
Chip Davis	103817009c	MSL: Force storage images on iOS to use discrete descriptors. Writable textures cannot use argument buffers on iOS. They must be passed as arguments directly to the shader function. Since we won't know if a given storage image will have the `NonWritable` decoration at the time we encode the argument buffer, we must therefore pass all storage images as discrete arguments. Previously, we were throwing an error if we encountered an argument buffer with a writable texture in it on iOS.	2019-09-05 11:01:05 -05:00
Mark Satterthwaite	b5ad5d4e2f	UE4 shader reference for those shaders that will compile without the changes.	2019-09-04 13:02:34 -04:00
Chip Davis	2eff420d9a	Support the SPV_EXT_fragment_shader_interlock extension. This was straightforward to implement in GLSL. The `ShadingRateInterlockOrderedEXT` and `ShadingRateInterlockUnorderedEXT` modes aren't implemented yet, because we don't support `SPV_NV_shading_rate` or `SPV_EXT_fragment_invocation_density` yet. HLSL and MSL were more interesting. They don't support this directly, but they do support marking resources as "rasterizer ordered," which does roughly the same thing. So this implementation scans all accesses inside the critical section and marks all storage resources found therein as rasterizer ordered. They also don't support the fine-grained controls on pixel- vs. sample-level interlock and disabling ordering guarantees that GLSL and SPIR-V do, but that's OK. "Unordered" here merely means the order is undefined; that it just so happens to be the same as rasterizer order is immaterial. As for pixel- vs. sample-level interlock, Vulkan explicitly states: > With sample shading enabled, [the `PixelInterlockOrderedEXT` and > `PixelInterlockUnorderedEXT`] execution modes are treated like > `SampleInterlockOrderedEXT` or `SampleInterlockUnorderedEXT` > respectively. and: > If [the `SampleInterlockOrderedEXT` or `SampleInterlockUnorderedEXT`] > execution modes are used in single-sample mode they are treated like > `PixelInterlockOrderedEXT` or `PixelInterlockUnorderedEXT` > respectively. So this will DTRT for MoltenVK and gfx-rs, at least. MSL additionally supports multiple raster order groups; resources that are not accessed together can be placed in different ROGs to allow them to be synchronized separately. A more sophisticated analysis might be able to place resources optimally, but that's outside the scope of this change. For now, we assign all resources to group 0, which should do for our purposes. `glslang` doesn't support the `RasterizerOrdered` UAVs this implementation produces for HLSL, so the test case needs `fxc.exe`. It also insists on GLSL 4.50 for `GL_ARB_fragment_shader_interlock`, even though the spec says it needs either 4.20 or `GL_ARB_shader_image_load_store`; and it doesn't support the `GL_NV_fragment_shader_interlock` extension at all. So I haven't been able to test those code paths. Fixes #1002.	2019-09-02 12:31:10 -05:00
Chip Davis	39dce88d3b	MSL: Add support for sampler Y'CbCr conversion. This change introduces functions and in one case, a class, to support the `VK_KHR_sampler_ycbcr_conversion` extension. Except in the case of GBGR8 and BGRG8 formats, for which Metal natively supports implicit chroma reconstruction, we're on our own here. We have to do everything ourselves. Much of the complexity comes from the need to support multiple planes, which must now be passed to functions that use the corresponding combined image-samplers. The rest is from the actual Y'CbCr conversion itself, which requires additional post-processing of the sample retrieved from the image. Passing sampled images to a function was a particular problem. To support this, I've added a new class which is emitted to MSL shaders that pass sampled images with Y'CbCr conversions attached around. It can handle sampled images with or without Y'CbCr conversion. This is an awful abomination that should not exist, but I'm worried that there's some shader out there which does this. This support requires Metal 2.0 to work properly, because it uses default-constructed texture objects, which were only added in MSL 2. I'm not even going to get into arrays of combined image-samplers--that's a whole other can of worms. They are deliberately unsupported in this change. I've taken the liberty of refactoring the support for texture swizzling while I'm at it. It's now treated as a post-processing step similar to Y'CbCr conversion. I'd like to think this is cleaner than having everything in `to_function_name()`/`to_function_args()`. It still looks really hairy, though. I did, however, get rid of the explicit type arguments to `spvGatherSwizzle()`/`spvGatherCompareSwizzle()`. Update the C API. In addition to supporting this new functionality, add some compiler options that I added in previous changes, but for which I neglected to update the C API.	2019-09-01 18:35:53 -05:00
Hans-Kristian Arntzen	9b845a4788	Merge pull request #1141 from troughton/inline-everything MSL: Inline all non-entry-point functions	2019-08-30 11:05:04 +02:00
Thomas Roughton	91b2f34a3d	Update tests to account for all non-entry-point functions being inlined	2019-08-30 09:39:06 +12:00
Hans-Kristian Arntzen	07c76f66b5	MSL: Add {Base,}{Vertex,Instance}Index to bitcast_from_builtin_load. Totally missed these, so float(index) would not work correctly for negative numbers.	2019-08-29 13:56:37 +02:00
Hans-Kristian Arntzen	9436cd3036	MSL: Deal with array copies from and to threadgroup.	2019-08-27 13:18:01 +02:00
Chip Davis	fb5ee4cb5c	MSL: Adjust BuiltInWorkgroupId for vkCmdDispatchBase(). This command allows the caller to set the base value of `BuiltInWorkgroupId`, and thus of `BuiltInGlobalInvocationId`. Metal provides no direct support for this... but it does provide a builtin, `[[grid_origin]]`, normally used to pass the base values for the stage input region, which we will now abuse to pass the dispatch base and avoid burning a buffer binding. `[[grid_origin]]`, as part of Metal's support for compute stage input, requires MSL 1.2. For 1.0 and 1.1, we're forced to provide a buffer. (Curiously, this builtin was undocumented until the MSL 2.2 release. Go figure.)	2019-07-24 08:56:15 -05:00
Hans-Kristian Arntzen	c62503bca7	Do not attempt to pack types which are already scalar.	2019-07-24 11:52:28 +02:00
Hans-Kristian Arntzen	461f1506e7	Do not eagerly invalidate all active variables on a branch. This is not necessary, as we must emit an invalidating store before we potentially consume an invalid expression. In fact, we're a bit conservative here in this case for example: int tmp = variable; if (...) { variable = 10; } else { // Consuming tmp here is fine, but it was // invalidated while emitting other branch. // Technically, we need to study if there is an invalidating store // in the CFG between the loading block and this block, and the other // branch will not be a part of that analysis. int tmp2 = tmp * tmp; } Fixing this case means complex CFG traversal everywhere, and it feels like overkill. Fixing this exposed a bug with access chains, so fix a bug where expression dependencies were not inherited properly in access chains. Access chains are now considered forwarded if there is at least one dependency which is also forwarded.	2019-07-24 11:17:30 +02:00
Hans-Kristian Arntzen	18bcc9b790	Do not disable temporary forwarding when we suppress usage tracking. This subtle bug removed any expression validation for trivially swizzled variables. Make usage suppression a more explicit concept rather than just hacking off forwarded_temporaries. There is some fallout here with loop generation since our expression invalidation is currently a bit too naive to handle loops properly. The forwarding bug masked this problem until now. If part of the loop condition is also used in the body, we end up reading an invalid expression, which in turn forces a temporary to be generated in the condition block, not good. We'll need to be smarter here ...	2019-07-23 19:18:44 +02:00
Hans-Kristian Arntzen	7277c7ac46	Use to_unpacked_row_major_expression to unify row-major in MSL/GLSL.	2019-07-23 11:36:54 +02:00
Hans-Kristian Arntzen	47a18b9f1b	Simplify row-major matrix/vector multiplies.	2019-07-23 10:56:57 +02:00
Hans-Kristian Arntzen	6057ffcbb1	Deal correctly with complete stores to row_major matrices.	2019-07-22 15:49:17 +02:00
Hans-Kristian Arntzen	19f5cd3e90	Declare correct matrix type when unpacking.	2019-07-22 13:25:45 +02:00
Hans-Kristian Arntzen	180a6b38c5	Fix some row-major column store cases.	2019-07-22 12:56:14 +02:00
Hans-Kristian Arntzen	249f8e5180	MSL: Support storing to row-major column. Defer transposes to actual Load or Store.	2019-07-22 11:13:44 +02:00
Hans-Kristian Arntzen	be2fccd837	Tests run clean.	2019-07-22 10:23:39 +02:00
Hans-Kristian Arntzen	6c1f97b4a9	Fix unpacking of packed but not remapped types on load.	2019-07-19 14:50:35 +02:00
Hans-Kristian Arntzen	b66a53a979	Traverse correct types when checking scalar layout.	2019-07-19 14:43:42 +02:00
Hans-Kristian Arntzen	e90d816cdd	Deal with scalar layout of entire structs. Mark all candidate struct types.	2019-07-19 14:18:14 +02:00
Chip Davis	12a8654784	Don't forward uses of an OpIsHelperInvocationEXT op. If this is computed before a `demote`, but used after, forwarding it will produce the wrong value. This does make for uglier shaders, but it's necessary right now to ensure correctness. I needed to use an assembly shader to produce the test for this. `spirv-opt` is not smart enough (or too smart?) to eliminate the variable that would be used in GLSL to express this.	2019-07-18 17:32:35 -05:00
Chip Davis	50dce10c5d	Support the SPV_EXT_demote_to_helper_invocation extension. This extension provides a new operation which causes a fragment to be discarded without terminating the fragment shader invocation. The invocation for the discarded fragment becomes a helper invocation, so that derivatives will remain defined. The old `HelperInvocation` builtin becomes undefined when this occurs, so a second new instruction queries the current helper invocation status. This is only fully supported for GLSL. HLSL doesn't support the `IsHelperInvocation` operation and MSL doesn't support the `DemoteToHelperInvocation` op. Fixes #1052.	2019-07-17 09:12:22 -05:00
Chip Davis	bc646574a6	MSL: Support the SPV_INTEL_shader_integer_functions2 extension. This provides a few functions normally available in OpenCL to the SPIR-V shader environment. These functions happen to be available in Metal as well. No GLSL, unfortunately. Intel has yet to publish a `GL_INTEL_shader_integer_functions2` spec.	2019-07-15 09:42:36 -05:00
Hans-Kristian Arntzen	33d2bbcf69	Merge branch 'msl-amd-trinary-functions' of git://github.com/cdavis5e/SPIRV-Cross	2019-07-15 09:46:31 +02:00
Hans-Kristian Arntzen	4d7aae1050	Merge branch 'device-group' of git://github.com/cdavis5e/SPIRV-Cross	2019-07-15 09:46:16 +02:00
Chip Davis	343c6f4ff4	Update external repos. Fix fallout from changes. There's a bug in glslang that prevents `float16_t`, `[u]int16_t`, and `[u]int8_t` constants from adding the corresponding SPIR-V capabilities. SPIRV-Tools, meanwhile, tightened validation so that these constants are only valid if the corresponding `Float16`, `Int16`, and `Int8` caps are on. This affects the `16bit-constants.frag` test for GLSL and MSL.	2019-07-13 16:50:21 -05:00
Chip Davis	6a58554568	Support the SPV_KHR_device_group extension. The only piece added by this extension is the `DeviceIndex` builtin, which tells the shader which device in a grouped logical device it is running on. Metal's pipeline state objects are owned by the `MTLDevice` that created them. Since Metal doesn't support logical grouping of devices the way Vulkan does, we'll thus have to create a pipeline state for each device in a grouped logical device. The upcoming peer group support in Metal 3 will not change this. For this reason, for Metal, the device index is supplied as a constant at pipeline compile time. There's an interaction between `VK_KHR_device_group` and `VK_KHR_multiview` in the `VK_PIPELINE_CREATE_VIEW_INDEX_FROM_DEVICE_INDEX_BIT`, which defines the view index to be the same as the device index. The new `view_index_from_device_index` MSL option supports this functionality.	2019-07-13 16:45:54 -05:00
Chip Davis	ca91fcfe5f	MSL: Support the SPV_AMD_shader_trinary_minmax extension. This requires MSL 2.1.	2019-07-13 16:43:57 -05:00
Hans-Kristian Arntzen	932ee0e328	Deal correctly with return sign of bitscan operations.	2019-07-12 10:57:56 +02:00
Hans-Kristian Arntzen	19ebbd48c7	Merge pull request #1077 from cdavis5e/msl-spirv-qualifiers MSL: Handle coherent, volatile, and restrict.	2019-07-12 10:03:06 +02:00
Hans-Kristian Arntzen	ad5eae46ed	Merge pull request #1078 from cdavis5e/post-depth-coverage Support the SPV_KHR_post_depth_coverage extension.	2019-07-12 09:56:26 +02:00
Chip Davis	6628ea6e48	MSL: Use the select() function for OpSelect. This significantly improves codegen for vector `OpSelect` in MSL.	2019-07-11 10:30:37 -05:00
Chip Davis	1df47db6ba	Support the SPV_KHR_post_depth_coverage extension. Using the `PostDepthCoverage` mode specifies that the `gl_SampleMaskIn` variable is to contain the computed coverage mask following the early fragment tests, which this mode requires and implicitly enables. Note that unlike Vulkan and OpenGL, Metal places this on the sample mask input itself, and furthermore does not implicitly enable early fragment testing. If it isn't enabled explicitly with an `[[early_fragment_tests]]` attribute, the compiler will error out. So we have to enable that mode explicitly if `PostDepthCoverage` is enabled but `EarlyFragmentTests` isn't. For Metal, only iOS supports this; for some reason, Apple has yet to implement it on macOS, even though many desktop cards support it.	2019-07-11 10:28:43 -05:00
Chip Davis	058f1a0933	MSL: Handle coherent, volatile, and restrict. This maps them to their MSL equivalents. I've mapped `Coherent` to `volatile` since MSL doesn't have anything weaker than `volatile` but stronger than nothing. As part of this, I had to remove the implicit `volatile` added for atomic operation casts. If the buffer is already `coherent` or `volatile`, then we would add a second `volatile`, which would be redundant. I think this is OK even when the buffer doesn't have `coherent`: `T ` is implicitly convertible to `volatile T `, but not vice-versa. It seems to compile OK at any rate. (Note that the non-`volatile` overloads of the atomic functions documented in the spec aren't present in the MSL 2.2 stdlib headers.) `restrict` is tricky, because in MSL, as in C++, it needs to go after the asterisk or ampersand for the pointer type it's modifying. Another issue is that, in the `Simple`, `GLSL450`, and `Vulkan` memory models, `Restrict` is the default (i.e. does not need to be specified); but MSL likely follows the `OpenCL` model where `Aliased` is the default. We probably need to implicitly set either `Restrict` or `Aliased` depending on the module's declared memory model.	2019-07-11 10:22:30 -05:00
Hans-Kristian Arntzen	1a592b7c0f	Merge pull request #1067 from cdavis5e/msl-scalar-block-layout MSL: Support scalar block layout.	2019-07-11 13:03:03 +02:00
Chip Davis	28454facbb	MSL: Handle packed matrices. The old method of using a different unpacked matrix type doesn't work for scalar alignment. It certainly wouldn't have any effect for a square matrix, since the number of columns and rows are the same. So now we'll store them as arrays of packed vectors.	2019-07-10 18:37:31 -05:00
Hans-Kristian Arntzen	f6f849397e	MSL: Re-roll array expressions in initializers. We cannot rely on copy path when using an array as part of a struct initializer, so reroll such expressions to an initializer list again.	2019-07-10 11:19:33 +02:00
Chip Davis	e5fa7edfd6	MSL: Support scalar block layout. Relaxed block layout relaxed the restrictions on vector alignment, allowing them to be aligned on scalar boundaries. Scalar block layout relaxes this further, allowing any member to be aligned on a scalar boundary. The requirement that a vector not improperly straddle a 16-byte boundary is also relaxed. I've also added a test showing that `std430` layout works with UBOs. I'm troubled by the dual meaning of the `Packed` extended decoration. In some instances (struct, `float[]`, and `vec2[]` members), it actually means the exact opposite, that the member needs extra padding. This is especially problematic for `vec2[]`, because now we need to distinguish the two cases by checking the array stride. I wonder if this should actually be split into two decorations.	2019-07-09 20:59:32 -05:00
Hans-Kristian Arntzen	50342966c0	Fall back to complex loop if non-trivial continue block is found. There is a case where we can deduce a for/while loop, but the continue block is actually very painful to deal with, so handle that case as well. Removes an exceptional case.	2019-07-08 11:54:29 +02:00

1 2 3 4 5 ...

470 Commits