SPIRV-Cross

Author	SHA1	Message	Date
Jan Sikorski	f0239bce05	MSL: extract global variables from subgroup ballot operations Fixes #1513.	2020-11-09 11:23:01 +01:00
Hans-Kristian Arntzen	6fc2a0581a	Run format_all.sh.	2020-11-08 13:59:52 +01:00
Hans-Kristian Arntzen	46bf1e99d6	Merge pull request #1525 from cdavis5e/msl-interpolation-functions MSL: Support pull-model interpolation on MSL 2.3+.	2020-11-07 17:04:56 +01:00
Hans-Kristian Arntzen	db13762297	MSL: Fix regression in image gather handling. It was not always possible to get backing variable for a late-combined image sampler.	2020-11-06 16:21:30 +01:00
Chip Davis	aca9b6879a	MSL: Support pull-model interpolation on MSL 2.3+. New in MSL 2.3 is a template that can be used in the place of a scalar type in a stage-in struct. This template has methods which interpolate the varying at the given points. Curiously, you can't set interpolation attributes on such a varying; perspective-correctness is encoded in the type, while interpolation must be done using one of the methods. This makes using this somewhat awkward from SPIRV-Cross, requiring us to jump through a bunch of hoops to make this all work. Using varyings from functions in particular is a pain point, requiring us to pass the stage-in struct itself around. An alternative is to pass references to the interpolants; except this will fall over badly with composite types, which naturally must be flattened. As with tessellation, dynamic indexing isn't supported with pull-model interpolation. This is because of the need to reference the original struct member in order to call one of the pull-model interpolation methods on it. Also, this is done at the variable level; this means that if one varying in a struct is used with the pull-model functions, then the entire struct is emitted as pull-model interpolants. For some reason, this was not documented in the MSL spec, though there is a property on `MTLDevice`, `supportsPullModelInterpolation`, indicating support for this, which is documented. This does not appear to be implemented yet for AMD: it returns `NO` from `supportsPullModelInterpolation`, and pipelines with shaders using the templates fail to compile. It is implemeted for Intel. It's probably also implemented for Apple GPUs: on Apple Silicon, OpenGL calls down to Metal, and it wouldn't be possible to use the interpolation functions without this implemented in Metal. Based on my testing, where SPIR-V and GLSL have the offset relative to the pixel center, in Metal it appears to be relative to the pixel's upper-left corner, as in HLSL. Therefore, I've added an offset 0.4375, i.e. one half minus one sixteenth, to all arguments to `interpolate_at_offset()`. This also fixes a long-standing bug: if a pull-model interpolation function is used on a varying, make sure that varying is declared. We were already doing this only for the AMD pull-model function, `interpolateAtVertexAMD()`; for reasons which are completely beyond me, we weren't doing this for the base interpolation functions. I also note that there are no tests for the interpolation functions for GLSL or HLSL.	2020-11-05 11:57:45 -06:00
Hans-Kristian Arntzen	244839d350	Merge pull request #1516 from billhollings/VK_EXT_descriptor_indexing MSL: Support run-time sized image and sampler arrays	2020-11-03 10:15:36 +01:00
Bill Hollings	4bdd49df3f	Syntax and format updates from code review.	2020-11-02 22:15:20 -05:00
Bill Hollings	7f67abe0fe	Minor format and typo updates from code review.	2020-10-30 16:05:44 -04:00
Chip Davis	547c29f7bb	MSL: Allow Bias and Grad arguments with comparison on Mac in MSL 2.3. I kept the code to replace constant zero arguments, because `Bias` and `Grad` still have some problems on desktop GPUs. `Bias` works on AMD GPUs. `Grad` does not. Both work on Intel. Still needs testing on NV. It will definitely work with Apple GPUs.	2020-10-30 11:14:59 -05:00
Hans-Kristian Arntzen	541a801fed	Merge pull request #1514 from cdavis5e/msl-mac-framebuffer-fetch MSL: Allow framebuffer fetch on Mac in MSL 2.3.	2020-10-30 08:09:41 +01:00
Yuwen Wu	c8a43876c7	added metal keyworld: "level" (#1501 ) * added metal keyworld: "level" * added more metal keywords * updated test case.	2020-10-30 08:07:25 +01:00
Bill Hollings	b7b0e804e5	MSL: Support run-time sized image and sampler arrays (GL_EXT_nonuniform_qualifier/SPV_EXT_descriptor_indexing). MSLResourceBinding includes array size through API, and substitutes in that size if the image or sampler array is not explicitly sized. OpCopyObject supports SPIRCombinedImageSampler type in MSL.	2020-10-29 18:50:42 -04:00
Chip Davis	c20d5945a2	MSL: Allow framebuffer fetch on Mac in MSL 2.3. Another Apple GPU feature that will now be supported on Apple Silicon Macs.	2020-10-29 10:50:59 -05:00
Chip Davis	d48d2a95c7	MSL: Allow post-depth coverage on Mac in MSL 2.3. It's still only supported on Apple GPUs, but Macs will have those soon.	2020-10-27 22:07:01 -05:00
Chip Davis	1b6b9705e6	MSL: For 2.1+, don't disable rasterization for vertex writes. Writing to buffers actually works starting in MSL 2.1 (macOS 10.14, iOS 12). Writing to textures works starting in MSL 2.2 (macOS 10.15, iOS 13). No tests unfortunately, because the MSL 2.2 compiler and above produce a warning that cannot be disabled, because it has no associated option.	2020-10-26 11:43:13 -05:00
Hans-Kristian Arntzen	f65f259ab7	MSL: Do not use component::x gather for depth2d textures.	2020-10-26 10:18:17 +01:00
Chip Davis	1264e2705e	MSL: Cast broadcast booleans to ushort. Metal doesn't support broadcasting or shuffling boolean values, but we can work around that by casting it to `ushort`, then casting it back to `bool`. I used `ushort` instead of `uint` because 16-bit values give better throughput on Apple GPUs.	2020-10-23 21:55:46 -05:00
Chip Davis	065b5bda3c	MSL: Mask ballots passed to Ballot bit ops. Only the least n bits are significant, where n is the subgroup size. The Vulkan CTS actually checks this. The `FindLSB` tests weren't actually failing, but I masked that anyway, in case there's some corner case the CTS is missing.	2020-10-23 21:55:46 -05:00
Chip Davis	781367d083	MSL: Support vectors with OpGroupNonUniformAllEqual. This was not tested here in SPIRV-Cross. Predictably, it broke when I tried it in the CTS.	2020-10-23 21:55:46 -05:00
Chip Davis	6ccb902462	MSL: Correct definitions of subgroup ballot mask variables. `SubgroupEqMask` had a fencepost error that gave wrong values for invocation ID 32. For `SubgroupGeMask` and `SubgroupGtMask`, I forgot to shift the values from `extract_bits()` up so that the mask is in the correct position. Using `insert_bits()` instead should fold these two operations into one. `SubgroupLtMask` and `SubgroupLeMask` were already correct.	2020-10-23 21:54:55 -05:00
Chip Davis	064ed448b9	MSL: Don't remove periods from swizzle buffer index exprs.	2020-10-20 17:47:40 -05:00
Chip Davis	5845e009ea	MSL: Handle Offset and Grad operands for 1D-as-2D textures.	2020-10-15 12:51:00 -05:00
Hans-Kristian Arntzen	9c220a8247	Merge pull request #1490 from KhronosGroup/fix-1488 MSL: Support querying and modifying generated combined sampler suffix.	2020-10-15 10:52:28 +02:00
Chip Davis	3e6010d8c5	MSL: Don't use a bitcast for tessellation levels in tesc shaders. `half` cannot be bitcasted to `float`, because the two types are not the same size. Use an expanding cast instead. We were already doing this for stores to the tessellation levels; why I didn't also do this for loads is beyond me.	2020-10-14 18:35:59 -05:00
Hans-Kristian Arntzen	bd1ee4344e	MSL: Support querying and modifying generated combined sampler suffix.	2020-10-14 14:52:18 +02:00
Chip Davis	21d38f74ce	MSL: Fix calculation of atomic image buffer address. Fix reversed coordinates: `y` should be used to calculate the row address. Align row address to the row stride. I've made the row alignment a function constant; this makes it possible to override it at pipeline compile time. Honestly, I don't know how this worked at all for Epic. It definitely didn't work in the CTS prior to this.	2020-10-13 20:51:56 -05:00
Chip Davis	7a5d0d6b29	MSL: Add missing interlock handling to atomic image buffers.	2020-10-13 11:44:17 -05:00
Hans-Kristian Arntzen	fab6ad234e	Merge pull request #1486 from cdavis5e/atomic-image-argument-buffer MSL: Support atomic access to images from argument buffers.	2020-10-13 12:55:43 +02:00
Chip Davis	9cafea6cf8	MSL: Support atomic access to images from argument buffers. This was not added when Epic contributed atomic image support. Fixes #1484.	2020-10-13 02:37:18 -05:00
Chip Davis	2219c4a392	MSL: Support SPV_EXT_demote_to_helper_invocation for MSL 2.3. MSL 2.3 has everything needed to support this extension on all platforms. The existing `discard_fragment()` function was given demote semantics, similar to Direct3D, and the `simd_is_helper_thread()` function was finally added to iOS. I've left the old test alone. Should I remove it in favor of these?	2020-10-13 00:25:32 -05:00
Hans-Kristian Arntzen	5ea576ece2	Allow flip_vert_y in all relevant stages.	2020-09-28 14:10:08 +02:00
Hans-Kristian Arntzen	66afe8c499	Implement a simple evaluator of specialization constants. In some cases, we need to get a literal value from a spec constant op. Mostly relevant when emitting buffers, so implement a 32-bit integer scalar subset of the evaluator. Can be extended as needed to support evaluating any specialization constant operation.	2020-09-14 11:45:59 +02:00
Hans-Kristian Arntzen	bdbef7b1f3	Merge pull request #1461 from Kangz/fix-warnings Fix -Wduplicate-enum and -Wrange-for-analysis.	2020-09-04 12:59:22 +02:00
Corentin Wallez	bcd71536e2	Fix -Wduplicate-enum and -Wrange-for-analysis.	2020-09-04 11:13:21 +02:00
Hans-Kristian Arntzen	18d03b3ea6	Handle OpUndef %void. Apparently this is legal and spirv-opt is starting to emit this kind of code now.	2020-09-04 09:29:44 +02:00
Chip Davis	4cf840ee7b	MSL: Support layered input attachments. These need to use arrayed texture types, or Metal will complain when binding the resource. The target layer is addressed relative to the Layer output by the vertex pipeline, or to the ViewIndex if in a multiview pipeline. Unlike with the s/t coordinates, Vulkan does not forbid non-zero layer coordinates here, though this cannot be expressed in Vulkan GLSL. Supporting 3D textures will require additional work. Part of the problem is that Metal does not allow texture views to subset a 3D texture, so we need some way to pass the base depth to the shader.	2020-09-02 09:18:25 -05:00
Chip Davis	cab7335e64	MSL: Don't set the layer for multiview if the device doesn't support it. Some older iOS devices don't support layered rendering. In that case, don't set `[[render_target_array_index]]`, because the compiler will reject the shader in that case. The client will then have to unroll the render pass manually.	2020-09-01 19:30:28 -05:00
Chip Davis	53080ecca8	MSL: Fix multiview view index calculation with a non-zero base instance. Account for a non-zero base instance when calculating the view index and the "real" instance index. Before, it was likely broken with a non-zero base instance, since the calculated instance index could be less than the base instance.	2020-08-31 20:33:44 -05:00
Hans-Kristian Arntzen	95993f78af	Run format_all.sh.	2020-08-24 11:02:49 +02:00
Hans-Kristian Arntzen	a07441568e	Overhaul how we deal with reserved identifiers. - Do not silently drop reserved identifiers in the parser. This makes it possible to reflect identifiers which are reserved by the cross-compiler module. - Instead of dropping the name, emit _RESERVED_IDENTIFIER_FIXUP in the source to make it clear that a name has been rewritten. - Document what is reserved and not.	2020-08-21 16:33:27 +02:00
Le Hoang Quyen	ab8eb70af1	Fix #1445 : MSL: Enclose args when convert distance(a,b) to abs(a-b)	2020-08-13 21:16:08 +08:00
Chip Davis	3347b1076d	MSL: Fix handling of matrices and structs in the output control point array. Prior to this point, we were treating them as flattened, as they are in old-style tessellation control shaders, and still are for structs in new-style shaders. This is not true for outputs; output composites are not flattened at all. This semantic mismatch broke a Vulkan CTS test. It should now pass.	2020-08-03 17:18:18 -05:00
Tomek Ponitka	18f23c47d9	Enabling setting a fixed sampleMask in Metal fragment shaders. In Metal render pipelines don't have an option to set a sampleMask parameter, the only way to get that functionality is to set the sample_mask output of the fragment shader to this value directly. We also need to take care to combine the fixed sample mask with the one that the shader might possibly output.	2020-07-24 11:19:46 +02:00
Chip Davis	688c5fcbda	MSL: Add support for processing more than one patch per workgroup. This should hopefully reduce underutilization of the GPU, especially on GPUs where the thread execution width is greater than the number of control points. This also simplifies initialization by reading the buffer directly instead of using Metal's vertex-attribute-in-compute support. It turns out the only way in which shader stages are allowed to differ in their interfaces is in the number of components per vector; the base type must be the same. Since we are using the raw buffer instead of attributes, we can now also emit arrays and matrices directly into the buffer, instead of flattening them and then unpacking them. Structs are still flattened, however; this is due to the need to handle vectors with fewer components than were output, and I think handling this while also directly emitting structs could get ugly. Another advantage of this scheme is that the extra invocations needed to read the attributes when there were more input than output points are now no more. The number of threads per workgroup is now lcm(SIMD-size, output control points). This should ensure we always process a whole number of patches per workgroup. To avoid complexity handling indices in the tessellation control shader, I've also changed the way vertex shaders for tessellation are handled. They are now compute kernels using Metal's support for vertex-style stage input. This lets us always emit vertices into the buffer in order of vertex shader execution. Now we no longer have to deal with indexing in the tessellation control shader. This also fixes a long-standing issue where if an index were greater than the number of vertices to draw, the vertex shader would wind up writing outside the buffer, and the vertex would be lost. This is a breaking change, and I know SPIRV-Cross has other clients, so I've hidden this behind an option for now. In the future, I want to remove this option and make it the default.	2020-07-23 17:59:54 -05:00
Chip Davis	884bc6df65	MSL: Factor creating a uint type into its own method. This is so common for artificially created variables that it's worth it to create it once and save it for later use.	2020-07-22 16:25:14 -05:00
Chip Davis	5e13f7fdf2	MSL: Factor a really gnarly condition into its own method. That branch has become nigh unreadable. This new method should make it readable again.	2020-07-22 16:25:10 -05:00
Thomas Roughton	b74a84e4cb	MSL: Ensure OpStore source operands are marked for inclusion in function arguments Without this change, code such as: ``` OpStore %param_var_mipLevelSizes_0 %heightmapMipSizes ``` within a function that then forwards the value `%param_var_mipLevelSizes_0` to another function will not have `%heightmapMipSizes` registered as an argument to the function.	2020-07-07 17:10:36 +12:00
Hans-Kristian Arntzen	711300baed	MSL: Do not emit swizzled writes in packing fixups. Similar to scalar access chain fix, this causes a read-modify-write on memory we're not supposed to write to.	2020-07-06 10:03:46 +02:00
Hans-Kristian Arntzen	fa5b206d97	MSL: Workaround broken vector -> scalar access chain in MSL. On MSL, the compiler refuses to allow access chains into a normal vector type. What happens in practice instead is a read-modify-write where a vector type is loaded, modified and written back. The workaround is to convert a vector into a pointer-to-scalar before the access chain continues to add the scalar index.	2020-07-06 10:03:44 +02:00
Hans-Kristian Arntzen	e1600d4df8	MSL: Use input attachment index directly for resource index fallback.	2020-07-06 09:49:46 +02:00

1 2 3 4 5 ...

764 Commits