SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	99ae0d32e9	MSL: Handle array with component when we cannot rely on user() attrib. In these cases, we emit one variable per location, and so we must flatten stuff.	2021-05-21 13:46:33 +02:00
Hans-Kristian Arntzen	e47a30e807	Honor NoContraction qualifier. We'll need to force a temporary and mark it as precise. MSL is a little weird here, but we can piggyback on top of the invariant float math option here to force fma() operations everywhere.	2021-05-07 12:59:47 +02:00
Lukas Taparauskas	72a2ec4c1b	MSL: Fix '--msl-multi-patch-workgroup' out of bounds reads when dispatching more threads than control points (#1662 ) * Fix '--msl-multi-patch-workgroup' cases where thread count exceeds data bounds Fix gl_PrimitiveID off by one error when computing last valid index Point gl_out to the last patch's data when threads exceed input data bounds Point patchOut to the last patch's data when threads exceed input data bounds Update MSL test expectations. * Undo change to MSL multi-patch hull output bound checks * Update MSL multi-patch test expectations.	2021-04-29 20:01:26 +02:00
Hans-Kristian Arntzen	82a77e534e	MSL: Use proper array for quad tess levels. We need to handle loads from array as well, so the float4 hack doesn't work.	2021-04-23 14:12:00 +02:00
Hans-Kristian Arntzen	532f65583e	Rewrite how non-uniform qualifiers are handled. Remove all shenanigans with propagation, and only consume nonuniform qualifiers exactly where needed (last minute).	2021-04-22 16:03:08 +02:00
Hans-Kristian Arntzen	ae9ca7d73c	MSL: Fix copy of arrays to/from stage IO variables. Need to take into account effective storage classes and whether or not we target stage IO blocks since native arrays are conditionally enabled.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	986196030d	MSL: Don't use native arrays for tess level inputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	4a379a00f3	MSL: Don't emit native array for masked clip/cull distance.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	682a227f4b	MSL: Make builtin argument type declaration context sensitive. Sometimes we'll need array template, sometimes not 🤷.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	c1edd35d57	MSL: Use spvUnsafeArray for builtin arrays after all. It will get too messy to deal with constant initializers any other way, so just deal with complexity in argument_decl instead ...	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	5826298697	MSL: Handle CullDistance better.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	23da445bd4	MSL: Emit multiple threadgroup slices for multi-patch. Multiple patches can run in the same workgroup when using multi-patch mode, so we need to allocate enough storage to avoid false sharing.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	e32c474911	MSL: Handle masking of TESC IO block members.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	dc54f75eec	MSL: Fixup gl_PerVertex names if we're emitting masked builtins.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	40f628f49c	MSL: Add test for complex control point outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	46c48ee6b5	MSL: Rewrite how IO blocks are emitted in multi-patch mode. Firstly, never flatten inputs or outputs in multi-patch mode. The main scenario where we do need to care is Block IO. In this case, we should only flatten the top-level member, and after that we use access chains as normal. Using structs in Input storage class is now possible as well. We don't need to consider per-location fixups at all here. In Vulkan, IO structs must match exactly. Only plain vectors can have smaller vector sizes as a special case.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ff3f5bcba5	MSL: Handle masking of builtin control points.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	436b1250da	MSL: Do not perform scalar fixups for control-point outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	74b2acab9b	MSL: Always emit block variable for block types.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ae7bb41ef4	MSL: Test that we can mask location writes in TESC.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ba93b6518d	MSL: Fix masking of vertex block outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	857295a9ab	MSL: Add tests for masking with --for-tess.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	43b6ea2c9a	MSL: Remove position mask tests. They will fail compilation.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	65b5ff7ece	MSL: Don't emit weird reference type for spvUnsafeArray types.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	50a6bc058a	MSL: Force builtin arrays for builtin array types. Handles argument_decl() correctly.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	88b54f5dab	MSL: Add tests for vertex output masking.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	0997e81118	MSL: Sort builtin IO block members by builtin type. Ensures consistent block matching.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	0ad12a0036	MSL: Always return [[position]] when required.	2021-02-15 12:57:37 +01:00
Hans-Kristian Arntzen	7ab3f3f74e	Deal better with CompositeExtract from constant composite. There is no good reason for applications to emit this kind of code, but some do. Special case this scenario.	2021-01-22 12:30:16 +01:00
Hans-Kristian Arntzen	5d82d32e0f	Roll dependencies.	2021-01-08 10:41:51 +01:00
Hans-Kristian Arntzen	a4a9b53b5b	MSL: Always enable Outputs in vertex stages. Subsequent stages can legally attempt to read from these variables, which causes compilation failure. Always make sure we emit user outputs in vertex shaders if they are active in the entry point.	2021-01-07 11:24:47 +01:00
Chip Davis	fd738e3387	MSL: Adjust FragCoord for sample-rate shading. In Metal, the `[[position]]` input to a fragment shader remains at fragment center, even at sample rate, like OpenGL and Direct3D. In Vulkan, however, when the fragment shader runs at sample rate, the `FragCoord` builtin moves to the sample position in the framebuffer, instead of the fragment center. To account for this difference, adjust the `FragCoord`, if present, by the sample position. The -0.5 offset is because the fragment center is at (0.5, 0.5). Also, add an option to force sample-rate shading in a fragment shader. Since Metal has no explicit control for this, this is done by adding a dummy `[[sample_id]]` which is otherwise unused, if none is already present. This is intended to be used from e.g. MoltenVK when a pipeline's `minSampleShading` value is nonzero. Instead of checking if any `Input` variables have `Sample` interpolation, I've elected to check that the `SampleRateShading` capability is present. Since `SampleId`, `SamplePosition`, and the `Sample` interpolation decoration require this cap, this should be equivalent for any valid SPIR-V module. If this isn't acceptable, let me know.	2020-11-23 10:30:24 -06:00
Hans-Kristian Arntzen	6a614cc7f7	Normalize all internal workaround methods to use spv prefix. We have been interchanging spv and SPIRV_Cross_ for a while, which causes weirdness since we don't explicitly ban SPIRV_Cross identifiers, as these identifiers are generally used for interface variable workarounds.	2020-11-23 15:42:27 +01:00
Jan Sikorski	f0239bce05	MSL: extract global variables from subgroup ballot operations Fixes #1513.	2020-11-09 11:23:01 +01:00
Chip Davis	aca9b6879a	MSL: Support pull-model interpolation on MSL 2.3+. New in MSL 2.3 is a template that can be used in the place of a scalar type in a stage-in struct. This template has methods which interpolate the varying at the given points. Curiously, you can't set interpolation attributes on such a varying; perspective-correctness is encoded in the type, while interpolation must be done using one of the methods. This makes using this somewhat awkward from SPIRV-Cross, requiring us to jump through a bunch of hoops to make this all work. Using varyings from functions in particular is a pain point, requiring us to pass the stage-in struct itself around. An alternative is to pass references to the interpolants; except this will fall over badly with composite types, which naturally must be flattened. As with tessellation, dynamic indexing isn't supported with pull-model interpolation. This is because of the need to reference the original struct member in order to call one of the pull-model interpolation methods on it. Also, this is done at the variable level; this means that if one varying in a struct is used with the pull-model functions, then the entire struct is emitted as pull-model interpolants. For some reason, this was not documented in the MSL spec, though there is a property on `MTLDevice`, `supportsPullModelInterpolation`, indicating support for this, which is documented. This does not appear to be implemented yet for AMD: it returns `NO` from `supportsPullModelInterpolation`, and pipelines with shaders using the templates fail to compile. It is implemeted for Intel. It's probably also implemented for Apple GPUs: on Apple Silicon, OpenGL calls down to Metal, and it wouldn't be possible to use the interpolation functions without this implemented in Metal. Based on my testing, where SPIR-V and GLSL have the offset relative to the pixel center, in Metal it appears to be relative to the pixel's upper-left corner, as in HLSL. Therefore, I've added an offset 0.4375, i.e. one half minus one sixteenth, to all arguments to `interpolate_at_offset()`. This also fixes a long-standing bug: if a pull-model interpolation function is used on a varying, make sure that varying is declared. We were already doing this only for the AMD pull-model function, `interpolateAtVertexAMD()`; for reasons which are completely beyond me, we weren't doing this for the base interpolation functions. I also note that there are no tests for the interpolation functions for GLSL or HLSL.	2020-11-05 11:57:45 -06:00
Chip Davis	547c29f7bb	MSL: Allow Bias and Grad arguments with comparison on Mac in MSL 2.3. I kept the code to replace constant zero arguments, because `Bias` and `Grad` still have some problems on desktop GPUs. `Bias` works on AMD GPUs. `Grad` does not. Both work on Intel. Still needs testing on NV. It will definitely work with Apple GPUs.	2020-10-30 11:14:59 -05:00
Yuwen Wu	c8a43876c7	added metal keyworld: "level" (#1501 ) * added metal keyworld: "level" * added more metal keywords * updated test case.	2020-10-30 08:07:25 +01:00
Chip Davis	d48d2a95c7	MSL: Allow post-depth coverage on Mac in MSL 2.3. It's still only supported on Apple GPUs, but Macs will have those soon.	2020-10-27 22:07:01 -05:00
Chip Davis	064ed448b9	MSL: Don't remove periods from swizzle buffer index exprs.	2020-10-20 17:47:40 -05:00
Chip Davis	5845e009ea	MSL: Handle Offset and Grad operands for 1D-as-2D textures.	2020-10-15 12:51:00 -05:00
Chip Davis	3e6010d8c5	MSL: Don't use a bitcast for tessellation levels in tesc shaders. `half` cannot be bitcasted to `float`, because the two types are not the same size. Use an expanding cast instead. We were already doing this for stores to the tessellation levels; why I didn't also do this for loads is beyond me.	2020-10-14 18:35:59 -05:00
Chip Davis	21d38f74ce	MSL: Fix calculation of atomic image buffer address. Fix reversed coordinates: `y` should be used to calculate the row address. Align row address to the row stride. I've made the row alignment a function constant; this makes it possible to override it at pipeline compile time. Honestly, I don't know how this worked at all for Epic. It definitely didn't work in the CTS prior to this.	2020-10-13 20:51:56 -05:00
Chip Davis	7a5d0d6b29	MSL: Add missing interlock handling to atomic image buffers.	2020-10-13 11:44:17 -05:00
Hans-Kristian Arntzen	fab6ad234e	Merge pull request #1486 from cdavis5e/atomic-image-argument-buffer MSL: Support atomic access to images from argument buffers.	2020-10-13 12:55:43 +02:00
Chip Davis	9cafea6cf8	MSL: Support atomic access to images from argument buffers. This was not added when Epic contributed atomic image support. Fixes #1484.	2020-10-13 02:37:18 -05:00
Chip Davis	2219c4a392	MSL: Support SPV_EXT_demote_to_helper_invocation for MSL 2.3. MSL 2.3 has everything needed to support this extension on all platforms. The existing `discard_fragment()` function was given demote semantics, similar to Direct3D, and the `simd_is_helper_thread()` function was finally added to iOS. I've left the old test alone. Should I remove it in favor of these?	2020-10-13 00:25:32 -05:00
Chip Davis	4cf840ee7b	MSL: Support layered input attachments. These need to use arrayed texture types, or Metal will complain when binding the resource. The target layer is addressed relative to the Layer output by the vertex pipeline, or to the ViewIndex if in a multiview pipeline. Unlike with the s/t coordinates, Vulkan does not forbid non-zero layer coordinates here, though this cannot be expressed in Vulkan GLSL. Supporting 3D textures will require additional work. Part of the problem is that Metal does not allow texture views to subset a 3D texture, so we need some way to pass the base depth to the shader.	2020-09-02 09:18:25 -05:00
Chip Davis	cab7335e64	MSL: Don't set the layer for multiview if the device doesn't support it. Some older iOS devices don't support layered rendering. In that case, don't set `[[render_target_array_index]]`, because the compiler will reject the shader in that case. The client will then have to unroll the render pass manually.	2020-09-01 19:30:28 -05:00
Chip Davis	53080ecca8	MSL: Fix multiview view index calculation with a non-zero base instance. Account for a non-zero base instance when calculating the view index and the "real" instance index. Before, it was likely broken with a non-zero base instance, since the calculated instance index could be less than the base instance.	2020-08-31 20:33:44 -05:00
Hans-Kristian Arntzen	a07441568e	Overhaul how we deal with reserved identifiers. - Do not silently drop reserved identifiers in the parser. This makes it possible to reflect identifiers which are reserved by the cross-compiler module. - Instead of dropping the name, emit _RESERVED_IDENTIFIER_FIXUP in the source to make it clear that a name has been rewritten. - Document what is reserved and not.	2020-08-21 16:33:27 +02:00

1 2 3 4 5 ...

473 Commits