SPIRV-Cross

Author	SHA1	Message	Date
Chip Davis	e8d419854f	MSL: Add a workaround for broken `level()` arguments. Some Metal devices have a bug with depth array textures using comparison with explicit LoD, where the LoD given will be biased by some amount. For these devices, we can use a gradient instead, which does not exhibit this problem. As with the fragment demote workaround, this is only expected to be needed until the bug is fixed in Metal.	2023-02-02 22:01:46 -08:00
Bill Hollings	643b7be196	MSL: Add support for writable images in iOS Tier2 argument buffers. - Add CompilerMSL::Options::argument_buffers_tier as an enumeration to allow calling app to specify platform argument buffer tier capabilities. - Support iOS writable images in Tier2 argument buffers when specified. Tier capabilities based on recommendations from Apple engineering.	2022-12-28 12:40:37 -05:00
Hans-Kristian Arntzen	3c997e12eb	Add C API option for enable row major workaround.	2022-12-13 15:04:55 +01:00
Chip Davis	aa5a8c482e	MSL: Prevent stores to storage resources in discarded fragments. Some Metal devices have a bug where storage resources can still be written to even if the fragment is discarded. This is obviously a bug in Metal, but bothering Apple to fix it will only fix it for newer versions; therefore, a workaround is needed for older versions. I have made this an option so that, in case the bug is ever fixed, the workaround can be disabled. This workaround is simple: if a fragment shader may discard its fragment and writes to a storage resource, a variable representing the `HelperInvocation` built-in is created and passed to all functions. The flag is checked on all resource writes; writes do not occur when `HelperInvocation` is `true`. This relies on the earlier workaround to update `HelperInvocation` when the fragment is discarded. Fixes at least 3 failures in the CTS.	2022-11-20 01:29:41 -08:00
Chip Davis	c7ce92a95b	MSL: Manually update `BuiltInHelperInvocation` when a fragment is discarded. Some Metal devices have a bug where `simd_is_helper_thread()` won't return true after a fragment has been discarded. We can work around this by manually setting `gl_HelperInvocation` upon discarding a fragment. This is fairly unintrusive, so it is enabled by default. I've made it an option so that, when the bug is fixed, we can disable it.	2022-11-19 23:48:26 -08:00
Chip Davis	0b679334e4	MSL: Don't flatten arrayed per-patch output blocks in tessellation shaders. Flattening doesn't play well with dynamic indices. In this case, it's better to leave it as an array of structs. (I wanted to do this for named blocks generally. Trouble is, the builtin `gl_out` block is also a named block...) Fixes six more CTS tests, under `dEQP-VK.tessellation.user_defined_io.per_patch_block_array.*`.	2022-10-18 15:04:42 -07:00
Chip Davis	a171087180	MSL: Support "raw" buffer input in tessellation evaluation shaders. Using vertex-style stage input is complex, and it doesn't support nesting of structures or arrays. By using raw buffer input instead, we get this support "for free," and everything becomes much simpler. Arguably, this is the way I should've done this in the first place. Eventually, I'd like to make this the default, and then remove the option altogether. (And I still need to do that with `multi_patch_workgroup`...) Should help fix 66 tests in the Vulkan CTS, under the following trees: - `dEQP-VK.pipeline..interface_matching.` - `dEQP-VK.tessellation.user_defined_io.` - `dEQP-VK.clipping.user_defined.`	2022-10-18 14:58:59 -07:00
Hans-Kristian Arntzen	f3b1375b13	Add reflection support for shader record buffers. Reflect naming scheme in a context sensitive way that matches the frontend. GLSL -> use block name HLSL (DXC) -> use instance name.	2022-10-03 12:20:08 +02:00
Chip Davis	064eaebe72	MSL: Add a mechanism to fix up shader outputs. This is analogous to the existing support for fixing up shader inputs. It is intended to be used with tessellation to add implicit builtins that are read from a later stage, despite not being written in an earlier stage. (Believe it or not, this is in fact legal in Vulkan.) Helps fix 8 CTS tests under `dEQP-VK.pipeline.*.no_position`. (Eight other tests work solely by accident without this change.)	2022-09-09 17:06:34 -07:00
Hans-Kristian Arntzen	31be74a853	Add relax_nan_checks options. Makes codegen from typical D3D emulation SPIR-V more readable. Also makes cross compilation with NotEqual more sensible. It's very rare to actually need the strict NaN-checks in practice. Also, glslang now emits UnordNotEqual by default it seems, so give up trying to assume OrdNotEqual. Harmonize for UnordNotEqual as the sane default.	2022-03-03 14:50:56 +01:00
Jon Leech	f2a65545b8	Finish adding SPDX tags and setup a reuse checked in Github Actions CI	2021-06-29 11:03:52 +02:00
Hans-Kristian Arntzen	d75666b170	GLSL: Emit num_views for OVR_multiview2.	2021-06-28 12:56:27 +02:00
Hans-Kristian Arntzen	406af8ff4d	c: Add C API for builtin stage IO reflection.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	adc5fe3615	C: Add C api for stage output masking.	2021-04-19 12:10:49 +02:00
lukas.taparauskas	215f31b33f	c: Add missing API to query active builtins.	2021-04-14 15:44:00 +03:00
Hans-Kristian Arntzen	09dc76f68a	c: Add missing IOS_SUPPORT_BASE_VERTEX_INSTANCE option.	2021-02-15 11:43:46 +01:00
Hans-Kristian Arntzen	4704482bbc	meta: Update copyright headers to 2021.	2021-01-14 16:07:49 +01:00
Hans-Kristian Arntzen	cf1e9e0643	Add MIT dual license for the SPIRV-Cross API.	2020-12-01 16:47:08 +01:00
Chip Davis	fd738e3387	MSL: Adjust FragCoord for sample-rate shading. In Metal, the `[[position]]` input to a fragment shader remains at fragment center, even at sample rate, like OpenGL and Direct3D. In Vulkan, however, when the fragment shader runs at sample rate, the `FragCoord` builtin moves to the sample position in the framebuffer, instead of the fragment center. To account for this difference, adjust the `FragCoord`, if present, by the sample position. The -0.5 offset is because the fragment center is at (0.5, 0.5). Also, add an option to force sample-rate shading in a fragment shader. Since Metal has no explicit control for this, this is done by adding a dummy `[[sample_id]]` which is otherwise unused, if none is already present. This is intended to be used from e.g. MoltenVK when a pipeline's `minSampleShading` value is nonzero. Instead of checking if any `Input` variables have `Sample` interpolation, I've elected to check that the `SampleRateShading` capability is present. Since `SampleId`, `SamplePosition`, and the `Sample` interpolation decoration require this cap, this should be equivalent for any valid SPIR-V module. If this isn't acceptable, let me know.	2020-11-23 10:30:24 -06:00
Chip Davis	68908355a9	MSL: Expand subgroup support. Add support for declaring a fixed subgroup size. Metal, like Vulkan with `VK_EXT_subgroup_size_control`, allows the thread execution width to vary depending on factors such as register usage. Unfortunately, this breaks several tests that depend on the subgroup size being what the device says it is. So we'll fix the subgroup size at the size the device declares. The extra invocations in the subgroup will appear to be inactive. Because of this, the ballot mask builtins are now ANDed with the active subgroup mask. Add support for emulating a subgroup of size 1. This is intended to be used by Vulkan Portability implementations (e.g. MoltenVK) when the hardware/software combo provides insufficient support for subgroups. Luckily for us, Vulkan 1.1 only requires that the subgroup size be at least 1. Add support for quadgroup and SIMD-group functions which were added to iOS in Metal 2.2 and 2.3. This will allow clients to take advantage of expanded quadgroup and SIMD-group support in recent Metal versions and on recent Apple GPUs (families 6 and 7). Gut emulation of subgroup builtins in fragment shaders. It turns out codegen for the SIMD-group functions in fragment wasn't implemented for AMD on Mojave; it's a safe bet that it wasn't implemented for the other drivers either. Subgroup support in fragment shaders now requires Metal 2.2.	2020-11-20 15:55:49 -06:00
Hans-Kristian Arntzen	b3344174f7	HLSL: Add option to flatten matrix vertex input semantics. Helps translation layers where we expect inputs to be multiple float vectors rather than an indexed matrix.	2020-11-03 11:18:32 +01:00
Chip Davis	c20d5945a2	MSL: Allow framebuffer fetch on Mac in MSL 2.3. Another Apple GPU feature that will now be supported on Apple Silicon Macs.	2020-10-29 10:50:59 -05:00
Hans-Kristian Arntzen	bd1ee4344e	MSL: Support querying and modifying generated combined sampler suffix.	2020-10-14 14:52:18 +02:00
Chip Davis	21d38f74ce	MSL: Fix calculation of atomic image buffer address. Fix reversed coordinates: `y` should be used to calculate the row address. Align row address to the row stride. I've made the row alignment a function constant; this makes it possible to override it at pipeline compile time. Honestly, I don't know how this worked at all for Epic. It definitely didn't work in the CTS prior to this.	2020-10-13 20:51:56 -05:00
Chip Davis	4cf840ee7b	MSL: Support layered input attachments. These need to use arrayed texture types, or Metal will complain when binding the resource. The target layer is addressed relative to the Layer output by the vertex pipeline, or to the ViewIndex if in a multiview pipeline. Unlike with the s/t coordinates, Vulkan does not forbid non-zero layer coordinates here, though this cannot be expressed in Vulkan GLSL. Supporting 3D textures will require additional work. Part of the problem is that Metal does not allow texture views to subset a 3D texture, so we need some way to pass the base depth to the shader.	2020-09-02 09:18:25 -05:00
Chip Davis	cab7335e64	MSL: Don't set the layer for multiview if the device doesn't support it. Some older iOS devices don't support layered rendering. In that case, don't set `[[render_target_array_index]]`, because the compiler will reject the shader in that case. The client will then have to unroll the render pass manually.	2020-09-01 19:30:28 -05:00
Hans-Kristian Arntzen	57c93d44ac	GLSL: Add option to force flattening IO blocks. It is not always desirable to use actual blocks. A prime example in the case where EXT_shader_io_blocks is not supported on the target implementation.	2020-07-28 15:16:06 +02:00
Chip Davis	688c5fcbda	MSL: Add support for processing more than one patch per workgroup. This should hopefully reduce underutilization of the GPU, especially on GPUs where the thread execution width is greater than the number of control points. This also simplifies initialization by reading the buffer directly instead of using Metal's vertex-attribute-in-compute support. It turns out the only way in which shader stages are allowed to differ in their interfaces is in the number of components per vector; the base type must be the same. Since we are using the raw buffer instead of attributes, we can now also emit arrays and matrices directly into the buffer, instead of flattening them and then unpacking them. Structs are still flattened, however; this is due to the need to handle vectors with fewer components than were output, and I think handling this while also directly emitting structs could get ugly. Another advantage of this scheme is that the extra invocations needed to read the attributes when there were more input than output points are now no more. The number of threads per workgroup is now lcm(SIMD-size, output control points). This should ensure we always process a whole number of patches per workgroup. To avoid complexity handling indices in the tessellation control shader, I've also changed the way vertex shaders for tessellation are handled. They are now compute kernels using Metal's support for vertex-style stage input. This lets us always emit vertices into the buffer in order of vertex shader execution. Now we no longer have to deal with indexing in the tessellation control shader. This also fixes a long-standing issue where if an index were greater than the number of vertices to draw, the vertex shader would wind up writing outside the buffer, and the vertex would be lost. This is a breaking change, and I know SPIRV-Cross has other clients, so I've hidden this behind an option for now. In the future, I want to remove this option and make it the default.	2020-07-23 17:59:54 -05:00
Hans-Kristian Arntzen	f9da366ae6	MSL: Remove the old VertexAttr API. Too many issues with deprecated declarations on various compilers, just get rid of it.	2020-06-22 11:14:24 +02:00
Chip Davis	5281d9997e	MSL: Fix up input variables' vector lengths in all stages. Metal is picky about interface matching. If the types don't match exactly, down to the number of vector components, Metal fails pipline compilation. To support pipelines where the number of components consumed by the fragment shader is less than that produced by the vertex shader, we have to fix up the fragment shader to accept all the components produced.	2020-06-16 14:50:30 -05:00
Hans-Kristian Arntzen	3ce81c0025	Merge pull request #1384 from KhronosGroup/fix-1380 MSL: Remove obsolete MSLVertexAttr members.	2020-06-04 15:55:47 +02:00
Hans-Kristian Arntzen	6600793884	MSL: Remove obsolete MSLVertexAttr members. These are completely unused. Need to keep the members around for ABI compatbility however ...	2020-06-04 12:43:04 +02:00
Hans-Kristian Arntzen	2d5200650a	HLSL: Add native support for 16-bit types. Adds support for templated load/store in SM 6.2 to deal with small types.	2020-06-04 12:33:56 +02:00
Hans-Kristian Arntzen	6b0e558169	Handle RayQueryKHR type. Do not error out in parsing in shaders which use ray queries.	2020-04-21 14:25:18 +02:00
Hans-Kristian Arntzen	ebf463674d	MSL: Allow removing clip distance user varyings. Only safe if user knows that subsequent shader stage will not read clip distance.	2020-04-20 09:58:40 +02:00
Chip Davis	b29f83c383	MSL: Add options to control emission of fragment outputs. Like with `point_size` when not rendering points, Metal complains when writing to a variable using the `[[depth]]` qualifier when no depth buffer be attached. In that case, we must avoid emitting `FragDepth`, just like with `PointSize`. I assume it will also complain if there be no stencil attachment and the shader write to `[[stencil]]`, or it write to `[[color(n)]]` but there be no color attachment at n.	2020-04-13 15:29:11 -05:00
Hans-Kristian Arntzen	941cceedb4	Expose a query if samplers or images are comparison resources.	2020-04-03 17:43:42 +02:00
Hans-Kristian Arntzen	28bf9057df	HLSL: Add support for treating NonWritable UAV texture as SRV instead.	2020-04-03 11:50:50 +02:00
Hans-Kristian Arntzen	b8905bbd95	Add support for forcefully zero-initialized variables. Useful to better support certain platforms which require all variables to be initialized to something.	2020-03-26 13:38:27 +01:00
Hans-Kristian Arntzen	c27e1efbf1	HLSL: Add option to always treat SSBO as UAV, even with readonly. This can make codegen more predictable since ByteAddressBuffer is SRV and not UAV.	2020-03-04 16:42:31 +01:00
Hans-Kristian Arntzen	01968c4486	Add option to disable storage image qualifier deduction.	2020-03-04 16:42:31 +01:00
Hans-Kristian Arntzen	16796e92be	MSL: Add C API for force native arrays.	2020-02-24 13:51:08 +01:00
Chip Davis	fedbc35315	MSL: Support inline uniform blocks in argument buffers. Here, the inline uniform block is explicit: we instantiate the buffer block itself in the argument buffer, instead of a pointer to the buffer. I just hope this will work with the `MTLArgumentDescriptor` API... Note that Metal recursively assigns individual members of embedded structs IDs. This means for automatic assignment that we have to calculate the binding stride for a given buffer block. For MoltenVK, we'll simply increment the ID by the size of the inline uniform block. Then the later IDs will never conflict with the inline uniform block. We can get away with this because Metal doesn't require that IDs be contiguous, only monotonically increasing.	2020-01-24 18:51:24 -06:00
Hans-Kristian Arntzen	f9818f0804	Update license headers to 2020.	2020-01-16 15:24:37 +01:00
Hans-Kristian Arntzen	c3bd136df1	MSL: Add support for force-activating IAB resources. Important for ABI compatibility on MSL in certain cases.	2020-01-16 11:12:06 +01:00
Hans-Kristian Arntzen	cc153f8d7f	HLSL: Add a resource remapping API similar to MSL. Allows more flexibility of how resources are assigned without having to remap decorations.	2020-01-09 12:41:06 +01:00
Hans-Kristian Arntzen	b9e5fe01b0	HLSL: Add support to remove register() bindings. Sometimes it's useful to get automatic binding assignment from the D3D compiler instead.	2019-11-11 11:23:21 +01:00
Hans-Kristian Arntzen	e8ed10d445	Add spvc_type_get_base_type_id. Wraps SPIRType::self which is necessary for advanced reflection.	2019-11-04 10:53:09 +01:00
Hans-Kristian Arntzen	e9ad6398de	C API: Add missing boolean options.	2019-11-04 10:42:20 +01:00
Hans-Kristian Arntzen	4bb673a626	MSL: Add opt-in support for huge IABs. If there are enough members in an IAB, we cannot use the constant address space as MSL compiler complains about there being too many members. Support emitting the device address space instead.	2019-10-14 16:20:34 +02:00

1 2

73 Commits