SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	e07f0a9df5	GLSL: Fix buffer_reference with aliased names.	2020-11-23 16:36:49 +01:00
Hans-Kristian Arntzen	c5826b4b69	GLSL: Emit storage qualifiers for buffer_reference.	2020-11-23 16:26:33 +01:00
Hans-Kristian Arntzen	650b5e1b12	HLSL: Fix validation with FXC for test.	2020-11-23 16:03:35 +01:00
Hans-Kristian Arntzen	6a614cc7f7	Normalize all internal workaround methods to use spv prefix. We have been interchanging spv and SPIRV_Cross_ for a while, which causes weirdness since we don't explicitly ban SPIRV_Cross identifiers, as these identifiers are generally used for interface variable workarounds.	2020-11-23 15:42:27 +01:00
Chip Davis	68908355a9	MSL: Expand subgroup support. Add support for declaring a fixed subgroup size. Metal, like Vulkan with `VK_EXT_subgroup_size_control`, allows the thread execution width to vary depending on factors such as register usage. Unfortunately, this breaks several tests that depend on the subgroup size being what the device says it is. So we'll fix the subgroup size at the size the device declares. The extra invocations in the subgroup will appear to be inactive. Because of this, the ballot mask builtins are now ANDed with the active subgroup mask. Add support for emulating a subgroup of size 1. This is intended to be used by Vulkan Portability implementations (e.g. MoltenVK) when the hardware/software combo provides insufficient support for subgroups. Luckily for us, Vulkan 1.1 only requires that the subgroup size be at least 1. Add support for quadgroup and SIMD-group functions which were added to iOS in Metal 2.2 and 2.3. This will allow clients to take advantage of expanded quadgroup and SIMD-group support in recent Metal versions and on recent Apple GPUs (families 6 and 7). Gut emulation of subgroup builtins in fragment shaders. It turns out codegen for the SIMD-group functions in fragment wasn't implemented for AMD on Mojave; it's a safe bet that it wasn't implemented for the other drivers either. Subgroup support in fragment shaders now requires Metal 2.2.	2020-11-20 15:55:49 -06:00
Hans-Kristian Arntzen	1ee2d13873	MSL: Add missing reference file.	2020-11-11 16:25:01 +01:00
Jan Sikorski	f0239bce05	MSL: extract global variables from subgroup ballot operations Fixes #1513.	2020-11-09 11:23:01 +01:00
Hans-Kristian Arntzen	71fcf0d9e6	Update texture gather test result.	2020-11-08 13:54:30 +01:00
Hans-Kristian Arntzen	46bf1e99d6	Merge pull request #1525 from cdavis5e/msl-interpolation-functions MSL: Support pull-model interpolation on MSL 2.3+.	2020-11-07 17:04:56 +01:00
Hans-Kristian Arntzen	683c3f5c3f	Merge pull request #1530 from rdb/legacy-glsl-round GLSL: Provide round/roundEven for legacy GLSL	2020-11-07 16:40:18 +01:00
Hans-Kristian Arntzen	ea334c14bc	Merge pull request #1527 from rdb/legacy-transpose GLSL: implement transpose() in GLSL 1.10 / ES 1.00	2020-11-07 16:37:59 +01:00
Hans-Kristian Arntzen	2417010046	Merge pull request #1528 from rdb/fix-legacy-vertex-shader-lod GLSL: Fix support for textureLod in legacy vertex shaders	2020-11-07 16:33:50 +01:00
rdb	bf71994dae	GLSL: implement transpose() in GLSL 1.10 / ES 1.00	2020-11-06 22:27:54 +01:00
rdb	9e6e5d2738	GLSL: Fix round/roundEven for legacy GLSL.	2020-11-06 17:34:38 +01:00
rdb	e8c500ceef	GLSL: Fix support for textureLod in legacy vertex shaders	2020-11-06 16:37:27 +01:00
Hans-Kristian Arntzen	db13762297	MSL: Fix regression in image gather handling. It was not always possible to get backing variable for a late-combined image sampler.	2020-11-06 16:21:30 +01:00
Chip Davis	aca9b6879a	MSL: Support pull-model interpolation on MSL 2.3+. New in MSL 2.3 is a template that can be used in the place of a scalar type in a stage-in struct. This template has methods which interpolate the varying at the given points. Curiously, you can't set interpolation attributes on such a varying; perspective-correctness is encoded in the type, while interpolation must be done using one of the methods. This makes using this somewhat awkward from SPIRV-Cross, requiring us to jump through a bunch of hoops to make this all work. Using varyings from functions in particular is a pain point, requiring us to pass the stage-in struct itself around. An alternative is to pass references to the interpolants; except this will fall over badly with composite types, which naturally must be flattened. As with tessellation, dynamic indexing isn't supported with pull-model interpolation. This is because of the need to reference the original struct member in order to call one of the pull-model interpolation methods on it. Also, this is done at the variable level; this means that if one varying in a struct is used with the pull-model functions, then the entire struct is emitted as pull-model interpolants. For some reason, this was not documented in the MSL spec, though there is a property on `MTLDevice`, `supportsPullModelInterpolation`, indicating support for this, which is documented. This does not appear to be implemented yet for AMD: it returns `NO` from `supportsPullModelInterpolation`, and pipelines with shaders using the templates fail to compile. It is implemeted for Intel. It's probably also implemented for Apple GPUs: on Apple Silicon, OpenGL calls down to Metal, and it wouldn't be possible to use the interpolation functions without this implemented in Metal. Based on my testing, where SPIR-V and GLSL have the offset relative to the pixel center, in Metal it appears to be relative to the pixel's upper-left corner, as in HLSL. Therefore, I've added an offset 0.4375, i.e. one half minus one sixteenth, to all arguments to `interpolate_at_offset()`. This also fixes a long-standing bug: if a pull-model interpolation function is used on a varying, make sure that varying is declared. We were already doing this only for the AMD pull-model function, `interpolateAtVertexAMD()`; for reasons which are completely beyond me, we weren't doing this for the base interpolation functions. I also note that there are no tests for the interpolation functions for GLSL or HLSL.	2020-11-05 11:57:45 -06:00
rdb	135933d59e	HLSL: Add regression test for SM3.0 texture samplers	2020-11-03 18:15:05 +01:00
Hans-Kristian Arntzen	fc644b50e6	Merge pull request #1523 from KhronosGroup/fix-1512 HLSL: Add option to flatten matrix vertex input semantics.	2020-11-03 13:16:54 +01:00
Hans-Kristian Arntzen	b3344174f7	HLSL: Add option to flatten matrix vertex input semantics. Helps translation layers where we expect inputs to be multiple float vectors rather than an indexed matrix.	2020-11-03 11:18:32 +01:00
Hans-Kristian Arntzen	1f018b0fb8	Parser: Don't assume OpTypePointer will always take a SPIRType. Possible to receive a function prototype here. Don't try to do anything smart here, just don't crash during parsing.	2020-11-03 10:53:37 +01:00
Hans-Kristian Arntzen	c5a3f37a1c	Merge pull request #1519 from cdavis5e/msl-mac-comparison-bias-grad MSL: Allow Bias and Grad arguments with comparison on Mac in MSL 2.3.	2020-11-02 20:01:14 +01:00
criss	6402586015	Updated ref file for subgroups_basicvoteballot.vk.comp	2020-11-02 18:40:56 +01:00
Chip Davis	547c29f7bb	MSL: Allow Bias and Grad arguments with comparison on Mac in MSL 2.3. I kept the code to replace constant zero arguments, because `Bias` and `Grad` still have some problems on desktop GPUs. `Bias` works on AMD GPUs. `Grad` does not. Both work on Intel. Still needs testing on NV. It will definitely work with Apple GPUs.	2020-10-30 11:14:59 -05:00
Hans-Kristian Arntzen	439b666829	GLSL: Fix nonuniformEXT injection. Needs to consider that other expressions might be using brackets as well ...	2020-10-30 14:11:16 +01:00
Hans-Kristian Arntzen	541a801fed	Merge pull request #1514 from cdavis5e/msl-mac-framebuffer-fetch MSL: Allow framebuffer fetch on Mac in MSL 2.3.	2020-10-30 08:09:41 +01:00
Yuwen Wu	c8a43876c7	added metal keyworld: "level" (#1501 ) * added metal keyworld: "level" * added more metal keywords * updated test case.	2020-10-30 08:07:25 +01:00
Chip Davis	c20d5945a2	MSL: Allow framebuffer fetch on Mac in MSL 2.3. Another Apple GPU feature that will now be supported on Apple Silicon Macs.	2020-10-29 10:50:59 -05:00
Hans-Kristian Arntzen	78c6d2d628	Merge pull request #1509 from cdavis5e/mac-post-depth-coverage MSL: Allow post-depth coverage on Mac in MSL 2.3.	2020-10-29 09:50:44 +01:00
Hans-Kristian Arntzen	08e49bfd67	Merge pull request #1508 from KhronosGroup/fix-1507 Handle case where block is loop header, continue AND break block.	2020-10-28 16:04:14 +01:00
Chip Davis	d48d2a95c7	MSL: Allow post-depth coverage on Mac in MSL 2.3. It's still only supported on Apple GPUs, but Macs will have those soon.	2020-10-27 22:07:01 -05:00
Hans-Kristian Arntzen	542d460364	Handle case where block is loop header, continue AND break block.	2020-10-27 12:29:08 +01:00
Hans-Kristian Arntzen	e47561a28b	GLSL: Support a workaround for loading row-major matrices. On AMD Windows OpenGL, it has been reported that we need to load matrices via a wrapper function.	2020-10-27 12:07:09 +01:00
Hans-Kristian Arntzen	f65f259ab7	MSL: Do not use component::x gather for depth2d textures.	2020-10-26 10:18:17 +01:00
Chip Davis	1264e2705e	MSL: Cast broadcast booleans to ushort. Metal doesn't support broadcasting or shuffling boolean values, but we can work around that by casting it to `ushort`, then casting it back to `bool`. I used `ushort` instead of `uint` because 16-bit values give better throughput on Apple GPUs.	2020-10-23 21:55:46 -05:00
Chip Davis	065b5bda3c	MSL: Mask ballots passed to Ballot bit ops. Only the least n bits are significant, where n is the subgroup size. The Vulkan CTS actually checks this. The `FindLSB` tests weren't actually failing, but I masked that anyway, in case there's some corner case the CTS is missing.	2020-10-23 21:55:46 -05:00
Chip Davis	781367d083	MSL: Support vectors with OpGroupNonUniformAllEqual. This was not tested here in SPIRV-Cross. Predictably, it broke when I tried it in the CTS.	2020-10-23 21:55:46 -05:00
Chip Davis	6ccb902462	MSL: Correct definitions of subgroup ballot mask variables. `SubgroupEqMask` had a fencepost error that gave wrong values for invocation ID 32. For `SubgroupGeMask` and `SubgroupGtMask`, I forgot to shift the values from `extract_bits()` up so that the mask is in the correct position. Using `insert_bits()` instead should fold these two operations into one. `SubgroupLtMask` and `SubgroupLeMask` were already correct.	2020-10-23 21:54:55 -05:00
Chip Davis	064ed448b9	MSL: Don't remove periods from swizzle buffer index exprs.	2020-10-20 17:47:40 -05:00
Chip Davis	5845e009ea	MSL: Handle Offset and Grad operands for 1D-as-2D textures.	2020-10-15 12:51:00 -05:00
Chip Davis	3e6010d8c5	MSL: Don't use a bitcast for tessellation levels in tesc shaders. `half` cannot be bitcasted to `float`, because the two types are not the same size. Use an expanding cast instead. We were already doing this for stores to the tessellation levels; why I didn't also do this for loads is beyond me.	2020-10-14 18:35:59 -05:00
Chip Davis	21d38f74ce	MSL: Fix calculation of atomic image buffer address. Fix reversed coordinates: `y` should be used to calculate the row address. Align row address to the row stride. I've made the row alignment a function constant; this makes it possible to override it at pipeline compile time. Honestly, I don't know how this worked at all for Epic. It definitely didn't work in the CTS prior to this.	2020-10-13 20:51:56 -05:00
Chip Davis	7a5d0d6b29	MSL: Add missing interlock handling to atomic image buffers.	2020-10-13 11:44:17 -05:00
Hans-Kristian Arntzen	fab6ad234e	Merge pull request #1486 from cdavis5e/atomic-image-argument-buffer MSL: Support atomic access to images from argument buffers.	2020-10-13 12:55:43 +02:00
Chip Davis	9cafea6cf8	MSL: Support atomic access to images from argument buffers. This was not added when Epic contributed atomic image support. Fixes #1484.	2020-10-13 02:37:18 -05:00
Chip Davis	2219c4a392	MSL: Support SPV_EXT_demote_to_helper_invocation for MSL 2.3. MSL 2.3 has everything needed to support this extension on all platforms. The existing `discard_fragment()` function was given demote semantics, similar to Direct3D, and the `simd_is_helper_thread()` function was finally added to iOS. I've left the old test alone. Should I remove it in favor of these?	2020-10-13 00:25:32 -05:00
Hans-Kristian Arntzen	5619329665	Style nits for GL subgroup implementation.	2020-10-08 13:25:29 +02:00
Hans-Kristian Arntzen	a6f6547cf1	Add missing VK variant of the test file.	2020-10-08 12:22:45 +02:00
Hans-Kristian Arntzen	28994a3186	Update GL subgroup test file.	2020-10-08 12:22:24 +02:00
Hans-Kristian Arntzen	819c599ecd	Merge branch 'issues1350-2' of git://github.com/devshgraphicsprogramming/SPIRV-Cross into master	2020-10-08 12:20:07 +02:00

1 2 3 4 5 ...

1025 Commits