SPIRV-Cross

Author	SHA1	Message	Date
Bill Hollings	0c0fd98322	MSL: Use var name instead of var-type name for flattened interface members. This allows two variables of the same struct type to be flattened into the same interface struct without a member name conflict. Add shaders-msl/frag/in_block_with_multiple_structs_of_same_type.frag unit test shader to demonstrate this.	2022-03-04 11:38:53 -05:00
Hans-Kristian Arntzen	b192b8887a	MSL: Consider that gl_IsHelperInvocation can be Volatile. Just emit simd_is_helper_thread() directly.	2022-03-04 11:46:35 +01:00
Hans-Kristian Arntzen	15d29f00e2	Add test for SPIR-V 1.6 Volatile HelperInvocation.	2022-03-04 11:19:33 +01:00
Hans-Kristian Arntzen	31be74a853	Add relax_nan_checks options. Makes codegen from typical D3D emulation SPIR-V more readable. Also makes cross compilation with NotEqual more sensible. It's very rare to actually need the strict NaN-checks in practice. Also, glslang now emits UnordNotEqual by default it seems, so give up trying to assume OrdNotEqual. Harmonize for UnordNotEqual as the sane default.	2022-03-03 14:50:56 +01:00
Hans-Kristian Arntzen	5b952d2cbf	MSL: Rethink how opaque descriptors are passed to leaf functions. We were passing arrays by value which the compiler fails to optimize, causing abyssal performance. To fix this, we need to consider that descriptors can be in constant or const device address spaces. Also, lone descriptors are passed by value, so we explicitly remove address space qualifiers. One failure case is when shader passes a texture/sampler array as an argument. It's all UniformConstant in SPIR-V, but in MSL it might be thread, const device or constant, so that won't work ... Global variable use works fine though, and that should cover 99.9999999% of use cases.	2022-01-18 14:40:52 +01:00
Bill Hollings	ec054dad7f	MSL: Support synthetic functions in function constants. Emit synthetic functions before function constants. Support use of spvQuantizeToF16() in function constants for numerical behavior consistency with the op code. Ensure subnormal results from OpQuantizeToF16 are flushed to zero per SPIR-V spec. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes. Any MSL reference shader that inclues a synthetic function is affected, since the location it is emitted has changed.	2021-09-28 19:10:16 -04:00
Bill Hollings	40141ffddf	MSL: Selectively enable fast-math in MSL code to match Vulkan CTS results. Based on CTS testing, math optimizations between MSL and Vulkan are inconsistent. In some cases, enabling MSL's fast-math compilation option matches Vulkan's math results. In other cases, disabling it does. Broadly enabling or disabling fast-math across all shaders results in some CTS test failures either way. To fix this, selectively enable/disable fast-math optimizations in the MSL code, using metal::fast and metal::precise function namespaces, where supported, and the [[clang::optnone]] function attribute otherwise. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-22 18:58:31 -04:00
Chip Davis	03ad13bae6	MSL: Simplify spvSubgroupBallot(). A bitcast to `uint2` will do just fine. I honestly don't know why I didn't do it this way earlier.	2021-07-21 00:25:09 -05:00
Hans-Kristian Arntzen	71b83a18f4	MSL: Add test for scalar access chain pull interpolant.	2021-07-13 12:25:18 +02:00
Hans-Kristian Arntzen	165dbff228	Handle odd type for textureGather component.	2021-06-03 11:37:45 +02:00
Hans-Kristian Arntzen	7ab3f3f74e	Deal better with CompositeExtract from constant composite. There is no good reason for applications to emit this kind of code, but some do. Special case this scenario.	2021-01-22 12:30:16 +01:00
Hans-Kristian Arntzen	893a011299	MSL: Fix various bugs with framebuffer fetch on macOS and argument buffers. Introduce a helper to make it clearer if a resource can be considered for argument buffers or not.	2021-01-08 10:19:18 +01:00
Hans-Kristian Arntzen	3136e34215	MSL: Always use input_attachment_index for framebuffer fetch binding. --msl-decoration-binding would end up overriding the input attachment index to binding which is very unexpected and broken.	2021-01-08 10:17:42 +01:00
Hans-Kristian Arntzen	a11c4780d0	GLSL: Emit nonuniformEXT in correct place for late-combined samplers. Need to emit nonuniformEXT(sampler2D()) since constructor expressions in Vulkan GLSL do not propgate the nonuniform qualifier.	2020-12-07 13:00:15 +01:00
Chip Davis	68908355a9	MSL: Expand subgroup support. Add support for declaring a fixed subgroup size. Metal, like Vulkan with `VK_EXT_subgroup_size_control`, allows the thread execution width to vary depending on factors such as register usage. Unfortunately, this breaks several tests that depend on the subgroup size being what the device says it is. So we'll fix the subgroup size at the size the device declares. The extra invocations in the subgroup will appear to be inactive. Because of this, the ballot mask builtins are now ANDed with the active subgroup mask. Add support for emulating a subgroup of size 1. This is intended to be used by Vulkan Portability implementations (e.g. MoltenVK) when the hardware/software combo provides insufficient support for subgroups. Luckily for us, Vulkan 1.1 only requires that the subgroup size be at least 1. Add support for quadgroup and SIMD-group functions which were added to iOS in Metal 2.2 and 2.3. This will allow clients to take advantage of expanded quadgroup and SIMD-group support in recent Metal versions and on recent Apple GPUs (families 6 and 7). Gut emulation of subgroup builtins in fragment shaders. It turns out codegen for the SIMD-group functions in fragment wasn't implemented for AMD on Mojave; it's a safe bet that it wasn't implemented for the other drivers either. Subgroup support in fragment shaders now requires Metal 2.2.	2020-11-20 15:55:49 -06:00
Hans-Kristian Arntzen	db13762297	MSL: Fix regression in image gather handling. It was not always possible to get backing variable for a late-combined image sampler.	2020-11-06 16:21:30 +01:00
Chip Davis	c20d5945a2	MSL: Allow framebuffer fetch on Mac in MSL 2.3. Another Apple GPU feature that will now be supported on Apple Silicon Macs.	2020-10-29 10:50:59 -05:00
Hans-Kristian Arntzen	f65f259ab7	MSL: Do not use component::x gather for depth2d textures.	2020-10-26 10:18:17 +01:00
Chip Davis	1264e2705e	MSL: Cast broadcast booleans to ushort. Metal doesn't support broadcasting or shuffling boolean values, but we can work around that by casting it to `ushort`, then casting it back to `bool`. I used `ushort` instead of `uint` because 16-bit values give better throughput on Apple GPUs.	2020-10-23 21:55:46 -05:00
Chip Davis	065b5bda3c	MSL: Mask ballots passed to Ballot bit ops. Only the least n bits are significant, where n is the subgroup size. The Vulkan CTS actually checks this. The `FindLSB` tests weren't actually failing, but I masked that anyway, in case there's some corner case the CTS is missing.	2020-10-23 21:55:46 -05:00
Chip Davis	781367d083	MSL: Support vectors with OpGroupNonUniformAllEqual. This was not tested here in SPIRV-Cross. Predictably, it broke when I tried it in the CTS.	2020-10-23 21:55:46 -05:00
Chip Davis	6ccb902462	MSL: Correct definitions of subgroup ballot mask variables. `SubgroupEqMask` had a fencepost error that gave wrong values for invocation ID 32. For `SubgroupGeMask` and `SubgroupGtMask`, I forgot to shift the values from `extract_bits()` up so that the mask is in the correct position. Using `insert_bits()` instead should fold these two operations into one. `SubgroupLtMask` and `SubgroupLeMask` were already correct.	2020-10-23 21:54:55 -05:00
Hans-Kristian Arntzen	e1600d4df8	MSL: Use input attachment index directly for resource index fallback.	2020-07-06 09:49:46 +02:00
dan sinclair	0abc017501	Roll deps and update tests. This CL rolls the GLSlang, SPIRV-Tools and SPIRV-Headers dependencies and updates the various test files.	2020-06-22 09:33:29 -04:00
Hans-Kristian Arntzen	0ebb88cc39	MSL: Redirect member indices when buffer has been sorted by Offset. If a buffer rewrites its Offsets, all member references to that struct are invalidated, and must be redirected, do so in to_member_reference, but there might be other places where this is needed. Fix as required. SPIR-V code relying on this is somewhat questionable, but seems to be in-spec.	2020-04-30 11:48:53 +02:00
Hans-Kristian Arntzen	6ef47d6657	MSL: Fix case where subpassInput is passed to leaf functions.	2020-04-27 11:29:21 +02:00
Hans-Kristian Arntzen	3cb6aeb480	MSL: Fix access chain for deep struct hierarchy on array of buffers.	2020-03-31 14:17:29 +02:00
Hans-Kristian Arntzen	b8905bbd95	Add support for forcefully zero-initialized variables. Useful to better support certain platforms which require all variables to be initialized to something.	2020-03-26 13:38:27 +01:00
Hans-Kristian Arntzen	c3bd136df1	MSL: Add support for force-activating IAB resources. Important for ABI compatibility on MSL in certain cases.	2020-01-16 11:12:06 +01:00
Hans-Kristian Arntzen	d4ca91f6c2	Move .invalid. test shaders to the more appropriate subfolders.	2019-11-06 10:40:37 +01:00
Hans-Kristian Arntzen	d1479f871a	MSL: Do not generate UnsafeArray<> for any array inside buffer objects. This avoids a lot of huge code changes. Arrays generally cannot be copied in and out of buffers, at least no compiler frontend seems to do it. Also avoids a lot of issues surrounding packed vectors and matrices.	2019-10-24 12:22:30 +02:00
Lukas Hermanns	c3d6022956	Update for pull request #1162 rev. 1	2019-09-24 18:13:04 -04:00
Lukas Hermanns	7ad0a84778	Updates for pull request #1162	2019-09-24 14:35:25 -04:00
Lukas Hermanns	37df74035b	Merge branch 'ue4_dev'	2019-09-20 09:42:42 -04:00
Lukas Hermanns	744cc3e595	Updated test shaders.	2019-09-18 14:18:22 -04:00
Lukas Hermanns	cb3ecb9e1b	Updated reference Metal shaders.	2019-09-17 15:11:19 -04:00
Mark Satterthwaite	564cb3c08d	Update the Metal shaders to account for changes in the shader compilation.	2019-09-11 15:06:05 -04:00
Hans-Kristian Arntzen	63a770ed5c	Add test shader for simple case of interlocked callstack.	2019-09-04 11:56:19 +02:00
Chip Davis	39dce88d3b	MSL: Add support for sampler Y'CbCr conversion. This change introduces functions and in one case, a class, to support the `VK_KHR_sampler_ycbcr_conversion` extension. Except in the case of GBGR8 and BGRG8 formats, for which Metal natively supports implicit chroma reconstruction, we're on our own here. We have to do everything ourselves. Much of the complexity comes from the need to support multiple planes, which must now be passed to functions that use the corresponding combined image-samplers. The rest is from the actual Y'CbCr conversion itself, which requires additional post-processing of the sample retrieved from the image. Passing sampled images to a function was a particular problem. To support this, I've added a new class which is emitted to MSL shaders that pass sampled images with Y'CbCr conversions attached around. It can handle sampled images with or without Y'CbCr conversion. This is an awful abomination that should not exist, but I'm worried that there's some shader out there which does this. This support requires Metal 2.0 to work properly, because it uses default-constructed texture objects, which were only added in MSL 2. I'm not even going to get into arrays of combined image-samplers--that's a whole other can of worms. They are deliberately unsupported in this change. I've taken the liberty of refactoring the support for texture swizzling while I'm at it. It's now treated as a post-processing step similar to Y'CbCr conversion. I'd like to think this is cleaner than having everything in `to_function_name()`/`to_function_args()`. It still looks really hairy, though. I did, however, get rid of the explicit type arguments to `spvGatherSwizzle()`/`spvGatherCompareSwizzle()`. Update the C API. In addition to supporting this new functionality, add some compiler options that I added in previous changes, but for which I neglected to update the C API.	2019-09-01 18:35:53 -05:00
Thomas Roughton	91b2f34a3d	Update tests to account for all non-entry-point functions being inlined	2019-08-30 09:39:06 +12:00
Hans-Kristian Arntzen	e2c95bdcbc	MSL: Rewrite how resource indices are fallback-assigned. We used to use the Binding decoration for this, but this method is hopelessly broken. If no explicit MSL resource remapping exists, we remap automatically in a manner which should always "just work".	2019-06-21 12:54:08 +02:00
Hans-Kristian Arntzen	7b9e0fb428	MSL: Implement OpArrayLength. This gets rather complicated because MSL does not support OpArrayLength natively. We need to pass down a buffer which contains buffer sizes, and we compute the array length on-demand. Support both discrete descriptors as well as argument buffers.	2019-05-27 16:13:09 +02:00
Hans-Kristian Arntzen	eaf7afed97	MSL: Support argument buffers and image swizzling. Change aux buffer to swizzle buffer. There is no good reason to expand the aux buffer, so name it appropriately. Make the code cleaner by emitting a straight pointer to uint rather than a dummy struct which only contains a single unsized array member anyways. This will also end up being very similar to how we implement swizzle buffers for argument buffers. Do not use implied binding if it overflows int32_t.	2019-05-18 10:30:06 +02:00
Hans-Kristian Arntzen	d9ed3dcc7a	Merge pull request #848 from cdavis5e/capture-output-buffer MSL: Add a setting to capture vertex shader output to a buffer.	2019-02-07 15:11:41 +01:00
Chip Davis	056c0e207d	Take the vertex count from any indirect parameters passed. This is necessary to deal with indirect draws, where the draw parameters are given in a buffer instead of passed by the CPU. For normal draws, the draw parameters are set with Metal's `setVertexBytes:` method. This undoes the change to add the vertex count to the aux buffer, rendering that entire discussion largely moot. Oh well. It was a discussion that needed to happen anyway.	2019-02-06 15:17:14 -06:00
Chip Davis	0757fae511	MSL: Stop passing the aux buffer around. Since we pass the component swizzle around now, there's no need to pass it to every function that takes a sampled image.	2019-02-05 20:04:32 -06:00
Chip Davis	c51e5b7911	MSL: Add a setting to capture vertex shader output to a buffer. This will be necessary to support transform feedback, as well as tessellation shaders.	2019-02-05 20:00:10 -06:00
Hans-Kristian Arntzen	18a4accd2f	HLSL/MSL: Fix texture projection with Dref. We need to divide the Dref by q.	2019-01-28 10:25:13 +01:00
Chip Davis	664df22d12	MSL: Fix passing a sampled image to a function. In the past, SPIRV-Cross threw an error in this case because it couldn't work out which swizzle from the auxiliary buffer needs to be passed. Now, we pass the swizzle around with the texture object, like a combined image-sampler and its associated sampler.	2019-01-14 09:29:31 -06:00
Chip Davis	3394f53734	MSL: Fix mapping of identity-swizzled components. Before, if any component was not identity-mapped, those components that were still identity-mapped were set to 0. Now we properly leave them alone.	2019-01-07 11:20:13 -06:00

1 2

59 Commits