SPIRV-Cross

Author	SHA1	Message	Date
Erfan Ahmadi	43eecb2360	SPIRV-Cross contribution needed for `INTEL_fragment_shader_ordering`	2021-10-25 10:50:10 +02:00
Hans-Kristian Arntzen	2b5e17eca5	MSL: Never used templated array for RayQuery objects. Not supported and compiler derps out.	2021-10-21 22:02:01 +02:00
Hans-Kristian Arntzen	5afb3d313f	MSL: Fix some trivial bugs not caught by CI when adding ray query.	2021-10-21 21:53:41 +02:00
丛越	d52ec1e196	Fix all requested changes, test_shaders.py supports compiling MSL 2.4 shaders, and the Intersection Query currently only supports MSL 2.4 on the iOS platform.	2021-10-21 17:46:45 +08:00
丛越	597f29d09d	Support Metal 2.4 Intersection Query, Implement GL_EXT_ray_query.	2021-10-19 18:45:10 +08:00
Hans-Kristian Arntzen	6382f15470	Test behavior around OpSelect with matrices.	2021-10-13 16:08:29 +02:00
Hans-Kristian Arntzen	6071df5840	Fix wrong detection of trivial_mix_op. Effectively, only the last component of the select was considered, need to correctly early out if any case is hit.	2021-10-13 15:34:00 +02:00
Hans-Kristian Arntzen	97a438d214	Merge pull request #1757 from KhronosGroup/fix-1754 Improve handling of INT_MIN/INT64_MIN literals.	2021-09-30 17:04:30 +02:00
Hans-Kristian Arntzen	f72bb3c6f5	Improve handling of INT_MIN/INT64_MIN literals. We cannot naively convert these to decimal literals. C/C++ (and thus MSL) has extremely awkward literal promotion rules.	2021-09-30 16:29:30 +02:00
Hans-Kristian Arntzen	9b2a8c7622	HLSL: Ensure synthetic NumWorkgroups variable is considered active. In SPIR-V 1.4+, active global variables must be marked as such.	2021-09-30 14:39:42 +02:00
Hans-Kristian Arntzen	bb04156d3c	CLI/HLSL: Don't set explicit binding for synthesized NumWorkgroups CBV.	2021-09-30 14:30:49 +02:00
Bill Hollings	ec054dad7f	MSL: Support synthetic functions in function constants. Emit synthetic functions before function constants. Support use of spvQuantizeToF16() in function constants for numerical behavior consistency with the op code. Ensure subnormal results from OpQuantizeToF16 are flushed to zero per SPIR-V spec. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes. Any MSL reference shader that inclues a synthetic function is affected, since the location it is emitted has changed.	2021-09-28 19:10:16 -04:00
Bill Hollings	ba66a91402	MSL: Use vec<T, n> in template SpvHalfTypeSelector for function spvQuantizeToF16(). Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-25 14:36:42 -04:00
Bill Hollings	a2671e35b0	MSL: Consolidate spvQuantizeToF16() functions into a single template function. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-24 14:41:15 -04:00
Bill Hollings	5742047b24	MSL: Honor infinities in OpQuantizeToF16 when compiling using fast-math. Add spvQuantizeToF16() family of synthetic functions to convert from float to half and back again, and add function attribute [[clang::optnone]] to honor infinities during conversions. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-24 11:22:05 -04:00
Bill Hollings	fb3defc9ef	MSL: Honor DecorationNoContraction when compiling using fast-math. Add [[clang::optnone]] attribute to spvF*() functions used for handling floating point operations decorated with DecorationNoContraction. Just using precise::fma() did not work. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-23 14:37:08 -04:00
Bill Hollings	40141ffddf	MSL: Selectively enable fast-math in MSL code to match Vulkan CTS results. Based on CTS testing, math optimizations between MSL and Vulkan are inconsistent. In some cases, enabling MSL's fast-math compilation option matches Vulkan's math results. In other cases, disabling it does. Broadly enabling or disabling fast-math across all shaders results in some CTS test failures either way. To fix this, selectively enable/disable fast-math optimizations in the MSL code, using metal::fast and metal::precise function namespaces, where supported, and the [[clang::optnone]] function attribute otherwise. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-22 18:58:31 -04:00
Bill Hollings	35e92e6ffb	MSL: Return fragment function value even when last SPIR-V Op is discard (OpKill). Add test shader for new functionality. Add legacy test reference shader for unrelated buffer-bitcast test, that doesn't seem to have been added previously.	2021-09-12 16:28:21 -04:00
Bill Hollings	472f9d4f6d	Add tests for OpSpecConstantOp ops OpQuantizeToF16 and OpSRem. Tests provided by @cdavis5e.	2021-09-05 16:51:04 -04:00
Hans-Kristian Arntzen	b8f1e71907	GLSL: Emit GL_EXT_buffer_reference_uvec2 as required.	2021-09-02 13:17:13 +02:00
Hans-Kristian Arntzen	23c4480d8e	Fix switch fallthrough case in some cases.	2021-08-31 17:24:09 +02:00
Hans-Kristian Arntzen	2eea6a579b	MSL: Consider that function/private variables can be block-like. Handles a special case with array copies. The implementation of this fix is not perfect, but should be good enough for time being.	2021-08-23 13:26:45 +02:00
Hans-Kristian Arntzen	c062b6b852	Merge pull request #1725 from billhollings/fix-duplicate-glposition MSL: Fix duplicate gl_Position outputs when gl_Position defined but unused.	2021-08-23 11:37:10 +02:00
Hans-Kristian Arntzen	fad1590786	Merge pull request #1722 from billhollings/row-maj-mtx-store-from-const MSL: Support row-major transpose when storing matrix from constant RHS matrix.	2021-08-23 11:29:01 +02:00
Bill Hollings	e76fcf9309	MSL: Add test for fixes to MSL constant expression type down-casting.	2021-08-16 13:56:05 -04:00
Bill Hollings	3105e82b2e	MSL: Fix duplicate gl_Position outputs when gl_Position defined but unused. When gl_Position is defined by SPIR-V, but neither used nor initialized, it appeared twice in the MSL output, as gl_Position and glPosition_1. The existing tests for whether an output is active check only that it is used by an op, or initialized. Adding the implicit gl_Position also marked the existing gl_Position as active, duplicating the output variable. Fix is that when checking for the need to add an implicit gl_Position output, also check if the var is already defined in the shader, and just needs to be marked as active. Add test shader.	2021-08-16 11:23:15 -04:00
Bill Hollings	9552ca5473	MSL: Support row-major transpose when storing matrix from constant RHS matrix. Remove test and exception when storing row-major matrix from RHS that is not a SPIRExpression. Add test shaders.	2021-08-12 09:08:35 -04:00
Hans-Kristian Arntzen	cb613eb675	Handle value access in terminators. Fixes case where value is created inside loop body and consumed by a return outside it.	2021-07-29 15:27:52 +02:00
Hans-Kristian Arntzen	cd22336a38	Merge pull request #1712 from cdavis5e/msl-subgroup-ballot-simplify MSL: Simplify spvSubgroupBallot().	2021-07-22 12:15:33 +02:00
Chip Davis	03ad13bae6	MSL: Simplify spvSubgroupBallot(). A bitcast to `uint2` will do just fine. I honestly don't know why I didn't do it this way earlier.	2021-07-21 00:25:09 -05:00
Hans-Kristian Arntzen	18f3cd6810	GLSL: Ensure ray query object decls are flushed if allocated in Function. glslang always emits Private variables, but DXC not so much.	2021-07-20 12:04:00 +02:00
Hans-Kristian Arntzen	5b227cc57c	GLSL: Implement GL_EXT_ray_query.	2021-07-19 14:01:21 +02:00
Bill Hollings	ebb5098def	MSL: Adjust gl_SampleMaskIn for sample-shading and/or fixed sample mask. Vulkan specifies that the Sample Mask Test occurs before fragment shading. This means gl_SampleMaskIn should be influenced by both sample-shading and VkPipelineMultisampleStateCreateInfo::pSampleMask. CTS tests dEQP-VK.pipeline.multisample_shader_builtin.* bear this out. For sample-shading, gl_SampleMaskIn should only have a single bit set, Since Metal does not filter for this, apply a bitmask based on gl_SampleID. For a fixed sample mask, since Metal is unaware of VkPipelineMultisampleStateCreateInfo::pSampleMask, we need to ensure that we apply it to both gl_SampleMaskIn and gl_SampleMask. This has the side effect of a redundant application of pSampleMask if the shader already includes gl_SampleMaskIn when setting gl_SampleMask, but I don't see an easy way around this. Also, simplify the logic for including the fixed sample mask in gl_ShaderMask, and print the fixed sample mask as a hex value for readability of bits.	2021-07-13 21:22:13 -04:00
Hans-Kristian Arntzen	71b83a18f4	MSL: Add test for scalar access chain pull interpolant.	2021-07-13 12:25:18 +02:00
Hans-Kristian Arntzen	206ee8f171	GLSL: Support pervertexNV in NV barycentric extension.	2021-06-30 16:27:46 +02:00
Hans-Kristian Arntzen	d6b29ab017	HLSL: Rewrite how block IO is emitted. Emit block members directly in the IO structs and sort them. Ensures we can get some kind of stable order between stages. To complete the story, we'll need to be able to inject unused inputs / builtins, or eliminate unused outputs (probably easiest solution).	2021-06-28 15:04:49 +02:00
Hans-Kristian Arntzen	8216e87f02	Handle SPIR-V 1.4 selection constructs. Fix bug in to_trivial_mix_op, where we made a pre-1.4 assumption that component count of selector is equal to value component count.	2021-06-28 12:23:44 +02:00
Hans-Kristian Arntzen	2e1b5fb39e	Merge pull request #1686 from KhronosGroup/fix-1684 GLSL: Support control flow hints	2021-06-03 14:13:18 +02:00
Hans-Kristian Arntzen	449f68ef3b	Ensure loop control flow hints only appear above loops.	2021-06-03 12:19:10 +02:00
Hans-Kristian Arntzen	d62b3c2b92	GLSL: Implement control flow hints.	2021-06-03 12:01:49 +02:00
Hans-Kristian Arntzen	165dbff228	Handle odd type for textureGather component.	2021-06-03 11:37:45 +02:00
xndcn	02fb8f2a24	Add comment after inf/nan float number for clarifying.	2021-05-27 02:40:41 +08:00
Hans-Kristian Arntzen	bf3793dd35	MSL: Improve handling of split tessellation access chains.	2021-05-21 16:32:03 +02:00
Hans-Kristian Arntzen	a6c9514856	Merge pull request #1676 from KhronosGroup/fix-1671 GLSL: Implement noncoherent framebuffer fetch.	2021-05-21 15:43:58 +02:00
Hans-Kristian Arntzen	26a4986009	GLSL: Implement noncoherent framebuffer fetch.	2021-05-21 14:22:57 +02:00
Hans-Kristian Arntzen	99ae0d32e9	MSL: Handle array with component when we cannot rely on user() attrib. In these cases, we emit one variable per location, and so we must flatten stuff.	2021-05-21 13:46:33 +02:00
Hans-Kristian Arntzen	b8115ffbe0	HLSL: Implement invariant as precise. Only option we have.	2021-05-07 13:15:55 +02:00
Hans-Kristian Arntzen	e47a30e807	Honor NoContraction qualifier. We'll need to force a temporary and mark it as precise. MSL is a little weird here, but we can piggyback on top of the invariant float math option here to force fma() operations everywhere.	2021-05-07 12:59:47 +02:00
Hans-Kristian Arntzen	6dbab0df47	Update reference output.	2021-05-07 11:12:22 +02:00
Lukas Taparauskas	72a2ec4c1b	MSL: Fix '--msl-multi-patch-workgroup' out of bounds reads when dispatching more threads than control points (#1662 ) * Fix '--msl-multi-patch-workgroup' cases where thread count exceeds data bounds Fix gl_PrimitiveID off by one error when computing last valid index Point gl_out to the last patch's data when threads exceed input data bounds Point patchOut to the last patch's data when threads exceed input data bounds Update MSL test expectations. * Undo change to MSL multi-patch hull output bound checks * Update MSL multi-patch test expectations.	2021-04-29 20:01:26 +02:00
Hans-Kristian Arntzen	c624d5387c	Merge pull request #1660 from KhronosGroup/fix-1658 MSL: Use proper array for quad tess levels.	2021-04-23 15:21:00 +02:00
Hans-Kristian Arntzen	82a77e534e	MSL: Use proper array for quad tess levels. We need to handle loads from array as well, so the float4 hack doesn't work.	2021-04-23 14:12:00 +02:00
Hans-Kristian Arntzen	0e963c62b6	HLSL: Support Shuffle wave ops. WaveReadLaneAt is no longer restricted to dynamically uniform index, so can implement the other shuffle ops.	2021-04-23 13:03:35 +02:00
Hans-Kristian Arntzen	532f65583e	Rewrite how non-uniform qualifiers are handled. Remove all shenanigans with propagation, and only consume nonuniform qualifiers exactly where needed (last minute).	2021-04-22 16:03:08 +02:00
Hans-Kristian Arntzen	d137abeef5	Merge pull request #1655 from KhronosGroup/fix-1640 GLSL: Support shading rate builtins.	2021-04-20 16:35:02 +02:00
Hans-Kristian Arntzen	8e24e0b224	Merge pull request #1654 from KhronosGroup/fix-1641 GLSL: Implement gl_FragFullyCoveredNV.	2021-04-20 16:34:53 +02:00
Hans-Kristian Arntzen	71eb1754e3	Merge pull request #1653 from KhronosGroup/fix-1638 GLSL: Support GL_EXT_shader_image_load_formatted.	2021-04-20 16:34:44 +02:00
Hans-Kristian Arntzen	c89b5a1a3f	GLSL: Support shading rate builtins.	2021-04-20 13:58:07 +02:00
Hans-Kristian Arntzen	3fd148450a	GLSL: Implement gl_FragFullyCoveredNV.	2021-04-20 13:44:52 +02:00
Hans-Kristian Arntzen	f93a8fb1fe	GLSL: Support GL_EXT_shader_image_load_formatted.	2021-04-20 13:36:51 +02:00
Hans-Kristian Arntzen	96ba044f01	HLSL: Fix automatic location assignment in block IO.	2021-04-20 13:04:26 +02:00
Hans-Kristian Arntzen	ae9ca7d73c	MSL: Fix copy of arrays to/from stage IO variables. Need to take into account effective storage classes and whether or not we target stage IO blocks since native arrays are conditionally enabled.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	986196030d	MSL: Don't use native arrays for tess level inputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	4a379a00f3	MSL: Don't emit native array for masked clip/cull distance.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	682a227f4b	MSL: Make builtin argument type declaration context sensitive. Sometimes we'll need array template, sometimes not 🤷.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	c1edd35d57	MSL: Use spvUnsafeArray for builtin arrays after all. It will get too messy to deal with constant initializers any other way, so just deal with complexity in argument_decl instead ...	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	7b9a591aa7	MSL: Hoist out to_tesc_invocation_id() in more places. When emitting fixup code, we might not have gl_InvocationID yet.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	75ed73818c	MSL: Handle loading Clip/CullDistance in TESE. Need to allow the flattened space to go through in some edge cases where we cannot reasonably unflatten.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	a159334895	MSL: Correctly analyze if builtin block is active. Need to consider all members, bi_type is invalid for Blocks, need to look at member decorations.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	cea934c03f	MSL: Test that we can capture cull distance to buffer.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	5826298697	MSL: Handle CullDistance better.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	23da445bd4	MSL: Emit multiple threadgroup slices for multi-patch. Multiple patches can run in the same workgroup when using multi-patch mode, so we need to allocate enough storage to avoid false sharing.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	b442500204	MSL: Unroll initializations of CullDistance/ClipDistance control points.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	c9946296dd	MSL: Fix initialization of masked threadgroup variables.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	5e9c2d060e	MSL: Cleanup fallback IO block emission. Need to emit in add_variable_to_iface(). Unifies the code paths a fair bit.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	e32c474911	MSL: Handle masking of TESC IO block members.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	dc54f75eec	MSL: Fixup gl_PerVertex names if we're emitting masked builtins.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	40f628f49c	MSL: Add test for complex control point outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	46c48ee6b5	MSL: Rewrite how IO blocks are emitted in multi-patch mode. Firstly, never flatten inputs or outputs in multi-patch mode. The main scenario where we do need to care is Block IO. In this case, we should only flatten the top-level member, and after that we use access chains as normal. Using structs in Input storage class is now possible as well. We don't need to consider per-location fixups at all here. In Vulkan, IO structs must match exactly. Only plain vectors can have smaller vector sizes as a special case.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ff3f5bcba5	MSL: Handle masking of builtin control points.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	436b1250da	MSL: Do not perform scalar fixups for control-point outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	74b2acab9b	MSL: Always emit block variable for block types.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ae7bb41ef4	MSL: Test that we can mask location writes in TESC.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ba93b6518d	MSL: Fix masking of vertex block outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	857295a9ab	MSL: Add tests for masking with --for-tess.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	43b6ea2c9a	MSL: Remove position mask tests. They will fail compilation.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	65b5ff7ece	MSL: Don't emit weird reference type for spvUnsafeArray types.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	50a6bc058a	MSL: Force builtin arrays for builtin array types. Handles argument_decl() correctly.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	88b54f5dab	MSL: Add tests for vertex output masking.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	0997e81118	MSL: Sort builtin IO block members by builtin type. Ensures consistent block matching.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ee31e84e30	GLSL: Handle complex load/store scenarios to gl_SampleMask. Need special workarounds to handle array load/store since array size is unsized in GLSL, and array copy is not possible. Also, consider bitcast for scalar loads and stores.	2021-03-09 10:25:03 +01:00
Hans-Kristian Arntzen	fb1f295aaf	Merge pull request #1635 from KhronosGroup/fix-1627 Handle edge cases in OpCopyMemory.	2021-03-09 10:21:35 +01:00
Hans-Kristian Arntzen	4ca06c7278	Handle edge cases in OpCopyMemory. Implement this by synthesizing an OpLoad/OpStore pair instead.	2021-03-08 14:15:27 +01:00
Hans-Kristian Arntzen	aea6d29aa8	MSL: Add test for logical subgroup arith ops.	2021-03-08 12:57:37 +01:00
Hans-Kristian Arntzen	d6c2c1b39a	HLSL: Support logical subgroup ops.	2021-03-08 12:52:03 +01:00
Hans-Kristian Arntzen	5570043af3	GLSL: Add support for Logical subgroup ops. Completely missed these ...	2021-03-08 12:06:46 +01:00
Hans-Kristian Arntzen	97796e0609	MSL: Deal with pointer-to-pointer qualifier ordering.	2021-02-26 13:37:14 +01:00
Hans-Kristian Arntzen	621884d709	Merge pull request #1622 from KhronosGroup/fix-1619 MSL: Handle load and store to TessLevel array in TESC.	2021-02-17 20:46:06 +01:00
Hans-Kristian Arntzen	85704f70bc	MSL: Handle load and store to TessLevel array in TESC. More edge cases ... :(	2021-02-17 13:26:08 +01:00
Hans-Kristian Arntzen	ce552f4f91	MSL: Gracefully assign automatic input locations to builtin attributes.	2021-02-17 12:29:19 +01:00
Hans-Kristian Arntzen	bae17e8204	Merge pull request #1617 from KhronosGroup/fix-1608 MSL: Fixup type when using tessellation levels in TESC functions.	2021-02-16 11:10:07 +01:00
Hans-Kristian Arntzen	daddbd4078	MSL: Fixup type when using tessellation levels in TESC functions. Need to rewrite array size depending on execution mode.	2021-02-15 13:28:11 +01:00
Hans-Kristian Arntzen	0ad12a0036	MSL: Always return [[position]] when required.	2021-02-15 12:57:37 +01:00
Hans-Kristian Arntzen	ea02a0c03a	Check entry point variables in is_hidden_variables. Need to be careful not to emit globals we're not supposed to.	2021-01-22 13:53:22 +01:00
Hans-Kristian Arntzen	4bedad3860	Handle nonuniformEXT qualifier for acceleration structures.	2021-01-22 13:13:56 +01:00
Hans-Kristian Arntzen	7ab3f3f74e	Deal better with CompositeExtract from constant composite. There is no good reason for applications to emit this kind of code, but some do. Special case this scenario.	2021-01-22 12:30:16 +01:00
Hans-Kristian Arntzen	66fb0bd9df	GLSL: Handle tracing against incoming payload/callable.	2021-01-22 11:23:04 +01:00
Hans-Kristian Arntzen	2097c30985	GLSL: Support both SPV_KHR_ray_tracing and NV_ray_tracing. Fairly minor differences, so can keep them side by side without too much effort. NV support is effectively deprecated now however. - Add OpConvertUToAccelerationStructureKHR - Ignore/Terminate ray is now a terminator in KHR, but a call in NV. - Fix some bugs with reportIntersection.	2021-01-08 14:59:04 +01:00
Hans-Kristian Arntzen	5d82d32e0f	Roll dependencies.	2021-01-08 10:41:51 +01:00
Hans-Kristian Arntzen	893a011299	MSL: Fix various bugs with framebuffer fetch on macOS and argument buffers. Introduce a helper to make it clearer if a resource can be considered for argument buffers or not.	2021-01-08 10:19:18 +01:00
Hans-Kristian Arntzen	3136e34215	MSL: Always use input_attachment_index for framebuffer fetch binding. --msl-decoration-binding would end up overriding the input attachment index to binding which is very unexpected and broken.	2021-01-08 10:17:42 +01:00
Hans-Kristian Arntzen	03ee71e86c	Add test for pure initializer gl_FragDepth. Tests that the builtin is considered active.	2021-01-07 15:32:15 +01:00
Hans-Kristian Arntzen	3776d8978c	GLSL: Force block declaration if clip/cull is used in tesc.	2021-01-07 15:32:15 +01:00
Hans-Kristian Arntzen	014b3bc5ea	MSL: Make sure initialized output builtins are considered active.	2021-01-07 15:32:13 +01:00
Hans-Kristian Arntzen	a4a9b53b5b	MSL: Always enable Outputs in vertex stages. Subsequent stages can legally attempt to read from these variables, which causes compilation failure. Always make sure we emit user outputs in vertex shaders if they are active in the entry point.	2021-01-07 11:24:47 +01:00
Hans-Kristian Arntzen	fa76d01203	MSL: Only consider builtin variables if they are part of IO interface.	2021-01-07 10:50:29 +01:00
Hans-Kristian Arntzen	efed4c9738	MSL: Fix initializer for tess level outputs. It's an array, not vector.	2021-01-06 10:39:39 +01:00
Hans-Kristian Arntzen	ab9200ffdf	MSL: Don't flatten builtin arrays unless they're part of IO interface.	2021-01-06 10:33:17 +01:00
Hans-Kristian Arntzen	df4f8ef8fe	MSL: Emit correct initializer for tessellation control points.	2021-01-05 15:16:49 +01:00
Hans-Kristian Arntzen	ad3e1584f9	MSL: Handle initializers for tess levels.	2021-01-05 13:25:50 +01:00
Hans-Kristian Arntzen	6a3ea0385e	GLSL: Add test for initializing tess level output.	2021-01-05 12:12:26 +01:00
Hans-Kristian Arntzen	175381fe08	GLSL: Handle some extreme edge cases in Output variable initialization. Deal with patch blocks, arrays of patch blocks, arrays of blocks, etc.	2021-01-05 12:06:36 +01:00
Hans-Kristian Arntzen	a1c784f002	More robust handling of initialized output builtin variables.	2021-01-04 19:12:43 +01:00
Hans-Kristian Arntzen	9a304fe931	Handle output IO block initializers more robustly.	2021-01-04 19:04:10 +01:00
Hans-Kristian Arntzen	ddb3c65648	Handle reserved identifiers for functions. gl_ identifiers are already handled by fixups, so remove redundant code.	2021-01-04 10:00:12 +01:00
Hans-Kristian Arntzen	c4ff129fe3	MSL: Handle reserved identifiers for entry point. We only considered invalid names, and overwrote the alias for the function. The correct fix is to replace illegal names early, do the reserved fixup, then copy back alias to entry point name.	2021-01-04 09:40:11 +01:00
Hans-Kristian Arntzen	c8765a75f2	GLSL: Fix KHR subgroup extension table for subgroups.	2020-12-11 12:26:43 +01:00
Hans-Kristian Arntzen	762c3082ae	Merge pull request #1564 from KhronosGroup/fix-1558 GLSL: Emit nonuniformEXT in correct place for late-combined samplers.	2020-12-07 14:07:38 +01:00
Hans-Kristian Arntzen	a11c4780d0	GLSL: Emit nonuniformEXT in correct place for late-combined samplers. Need to emit nonuniformEXT(sampler2D()) since constructor expressions in Vulkan GLSL do not propgate the nonuniform qualifier.	2020-12-07 13:00:15 +01:00
Hans-Kristian Arntzen	dc940846d7	GLSL/HLSL: Disallow VariablePointers capability outright. Cannot be supported, error out early.	2020-12-07 12:16:02 +01:00
comex	c80cbde7aa	spirv_msl: Don't add fixup hooks for builtin variables if they're unused. This is necessary to avoid invalid output because of how implicit dependencies on builtins work. For example, the fixup for `BuiltInSubgroupEqMask` initializes the variable based on `builtin_subgroup_invocation_id_id`, a field storing the ID for a variable with decoration `BuiltInSubgroupLocalInvocationId`. This could be either a variable that already exists in the input (spirv_msl.cpp:300) or, if necessary, a newly created one (spirv_msl.cpp:621). In both cases, though, `builtin_subgroup_invocation_id_id` is only set under the condition `need_subgroup_mask \|\| needs_subgroup_invocation_id`. `need_subgroup_mask` is true if any of the `BuiltInSubgroupXXMask` are set in `active_input_builtins`. Normally, if the program contains `BuiltInSubgroupEqMask`, `Compiler::ActiveBuiltinHandler` will set it in `active_input_builtins`. But this only happens if the variable is actually used, whereas `fix_up_shader_inputs_outputs` loops over all variables in the program regardless of whether they're used. If `BuiltInSubgroupEqMask` is not used, `builtin_subgroup_invocation_id_id` is never set, but before this patch the fixup hook would try to use it anyway, producing MSL that references a nonexistent variable named `_0`. Avoid this by changing `fix_up_shader_inputs_outputs` to skip builtins which are not set in `active_input_builtins` or `active_output_builtins`. And add a test case.	2020-11-25 13:41:12 -05:00
Chip Davis	1e67b21ee9	MSL: Don't mask off inactive bits in ballot masks. This was based on my misreading the spec. The Vulkan CTS expects the bits to be set, even if the invocations corresponding to them are inactive.	2020-11-25 09:29:51 -06:00
Chip Davis	fd738e3387	MSL: Adjust FragCoord for sample-rate shading. In Metal, the `[[position]]` input to a fragment shader remains at fragment center, even at sample rate, like OpenGL and Direct3D. In Vulkan, however, when the fragment shader runs at sample rate, the `FragCoord` builtin moves to the sample position in the framebuffer, instead of the fragment center. To account for this difference, adjust the `FragCoord`, if present, by the sample position. The -0.5 offset is because the fragment center is at (0.5, 0.5). Also, add an option to force sample-rate shading in a fragment shader. Since Metal has no explicit control for this, this is done by adding a dummy `[[sample_id]]` which is otherwise unused, if none is already present. This is intended to be used from e.g. MoltenVK when a pipeline's `minSampleShading` value is nonzero. Instead of checking if any `Input` variables have `Sample` interpolation, I've elected to check that the `SampleRateShading` capability is present. Since `SampleId`, `SamplePosition`, and the `Sample` interpolation decoration require this cap, this should be equivalent for any valid SPIR-V module. If this isn't acceptable, let me know.	2020-11-23 10:30:24 -06:00
Hans-Kristian Arntzen	e07f0a9df5	GLSL: Fix buffer_reference with aliased names.	2020-11-23 16:36:49 +01:00
Hans-Kristian Arntzen	c5826b4b69	GLSL: Emit storage qualifiers for buffer_reference.	2020-11-23 16:26:33 +01:00
Hans-Kristian Arntzen	650b5e1b12	HLSL: Fix validation with FXC for test.	2020-11-23 16:03:35 +01:00
Hans-Kristian Arntzen	6a614cc7f7	Normalize all internal workaround methods to use spv prefix. We have been interchanging spv and SPIRV_Cross_ for a while, which causes weirdness since we don't explicitly ban SPIRV_Cross identifiers, as these identifiers are generally used for interface variable workarounds.	2020-11-23 15:42:27 +01:00
Chip Davis	68908355a9	MSL: Expand subgroup support. Add support for declaring a fixed subgroup size. Metal, like Vulkan with `VK_EXT_subgroup_size_control`, allows the thread execution width to vary depending on factors such as register usage. Unfortunately, this breaks several tests that depend on the subgroup size being what the device says it is. So we'll fix the subgroup size at the size the device declares. The extra invocations in the subgroup will appear to be inactive. Because of this, the ballot mask builtins are now ANDed with the active subgroup mask. Add support for emulating a subgroup of size 1. This is intended to be used by Vulkan Portability implementations (e.g. MoltenVK) when the hardware/software combo provides insufficient support for subgroups. Luckily for us, Vulkan 1.1 only requires that the subgroup size be at least 1. Add support for quadgroup and SIMD-group functions which were added to iOS in Metal 2.2 and 2.3. This will allow clients to take advantage of expanded quadgroup and SIMD-group support in recent Metal versions and on recent Apple GPUs (families 6 and 7). Gut emulation of subgroup builtins in fragment shaders. It turns out codegen for the SIMD-group functions in fragment wasn't implemented for AMD on Mojave; it's a safe bet that it wasn't implemented for the other drivers either. Subgroup support in fragment shaders now requires Metal 2.2.	2020-11-20 15:55:49 -06:00
Hans-Kristian Arntzen	1ee2d13873	MSL: Add missing reference file.	2020-11-11 16:25:01 +01:00
Jan Sikorski	f0239bce05	MSL: extract global variables from subgroup ballot operations Fixes #1513.	2020-11-09 11:23:01 +01:00
Hans-Kristian Arntzen	71fcf0d9e6	Update texture gather test result.	2020-11-08 13:54:30 +01:00
Hans-Kristian Arntzen	46bf1e99d6	Merge pull request #1525 from cdavis5e/msl-interpolation-functions MSL: Support pull-model interpolation on MSL 2.3+.	2020-11-07 17:04:56 +01:00
Hans-Kristian Arntzen	683c3f5c3f	Merge pull request #1530 from rdb/legacy-glsl-round GLSL: Provide round/roundEven for legacy GLSL	2020-11-07 16:40:18 +01:00
Hans-Kristian Arntzen	ea334c14bc	Merge pull request #1527 from rdb/legacy-transpose GLSL: implement transpose() in GLSL 1.10 / ES 1.00	2020-11-07 16:37:59 +01:00
Hans-Kristian Arntzen	2417010046	Merge pull request #1528 from rdb/fix-legacy-vertex-shader-lod GLSL: Fix support for textureLod in legacy vertex shaders	2020-11-07 16:33:50 +01:00
rdb	bf71994dae	GLSL: implement transpose() in GLSL 1.10 / ES 1.00	2020-11-06 22:27:54 +01:00
rdb	9e6e5d2738	GLSL: Fix round/roundEven for legacy GLSL.	2020-11-06 17:34:38 +01:00
rdb	e8c500ceef	GLSL: Fix support for textureLod in legacy vertex shaders	2020-11-06 16:37:27 +01:00
Hans-Kristian Arntzen	db13762297	MSL: Fix regression in image gather handling. It was not always possible to get backing variable for a late-combined image sampler.	2020-11-06 16:21:30 +01:00
Chip Davis	aca9b6879a	MSL: Support pull-model interpolation on MSL 2.3+. New in MSL 2.3 is a template that can be used in the place of a scalar type in a stage-in struct. This template has methods which interpolate the varying at the given points. Curiously, you can't set interpolation attributes on such a varying; perspective-correctness is encoded in the type, while interpolation must be done using one of the methods. This makes using this somewhat awkward from SPIRV-Cross, requiring us to jump through a bunch of hoops to make this all work. Using varyings from functions in particular is a pain point, requiring us to pass the stage-in struct itself around. An alternative is to pass references to the interpolants; except this will fall over badly with composite types, which naturally must be flattened. As with tessellation, dynamic indexing isn't supported with pull-model interpolation. This is because of the need to reference the original struct member in order to call one of the pull-model interpolation methods on it. Also, this is done at the variable level; this means that if one varying in a struct is used with the pull-model functions, then the entire struct is emitted as pull-model interpolants. For some reason, this was not documented in the MSL spec, though there is a property on `MTLDevice`, `supportsPullModelInterpolation`, indicating support for this, which is documented. This does not appear to be implemented yet for AMD: it returns `NO` from `supportsPullModelInterpolation`, and pipelines with shaders using the templates fail to compile. It is implemeted for Intel. It's probably also implemented for Apple GPUs: on Apple Silicon, OpenGL calls down to Metal, and it wouldn't be possible to use the interpolation functions without this implemented in Metal. Based on my testing, where SPIR-V and GLSL have the offset relative to the pixel center, in Metal it appears to be relative to the pixel's upper-left corner, as in HLSL. Therefore, I've added an offset 0.4375, i.e. one half minus one sixteenth, to all arguments to `interpolate_at_offset()`. This also fixes a long-standing bug: if a pull-model interpolation function is used on a varying, make sure that varying is declared. We were already doing this only for the AMD pull-model function, `interpolateAtVertexAMD()`; for reasons which are completely beyond me, we weren't doing this for the base interpolation functions. I also note that there are no tests for the interpolation functions for GLSL or HLSL.	2020-11-05 11:57:45 -06:00

1 2 3 4 5 ...

1258 Commits