SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	df76a14056	MSL: Refactor member reference in terms of one boolean. ptr_chain was really just masking the proper i == 0 check. Be more explicit about what the check is actually doing and comment this.	2022-11-21 13:40:27 +01:00
Dunfan Lu	e75c496ec6	Fix MSL Access Chain	2022-11-21 13:29:18 +01:00
Chip Davis	aa5a8c482e	MSL: Prevent stores to storage resources in discarded fragments. Some Metal devices have a bug where storage resources can still be written to even if the fragment is discarded. This is obviously a bug in Metal, but bothering Apple to fix it will only fix it for newer versions; therefore, a workaround is needed for older versions. I have made this an option so that, in case the bug is ever fixed, the workaround can be disabled. This workaround is simple: if a fragment shader may discard its fragment and writes to a storage resource, a variable representing the `HelperInvocation` built-in is created and passed to all functions. The flag is checked on all resource writes; writes do not occur when `HelperInvocation` is `true`. This relies on the earlier workaround to update `HelperInvocation` when the fragment is discarded. Fixes at least 3 failures in the CTS.	2022-11-20 01:29:41 -08:00
Chip Davis	c7ce92a95b	MSL: Manually update `BuiltInHelperInvocation` when a fragment is discarded. Some Metal devices have a bug where `simd_is_helper_thread()` won't return true after a fragment has been discarded. We can work around this by manually setting `gl_HelperInvocation` upon discarding a fragment. This is fairly unintrusive, so it is enabled by default. I've made it an option so that, when the bug is fixed, we can disable it.	2022-11-19 23:48:26 -08:00
Hans-Kristian Arntzen	2a49f7e82d	MSL: Fix restrict vs __restrict incompatibility. restrict was supported, but it broke in MSL 3.0. __restrict works on all versions, so opt for that instead. Also check for RestrictPointer decoration and refactor to_restrict() to not take optional parameter to make it more obvious when implied space character is added.	2022-10-26 17:52:47 +02:00
Chip Davis	0b679334e4	MSL: Don't flatten arrayed per-patch output blocks in tessellation shaders. Flattening doesn't play well with dynamic indices. In this case, it's better to leave it as an array of structs. (I wanted to do this for named blocks generally. Trouble is, the builtin `gl_out` block is also a named block...) Fixes six more CTS tests, under `dEQP-VK.tessellation.user_defined_io.per_patch_block_array.*`.	2022-10-18 15:04:42 -07:00
Chip Davis	a171087180	MSL: Support "raw" buffer input in tessellation evaluation shaders. Using vertex-style stage input is complex, and it doesn't support nesting of structures or arrays. By using raw buffer input instead, we get this support "for free," and everything becomes much simpler. Arguably, this is the way I should've done this in the first place. Eventually, I'd like to make this the default, and then remove the option altogether. (And I still need to do that with `multi_patch_workgroup`...) Should help fix 66 tests in the Vulkan CTS, under the following trees: - `dEQP-VK.pipeline..interface_matching.` - `dEQP-VK.tessellation.user_defined_io.` - `dEQP-VK.clipping.user_defined.`	2022-10-18 14:58:59 -07:00
Hans-Kristian Arntzen	4ecdb24e59	MSL: Expose way to query if a buffer needs array length.	2022-10-03 12:30:15 +02:00
Hans-Kristian Arntzen	24dc49e692	MSL: Handle descriptor aliasing of raw buffer descriptors. It is allowed to redeclare descriptors with different types in Vulkan. MSL in general does not allow this, but for raw buffers, we can cast the reference type at the very least. For typed resources we are kinda hosed. Without descriptor indexing's PARTIALLY_BOUND_BIT, descriptors must be valid if they are statically accessed, so it would not be valid to access differently typed aliases unless that flag is used. There might be a way to reinterpret cast descriptors, but that seems very sketchy. Implements support for: - Single discrete descriptor - Single argument buffer descriptor - Array of argument buffer descriptors Other cases are unimplemented for now since they are extremely painful to unroll.	2022-09-20 15:21:56 +02:00
Bill Hollings	5493b3030e	MSL: Support OpPtrEqual, OpPtrNotEqual, and OpPtrDiff. - Add CompilerMSL::emit_binary_ptr_op() and to_ptr_expression() to emit binary pointer op. Compare matrix addresses without automatic transpose() conversion, to avoid error taking address of temporary copy. - Add Compiler::add_active_interface_variable() to also track active interface vars in the entry point for SPIR-V 1.4 and above. - For OpPtrAccessChain that ends in array element, use Element as offset to existing index, otherwise it will access into array dimension that doesn't exist. - Dereference pointer function call arguments. Ultimately, this dereferencing is actually backwards, and in future, we should aim to properly support passing pointer variables between functions, but such a refactoring was beyond the scope here. - Use [] to declare array of pointers, as array<T*> is not supported in MSL. - Add unit test shaders.	2022-09-14 15:19:15 -04:00
Chip Davis	064eaebe72	MSL: Add a mechanism to fix up shader outputs. This is analogous to the existing support for fixing up shader inputs. It is intended to be used with tessellation to add implicit builtins that are read from a later stage, despite not being written in an earlier stage. (Believe it or not, this is in fact legal in Vulkan.) Helps fix 8 CTS tests under `dEQP-VK.pipeline.*.no_position`. (Eight other tests work solely by accident without this change.)	2022-09-09 17:06:34 -07:00
Chip Davis	fc4a12fd4f	MSL: Use a wrapper type for matrices in workgroup storage. The standard `matrix` type in MSL lacked a constructor in the `threadgroup` AS. This means that it was impossible to declare a `threadgroup` variable that contains a matrix. This appears to have been an oversight that was corrected in macOS 13/Xcode 14 beta 4. This workaround continues to be required, however, for older systems. To avoid changing interfaces unnecessarily (which shouldn't be a problem regardless because the old and new types take up the same amount of storage), only do this for structs if the struct is positively identified as being used for workgroup storage. I'm entirely aware this is inconsistent with the way packed matrices are handled. One of them should be changed to match the other. Not sure which one. Fixes 23 CTS tests under `dEQP-VK.memory_model.shared`.	2022-08-07 17:31:41 -07:00
Chip Davis	faea931de3	MSL: Also replace `bool` with `short` in structures. Since `bool` is a logical type, it cannot be used in uniform or storage buffers. Therefore, replacing it in structures should not change the shader interface. We leave it alone for builtins. (FIXME: Should we also leave it for I/O varyings?) Fixes 24 CTS tests under `dEQP-VK.memory_model.shared`.	2022-08-05 11:43:21 -07:00
Bill Hollings	4185acc70d	MSL: Fixes from review for SPV_KHR_physical_storage_buffer extension. - Assign ulongn physical type to buffer pointers in short arrays when array stride is larger than pointer size. - Support GL_EXT_buffer_reference_uvec2 casting buffer reference pointers to and from uvec2 values. - When packing structs, include structs inside physical buffers. - Update mechanism for traversing pointer arrays when calculating type sizes. - Added unit test shaders.	2022-07-01 16:10:41 -04:00
Roy.li	749be80389	Use types have same widths in loop condition. In case comparisons between types of different widths in a loop condition caused the loop to behave unexpectedly.	2022-03-24 14:26:03 +08:00
Hans-Kristian Arntzen	7b594c125e	Fix formatting nits from review.	2022-03-03 10:26:09 +01:00
Bill Hollings	3bb3b22b34	MSL: Non-functional fixes from PR code review.	2022-03-03 10:19:03 +01:00
Bill Hollings	3d4daab29d	MSL: Support input/output blocks containing nested struct arrays Fixes numerous CTS tests of types dEQP-VK.pipeline.interface_matching.vector_length.member_of_*, passing complex nested structs between stages as stage I/O. - Make add_composite_member_variable_to_interface_block() recursive to allow struct members to contain nested structs, building up member names and access chains recursively, and only add the resulting flattened leaf members to the synthetic input and output interface blocks. - Recursively generate individual location numbers for the flattened members of the input/output block. - Replace to_qualified_member_name() with append_member_name(). - Update add_variable_to_interface_block() to support arrays as struct members, adding a member to input and output interface blocks for each element of the array. - Pass name qualifiers to add_plain_member_variable_to_interface_block() to allow struct members to be arrays of structs, building up member names and access chains, and adding multiple distinct flattened leaf members to the synthetic input and output interface blocks. - Generate individual location numbers for the individual array members of the input/output block. - SPIRVCrossDecorationInterfaceMemberIndex references the index of a member of a variable that is a struct type. The value is relative to the variable, and for structs nested within that top-level struct, the index value needs to take into consideration the members within those nested structs. - Pass var_mbr_idx to add_plain_member_variable_to_interface_block() and add_composite_member_variable_to_interface_block(), start at zero for each variable, and increment for each member or nested member within that variable. - Add unit test shaders-msl/vert/out-block-with-nested-struct-array.vert - Add unit test shaders-msl/vert/out-block-with-struct-array.vert - Add unit test shaders-msl/tese/in-block-with-nested-struct.tese	2022-03-03 10:18:40 +01:00
Hans-Kristian Arntzen	5555f2784b	MSL: Refactor and fix use of quadgroup vs simdgroup.	2022-02-28 11:58:33 +01:00
Hans-Kristian Arntzen	5b952d2cbf	MSL: Rethink how opaque descriptors are passed to leaf functions. We were passing arrays by value which the compiler fails to optimize, causing abyssal performance. To fix this, we need to consider that descriptors can be in constant or const device address spaces. Also, lone descriptors are passed by value, so we explicitly remove address space qualifiers. One failure case is when shader passes a texture/sampler array as an argument. It's all UniformConstant in SPIR-V, but in MSL it might be thread, const device or constant, so that won't work ... Global variable use works fine though, and that should cover 99.9999999% of use cases.	2022-01-18 14:40:52 +01:00
Hans-Kristian Arntzen	5a5be7f9b9	MSL: Handle signed atomic min/max. C++ deduces this based on the pointer type, so cast to atomic_uint/int if we have to.	2022-01-17 15:40:58 +01:00
Bill Hollings	248e9ae9ed	MSL: Don't output depth and stencil values with explicit early fragment tests. Fragment shaders that require explicit early fragment tests are incompatible with specifying depth and stencil values within the shader. If explicit early fragment tests is specified, remove the depth and stencil outputs from the output structure, and replace them with dummy local variables. Add CompilerMSL:uses_explicit_early_fragment_test() function to consolidate testing for whether early fragment tests are required. Add two unit tests for depth-out with, and without, early fragment tests.	2021-11-12 14:17:00 -05:00
Hans-Kristian Arntzen	edf247fb1c	MSL: Workaround compiler crashes when using threadgroup bool. Promote to short instead and do simple casts on load/store instead. Not 100% complete fix since structs can contain booleans, but this is getting into pretty ridiculously complicated territory.	2021-10-25 10:55:11 +02:00
丛越	d52ec1e196	Fix all requested changes, test_shaders.py supports compiling MSL 2.4 shaders, and the Intersection Query currently only supports MSL 2.4 on the iOS platform.	2021-10-21 17:46:45 +08:00
丛越	597f29d09d	Support Metal 2.4 Intersection Query, Implement GL_EXT_ray_query.	2021-10-19 18:45:10 +08:00
Hans-Kristian Arntzen	325f107c5b	Merge pull request #1745 from billhollings/location-component-vecsize MSL: Track location component to match vecsize between shader stages.	2021-09-30 14:02:25 +02:00
Bill Hollings	5742047b24	MSL: Honor infinities in OpQuantizeToF16 when compiling using fast-math. Add spvQuantizeToF16() family of synthetic functions to convert from float to half and back again, and add function attribute [[clang::optnone]] to honor infinities during conversions. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-24 11:22:05 -04:00
Bill Hollings	548a23da34	MSL: Track location component to match vecsize between shader stages. Matching output/input struct member types between shader stages could fail if a location is shared between members, each using different components of that location, because the member vecsize was only stored once for the location. Add MSLShaderInput::component member. Use LocationComponentPair to key inputs_by_location, instead of just location. ensure_correct_input_type() pass component value as well as location.	2021-09-23 09:56:04 -04:00
Bill Hollings	86dfac12c8	MSL: Fix location and component variable matching between shader stages. Consolidate derivation of Metal 'user(locnL_C)' output/input location and component attribute qualifier, to establish SVOT across stages.	2021-09-18 18:55:12 -04:00
Bill Hollings	5fb1ca4f0d	Add support for additional ops in OpSpecConstantOp. MSL: Support op OpQuantizeToF16 in OpSpecConstantOp. All: Support op OpSRem in OpSpecConstantOp.	2021-09-03 18:20:49 -04:00
Bill Hollings	ebb5098def	MSL: Adjust gl_SampleMaskIn for sample-shading and/or fixed sample mask. Vulkan specifies that the Sample Mask Test occurs before fragment shading. This means gl_SampleMaskIn should be influenced by both sample-shading and VkPipelineMultisampleStateCreateInfo::pSampleMask. CTS tests dEQP-VK.pipeline.multisample_shader_builtin.* bear this out. For sample-shading, gl_SampleMaskIn should only have a single bit set, Since Metal does not filter for this, apply a bitmask based on gl_SampleID. For a fixed sample mask, since Metal is unaware of VkPipelineMultisampleStateCreateInfo::pSampleMask, we need to ensure that we apply it to both gl_SampleMaskIn and gl_SampleMask. This has the side effect of a redundant application of pSampleMask if the shader already includes gl_SampleMaskIn when setting gl_SampleMask, but I don't see an easy way around this. Also, simplify the logic for including the fixed sample mask in gl_ShaderMask, and print the fixed sample mask as a hex value for readability of bits.	2021-07-13 21:22:13 -04:00
Jon Leech	f2a65545b8	Finish adding SPDX tags and setup a reuse checked in Github Actions CI	2021-06-29 11:03:52 +02:00
Hans-Kristian Arntzen	d62b3c2b92	GLSL: Implement control flow hints.	2021-06-03 12:01:49 +02:00
Hans-Kristian Arntzen	99ae0d32e9	MSL: Handle array with component when we cannot rely on user() attrib. In these cases, we emit one variable per location, and so we must flatten stuff.	2021-05-21 13:46:33 +02:00
Hans-Kristian Arntzen	e47a30e807	Honor NoContraction qualifier. We'll need to force a temporary and mark it as precise. MSL is a little weird here, but we can piggyback on top of the invariant float math option here to force fma() operations everywhere.	2021-05-07 12:59:47 +02:00
Hans-Kristian Arntzen	96ba044f01	HLSL: Fix automatic location assignment in block IO.	2021-04-20 13:04:26 +02:00
Hans-Kristian Arntzen	ae9ca7d73c	MSL: Fix copy of arrays to/from stage IO variables. Need to take into account effective storage classes and whether or not we target stage IO blocks since native arrays are conditionally enabled.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	7b9a591aa7	MSL: Hoist out to_tesc_invocation_id() in more places. When emitting fixup code, we might not have gl_InvocationID yet.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	75ed73818c	MSL: Handle loading Clip/CullDistance in TESE. Need to allow the flattened space to go through in some edge cases where we cannot reasonably unflatten.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	23da445bd4	MSL: Emit multiple threadgroup slices for multi-patch. Multiple patches can run in the same workgroup when using multi-patch mode, so we need to allocate enough storage to avoid false sharing.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	faf80b08fc	MSL: Don't report fallback location allocations as being "used". It may shadow unused real inputs and confuse applications.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	5e9c2d060e	MSL: Cleanup fallback IO block emission. Need to emit in add_variable_to_iface(). Unifies the code paths a fair bit.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	e32c474911	MSL: Handle masking of TESC IO block members.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	40f628f49c	MSL: Add test for complex control point outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	46c48ee6b5	MSL: Rewrite how IO blocks are emitted in multi-patch mode. Firstly, never flatten inputs or outputs in multi-patch mode. The main scenario where we do need to care is Block IO. In this case, we should only flatten the top-level member, and after that we use access chains as normal. Using structs in Input storage class is now possible as well. We don't need to consider per-location fixups at all here. In Vulkan, IO structs must match exactly. Only plain vectors can have smaller vector sizes as a special case.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	425e968720	MSL: Handle flattening of patch block outputs as well. Always propagate InterfaceMember decoration.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	6ecdd64a91	MSL: Emit a masked builtin IO block if necessary.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ae7bb41ef4	MSL: Test that we can mask location writes in TESC.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ba93b6518d	MSL: Fix masking of vertex block outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	f2b5fb3f45	MSL: Emit threadgroup storage class for masked control point outputs. Shader can still rely on writes to threadgroup memory to be visible.	2021-04-19 12:10:49 +02:00

1 2 3 4 5 ...

375 Commits