SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	b192b8887a	MSL: Consider that gl_IsHelperInvocation can be Volatile. Just emit simd_is_helper_thread() directly.	2022-03-04 11:46:35 +01:00
Hans-Kristian Arntzen	31be74a853	Add relax_nan_checks options. Makes codegen from typical D3D emulation SPIR-V more readable. Also makes cross compilation with NotEqual more sensible. It's very rare to actually need the strict NaN-checks in practice. Also, glslang now emits UnordNotEqual by default it seems, so give up trying to assume OrdNotEqual. Harmonize for UnordNotEqual as the sane default.	2022-03-03 14:50:56 +01:00
Hans-Kristian Arntzen	2ce1e5c140	Merge pull request #1878 from mysterymath/move Qualify std::move.	2022-03-03 11:06:13 +01:00
Hans-Kristian Arntzen	7b594c125e	Fix formatting nits from review.	2022-03-03 10:26:09 +01:00
Bill Hollings	bb4ced0280	MSL: Document clarification of location incrementing.	2022-03-03 10:19:04 +01:00
Bill Hollings	3bb3b22b34	MSL: Non-functional fixes from PR code review.	2022-03-03 10:19:03 +01:00
Bill Hollings	3d4daab29d	MSL: Support input/output blocks containing nested struct arrays Fixes numerous CTS tests of types dEQP-VK.pipeline.interface_matching.vector_length.member_of_*, passing complex nested structs between stages as stage I/O. - Make add_composite_member_variable_to_interface_block() recursive to allow struct members to contain nested structs, building up member names and access chains recursively, and only add the resulting flattened leaf members to the synthetic input and output interface blocks. - Recursively generate individual location numbers for the flattened members of the input/output block. - Replace to_qualified_member_name() with append_member_name(). - Update add_variable_to_interface_block() to support arrays as struct members, adding a member to input and output interface blocks for each element of the array. - Pass name qualifiers to add_plain_member_variable_to_interface_block() to allow struct members to be arrays of structs, building up member names and access chains, and adding multiple distinct flattened leaf members to the synthetic input and output interface blocks. - Generate individual location numbers for the individual array members of the input/output block. - SPIRVCrossDecorationInterfaceMemberIndex references the index of a member of a variable that is a struct type. The value is relative to the variable, and for structs nested within that top-level struct, the index value needs to take into consideration the members within those nested structs. - Pass var_mbr_idx to add_plain_member_variable_to_interface_block() and add_composite_member_variable_to_interface_block(), start at zero for each variable, and increment for each member or nested member within that variable. - Add unit test shaders-msl/vert/out-block-with-nested-struct-array.vert - Add unit test shaders-msl/vert/out-block-with-struct-array.vert - Add unit test shaders-msl/tese/in-block-with-nested-struct.tese	2022-03-03 10:18:40 +01:00
Daniel Thornburgh	44c3333a1c	Qualify std::move. Clang added -Wunqualified-std-cast-call in https://reviews.llvm.org/D119670, which warns on unqualified std::move and std::forward calls. This change qualifies these calls to allow the project to build on HEAD Clang -Werror.	2022-03-02 23:17:58 +00:00
Hans-Kristian Arntzen	5555f2784b	MSL: Refactor and fix use of quadgroup vs simdgroup.	2022-02-28 11:58:33 +01:00
xndcn	1b5c406a3d	MSL: append entry point args to local variable names to avoid conflicts	2022-02-24 22:52:21 +08:00
Rodolphe	1a36968e5d	[Spirv_msl] Fix normalize on half3/half2 (#1856 ) When emitting a normalize OP, conditionally stop using the fast namespace	2022-02-01 23:17:22 +01:00
Hans-Kristian Arntzen	53d94a982e	Merge pull request #1852 from KhronosGroup/fix-1850 MSL: Rethink how opaque descriptors are passed to leaf functions.	2022-01-18 15:25:32 +01:00
Hans-Kristian Arntzen	5b952d2cbf	MSL: Rethink how opaque descriptors are passed to leaf functions. We were passing arrays by value which the compiler fails to optimize, causing abyssal performance. To fix this, we need to consider that descriptors can be in constant or const device address spaces. Also, lone descriptors are passed by value, so we explicitly remove address space qualifiers. One failure case is when shader passes a texture/sampler array as an argument. It's all UniformConstant in SPIR-V, but in MSL it might be thread, const device or constant, so that won't work ... Global variable use works fine though, and that should cover 99.9999999% of use cases.	2022-01-18 14:40:52 +01:00
Hans-Kristian Arntzen	48b5a9069f	Handle aliased names in spec constants. Need to register resource name.	2022-01-18 12:31:28 +01:00
Hans-Kristian Arntzen	9b25581d49	MSL: Handle constant construct of block-like array types. Need this to be context sensitive, since array of block-like struct is template, but struct of block-like array is C-style. Also, test a mix and match, so we have constant array of block-like struct with array inside. :v	2022-01-17 18:28:25 +01:00
Hans-Kristian Arntzen	5a5be7f9b9	MSL: Handle signed atomic min/max. C++ deduces this based on the pointer type, so cast to atomic_uint/int if we have to.	2022-01-17 15:40:58 +01:00
Hans-Kristian Arntzen	1d13a3e36a	Rework how loop iteration counts are validated. Introduces an idea of a recompilation making forward progress. There are some extreme edge cases where we need more than 3 loops, but only allow this in specific circumstances where we can reason about forward progress being made.	2022-01-17 14:12:01 +01:00
Hans-Kristian Arntzen	a10d72e3b1	Merge pull request #1834 from chirsz-ever/chirsz/fix-msl-copy-array MSL: Allow copy array from UniformConstant storage	2022-01-05 16:10:18 +01:00
Hans-Kristian Arntzen	be333e0cab	MSL: Move float2->3 TessCoord fixup to a better location.	2022-01-05 13:32:17 +01:00
chirsz-ever	90c252274d	MSL: Allow copy array from UniformConstant storage	2022-01-05 17:45:21 +08:00
Nikita Fediuchin	2acf0e73dd	Fix gl_TessCoord arguments presence. Update reference shaders. * Added check for "gl_TessCoord" presence in the entry point arguments. * Updated reference tessellation evaluation shaders.	2021-12-20 22:58:21 +02:00
Nikita Fediuchin	a66984072b	Fix incorrect MSL gl_TessCoord variable type Adds "In" postfix to the "gl_TessCoord" on entry point arguments decompilation stage if built in type is "BuiltInTessCoord" and execution mode is "ExecutionModeQuads". And declares shadow variable "gl_TessCoord" on entry point declarations.	2021-12-17 22:51:18 +02:00
Sebastián Aedo	6d8302ef14	MSL: Add 64 bit switch support Add 64 bit switch support for MSL version 2.2. * Also fixes a wrong endianness conversion. Signed-off-by: Sebastián Aedo <saedo@codeweavers.com>	2021-11-26 15:54:56 -03:00
Bill Hollings	248e9ae9ed	MSL: Don't output depth and stencil values with explicit early fragment tests. Fragment shaders that require explicit early fragment tests are incompatible with specifying depth and stencil values within the shader. If explicit early fragment tests is specified, remove the depth and stencil outputs from the output structure, and replace them with dummy local variables. Add CompilerMSL:uses_explicit_early_fragment_test() function to consolidate testing for whether early fragment tests are required. Add two unit tests for depth-out with, and without, early fragment tests.	2021-11-12 14:17:00 -05:00
Bill Hollings	fd252b21ff	Separate (partially) the tracking of depth images from depth compare ops. SPIR-V allows an image to be marked as a depth image, but with a non-depth format. Such images should be read or sampled as vectors instead of scalars, except when they are subject to compare operations. Don't mark an OpSampledImage as using a compare operation just because the image contains a depth marker. Instead, require that a compare operation is actually used on that image. Compiler::image_is_comparison() was really testing whether an image is a depth image, since it incorporates the depth marker. Rename that function to is_depth_image(), to clarify what it is really testing. In Compiler::is_depth_image(), do not treat an image as a depth image if it has been explicitly marked with a color format, unless the image is subject to compare operations. In CompilerMSL::to_function_name(), test for compare operations specifically, rather than assuming them from the depth-image marker. CompilerGLSL and CompilerMSL still contain a number of internal tests that use is_depth_image() both for testing for a depth image, and for testing whether compare operations are being used. I've left these as they are for now, but these should be cleaned up at some point. Add unit tests for fetch/sample depth images with color formats and no compare ops.	2021-11-08 15:59:45 -05:00
Hans-Kristian Arntzen	21a15b90e5	Merge pull request #1804 from KhronosGroup/fix-1759 MSL: Handle non-thread storage class in Modf/Frexp pointer versions.	2021-11-07 13:36:32 +01:00
Hans-Kristian Arntzen	e40d19bdbf	MSL: Handle non-thread storage class in Modf/Frexp pointer versions.	2021-11-07 12:51:15 +01:00
Hans-Kristian Arntzen	f1d4aff85e	Merge pull request #1805 from KhronosGroup/fix-1776 Handle Modf/Frexp in more cases.	2021-11-07 12:36:42 +01:00
Hans-Kristian Arntzen	4561ecddbd	Handle Modf/Frexp in more cases. Consider it a write to a variable, similar to OpStore.	2021-11-07 11:36:44 +01:00
Bill Hollings	4ebd56bf9e	MSL: Correctly emit user(clip/cullN) for clip/cull builtins in tess output struct. Only emit user(locnN) for tess builtin input variables, and allow output builtin to emit user(clip/cullN). Previously, output builtin would emit location if input builtin also existed.	2021-11-03 00:03:26 -04:00
Bill Hollings	be812c45e5	MSL: Remove over-zealous check for struct packing compatibility. Previous test for SPIRVCrossDecorationPhysicalTypePacked on parent struct when unpacking member struct was too restrictive, and not needed as long as padding compensates.	2021-10-28 19:36:32 -04:00
Bill Hollings	76cb807c19	MSL: Fix type redirection when struct members are reordered to align with offsets. Populate member_type_index_redirection as reverse lookup, not forward lookup. Move use of member_type_index_redirection from CompilerMSL::to_member_reference() to CompilerGLSL::access_chain_internal() to access all redirected type info, not just name.	2021-10-28 10:16:34 -04:00
Hans-Kristian Arntzen	edf247fb1c	MSL: Workaround compiler crashes when using threadgroup bool. Promote to short instead and do simple casts on load/store instead. Not 100% complete fix since structs can contain booleans, but this is getting into pretty ridiculously complicated territory.	2021-10-25 10:55:11 +02:00
Hans-Kristian Arntzen	2a56c2bc04	Merge pull request #1783 from billhollings/more-unpacked-vectors MSL: Support more usecases for unpacked vectors.	2021-10-21 23:00:14 +02:00
Bill Hollings	974a0818b8	MSL: Support more usecases for unpacked vectors. Additional usecases include array and vector indexing, variable declarations, loop initializers, function return values, switch statement evaluations, and various specialized MSL operations. Ultimately, we might consider refactoring CompilerMSL::to_expression() to always take into consideration possible unpacking behavior. Refactor CompilerGLSL::to_enclosed_unpacked_expression() for conciseness and consistency with similar functionality.	2021-10-21 16:11:33 -04:00
Hans-Kristian Arntzen	2b5e17eca5	MSL: Never used templated array for RayQuery objects. Not supported and compiler derps out.	2021-10-21 22:02:01 +02:00
Hans-Kristian Arntzen	bc338710e1	MSL: Remove some redundant breaks.	2021-10-21 22:01:50 +02:00
Hans-Kristian Arntzen	5afb3d313f	MSL: Fix some trivial bugs not caught by CI when adding ray query.	2021-10-21 21:53:41 +02:00
丛越	2895a43274	Fix unsupported exceptions for ConvertUToAccelerationStructure & BindingTableRecordOffset.	2021-10-21 19:01:38 +08:00
丛越	d52ec1e196	Fix all requested changes, test_shaders.py supports compiling MSL 2.4 shaders, and the Intersection Query currently only supports MSL 2.4 on the iOS platform.	2021-10-21 17:46:45 +08:00
丛越	597f29d09d	Support Metal 2.4 Intersection Query, Implement GL_EXT_ray_query.	2021-10-19 18:45:10 +08:00
Hans-Kristian Arntzen	9462b90067	Merge pull request #1752 from billhollings/function-constants-opquantize MSL: Support synthetic functions in function constants.	2021-09-30 14:06:10 +02:00
Hans-Kristian Arntzen	325f107c5b	Merge pull request #1745 from billhollings/location-component-vecsize MSL: Track location component to match vecsize between shader stages.	2021-09-30 14:02:25 +02:00
Bill Hollings	ec054dad7f	MSL: Support synthetic functions in function constants. Emit synthetic functions before function constants. Support use of spvQuantizeToF16() in function constants for numerical behavior consistency with the op code. Ensure subnormal results from OpQuantizeToF16 are flushed to zero per SPIR-V spec. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes. Any MSL reference shader that inclues a synthetic function is affected, since the location it is emitted has changed.	2021-09-28 19:10:16 -04:00
Bill Hollings	ba66a91402	MSL: Use vec<T, n> in template SpvHalfTypeSelector for function spvQuantizeToF16(). Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-25 14:36:42 -04:00
Bill Hollings	a2671e35b0	MSL: Consolidate spvQuantizeToF16() functions into a single template function. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-24 14:41:15 -04:00
Bill Hollings	5742047b24	MSL: Honor infinities in OpQuantizeToF16 when compiling using fast-math. Add spvQuantizeToF16() family of synthetic functions to convert from float to half and back again, and add function attribute [[clang::optnone]] to honor infinities during conversions. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-24 11:22:05 -04:00
Bill Hollings	fb3defc9ef	MSL: Honor DecorationNoContraction when compiling using fast-math. Add [[clang::optnone]] attribute to spvF*() functions used for handling floating point operations decorated with DecorationNoContraction. Just using precise::fma() did not work. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-23 14:37:08 -04:00
Bill Hollings	548a23da34	MSL: Track location component to match vecsize between shader stages. Matching output/input struct member types between shader stages could fail if a location is shared between members, each using different components of that location, because the member vecsize was only stored once for the location. Add MSLShaderInput::component member. Use LocationComponentPair to key inputs_by_location, instead of just location. ensure_correct_input_type() pass component value as well as location.	2021-09-23 09:56:04 -04:00
Hans-Kristian Arntzen	d579444a82	Merge pull request #1744 from billhollings/location-attribute-fix MSL: Fix location and component variable matching between shader stages.	2021-09-23 12:45:02 +02:00
Bill Hollings	40141ffddf	MSL: Selectively enable fast-math in MSL code to match Vulkan CTS results. Based on CTS testing, math optimizations between MSL and Vulkan are inconsistent. In some cases, enabling MSL's fast-math compilation option matches Vulkan's math results. In other cases, disabling it does. Broadly enabling or disabling fast-math across all shaders results in some CTS test failures either way. To fix this, selectively enable/disable fast-math optimizations in the MSL code, using metal::fast and metal::precise function namespaces, where supported, and the [[clang::optnone]] function attribute otherwise. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-22 18:58:31 -04:00
Bill Hollings	86dfac12c8	MSL: Fix location and component variable matching between shader stages. Consolidate derivation of Metal 'user(locnL_C)' output/input location and component attribute qualifier, to establish SVOT across stages.	2021-09-18 18:55:12 -04:00
Bill Hollings	2a150f5a2f	MSL: Add explanatory comment requested in PR review.	2021-09-14 14:58:42 -04:00
Bill Hollings	35e92e6ffb	MSL: Return fragment function value even when last SPIR-V Op is discard (OpKill). Add test shader for new functionality. Add legacy test reference shader for unrelated buffer-bitcast test, that doesn't seem to have been added previously.	2021-09-12 16:28:21 -04:00
Bill Hollings	5fb1ca4f0d	Add support for additional ops in OpSpecConstantOp. MSL: Support op OpQuantizeToF16 in OpSpecConstantOp. All: Support op OpSRem in OpSpecConstantOp.	2021-09-03 18:20:49 -04:00
Hans-Kristian Arntzen	2eea6a579b	MSL: Consider that function/private variables can be block-like. Handles a special case with array copies. The implementation of this fix is not perfect, but should be good enough for time being.	2021-08-23 13:26:45 +02:00
Hans-Kristian Arntzen	c062b6b852	Merge pull request #1725 from billhollings/fix-duplicate-glposition MSL: Fix duplicate gl_Position outputs when gl_Position defined but unused.	2021-08-23 11:37:10 +02:00
Hans-Kristian Arntzen	fad1590786	Merge pull request #1722 from billhollings/row-maj-mtx-store-from-const MSL: Support row-major transpose when storing matrix from constant RHS matrix.	2021-08-23 11:29:01 +02:00
Bill Hollings	3105e82b2e	MSL: Fix duplicate gl_Position outputs when gl_Position defined but unused. When gl_Position is defined by SPIR-V, but neither used nor initialized, it appeared twice in the MSL output, as gl_Position and glPosition_1. The existing tests for whether an output is active check only that it is used by an op, or initialized. Adding the implicit gl_Position also marked the existing gl_Position as active, duplicating the output variable. Fix is that when checking for the need to add an implicit gl_Position output, also check if the var is already defined in the shader, and just needs to be marked as active. Add test shader.	2021-08-16 11:23:15 -04:00
Bill Hollings	a75fe07546	MSL: Fix casting in constant expressions with different sizes. Previous casting in constant expressions used as_type<> between types of different overall sizes. Add check for overall size (width * vecsize) to ensure as_type<> will work, otherwise use regular cast. Also beef up test of integer values to also check vecsize, and use regular casts for those.	2021-08-12 16:19:46 -04:00
Bill Hollings	9552ca5473	MSL: Support row-major transpose when storing matrix from constant RHS matrix. Remove test and exception when storing row-major matrix from RHS that is not a SPIRExpression. Add test shaders.	2021-08-12 09:08:35 -04:00
Chip Davis	03ad13bae6	MSL: Simplify spvSubgroupBallot(). A bitcast to `uint2` will do just fine. I honestly don't know why I didn't do it this way earlier.	2021-07-21 00:25:09 -05:00
Hans-Kristian Arntzen	6196e3b029	MSL: Remove redundant path for SampleMask.	2021-07-15 16:34:06 +02:00
Bill Hollings	ebb5098def	MSL: Adjust gl_SampleMaskIn for sample-shading and/or fixed sample mask. Vulkan specifies that the Sample Mask Test occurs before fragment shading. This means gl_SampleMaskIn should be influenced by both sample-shading and VkPipelineMultisampleStateCreateInfo::pSampleMask. CTS tests dEQP-VK.pipeline.multisample_shader_builtin.* bear this out. For sample-shading, gl_SampleMaskIn should only have a single bit set, Since Metal does not filter for this, apply a bitmask based on gl_SampleID. For a fixed sample mask, since Metal is unaware of VkPipelineMultisampleStateCreateInfo::pSampleMask, we need to ensure that we apply it to both gl_SampleMaskIn and gl_SampleMask. This has the side effect of a redundant application of pSampleMask if the shader already includes gl_SampleMaskIn when setting gl_SampleMask, but I don't see an easy way around this. Also, simplify the logic for including the fixed sample mask in gl_ShaderMask, and print the fixed sample mask as a hex value for readability of bits.	2021-07-13 21:22:13 -04:00
Bill Hollings	3e04eee491	MSL: Fix setting SPIRVCrossDecorationInterpolantComponentExpr decoration. When setting SPIRVCrossDecorationInterpolantComponentExpr decoration, take into consideration a possible array of vectors in an access chain, and don't apply the decoration until we traverse through the array into the vector itself.	2021-07-12 14:52:07 -04:00
Peter Kasting	7cdab07efe	Fix -Wunreachable-code-aggressive. Bug: chromium:1066980	2021-06-30 09:17:59 -07:00
Jon Leech	f2a65545b8	Finish adding SPDX tags and setup a reuse checked in Github Actions CI	2021-06-29 11:03:52 +02:00
Hans-Kristian Arntzen	d62b3c2b92	GLSL: Implement control flow hints.	2021-06-03 12:01:49 +02:00
Hans-Kristian Arntzen	bf3793dd35	MSL: Improve handling of split tessellation access chains.	2021-05-21 16:32:03 +02:00
Hans-Kristian Arntzen	99ae0d32e9	MSL: Handle array with component when we cannot rely on user() attrib. In these cases, we emit one variable per location, and so we must flatten stuff.	2021-05-21 13:46:33 +02:00
Hans-Kristian Arntzen	a64ddcdd49	MSL: Handle array of IO variable with Component decoration.	2021-05-21 12:31:33 +02:00
Hans-Kristian Arntzen	e47a30e807	Honor NoContraction qualifier. We'll need to force a temporary and mark it as precise. MSL is a little weird here, but we can piggyback on top of the invariant float math option here to force fma() operations everywhere.	2021-05-07 12:59:47 +02:00
Lukas Taparauskas	72a2ec4c1b	MSL: Fix '--msl-multi-patch-workgroup' out of bounds reads when dispatching more threads than control points (#1662 ) * Fix '--msl-multi-patch-workgroup' cases where thread count exceeds data bounds Fix gl_PrimitiveID off by one error when computing last valid index Point gl_out to the last patch's data when threads exceed input data bounds Point patchOut to the last patch's data when threads exceed input data bounds Update MSL test expectations. * Undo change to MSL multi-patch hull output bound checks * Update MSL multi-patch test expectations.	2021-04-29 20:01:26 +02:00
Bill Hollings	098cdd64f6	MSL: Padding for Metal argument buffers should not double-count SampledImages.	2021-04-26 13:19:23 -04:00
Hans-Kristian Arntzen	82a77e534e	MSL: Use proper array for quad tess levels. We need to handle loads from array as well, so the float4 hack doesn't work.	2021-04-23 14:12:00 +02:00
Hans-Kristian Arntzen	96ba044f01	HLSL: Fix automatic location assignment in block IO.	2021-04-20 13:04:26 +02:00
Hans-Kristian Arntzen	ae9ca7d73c	MSL: Fix copy of arrays to/from stage IO variables. Need to take into account effective storage classes and whether or not we target stage IO blocks since native arrays are conditionally enabled.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	986196030d	MSL: Don't use native arrays for tess level inputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	4a379a00f3	MSL: Don't emit native array for masked clip/cull distance.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	682a227f4b	MSL: Make builtin argument type declaration context sensitive. Sometimes we'll need array template, sometimes not 🤷.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	c1edd35d57	MSL: Use spvUnsafeArray for builtin arrays after all. It will get too messy to deal with constant initializers any other way, so just deal with complexity in argument_decl instead ...	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	7b9a591aa7	MSL: Hoist out to_tesc_invocation_id() in more places. When emitting fixup code, we might not have gl_InvocationID yet.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	75ed73818c	MSL: Handle loading Clip/CullDistance in TESE. Need to allow the flattened space to go through in some edge cases where we cannot reasonably unflatten.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	a159334895	MSL: Correctly analyze if builtin block is active. Need to consider all members, bi_type is invalid for Blocks, need to look at member decorations.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	5826298697	MSL: Handle CullDistance better.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	23da445bd4	MSL: Emit multiple threadgroup slices for multi-patch. Multiple patches can run in the same workgroup when using multi-patch mode, so we need to allocate enough storage to avoid false sharing.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	b442500204	MSL: Unroll initializations of CullDistance/ClipDistance control points.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	c9946296dd	MSL: Fix initialization of masked threadgroup variables.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	faf80b08fc	MSL: Don't report fallback location allocations as being "used". It may shadow unused real inputs and confuse applications.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	5e9c2d060e	MSL: Cleanup fallback IO block emission. Need to emit in add_variable_to_iface(). Unifies the code paths a fair bit.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	e32c474911	MSL: Handle masking of TESC IO block members.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	dc54f75eec	MSL: Fixup gl_PerVertex names if we're emitting masked builtins.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	40f628f49c	MSL: Add test for complex control point outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	46c48ee6b5	MSL: Rewrite how IO blocks are emitted in multi-patch mode. Firstly, never flatten inputs or outputs in multi-patch mode. The main scenario where we do need to care is Block IO. In this case, we should only flatten the top-level member, and after that we use access chains as normal. Using structs in Input storage class is now possible as well. We don't need to consider per-location fixups at all here. In Vulkan, IO structs must match exactly. Only plain vectors can have smaller vector sizes as a special case.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	425e968720	MSL: Handle flattening of patch block outputs as well. Always propagate InterfaceMember decoration.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	8e2dbe0d38	MSL: Do not declare patch variables on stack.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ff3f5bcba5	MSL: Handle masking of builtin control points.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	6ecdd64a91	MSL: Emit a masked builtin IO block if necessary.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	22c9b63e78	MSL: Fix argument_decl check for builtin.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	c635c35c12	MSL: Temporarily allow empty output struct.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	a59e25db18	MSL: Small refactors.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	74b2acab9b	MSL: Always emit block variable for block types.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ae7bb41ef4	MSL: Test that we can mask location writes in TESC.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	3255d6cef0	MSL: Explicitly only consider masked variables to be thread-group-like.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ba93b6518d	MSL: Fix masking of vertex block outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	a393de31e6	MSL: Refactor out variable/block member masking.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	e7b37392bf	MSL: Emit correct address space for masked arguments.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	65b5ff7ece	MSL: Don't emit weird reference type for spvUnsafeArray types.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	e7824c8b6e	MSL: Handle masked outputs in extract_global_variables. Need to conditionally add gl_in or gl_out.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	50a6bc058a	MSL: Force builtin arrays for builtin array types. Handles argument_decl() correctly.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	f2b5fb3f45	MSL: Emit threadgroup storage class for masked control point outputs. Shader can still rely on writes to threadgroup memory to be visible.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ea91579a7f	MSL: Do not redirect tess access chains on masked outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	f682e89188	MSL: Correctly emit array type for masked outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	2a2d57df13	MSL: Sketch out API to aid LTO-style optimization.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	9a144bb2b9	Clean up member sorting.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	0997e81118	MSL: Sort builtin IO block members by builtin type. Ensures consistent block matching.	2021-04-19 12:10:49 +02:00
Bill Hollings	b3bfe22eaa	MSL: Fixes to support padding Metal argument buffer entries based on argument index. For buffers, support all MSLResourceBinding::basetype pointers, not just void*. Rename MSLResourceBinding::base_type to basetype for consistent use in other structs.	2021-04-18 17:34:55 -04:00
Bill Hollings	daba0dfba6	MSL: Fixes to support padding Metal argument buffer entries based on argument index. For completeness, add [[id(N)]] qualifier to padding struct members. Run clang-format.	2021-04-17 15:20:53 -04:00
Bill Hollings	9060e5a13c	MSL: Fixes to support padding Metal argument buffer entries based on argument index. Use separate lookups for texture and sampler members when padding for SamplerImages. Remove unreachable code following SPIRV_CROSS_THROW.	2021-04-16 15:00:59 -04:00
Bill Hollings	9866cf4496	MSL: Fixes to support padding Metal argument buffer entries based on argument index. Add lookup from argument buffer argument index to resource binding for efficiency. Fix error in advancing padding counts with combined image samplers. Run clang-format.	2021-04-16 09:05:15 -04:00
Bill Hollings	17dab614dc	MSL: Support padding Metal argument buffer entries based on argument index. If CompilerMSL::Options::pad_argument_buffer_resources enabled, Metal argument buffer struct members are positionally aligned to their argument indexes by adding synthetic padding members when needed. The types and sizes of these synthetic members are identified in the resource_bindings vector provided through the API. Add CompilerMSL::Options::pad_argument_buffer_resources to enable padding Metal argument buffer structs to positionally match members to argument indexes. Add MSLResourceBinding::base_type to identify resource type through API.	2021-04-13 19:01:20 -04:00
Mehmet Oguz Derin	e2f7a753d2	Move condition to default block	2021-03-26 16:29:44 +03:00
Mehmet Oguz Derin	0a0c9db9be	MSL: Support long ulong types in buffers in 2.3+. [Metal Shading Language Specification](https://developer.apple.com/metal/Metal-Shading-Language-Specification.pdf) states that: > Note: As of Metal 2.3, Metal supports buffers that contain long or ulong data types. This PR aims to support long and ulong data types in buffers for Metal 2.3+.	2021-03-26 02:41:26 +03:00
Hans-Kristian Arntzen	97796e0609	MSL: Deal with pointer-to-pointer qualifier ordering.	2021-02-26 13:37:14 +01:00
Hans-Kristian Arntzen	621884d709	Merge pull request #1622 from KhronosGroup/fix-1619 MSL: Handle load and store to TessLevel array in TESC.	2021-02-17 20:46:06 +01:00
Hans-Kristian Arntzen	85704f70bc	MSL: Handle load and store to TessLevel array in TESC. More edge cases ... :(	2021-02-17 13:26:08 +01:00
Hans-Kristian Arntzen	ce552f4f91	MSL: Gracefully assign automatic input locations to builtin attributes.	2021-02-17 12:29:19 +01:00
Hans-Kristian Arntzen	aa271c1460	MSL: Refactor out location consumption count computation.	2021-02-17 11:29:33 +01:00
Hans-Kristian Arntzen	bae17e8204	Merge pull request #1617 from KhronosGroup/fix-1608 MSL: Fixup type when using tessellation levels in TESC functions.	2021-02-16 11:10:07 +01:00
Hans-Kristian Arntzen	daddbd4078	MSL: Fixup type when using tessellation levels in TESC functions. Need to rewrite array size depending on execution mode.	2021-02-15 13:28:11 +01:00
Hans-Kristian Arntzen	0ad12a0036	MSL: Always return [[position]] when required.	2021-02-15 12:57:37 +01:00
Hans-Kristian Arntzen	4704482bbc	meta: Update copyright headers to 2021.	2021-01-14 16:07:49 +01:00
Hans-Kristian Arntzen	893a011299	MSL: Fix various bugs with framebuffer fetch on macOS and argument buffers. Introduce a helper to make it clearer if a resource can be considered for argument buffers or not.	2021-01-08 10:19:18 +01:00
Hans-Kristian Arntzen	3136e34215	MSL: Always use input_attachment_index for framebuffer fetch binding. --msl-decoration-binding would end up overriding the input attachment index to binding which is very unexpected and broken.	2021-01-08 10:17:42 +01:00
Hans-Kristian Arntzen	fa76d01203	MSL: Only consider builtin variables if they are part of IO interface.	2021-01-07 10:50:29 +01:00
Hans-Kristian Arntzen	c8837d7d80	MSL: Very slight refactor.	2021-01-07 10:27:14 +01:00
Hans-Kristian Arntzen	efed4c9738	MSL: Fix initializer for tess level outputs. It's an array, not vector.	2021-01-06 10:39:39 +01:00
Hans-Kristian Arntzen	ab9200ffdf	MSL: Don't flatten builtin arrays unless they're part of IO interface.	2021-01-06 10:33:17 +01:00
Hans-Kristian Arntzen	df4f8ef8fe	MSL: Emit correct initializer for tessellation control points.	2021-01-05 15:16:49 +01:00
Hans-Kristian Arntzen	ad3e1584f9	MSL: Handle initializers for tess levels.	2021-01-05 13:25:50 +01:00
Hans-Kristian Arntzen	1a38fec382	Minor redundant nit.	2021-01-04 19:12:46 +01:00
Hans-Kristian Arntzen	a1c784f002	More robust handling of initialized output builtin variables.	2021-01-04 19:12:43 +01:00
Hans-Kristian Arntzen	c4ff129fe3	MSL: Handle reserved identifiers for entry point. We only considered invalid names, and overwrote the alias for the function. The correct fix is to replace illegal names early, do the reserved fixup, then copy back alias to entry point name.	2021-01-04 09:40:11 +01:00
Hans-Kristian Arntzen	6d10da0224	Merge pull request #1553 from comex/no-subgroups-in-vertex-shaders msl: Don't try to use [[thread_index_in_simdgroup]] in vertex shaders.	2020-12-02 19:29:19 +01:00
Hans-Kristian Arntzen	cf1e9e0643	Add MIT dual license for the SPIRV-Cross API.	2020-12-01 16:47:08 +01:00
Jan Sikorski	c09a65c12d	MSL: Added fmin3 and fmax3 library functions to the illegal name list.	2020-11-27 15:25:34 +01:00
comex	5a85fa9400	msl: Don't try to use [[thread_index_in_simdgroup]] in vertex shaders. According to the Metal Shading Language Specification, it's not supported for vertex functions in any Metal version, only fragment and kernel functions.	2020-11-26 20:19:08 -05:00
comex	c80cbde7aa	spirv_msl: Don't add fixup hooks for builtin variables if they're unused. This is necessary to avoid invalid output because of how implicit dependencies on builtins work. For example, the fixup for `BuiltInSubgroupEqMask` initializes the variable based on `builtin_subgroup_invocation_id_id`, a field storing the ID for a variable with decoration `BuiltInSubgroupLocalInvocationId`. This could be either a variable that already exists in the input (spirv_msl.cpp:300) or, if necessary, a newly created one (spirv_msl.cpp:621). In both cases, though, `builtin_subgroup_invocation_id_id` is only set under the condition `need_subgroup_mask \|\| needs_subgroup_invocation_id`. `need_subgroup_mask` is true if any of the `BuiltInSubgroupXXMask` are set in `active_input_builtins`. Normally, if the program contains `BuiltInSubgroupEqMask`, `Compiler::ActiveBuiltinHandler` will set it in `active_input_builtins`. But this only happens if the variable is actually used, whereas `fix_up_shader_inputs_outputs` loops over all variables in the program regardless of whether they're used. If `BuiltInSubgroupEqMask` is not used, `builtin_subgroup_invocation_id_id` is never set, but before this patch the fixup hook would try to use it anyway, producing MSL that references a nonexistent variable named `_0`. Avoid this by changing `fix_up_shader_inputs_outputs` to skip builtins which are not set in `active_input_builtins` or `active_output_builtins`. And add a test case.	2020-11-25 13:41:12 -05:00
Chip Davis	1e67b21ee9	MSL: Don't mask off inactive bits in ballot masks. This was based on my misreading the spec. The Vulkan CTS expects the bits to be set, even if the invocations corresponding to them are inactive.	2020-11-25 09:29:51 -06:00
Chip Davis	fd738e3387	MSL: Adjust FragCoord for sample-rate shading. In Metal, the `[[position]]` input to a fragment shader remains at fragment center, even at sample rate, like OpenGL and Direct3D. In Vulkan, however, when the fragment shader runs at sample rate, the `FragCoord` builtin moves to the sample position in the framebuffer, instead of the fragment center. To account for this difference, adjust the `FragCoord`, if present, by the sample position. The -0.5 offset is because the fragment center is at (0.5, 0.5). Also, add an option to force sample-rate shading in a fragment shader. Since Metal has no explicit control for this, this is done by adding a dummy `[[sample_id]]` which is otherwise unused, if none is already present. This is intended to be used from e.g. MoltenVK when a pipeline's `minSampleShading` value is nonzero. Instead of checking if any `Input` variables have `Sample` interpolation, I've elected to check that the `SampleRateShading` capability is present. Since `SampleId`, `SamplePosition`, and the `Sample` interpolation decoration require this cap, this should be equivalent for any valid SPIR-V module. If this isn't acceptable, let me know.	2020-11-23 10:30:24 -06:00
Hans-Kristian Arntzen	6a614cc7f7	Normalize all internal workaround methods to use spv prefix. We have been interchanging spv and SPIRV_Cross_ for a while, which causes weirdness since we don't explicitly ban SPIRV_Cross identifiers, as these identifiers are generally used for interface variable workarounds.	2020-11-23 15:42:27 +01:00
Chip Davis	68908355a9	MSL: Expand subgroup support. Add support for declaring a fixed subgroup size. Metal, like Vulkan with `VK_EXT_subgroup_size_control`, allows the thread execution width to vary depending on factors such as register usage. Unfortunately, this breaks several tests that depend on the subgroup size being what the device says it is. So we'll fix the subgroup size at the size the device declares. The extra invocations in the subgroup will appear to be inactive. Because of this, the ballot mask builtins are now ANDed with the active subgroup mask. Add support for emulating a subgroup of size 1. This is intended to be used by Vulkan Portability implementations (e.g. MoltenVK) when the hardware/software combo provides insufficient support for subgroups. Luckily for us, Vulkan 1.1 only requires that the subgroup size be at least 1. Add support for quadgroup and SIMD-group functions which were added to iOS in Metal 2.2 and 2.3. This will allow clients to take advantage of expanded quadgroup and SIMD-group support in recent Metal versions and on recent Apple GPUs (families 6 and 7). Gut emulation of subgroup builtins in fragment shaders. It turns out codegen for the SIMD-group functions in fragment wasn't implemented for AMD on Mojave; it's a safe bet that it wasn't implemented for the other drivers either. Subgroup support in fragment shaders now requires Metal 2.2.	2020-11-20 15:55:49 -06:00
Hans-Kristian Arntzen	b3c59263a0	Merge pull request #1541 from cdavis5e/msl-ios-features MSL: Expose some more features on iOS.	2020-11-16 09:43:40 +01:00
Chip Davis	88e25e60ec	MSL: Expose some more features on iOS. `min_lod_clamp()` was actually added in MSL 2.2 on iOS 13. The restriction was based on the beta versions which didn't have it. Since the beta versions didn't support family 6, this leads me to suspect that the reason they lacked `min_lod_clamp()` is that it requires family 6. This does not seem to be documented anywhere. `simd_is_helper_thread()` was added in MSL 2.3 to iOS. I neglected to update this when I finished up `SPV_EXT_demote_to_helper_invocation`. `barycentric_coord` and `primitive_id` were added in MSL 2.3 on iOS 14. They are only supported on family 7.	2020-11-14 01:57:11 -06:00
Hans-Kristian Arntzen	ef0256c23e	Fix switch fallthrough	2020-11-11 16:21:18 +01:00
Jan Sikorski	f0239bce05	MSL: extract global variables from subgroup ballot operations Fixes #1513.	2020-11-09 11:23:01 +01:00
Hans-Kristian Arntzen	6fc2a0581a	Run format_all.sh.	2020-11-08 13:59:52 +01:00
Hans-Kristian Arntzen	46bf1e99d6	Merge pull request #1525 from cdavis5e/msl-interpolation-functions MSL: Support pull-model interpolation on MSL 2.3+.	2020-11-07 17:04:56 +01:00
Hans-Kristian Arntzen	db13762297	MSL: Fix regression in image gather handling. It was not always possible to get backing variable for a late-combined image sampler.	2020-11-06 16:21:30 +01:00
Chip Davis	aca9b6879a	MSL: Support pull-model interpolation on MSL 2.3+. New in MSL 2.3 is a template that can be used in the place of a scalar type in a stage-in struct. This template has methods which interpolate the varying at the given points. Curiously, you can't set interpolation attributes on such a varying; perspective-correctness is encoded in the type, while interpolation must be done using one of the methods. This makes using this somewhat awkward from SPIRV-Cross, requiring us to jump through a bunch of hoops to make this all work. Using varyings from functions in particular is a pain point, requiring us to pass the stage-in struct itself around. An alternative is to pass references to the interpolants; except this will fall over badly with composite types, which naturally must be flattened. As with tessellation, dynamic indexing isn't supported with pull-model interpolation. This is because of the need to reference the original struct member in order to call one of the pull-model interpolation methods on it. Also, this is done at the variable level; this means that if one varying in a struct is used with the pull-model functions, then the entire struct is emitted as pull-model interpolants. For some reason, this was not documented in the MSL spec, though there is a property on `MTLDevice`, `supportsPullModelInterpolation`, indicating support for this, which is documented. This does not appear to be implemented yet for AMD: it returns `NO` from `supportsPullModelInterpolation`, and pipelines with shaders using the templates fail to compile. It is implemeted for Intel. It's probably also implemented for Apple GPUs: on Apple Silicon, OpenGL calls down to Metal, and it wouldn't be possible to use the interpolation functions without this implemented in Metal. Based on my testing, where SPIR-V and GLSL have the offset relative to the pixel center, in Metal it appears to be relative to the pixel's upper-left corner, as in HLSL. Therefore, I've added an offset 0.4375, i.e. one half minus one sixteenth, to all arguments to `interpolate_at_offset()`. This also fixes a long-standing bug: if a pull-model interpolation function is used on a varying, make sure that varying is declared. We were already doing this only for the AMD pull-model function, `interpolateAtVertexAMD()`; for reasons which are completely beyond me, we weren't doing this for the base interpolation functions. I also note that there are no tests for the interpolation functions for GLSL or HLSL.	2020-11-05 11:57:45 -06:00
Hans-Kristian Arntzen	244839d350	Merge pull request #1516 from billhollings/VK_EXT_descriptor_indexing MSL: Support run-time sized image and sampler arrays	2020-11-03 10:15:36 +01:00
Bill Hollings	4bdd49df3f	Syntax and format updates from code review.	2020-11-02 22:15:20 -05:00
Bill Hollings	7f67abe0fe	Minor format and typo updates from code review.	2020-10-30 16:05:44 -04:00
Chip Davis	547c29f7bb	MSL: Allow Bias and Grad arguments with comparison on Mac in MSL 2.3. I kept the code to replace constant zero arguments, because `Bias` and `Grad` still have some problems on desktop GPUs. `Bias` works on AMD GPUs. `Grad` does not. Both work on Intel. Still needs testing on NV. It will definitely work with Apple GPUs.	2020-10-30 11:14:59 -05:00
Hans-Kristian Arntzen	541a801fed	Merge pull request #1514 from cdavis5e/msl-mac-framebuffer-fetch MSL: Allow framebuffer fetch on Mac in MSL 2.3.	2020-10-30 08:09:41 +01:00
Yuwen Wu	c8a43876c7	added metal keyworld: "level" (#1501 ) * added metal keyworld: "level" * added more metal keywords * updated test case.	2020-10-30 08:07:25 +01:00
Bill Hollings	b7b0e804e5	MSL: Support run-time sized image and sampler arrays (GL_EXT_nonuniform_qualifier/SPV_EXT_descriptor_indexing). MSLResourceBinding includes array size through API, and substitutes in that size if the image or sampler array is not explicitly sized. OpCopyObject supports SPIRCombinedImageSampler type in MSL.	2020-10-29 18:50:42 -04:00
Chip Davis	c20d5945a2	MSL: Allow framebuffer fetch on Mac in MSL 2.3. Another Apple GPU feature that will now be supported on Apple Silicon Macs.	2020-10-29 10:50:59 -05:00
Chip Davis	d48d2a95c7	MSL: Allow post-depth coverage on Mac in MSL 2.3. It's still only supported on Apple GPUs, but Macs will have those soon.	2020-10-27 22:07:01 -05:00
Chip Davis	1b6b9705e6	MSL: For 2.1+, don't disable rasterization for vertex writes. Writing to buffers actually works starting in MSL 2.1 (macOS 10.14, iOS 12). Writing to textures works starting in MSL 2.2 (macOS 10.15, iOS 13). No tests unfortunately, because the MSL 2.2 compiler and above produce a warning that cannot be disabled, because it has no associated option.	2020-10-26 11:43:13 -05:00
Hans-Kristian Arntzen	f65f259ab7	MSL: Do not use component::x gather for depth2d textures.	2020-10-26 10:18:17 +01:00
Chip Davis	1264e2705e	MSL: Cast broadcast booleans to ushort. Metal doesn't support broadcasting or shuffling boolean values, but we can work around that by casting it to `ushort`, then casting it back to `bool`. I used `ushort` instead of `uint` because 16-bit values give better throughput on Apple GPUs.	2020-10-23 21:55:46 -05:00
Chip Davis	065b5bda3c	MSL: Mask ballots passed to Ballot bit ops. Only the least n bits are significant, where n is the subgroup size. The Vulkan CTS actually checks this. The `FindLSB` tests weren't actually failing, but I masked that anyway, in case there's some corner case the CTS is missing.	2020-10-23 21:55:46 -05:00
Chip Davis	781367d083	MSL: Support vectors with OpGroupNonUniformAllEqual. This was not tested here in SPIRV-Cross. Predictably, it broke when I tried it in the CTS.	2020-10-23 21:55:46 -05:00
Chip Davis	6ccb902462	MSL: Correct definitions of subgroup ballot mask variables. `SubgroupEqMask` had a fencepost error that gave wrong values for invocation ID 32. For `SubgroupGeMask` and `SubgroupGtMask`, I forgot to shift the values from `extract_bits()` up so that the mask is in the correct position. Using `insert_bits()` instead should fold these two operations into one. `SubgroupLtMask` and `SubgroupLeMask` were already correct.	2020-10-23 21:54:55 -05:00
Chip Davis	064ed448b9	MSL: Don't remove periods from swizzle buffer index exprs.	2020-10-20 17:47:40 -05:00
Chip Davis	5845e009ea	MSL: Handle Offset and Grad operands for 1D-as-2D textures.	2020-10-15 12:51:00 -05:00
Hans-Kristian Arntzen	9c220a8247	Merge pull request #1490 from KhronosGroup/fix-1488 MSL: Support querying and modifying generated combined sampler suffix.	2020-10-15 10:52:28 +02:00
Chip Davis	3e6010d8c5	MSL: Don't use a bitcast for tessellation levels in tesc shaders. `half` cannot be bitcasted to `float`, because the two types are not the same size. Use an expanding cast instead. We were already doing this for stores to the tessellation levels; why I didn't also do this for loads is beyond me.	2020-10-14 18:35:59 -05:00
Hans-Kristian Arntzen	bd1ee4344e	MSL: Support querying and modifying generated combined sampler suffix.	2020-10-14 14:52:18 +02:00
Chip Davis	21d38f74ce	MSL: Fix calculation of atomic image buffer address. Fix reversed coordinates: `y` should be used to calculate the row address. Align row address to the row stride. I've made the row alignment a function constant; this makes it possible to override it at pipeline compile time. Honestly, I don't know how this worked at all for Epic. It definitely didn't work in the CTS prior to this.	2020-10-13 20:51:56 -05:00
Chip Davis	7a5d0d6b29	MSL: Add missing interlock handling to atomic image buffers.	2020-10-13 11:44:17 -05:00
Hans-Kristian Arntzen	fab6ad234e	Merge pull request #1486 from cdavis5e/atomic-image-argument-buffer MSL: Support atomic access to images from argument buffers.	2020-10-13 12:55:43 +02:00
Chip Davis	9cafea6cf8	MSL: Support atomic access to images from argument buffers. This was not added when Epic contributed atomic image support. Fixes #1484.	2020-10-13 02:37:18 -05:00
Chip Davis	2219c4a392	MSL: Support SPV_EXT_demote_to_helper_invocation for MSL 2.3. MSL 2.3 has everything needed to support this extension on all platforms. The existing `discard_fragment()` function was given demote semantics, similar to Direct3D, and the `simd_is_helper_thread()` function was finally added to iOS. I've left the old test alone. Should I remove it in favor of these?	2020-10-13 00:25:32 -05:00
Hans-Kristian Arntzen	5ea576ece2	Allow flip_vert_y in all relevant stages.	2020-09-28 14:10:08 +02:00
Hans-Kristian Arntzen	66afe8c499	Implement a simple evaluator of specialization constants. In some cases, we need to get a literal value from a spec constant op. Mostly relevant when emitting buffers, so implement a 32-bit integer scalar subset of the evaluator. Can be extended as needed to support evaluating any specialization constant operation.	2020-09-14 11:45:59 +02:00
Hans-Kristian Arntzen	bdbef7b1f3	Merge pull request #1461 from Kangz/fix-warnings Fix -Wduplicate-enum and -Wrange-for-analysis.	2020-09-04 12:59:22 +02:00
Corentin Wallez	bcd71536e2	Fix -Wduplicate-enum and -Wrange-for-analysis.	2020-09-04 11:13:21 +02:00
Hans-Kristian Arntzen	18d03b3ea6	Handle OpUndef %void. Apparently this is legal and spirv-opt is starting to emit this kind of code now.	2020-09-04 09:29:44 +02:00
Chip Davis	4cf840ee7b	MSL: Support layered input attachments. These need to use arrayed texture types, or Metal will complain when binding the resource. The target layer is addressed relative to the Layer output by the vertex pipeline, or to the ViewIndex if in a multiview pipeline. Unlike with the s/t coordinates, Vulkan does not forbid non-zero layer coordinates here, though this cannot be expressed in Vulkan GLSL. Supporting 3D textures will require additional work. Part of the problem is that Metal does not allow texture views to subset a 3D texture, so we need some way to pass the base depth to the shader.	2020-09-02 09:18:25 -05:00
Chip Davis	cab7335e64	MSL: Don't set the layer for multiview if the device doesn't support it. Some older iOS devices don't support layered rendering. In that case, don't set `[[render_target_array_index]]`, because the compiler will reject the shader in that case. The client will then have to unroll the render pass manually.	2020-09-01 19:30:28 -05:00
Chip Davis	53080ecca8	MSL: Fix multiview view index calculation with a non-zero base instance. Account for a non-zero base instance when calculating the view index and the "real" instance index. Before, it was likely broken with a non-zero base instance, since the calculated instance index could be less than the base instance.	2020-08-31 20:33:44 -05:00
Hans-Kristian Arntzen	95993f78af	Run format_all.sh.	2020-08-24 11:02:49 +02:00
Hans-Kristian Arntzen	a07441568e	Overhaul how we deal with reserved identifiers. - Do not silently drop reserved identifiers in the parser. This makes it possible to reflect identifiers which are reserved by the cross-compiler module. - Instead of dropping the name, emit _RESERVED_IDENTIFIER_FIXUP in the source to make it clear that a name has been rewritten. - Document what is reserved and not.	2020-08-21 16:33:27 +02:00
Le Hoang Quyen	ab8eb70af1	Fix #1445 : MSL: Enclose args when convert distance(a,b) to abs(a-b)	2020-08-13 21:16:08 +08:00
Chip Davis	3347b1076d	MSL: Fix handling of matrices and structs in the output control point array. Prior to this point, we were treating them as flattened, as they are in old-style tessellation control shaders, and still are for structs in new-style shaders. This is not true for outputs; output composites are not flattened at all. This semantic mismatch broke a Vulkan CTS test. It should now pass.	2020-08-03 17:18:18 -05:00
Tomek Ponitka	18f23c47d9	Enabling setting a fixed sampleMask in Metal fragment shaders. In Metal render pipelines don't have an option to set a sampleMask parameter, the only way to get that functionality is to set the sample_mask output of the fragment shader to this value directly. We also need to take care to combine the fixed sample mask with the one that the shader might possibly output.	2020-07-24 11:19:46 +02:00
Chip Davis	688c5fcbda	MSL: Add support for processing more than one patch per workgroup. This should hopefully reduce underutilization of the GPU, especially on GPUs where the thread execution width is greater than the number of control points. This also simplifies initialization by reading the buffer directly instead of using Metal's vertex-attribute-in-compute support. It turns out the only way in which shader stages are allowed to differ in their interfaces is in the number of components per vector; the base type must be the same. Since we are using the raw buffer instead of attributes, we can now also emit arrays and matrices directly into the buffer, instead of flattening them and then unpacking them. Structs are still flattened, however; this is due to the need to handle vectors with fewer components than were output, and I think handling this while also directly emitting structs could get ugly. Another advantage of this scheme is that the extra invocations needed to read the attributes when there were more input than output points are now no more. The number of threads per workgroup is now lcm(SIMD-size, output control points). This should ensure we always process a whole number of patches per workgroup. To avoid complexity handling indices in the tessellation control shader, I've also changed the way vertex shaders for tessellation are handled. They are now compute kernels using Metal's support for vertex-style stage input. This lets us always emit vertices into the buffer in order of vertex shader execution. Now we no longer have to deal with indexing in the tessellation control shader. This also fixes a long-standing issue where if an index were greater than the number of vertices to draw, the vertex shader would wind up writing outside the buffer, and the vertex would be lost. This is a breaking change, and I know SPIRV-Cross has other clients, so I've hidden this behind an option for now. In the future, I want to remove this option and make it the default.	2020-07-23 17:59:54 -05:00
Chip Davis	884bc6df65	MSL: Factor creating a uint type into its own method. This is so common for artificially created variables that it's worth it to create it once and save it for later use.	2020-07-22 16:25:14 -05:00

... 2 3 4 5 6 ...

1069 Commits