SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	5b952d2cbf	MSL: Rethink how opaque descriptors are passed to leaf functions. We were passing arrays by value which the compiler fails to optimize, causing abyssal performance. To fix this, we need to consider that descriptors can be in constant or const device address spaces. Also, lone descriptors are passed by value, so we explicitly remove address space qualifiers. One failure case is when shader passes a texture/sampler array as an argument. It's all UniformConstant in SPIR-V, but in MSL it might be thread, const device or constant, so that won't work ... Global variable use works fine though, and that should cover 99.9999999% of use cases.	2022-01-18 14:40:52 +01:00
Hans-Kristian Arntzen	fe8848a6f2	Roll dependencies.	2022-01-05 14:56:01 +01:00
Hans-Kristian Arntzen	be333e0cab	MSL: Move float2->3 TessCoord fixup to a better location.	2022-01-05 13:32:17 +01:00
Nikita Fediuchin	2acf0e73dd	Fix gl_TessCoord arguments presence. Update reference shaders. * Added check for "gl_TessCoord" presence in the entry point arguments. * Updated reference tessellation evaluation shaders.	2021-12-20 22:58:21 +02:00
Sebastián Aedo	6d8302ef14	MSL: Add 64 bit switch support Add 64 bit switch support for MSL version 2.2. * Also fixes a wrong endianness conversion. Signed-off-by: Sebastián Aedo <saedo@codeweavers.com>	2021-11-26 15:54:56 -03:00
Bill Hollings	248e9ae9ed	MSL: Don't output depth and stencil values with explicit early fragment tests. Fragment shaders that require explicit early fragment tests are incompatible with specifying depth and stencil values within the shader. If explicit early fragment tests is specified, remove the depth and stencil outputs from the output structure, and replace them with dummy local variables. Add CompilerMSL:uses_explicit_early_fragment_test() function to consolidate testing for whether early fragment tests are required. Add two unit tests for depth-out with, and without, early fragment tests.	2021-11-12 14:17:00 -05:00
Bill Hollings	fd252b21ff	Separate (partially) the tracking of depth images from depth compare ops. SPIR-V allows an image to be marked as a depth image, but with a non-depth format. Such images should be read or sampled as vectors instead of scalars, except when they are subject to compare operations. Don't mark an OpSampledImage as using a compare operation just because the image contains a depth marker. Instead, require that a compare operation is actually used on that image. Compiler::image_is_comparison() was really testing whether an image is a depth image, since it incorporates the depth marker. Rename that function to is_depth_image(), to clarify what it is really testing. In Compiler::is_depth_image(), do not treat an image as a depth image if it has been explicitly marked with a color format, unless the image is subject to compare operations. In CompilerMSL::to_function_name(), test for compare operations specifically, rather than assuming them from the depth-image marker. CompilerGLSL and CompilerMSL still contain a number of internal tests that use is_depth_image() both for testing for a depth image, and for testing whether compare operations are being used. I've left these as they are for now, but these should be cleaned up at some point. Add unit tests for fetch/sample depth images with color formats and no compare ops.	2021-11-08 15:59:45 -05:00
Hans-Kristian Arntzen	4561ecddbd	Handle Modf/Frexp in more cases. Consider it a write to a variable, similar to OpStore.	2021-11-07 11:36:44 +01:00
Bill Hollings	be812c45e5	MSL: Remove over-zealous check for struct packing compatibility. Previous test for SPIRVCrossDecorationPhysicalTypePacked on parent struct when unpacking member struct was too restrictive, and not needed as long as padding compensates.	2021-10-28 19:36:32 -04:00
Bill Hollings	76cb807c19	MSL: Fix type redirection when struct members are reordered to align with offsets. Populate member_type_index_redirection as reverse lookup, not forward lookup. Move use of member_type_index_redirection from CompilerMSL::to_member_reference() to CompilerGLSL::access_chain_internal() to access all redirected type info, not just name.	2021-10-28 10:16:34 -04:00
Hans-Kristian Arntzen	edf247fb1c	MSL: Workaround compiler crashes when using threadgroup bool. Promote to short instead and do simple casts on load/store instead. Not 100% complete fix since structs can contain booleans, but this is getting into pretty ridiculously complicated territory.	2021-10-25 10:55:11 +02:00
Hans-Kristian Arntzen	2b5e17eca5	MSL: Never used templated array for RayQuery objects. Not supported and compiler derps out.	2021-10-21 22:02:01 +02:00
Hans-Kristian Arntzen	5afb3d313f	MSL: Fix some trivial bugs not caught by CI when adding ray query.	2021-10-21 21:53:41 +02:00
丛越	d52ec1e196	Fix all requested changes, test_shaders.py supports compiling MSL 2.4 shaders, and the Intersection Query currently only supports MSL 2.4 on the iOS platform.	2021-10-21 17:46:45 +08:00
丛越	597f29d09d	Support Metal 2.4 Intersection Query, Implement GL_EXT_ray_query.	2021-10-19 18:45:10 +08:00
Bill Hollings	ec054dad7f	MSL: Support synthetic functions in function constants. Emit synthetic functions before function constants. Support use of spvQuantizeToF16() in function constants for numerical behavior consistency with the op code. Ensure subnormal results from OpQuantizeToF16 are flushed to zero per SPIR-V spec. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes. Any MSL reference shader that inclues a synthetic function is affected, since the location it is emitted has changed.	2021-09-28 19:10:16 -04:00
Bill Hollings	ba66a91402	MSL: Use vec<T, n> in template SpvHalfTypeSelector for function spvQuantizeToF16(). Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-25 14:36:42 -04:00
Bill Hollings	a2671e35b0	MSL: Consolidate spvQuantizeToF16() functions into a single template function. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-24 14:41:15 -04:00
Bill Hollings	5742047b24	MSL: Honor infinities in OpQuantizeToF16 when compiling using fast-math. Add spvQuantizeToF16() family of synthetic functions to convert from float to half and back again, and add function attribute [[clang::optnone]] to honor infinities during conversions. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-24 11:22:05 -04:00
Bill Hollings	fb3defc9ef	MSL: Honor DecorationNoContraction when compiling using fast-math. Add [[clang::optnone]] attribute to spvF*() functions used for handling floating point operations decorated with DecorationNoContraction. Just using precise::fma() did not work. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-23 14:37:08 -04:00
Bill Hollings	40141ffddf	MSL: Selectively enable fast-math in MSL code to match Vulkan CTS results. Based on CTS testing, math optimizations between MSL and Vulkan are inconsistent. In some cases, enabling MSL's fast-math compilation option matches Vulkan's math results. In other cases, disabling it does. Broadly enabling or disabling fast-math across all shaders results in some CTS test failures either way. To fix this, selectively enable/disable fast-math optimizations in the MSL code, using metal::fast and metal::precise function namespaces, where supported, and the [[clang::optnone]] function attribute otherwise. Adjust SPIRV-Cross unit test reference shaders to accommodate these changes.	2021-09-22 18:58:31 -04:00
Bill Hollings	35e92e6ffb	MSL: Return fragment function value even when last SPIR-V Op is discard (OpKill). Add test shader for new functionality. Add legacy test reference shader for unrelated buffer-bitcast test, that doesn't seem to have been added previously.	2021-09-12 16:28:21 -04:00
Bill Hollings	472f9d4f6d	Add tests for OpSpecConstantOp ops OpQuantizeToF16 and OpSRem. Tests provided by @cdavis5e.	2021-09-05 16:51:04 -04:00
Hans-Kristian Arntzen	c062b6b852	Merge pull request #1725 from billhollings/fix-duplicate-glposition MSL: Fix duplicate gl_Position outputs when gl_Position defined but unused.	2021-08-23 11:37:10 +02:00
Hans-Kristian Arntzen	fad1590786	Merge pull request #1722 from billhollings/row-maj-mtx-store-from-const MSL: Support row-major transpose when storing matrix from constant RHS matrix.	2021-08-23 11:29:01 +02:00
Bill Hollings	e76fcf9309	MSL: Add test for fixes to MSL constant expression type down-casting.	2021-08-16 13:56:05 -04:00
Bill Hollings	3105e82b2e	MSL: Fix duplicate gl_Position outputs when gl_Position defined but unused. When gl_Position is defined by SPIR-V, but neither used nor initialized, it appeared twice in the MSL output, as gl_Position and glPosition_1. The existing tests for whether an output is active check only that it is used by an op, or initialized. Adding the implicit gl_Position also marked the existing gl_Position as active, duplicating the output variable. Fix is that when checking for the need to add an implicit gl_Position output, also check if the var is already defined in the shader, and just needs to be marked as active. Add test shader.	2021-08-16 11:23:15 -04:00
Bill Hollings	9552ca5473	MSL: Support row-major transpose when storing matrix from constant RHS matrix. Remove test and exception when storing row-major matrix from RHS that is not a SPIRExpression. Add test shaders.	2021-08-12 09:08:35 -04:00
Bill Hollings	ebb5098def	MSL: Adjust gl_SampleMaskIn for sample-shading and/or fixed sample mask. Vulkan specifies that the Sample Mask Test occurs before fragment shading. This means gl_SampleMaskIn should be influenced by both sample-shading and VkPipelineMultisampleStateCreateInfo::pSampleMask. CTS tests dEQP-VK.pipeline.multisample_shader_builtin.* bear this out. For sample-shading, gl_SampleMaskIn should only have a single bit set, Since Metal does not filter for this, apply a bitmask based on gl_SampleID. For a fixed sample mask, since Metal is unaware of VkPipelineMultisampleStateCreateInfo::pSampleMask, we need to ensure that we apply it to both gl_SampleMaskIn and gl_SampleMask. This has the side effect of a redundant application of pSampleMask if the shader already includes gl_SampleMaskIn when setting gl_SampleMask, but I don't see an easy way around this. Also, simplify the logic for including the fixed sample mask in gl_ShaderMask, and print the fixed sample mask as a hex value for readability of bits.	2021-07-13 21:22:13 -04:00
Hans-Kristian Arntzen	8216e87f02	Handle SPIR-V 1.4 selection constructs. Fix bug in to_trivial_mix_op, where we made a pre-1.4 assumption that component count of selector is equal to value component count.	2021-06-28 12:23:44 +02:00
xndcn	02fb8f2a24	Add comment after inf/nan float number for clarifying.	2021-05-27 02:40:41 +08:00
Hans-Kristian Arntzen	99ae0d32e9	MSL: Handle array with component when we cannot rely on user() attrib. In these cases, we emit one variable per location, and so we must flatten stuff.	2021-05-21 13:46:33 +02:00
Hans-Kristian Arntzen	e47a30e807	Honor NoContraction qualifier. We'll need to force a temporary and mark it as precise. MSL is a little weird here, but we can piggyback on top of the invariant float math option here to force fma() operations everywhere.	2021-05-07 12:59:47 +02:00
Lukas Taparauskas	72a2ec4c1b	MSL: Fix '--msl-multi-patch-workgroup' out of bounds reads when dispatching more threads than control points (#1662 ) * Fix '--msl-multi-patch-workgroup' cases where thread count exceeds data bounds Fix gl_PrimitiveID off by one error when computing last valid index Point gl_out to the last patch's data when threads exceed input data bounds Point patchOut to the last patch's data when threads exceed input data bounds Update MSL test expectations. * Undo change to MSL multi-patch hull output bound checks * Update MSL multi-patch test expectations.	2021-04-29 20:01:26 +02:00
Hans-Kristian Arntzen	82a77e534e	MSL: Use proper array for quad tess levels. We need to handle loads from array as well, so the float4 hack doesn't work.	2021-04-23 14:12:00 +02:00
Hans-Kristian Arntzen	532f65583e	Rewrite how non-uniform qualifiers are handled. Remove all shenanigans with propagation, and only consume nonuniform qualifiers exactly where needed (last minute).	2021-04-22 16:03:08 +02:00
Hans-Kristian Arntzen	ae9ca7d73c	MSL: Fix copy of arrays to/from stage IO variables. Need to take into account effective storage classes and whether or not we target stage IO blocks since native arrays are conditionally enabled.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	986196030d	MSL: Don't use native arrays for tess level inputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	4a379a00f3	MSL: Don't emit native array for masked clip/cull distance.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	682a227f4b	MSL: Make builtin argument type declaration context sensitive. Sometimes we'll need array template, sometimes not 🤷.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	c1edd35d57	MSL: Use spvUnsafeArray for builtin arrays after all. It will get too messy to deal with constant initializers any other way, so just deal with complexity in argument_decl instead ...	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	5826298697	MSL: Handle CullDistance better.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	23da445bd4	MSL: Emit multiple threadgroup slices for multi-patch. Multiple patches can run in the same workgroup when using multi-patch mode, so we need to allocate enough storage to avoid false sharing.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	e32c474911	MSL: Handle masking of TESC IO block members.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	dc54f75eec	MSL: Fixup gl_PerVertex names if we're emitting masked builtins.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	40f628f49c	MSL: Add test for complex control point outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	46c48ee6b5	MSL: Rewrite how IO blocks are emitted in multi-patch mode. Firstly, never flatten inputs or outputs in multi-patch mode. The main scenario where we do need to care is Block IO. In this case, we should only flatten the top-level member, and after that we use access chains as normal. Using structs in Input storage class is now possible as well. We don't need to consider per-location fixups at all here. In Vulkan, IO structs must match exactly. Only plain vectors can have smaller vector sizes as a special case.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ff3f5bcba5	MSL: Handle masking of builtin control points.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	436b1250da	MSL: Do not perform scalar fixups for control-point outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	74b2acab9b	MSL: Always emit block variable for block types.	2021-04-19 12:10:49 +02:00

1 2 3 4 5 ...

504 Commits