SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	c77b09b57c	Merge pull request #2063 from KhronosGroup/fix-2060 Merge #2061	2022-11-21 14:29:47 +01:00
Hans-Kristian Arntzen	df76a14056	MSL: Refactor member reference in terms of one boolean. ptr_chain was really just masking the proper i == 0 check. Be more explicit about what the check is actually doing and comment this.	2022-11-21 13:40:27 +01:00
Dunfan Lu	e75c496ec6	Fix MSL Access Chain	2022-11-21 13:29:18 +01:00
Chip Davis	5547b25afe	Interleave undef values with constants and types. Undef values may be of struct type and may be used in constants. Therefore, they must be interleaved with constants and types. Fixes the rest of the Vulkan CTS test `dEQP-VK.spirv_assembly.instruction.compute.opundef.undefined_spec_constant_composite`. (Please excuse the churn in the reference output; it's an inevitable result of this change.)	2022-11-20 02:08:37 -08:00
Try	80146a20da	HLSL: Implement VK_EXT_mesh_shader	2022-11-02 11:48:58 +01:00
Chip Davis	8cf99e7d44	MSL: Implement `CompositeInsert` `OpSpecConstantOp`. This op creates a new composite constant with one element replaced. So, we reconstruct the `SPIRConstant` for the composite constant, but with one of the IDs replaced. Constant initializer lists are memoized for when the result of a `CompositeInsert` is used in another `CompositeInsert`. (I wanted to add a test case for GLSL as well, but for two things: 1. `glslang` in Vulkan mode chokes on the first constant array, insisting that its initializer needs to be a constant. [Bug in glslang?] 2. The declarations for the buffers used by the shader aren't emitted, regardless of whether Vulkan mode is enabled.) Fixes five tests under `dEQP-VK.spirv_assembly.instruction.*.opspecconstantop.vector_related`.	2022-11-01 18:11:39 -07:00
Hans-Kristian Arntzen	4de9d6c2b6	MSL: Handle implicit integer promotion rules. MSL inherits the behavior of C where arithmetic on small types are implicitly converted to int. SPIR-V does not have this behavior, so make sure that arithmetic results are handled correctly.	2022-10-31 13:33:46 +01:00
Hans-Kristian Arntzen	5762617729	GLSL: Implement GL_EXT_mesh_shader.	2022-09-05 11:25:04 +02:00
Yuwen Wu	f40dba4919	GLSL: added an option to disable row-major-load workaround.	2022-08-24 11:07:12 +08:00
Hans-Kristian Arntzen	4dfac510ed	Handle multiple breaks out of switches. Use a switch stack instead.	2022-07-22 15:31:40 +02:00
Bill Hollings	4185acc70d	MSL: Fixes from review for SPV_KHR_physical_storage_buffer extension. - Assign ulongn physical type to buffer pointers in short arrays when array stride is larger than pointer size. - Support GL_EXT_buffer_reference_uvec2 casting buffer reference pointers to and from uvec2 values. - When packing structs, include structs inside physical buffers. - Update mechanism for traversing pointer arrays when calculating type sizes. - Added unit test shaders.	2022-07-01 16:10:41 -04:00
Hans-Kristian Arntzen	e45d01c41f	Emit KHR barycentrics if source enables the KHR extension. For roundtrip purposes, need to match KHR or NV extension.	2022-05-27 13:28:25 +02:00
Hans-Kristian Arntzen	23662668dd	Attempt more optimal codegen for OpCompositeInsert. Speculate that we can modify the SSA value in-place. As long as it is not used after the modify, this is fine. Also need to make sure we don't attempt to RMW something that is impossible to modify.	2022-05-18 16:37:33 +02:00
Hans-Kristian Arntzen	7a6c2da9aa	GLSL: Handle more proper semantics for RelaxedPrecision. GLSL and RelaxedPrecision are quite different in what they affect. RelaxedPrecision affects operations, while this is merely implied in GLSL based on inputs. This leads to situations where we have to promote mediump inputs to highp, and the simplest approach is to force highp temporaries for inputs which are consumed in a highp context. For completeness, we also demote RelaxedPrecision inputs to mediump variables. PHI is handled by copying the PHI into a temporary. We have to be very careful with hoisted temporaries, since the child temporary will not be analyzed up-front. We inherit the hoisted-ness state and emit the hoisted child temporary as necessary. When faking the temporaries with OpCopyObject, we make sure to block any variable hoisting. Hoisting children of PHI variables is fine, since PHIs are not hoisted with the same framework as other temporaries.	2022-05-02 15:11:24 +02:00
Hans-Kristian Arntzen	4ab5bbb4e5	Fixup names of anonymous inner structs. Just like we try to fixup struct names for block types, inner structs can be "anonymous" structs. HLSL codegen from DXC tends to emit this, and emitting dummy struct names tends to break GL linkage on some drivers.	2022-03-10 15:45:38 +01:00
Hans-Kristian Arntzen	31be74a853	Add relax_nan_checks options. Makes codegen from typical D3D emulation SPIR-V more readable. Also makes cross compilation with NotEqual more sensible. It's very rare to actually need the strict NaN-checks in practice. Also, glslang now emits UnordNotEqual by default it seems, so give up trying to assume OrdNotEqual. Harmonize for UnordNotEqual as the sane default.	2022-03-03 14:50:56 +01:00
Hans-Kristian Arntzen	a56b22bf4e	Add more scenarios where we can guarantee forward progress. The patterns where we force temporary due to invalid/overused expression -> recompile should be seen as making forward progress, and there are very rare scenarios where these recompiles can cascade into many loops. Refactor this style of logic into a new function which is equivalent to handle_invalid_expression().	2022-02-16 12:12:58 +01:00
Hans-Kristian Arntzen	c716a9a5dd	Add debug option to modify maximum number of compile iterations. Should be seen as a hack, but it's pragmatic in some scenarios.	2022-02-16 12:12:27 +01:00
Hans-Kristian Arntzen	9b25581d49	MSL: Handle constant construct of block-like array types. Need this to be context sensitive, since array of block-like struct is template, but struct of block-like array is C-style. Also, test a mix and match, so we have constant array of block-like struct with array inside. :v	2022-01-17 18:28:25 +01:00
Hans-Kristian Arntzen	1d13a3e36a	Rework how loop iteration counts are validated. Introduces an idea of a recompilation making forward progress. There are some extreme edge cases where we need more than 3 loops, but only allow this in specific circumstances where we can reason about forward progress being made.	2022-01-17 14:12:01 +01:00
Hans-Kristian Arntzen	7c83fc22fa	Add support for LocalSizeId. WorkgroupSize builtin is deprecated in 1.6 and LocalSizeId is supported in Vulkan starting with maintenance4.	2022-01-06 13:57:10 +01:00
Sebastián Aedo	6d8302ef14	MSL: Add 64 bit switch support Add 64 bit switch support for MSL version 2.2. * Also fixes a wrong endianness conversion. Signed-off-by: Sebastián Aedo <saedo@codeweavers.com>	2021-11-26 15:54:56 -03:00
Hans-Kristian Arntzen	f1b411c9e8	GLSL: Deal with buffer_reference_align. This is somewhat awkward to support, but the best effort we can do here is to analyze various Load/Store opcodes and deduce the ideal overall alignment based on this. This is not a 100% perfect solution, but should be correct for any reasonable use case. Also fix various nitpicks with BDA support while I'm at it.	2021-11-07 17:11:46 +01:00
Hans-Kristian Arntzen	edf247fb1c	MSL: Workaround compiler crashes when using threadgroup bool. Promote to short instead and do simple casts on load/store instead. Not 100% complete fix since structs can contain booleans, but this is getting into pretty ridiculously complicated territory.	2021-10-25 10:55:11 +02:00
Bill Hollings	5fb1ca4f0d	Add support for additional ops in OpSpecConstantOp. MSL: Support op OpQuantizeToF16 in OpSpecConstantOp. All: Support op OpSRem in OpSpecConstantOp.	2021-09-03 18:20:49 -04:00
Jon Leech	f2a65545b8	Finish adding SPDX tags and setup a reuse checked in Github Actions CI	2021-06-29 11:03:52 +02:00
Hans-Kristian Arntzen	d75666b170	GLSL: Emit num_views for OVR_multiview2.	2021-06-28 12:56:27 +02:00
Hans-Kristian Arntzen	26a4986009	GLSL: Implement noncoherent framebuffer fetch.	2021-05-21 14:22:57 +02:00
Hans-Kristian Arntzen	e47a30e807	Honor NoContraction qualifier. We'll need to force a temporary and mark it as precise. MSL is a little weird here, but we can piggyback on top of the invariant float math option here to force fma() operations everywhere.	2021-05-07 12:59:47 +02:00
Hans-Kristian Arntzen	532f65583e	Rewrite how non-uniform qualifiers are handled. Remove all shenanigans with propagation, and only consume nonuniform qualifiers exactly where needed (last minute).	2021-04-22 16:03:08 +02:00
Hans-Kristian Arntzen	96ba044f01	HLSL: Fix automatic location assignment in block IO.	2021-04-20 13:04:26 +02:00
Hans-Kristian Arntzen	ae9ca7d73c	MSL: Fix copy of arrays to/from stage IO variables. Need to take into account effective storage classes and whether or not we target stage IO blocks since native arrays are conditionally enabled.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	75ed73818c	MSL: Handle loading Clip/CullDistance in TESE. Need to allow the flattened space to go through in some edge cases where we cannot reasonably unflatten.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ae7bb41ef4	MSL: Test that we can mask location writes in TESC.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ba93b6518d	MSL: Fix masking of vertex block outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	a393de31e6	MSL: Refactor out variable/block member masking.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	394c038bfd	MSL: Do not consider effective storage for any composite.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	f2b5fb3f45	MSL: Emit threadgroup storage class for masked control point outputs. Shader can still rely on writes to threadgroup memory to be visible.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	2a2d57df13	MSL: Sketch out API to aid LTO-style optimization.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ee31e84e30	GLSL: Handle complex load/store scenarios to gl_SampleMask. Need special workarounds to handle array load/store since array size is unsized in GLSL, and array copy is not possible. Also, consider bitcast for scalar loads and stores.	2021-03-09 10:25:03 +01:00
Hans-Kristian Arntzen	97796e0609	MSL: Deal with pointer-to-pointer qualifier ordering.	2021-02-26 13:37:14 +01:00
Hans-Kristian Arntzen	7ab3f3f74e	Deal better with CompositeExtract from constant composite. There is no good reason for applications to emit this kind of code, but some do. Special case this scenario.	2021-01-22 12:30:16 +01:00
Hans-Kristian Arntzen	4704482bbc	meta: Update copyright headers to 2021.	2021-01-14 16:07:49 +01:00
Hans-Kristian Arntzen	2097c30985	GLSL: Support both SPV_KHR_ray_tracing and NV_ray_tracing. Fairly minor differences, so can keep them side by side without too much effort. NV support is effectively deprecated now however. - Add OpConvertUToAccelerationStructureKHR - Ignore/Terminate ray is now a terminator in KHR, but a call in NV. - Fix some bugs with reportIntersection.	2021-01-08 14:59:04 +01:00
Hans-Kristian Arntzen	39fee93906	GLSL: Refactor out Output variable initialization.	2021-01-05 12:50:36 +01:00
Hans-Kristian Arntzen	175381fe08	GLSL: Handle some extreme edge cases in Output variable initialization. Deal with patch blocks, arrays of patch blocks, arrays of blocks, etc.	2021-01-05 12:06:36 +01:00
Hans-Kristian Arntzen	c8765a75f2	GLSL: Fix KHR subgroup extension table for subgroups.	2020-12-11 12:26:43 +01:00
Hans-Kristian Arntzen	a11c4780d0	GLSL: Emit nonuniformEXT in correct place for late-combined samplers. Need to emit nonuniformEXT(sampler2D()) since constructor expressions in Vulkan GLSL do not propgate the nonuniform qualifier.	2020-12-07 13:00:15 +01:00
Hans-Kristian Arntzen	cf1e9e0643	Add MIT dual license for the SPIRV-Cross API.	2020-12-01 16:47:08 +01:00
Hans-Kristian Arntzen	6fc2a0581a	Run format_all.sh.	2020-11-08 13:59:52 +01:00

1 2 3 4 5 ...

369 Commits