SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	5b322555d6	GLSL: Fix bug with mixed precision on PHI variables.	2023-06-07 15:31:29 +02:00
Hans-Kristian Arntzen	cfd1618e31	MSL: Use templated array type when emitting BDA to arrays. Handling native array types is not really feasible since we need to fuse the variable declaration with the type declaration. This is feasible in something like variable_decl, but for plain SSA pointers, this breaks down.	2023-05-10 13:33:54 +02:00
Hans-Kristian Arntzen	50623e13c8	GLSL: Fix expression reorder bug with legacy FMA fallback.	2023-03-30 17:24:11 +02:00
Hans-Kristian Arntzen	68a012a4f2	CFG: Handle implied access to opaque loaded values. Similar concern as access chains. Objects that we cannot lower to temporaries must implicitly access all expression dependencies when they are themselves accessed.	2022-12-13 16:34:00 +01:00
Chip Davis	5547b25afe	Interleave undef values with constants and types. Undef values may be of struct type and may be used in constants. Therefore, they must be interleaved with constants and types. Fixes the rest of the Vulkan CTS test `dEQP-VK.spirv_assembly.instruction.compute.opundef.undefined_spec_constant_composite`. (Please excuse the churn in the reference output; it's an inevitable result of this change.)	2022-11-20 02:08:37 -08:00
Hans-Kristian Arntzen	94160e8890	GLSL: Add test for mesh with non-block builtin IO.	2022-11-02 13:08:56 +01:00
Hans-Kristian Arntzen	a5a7999666	Update mesh/task test references.	2022-09-12 12:46:14 +02:00
Hans-Kristian Arntzen	4c345166dc	GLSL: Implement task shaders. Due to bugged glslang / spirv-tools w.r.t. terminator instructions, add a hack to ignore invalid SPIR-V for the time being.	2022-09-05 12:31:22 +02:00
Hans-Kristian Arntzen	bc3987e59a	MSL: Fix various bugs with Ptr/U bitcasts.	2022-08-09 13:49:40 +02:00
Hans-Kristian Arntzen	05bbcc2d09	Do not optimize OpCompositeInsert for hoisted temporaries. We cannot track uses that happens in subsequent loop iterations, so have to be very careful. Fixes regression introduced after this was added.	2022-08-09 12:44:57 +02:00
Hans-Kristian Arntzen	4dfac510ed	Handle multiple breaks out of switches. Use a switch stack instead.	2022-07-22 15:31:40 +02:00
Hans-Kristian Arntzen	c24d5a7b90	Merge pull request #1985 from KhronosGroup/fix-1973 Handle collapsed loops similar to collapsed selections.	2022-07-22 15:28:54 +02:00
Hans-Kristian Arntzen	be904dcc1e	Handle collapsed loops similar to collapsed selections. If we hit the deprecated scenario of selection with identical branches, we need to collapse loops into direct branches if need be.	2022-07-22 14:37:46 +02:00
Hans-Kristian Arntzen	f7e98c39db	Avoid detection of false loop variables. If the loop variable candidate has a dominating block that lives in an unrelated loop, we have to reject the loop variable candidate.	2022-07-22 14:08:21 +02:00
Hans-Kristian Arntzen	963fdfdf68	Handle PHI in collapsed switch constructs. Collapsed switch constructs only branch directly to the merge block.	2022-07-04 13:55:00 +02:00
Hans-Kristian Arntzen	e08e0cf28d	GLSL: Handle forced temporary OpConvertUToAccelerationStructureKHR. Need to lower to temporary before casting to RTAS.	2022-06-17 13:38:45 +02:00
Hans-Kristian Arntzen	df21a99d1a	GLSL: Add more extension ladder fallbacks for unusual integer sizes.	2022-06-07 18:30:10 +02:00
Hans-Kristian Arntzen	4b9e60273c	Add tests for OpCompositeInsert edge cases.	2022-05-18 16:43:32 +02:00
Hans-Kristian Arntzen	23662668dd	Attempt more optimal codegen for OpCompositeInsert. Speculate that we can modify the SSA value in-place. As long as it is not used after the modify, this is fine. Also need to make sure we don't attempt to RMW something that is impossible to modify.	2022-05-18 16:37:33 +02:00
Hans-Kristian Arntzen	7a6c2da9aa	GLSL: Handle more proper semantics for RelaxedPrecision. GLSL and RelaxedPrecision are quite different in what they affect. RelaxedPrecision affects operations, while this is merely implied in GLSL based on inputs. This leads to situations where we have to promote mediump inputs to highp, and the simplest approach is to force highp temporaries for inputs which are consumed in a highp context. For completeness, we also demote RelaxedPrecision inputs to mediump variables. PHI is handled by copying the PHI into a temporary. We have to be very careful with hoisted temporaries, since the child temporary will not be analyzed up-front. We inherit the hoisted-ness state and emit the hoisted child temporary as necessary. When faking the temporaries with OpCopyObject, we make sure to block any variable hoisting. Hoisting children of PHI variables is fine, since PHIs are not hoisted with the same framework as other temporaries.	2022-05-02 15:11:24 +02:00
Hans-Kristian Arntzen	40b3053b4c	Handle early reads from loop variables with initializers. Need to consider that static_expression can be 0. Fallback as necessary. Also need to clear the static expression.	2022-04-28 14:36:53 +02:00
Hans-Kristian Arntzen	d2a4f9842b	GLSL: Support GL_EXT_debug_printf.	2022-04-19 12:07:54 +02:00
Hans-Kristian Arntzen	4ab5bbb4e5	Fixup names of anonymous inner structs. Just like we try to fixup struct names for block types, inner structs can be "anonymous" structs. HLSL codegen from DXC tends to emit this, and emitting dummy struct names tends to break GL linkage on some drivers.	2022-03-10 15:45:38 +01:00
Hans-Kristian Arntzen	31be74a853	Add relax_nan_checks options. Makes codegen from typical D3D emulation SPIR-V more readable. Also makes cross compilation with NotEqual more sensible. It's very rare to actually need the strict NaN-checks in practice. Also, glslang now emits UnordNotEqual by default it seems, so give up trying to assume OrdNotEqual. Harmonize for UnordNotEqual as the sane default.	2022-03-03 14:50:56 +01:00
Hans-Kristian Arntzen	dc62cc792b	Handle OpTerminateInvocation.	2022-03-03 10:36:07 +01:00
Hans-Kristian Arntzen	5d9a95370f	GLSL: Ensure correct signed integer type for more texture functions.	2022-02-16 11:14:07 +01:00
Hans-Kristian Arntzen	ac46140ba3	Test aliased names in declared LUTs.	2022-01-18 12:39:16 +01:00
Hans-Kristian Arntzen	48b5a9069f	Handle aliased names in spec constants. Need to register resource name.	2022-01-18 12:31:28 +01:00
Hans-Kristian Arntzen	7dd974b9db	Merge pull request #1847 from KhronosGroup/fix-1775 Rework how loop iteration counts are validated.	2022-01-17 14:55:56 +01:00
Hans-Kristian Arntzen	1d13a3e36a	Rework how loop iteration counts are validated. Introduces an idea of a recompilation making forward progress. There are some extreme edge cases where we need more than 3 loops, but only allow this in specific circumstances where we can reason about forward progress being made.	2022-01-17 14:12:01 +01:00
Hans-Kristian Arntzen	ef6bde6580	Do not forward expressions which carry a huge amount of dependencies. Need to force temporaries eventually, or compilers have a tendency to explode.	2022-01-17 13:45:12 +01:00
Hans-Kristian Arntzen	fe5a0aa72f	Merge pull request #1839 from KhronosGroup/spv1.6 Add sanity test for SPIR-V 1.6 modules.	2022-01-06 15:51:11 +01:00
Hans-Kristian Arntzen	7c12228359	Add sanity test for SPIR-V 1.6 modules. Just verify that we don't blow up on these modules.	2022-01-06 14:16:28 +01:00
Hans-Kristian Arntzen	7c83fc22fa	Add support for LocalSizeId. WorkgroupSize builtin is deprecated in 1.6 and LocalSizeId is supported in Vulkan starting with maintenance4.	2022-01-06 13:57:10 +01:00
Hans-Kristian Arntzen	f1b411c9e8	GLSL: Deal with buffer_reference_align. This is somewhat awkward to support, but the best effort we can do here is to analyze various Load/Store opcodes and deduce the ideal overall alignment based on this. This is not a 100% perfect solution, but should be correct for any reasonable use case. Also fix various nitpicks with BDA support while I'm at it.	2021-11-07 17:11:46 +01:00
Hans-Kristian Arntzen	2714f5410c	CFG: Handle degenerate selection constructs. Apparently, it's legal to use a selection construct where both paths branch to same location, but a different merge point is used. This breaks many assumptions the variable scope analyzer makes. The only logical way to generate code for this scenario is to treat the selection construct as a trivial switch construct with only a default case.	2021-11-07 13:32:29 +01:00
Erfan Ahmadi	43eecb2360	SPIRV-Cross contribution needed for `INTEL_fragment_shader_ordering`	2021-10-25 10:50:10 +02:00
Hans-Kristian Arntzen	23c4480d8e	Fix switch fallthrough case in some cases.	2021-08-31 17:24:09 +02:00
Hans-Kristian Arntzen	cb613eb675	Handle value access in terminators. Fixes case where value is created inside loop body and consumed by a return outside it.	2021-07-29 15:27:52 +02:00
Hans-Kristian Arntzen	18f3cd6810	GLSL: Ensure ray query object decls are flushed if allocated in Function. glslang always emits Private variables, but DXC not so much.	2021-07-20 12:04:00 +02:00
Hans-Kristian Arntzen	8216e87f02	Handle SPIR-V 1.4 selection constructs. Fix bug in to_trivial_mix_op, where we made a pre-1.4 assumption that component count of selector is equal to value component count.	2021-06-28 12:23:44 +02:00
Hans-Kristian Arntzen	532f65583e	Rewrite how non-uniform qualifiers are handled. Remove all shenanigans with propagation, and only consume nonuniform qualifiers exactly where needed (last minute).	2021-04-22 16:03:08 +02:00
Hans-Kristian Arntzen	ee31e84e30	GLSL: Handle complex load/store scenarios to gl_SampleMask. Need special workarounds to handle array load/store since array size is unsized in GLSL, and array copy is not possible. Also, consider bitcast for scalar loads and stores.	2021-03-09 10:25:03 +01:00
Hans-Kristian Arntzen	4ca06c7278	Handle edge cases in OpCopyMemory. Implement this by synthesizing an OpLoad/OpStore pair instead.	2021-03-08 14:15:27 +01:00
Hans-Kristian Arntzen	ea02a0c03a	Check entry point variables in is_hidden_variables. Need to be careful not to emit globals we're not supposed to.	2021-01-22 13:53:22 +01:00
Hans-Kristian Arntzen	4bedad3860	Handle nonuniformEXT qualifier for acceleration structures.	2021-01-22 13:13:56 +01:00
Hans-Kristian Arntzen	7ab3f3f74e	Deal better with CompositeExtract from constant composite. There is no good reason for applications to emit this kind of code, but some do. Special case this scenario.	2021-01-22 12:30:16 +01:00
Hans-Kristian Arntzen	03ee71e86c	Add test for pure initializer gl_FragDepth. Tests that the builtin is considered active.	2021-01-07 15:32:15 +01:00
Hans-Kristian Arntzen	3776d8978c	GLSL: Force block declaration if clip/cull is used in tesc.	2021-01-07 15:32:15 +01:00
Hans-Kristian Arntzen	6a3ea0385e	GLSL: Add test for initializing tess level output.	2021-01-05 12:12:26 +01:00

1 2 3

119 Commits