SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	d1285ec08e	GLSL: Use SPIR-V intrinsics for N{Min,Max,Clamp}. When targeting Vulkan semantics, we should emit code that is actually readable. Also, make sure to use plain F{Min,Max,Clamp} on HLSL. HLSL has NaN-aware semantics by default.	2024-05-21 17:56:42 +02:00
Hans-Kristian Arntzen	376a308dc7	GLSL: Handle ptr-to-array casts with unexpected array strides.	2024-01-18 14:14:10 +01:00
Hans-Kristian Arntzen	071969811b	GLSL: Add basic test for chained access chains.	2024-01-18 13:35:43 +01:00
Hans-Kristian Arntzen	32ac768c15	GLSL: Handle synthesized pointer-to-pointer.	2024-01-18 13:11:04 +01:00
Hans-Kristian Arntzen	3da5bc7a57	GLSL: Forward declare non-block BDA types as well. With non-block structs, we can have pointers in those types too.	2024-01-18 12:56:31 +01:00
Hans-Kristian Arntzen	871ac0e5b7	GLSL: Handle BDA to plain non-block struct. The implementation assumed that BDA to struct would always result in a Block-like type, but that is not necessarily the case. Treat non-Block structs as POD types as well.	2024-01-18 12:40:47 +01:00
Hans-Kristian Arntzen	ec42cb1c41	Roll deps.	2024-01-05 13:36:34 +01:00
Hans-Kristian Arntzen	56bdcfaf0e	Add test for sample mask bulk load/store.	2023-10-23 12:45:08 +02:00
Hans-Kristian Arntzen	be944258be	GLSL: Ignore bogus constant/undefs which use block types. This is banned in more recent SPIR-V. We cannot implement this in any reasonable way since block types don't participate as normal types in GLSL.	2023-10-11 13:48:25 +02:00
Hans-Kristian Arntzen	75620181bc	GLSL: Handle empty block declarations. Similar fix as for structs.	2023-10-11 13:09:55 +02:00
Hans-Kristian Arntzen	b76394e66a	GLSL: Fix Clip/Cull in mesh shaders.	2023-08-25 11:59:21 +02:00
Hans-Kristian Arntzen	79d63934a6	GLSL: Emit inline invariant position for mesh shaders. Work around missing feature from GLSL. Normally we can emit a global invariant gl_Position; and call it a day, but it does not work for mesh shaders it seems. Declaring invariance inside an explicit IO block works fine on the other hand ...	2023-08-17 12:45:54 +02:00
Hans-Kristian Arntzen	5b322555d6	GLSL: Fix bug with mixed precision on PHI variables.	2023-06-07 15:31:29 +02:00
Hans-Kristian Arntzen	cfd1618e31	MSL: Use templated array type when emitting BDA to arrays. Handling native array types is not really feasible since we need to fuse the variable declaration with the type declaration. This is feasible in something like variable_decl, but for plain SSA pointers, this breaks down.	2023-05-10 13:33:54 +02:00
Hans-Kristian Arntzen	50623e13c8	GLSL: Fix expression reorder bug with legacy FMA fallback.	2023-03-30 17:24:11 +02:00
Hans-Kristian Arntzen	68a012a4f2	CFG: Handle implied access to opaque loaded values. Similar concern as access chains. Objects that we cannot lower to temporaries must implicitly access all expression dependencies when they are themselves accessed.	2022-12-13 16:34:00 +01:00
Hans-Kristian Arntzen	94160e8890	GLSL: Add test for mesh with non-block builtin IO.	2022-11-02 13:08:56 +01:00
Hans-Kristian Arntzen	a5a7999666	Update mesh/task test references.	2022-09-12 12:46:14 +02:00
Hans-Kristian Arntzen	4c345166dc	GLSL: Implement task shaders. Due to bugged glslang / spirv-tools w.r.t. terminator instructions, add a hack to ignore invalid SPIR-V for the time being.	2022-09-05 12:31:22 +02:00
Hans-Kristian Arntzen	bc3987e59a	MSL: Fix various bugs with Ptr/U bitcasts.	2022-08-09 13:49:40 +02:00
Hans-Kristian Arntzen	05bbcc2d09	Do not optimize OpCompositeInsert for hoisted temporaries. We cannot track uses that happens in subsequent loop iterations, so have to be very careful. Fixes regression introduced after this was added.	2022-08-09 12:44:57 +02:00
Hans-Kristian Arntzen	4dfac510ed	Handle multiple breaks out of switches. Use a switch stack instead.	2022-07-22 15:31:40 +02:00
Hans-Kristian Arntzen	c24d5a7b90	Merge pull request #1985 from KhronosGroup/fix-1973 Handle collapsed loops similar to collapsed selections.	2022-07-22 15:28:54 +02:00
Hans-Kristian Arntzen	be904dcc1e	Handle collapsed loops similar to collapsed selections. If we hit the deprecated scenario of selection with identical branches, we need to collapse loops into direct branches if need be.	2022-07-22 14:37:46 +02:00
Hans-Kristian Arntzen	f7e98c39db	Avoid detection of false loop variables. If the loop variable candidate has a dominating block that lives in an unrelated loop, we have to reject the loop variable candidate.	2022-07-22 14:08:21 +02:00
Hans-Kristian Arntzen	963fdfdf68	Handle PHI in collapsed switch constructs. Collapsed switch constructs only branch directly to the merge block.	2022-07-04 13:55:00 +02:00
Hans-Kristian Arntzen	e08e0cf28d	GLSL: Handle forced temporary OpConvertUToAccelerationStructureKHR. Need to lower to temporary before casting to RTAS.	2022-06-17 13:38:45 +02:00
Hans-Kristian Arntzen	4b9e60273c	Add tests for OpCompositeInsert edge cases.	2022-05-18 16:43:32 +02:00
Hans-Kristian Arntzen	7a6c2da9aa	GLSL: Handle more proper semantics for RelaxedPrecision. GLSL and RelaxedPrecision are quite different in what they affect. RelaxedPrecision affects operations, while this is merely implied in GLSL based on inputs. This leads to situations where we have to promote mediump inputs to highp, and the simplest approach is to force highp temporaries for inputs which are consumed in a highp context. For completeness, we also demote RelaxedPrecision inputs to mediump variables. PHI is handled by copying the PHI into a temporary. We have to be very careful with hoisted temporaries, since the child temporary will not be analyzed up-front. We inherit the hoisted-ness state and emit the hoisted child temporary as necessary. When faking the temporaries with OpCopyObject, we make sure to block any variable hoisting. Hoisting children of PHI variables is fine, since PHIs are not hoisted with the same framework as other temporaries.	2022-05-02 15:11:24 +02:00
Hans-Kristian Arntzen	40b3053b4c	Handle early reads from loop variables with initializers. Need to consider that static_expression can be 0. Fallback as necessary. Also need to clear the static expression.	2022-04-28 14:36:53 +02:00
Hans-Kristian Arntzen	d2a4f9842b	GLSL: Support GL_EXT_debug_printf.	2022-04-19 12:07:54 +02:00
Hans-Kristian Arntzen	4ab5bbb4e5	Fixup names of anonymous inner structs. Just like we try to fixup struct names for block types, inner structs can be "anonymous" structs. HLSL codegen from DXC tends to emit this, and emitting dummy struct names tends to break GL linkage on some drivers.	2022-03-10 15:45:38 +01:00
Hans-Kristian Arntzen	31be74a853	Add relax_nan_checks options. Makes codegen from typical D3D emulation SPIR-V more readable. Also makes cross compilation with NotEqual more sensible. It's very rare to actually need the strict NaN-checks in practice. Also, glslang now emits UnordNotEqual by default it seems, so give up trying to assume OrdNotEqual. Harmonize for UnordNotEqual as the sane default.	2022-03-03 14:50:56 +01:00
Hans-Kristian Arntzen	dc62cc792b	Handle OpTerminateInvocation.	2022-03-03 10:36:07 +01:00
Hans-Kristian Arntzen	5d9a95370f	GLSL: Ensure correct signed integer type for more texture functions.	2022-02-16 11:14:07 +01:00
Hans-Kristian Arntzen	ac46140ba3	Test aliased names in declared LUTs.	2022-01-18 12:39:16 +01:00
Hans-Kristian Arntzen	48b5a9069f	Handle aliased names in spec constants. Need to register resource name.	2022-01-18 12:31:28 +01:00
Hans-Kristian Arntzen	7dd974b9db	Merge pull request #1847 from KhronosGroup/fix-1775 Rework how loop iteration counts are validated.	2022-01-17 14:55:56 +01:00
Hans-Kristian Arntzen	1d13a3e36a	Rework how loop iteration counts are validated. Introduces an idea of a recompilation making forward progress. There are some extreme edge cases where we need more than 3 loops, but only allow this in specific circumstances where we can reason about forward progress being made.	2022-01-17 14:12:01 +01:00
Hans-Kristian Arntzen	ef6bde6580	Do not forward expressions which carry a huge amount of dependencies. Need to force temporaries eventually, or compilers have a tendency to explode.	2022-01-17 13:45:12 +01:00
Hans-Kristian Arntzen	fe5a0aa72f	Merge pull request #1839 from KhronosGroup/spv1.6 Add sanity test for SPIR-V 1.6 modules.	2022-01-06 15:51:11 +01:00
Hans-Kristian Arntzen	7c12228359	Add sanity test for SPIR-V 1.6 modules. Just verify that we don't blow up on these modules.	2022-01-06 14:16:28 +01:00
Hans-Kristian Arntzen	7c83fc22fa	Add support for LocalSizeId. WorkgroupSize builtin is deprecated in 1.6 and LocalSizeId is supported in Vulkan starting with maintenance4.	2022-01-06 13:57:10 +01:00
Hans-Kristian Arntzen	f1b411c9e8	GLSL: Deal with buffer_reference_align. This is somewhat awkward to support, but the best effort we can do here is to analyze various Load/Store opcodes and deduce the ideal overall alignment based on this. This is not a 100% perfect solution, but should be correct for any reasonable use case. Also fix various nitpicks with BDA support while I'm at it.	2021-11-07 17:11:46 +01:00
Hans-Kristian Arntzen	2714f5410c	CFG: Handle degenerate selection constructs. Apparently, it's legal to use a selection construct where both paths branch to same location, but a different merge point is used. This breaks many assumptions the variable scope analyzer makes. The only logical way to generate code for this scenario is to treat the selection construct as a trivial switch construct with only a default case.	2021-11-07 13:32:29 +01:00
Hans-Kristian Arntzen	23c4480d8e	Fix switch fallthrough case in some cases.	2021-08-31 17:24:09 +02:00
Hans-Kristian Arntzen	cb613eb675	Handle value access in terminators. Fixes case where value is created inside loop body and consumed by a return outside it.	2021-07-29 15:27:52 +02:00
Hans-Kristian Arntzen	18f3cd6810	GLSL: Ensure ray query object decls are flushed if allocated in Function. glslang always emits Private variables, but DXC not so much.	2021-07-20 12:04:00 +02:00
Hans-Kristian Arntzen	8216e87f02	Handle SPIR-V 1.4 selection constructs. Fix bug in to_trivial_mix_op, where we made a pre-1.4 assumption that component count of selector is equal to value component count.	2021-06-28 12:23:44 +02:00
Hans-Kristian Arntzen	532f65583e	Rewrite how non-uniform qualifiers are handled. Remove all shenanigans with propagation, and only consume nonuniform qualifiers exactly where needed (last minute).	2021-04-22 16:03:08 +02:00

1 2 3

117 Commits