SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	b76394e66a	GLSL: Fix Clip/Cull in mesh shaders.	2023-08-25 11:59:21 +02:00
Hans-Kristian Arntzen	633dc30135	Merge pull request #2187 from KhronosGroup/mesh-invariant-position GLSL: Emit inline invariant position for mesh shaders.	2023-08-17 13:51:54 +02:00
Hans-Kristian Arntzen	a3f026815a	Merge pull request #2186 from KhronosGroup/64bit-images GLSL: Implement 64-bit image support.	2023-08-17 13:50:29 +02:00
Hans-Kristian Arntzen	79d63934a6	GLSL: Emit inline invariant position for mesh shaders. Work around missing feature from GLSL. Normally we can emit a global invariant gl_Position; and call it a day, but it does not work for mesh shaders it seems. Declaring invariance inside an explicit IO block works fine on the other hand ...	2023-08-17 12:45:54 +02:00
Hans-Kristian Arntzen	5e3ea64843	GLSL: Implement 64-bit image support.	2023-08-17 12:17:55 +02:00
Hans-Kristian Arntzen	5ffc813dc4	GLSL: Require mesh shader when using perprimitiveEXT in frag shader.	2023-08-17 12:07:11 +02:00
Hans-Kristian Arntzen	5b322555d6	GLSL: Fix bug with mixed precision on PHI variables.	2023-06-07 15:31:29 +02:00
Hans-Kristian Arntzen	cfd1618e31	MSL: Use templated array type when emitting BDA to arrays. Handling native array types is not really feasible since we need to fuse the variable declaration with the type declaration. This is feasible in something like variable_decl, but for plain SSA pointers, this breaks down.	2023-05-10 13:33:54 +02:00
Hans-Kristian Arntzen	54b48a2f3a	Merge pull request #2133 from georgeouzou/main GLSL: Support GL_KHR_shader_subgroup_arithmetic (WIP)	2023-05-10 11:32:00 +02:00
George Ouzounoudis	ff962261d4	GLSL: Add tests for subgroup iadd, fadd, imul, fmul	2023-05-01 15:51:54 +03:00
Hans-Kristian Arntzen	50623e13c8	GLSL: Fix expression reorder bug with legacy FMA fallback.	2023-03-30 17:24:11 +02:00
Hans-Kristian Arntzen	1047c13d40	GLSL: Handle textureGatherOffsets properly. We forgot to pass down the offset ID, clean up coffset vs offset jank while we're at it.	2023-01-12 16:27:11 +01:00
Hans-Kristian Arntzen	457fd3db70	Add tests for loops compiled with debug semantics.	2023-01-11 15:18:19 +01:00
Hans-Kristian Arntzen	68a012a4f2	CFG: Handle implied access to opaque loaded values. Similar concern as access chains. Objects that we cannot lower to temporaries must implicitly access all expression dependencies when they are themselves accessed.	2022-12-13 16:34:00 +01:00
Hans-Kristian Arntzen	94160e8890	GLSL: Add test for mesh with non-block builtin IO.	2022-11-02 13:08:56 +01:00
Hans-Kristian Arntzen	a5a7999666	Update mesh/task test references.	2022-09-12 12:46:14 +02:00
Hans-Kristian Arntzen	4c345166dc	GLSL: Implement task shaders. Due to bugged glslang / spirv-tools w.r.t. terminator instructions, add a hack to ignore invalid SPIR-V for the time being.	2022-09-05 12:31:22 +02:00
Hans-Kristian Arntzen	bc3987e59a	MSL: Fix various bugs with Ptr/U bitcasts.	2022-08-09 13:49:40 +02:00
Hans-Kristian Arntzen	05bbcc2d09	Do not optimize OpCompositeInsert for hoisted temporaries. We cannot track uses that happens in subsequent loop iterations, so have to be very careful. Fixes regression introduced after this was added.	2022-08-09 12:44:57 +02:00
Hans-Kristian Arntzen	4dfac510ed	Handle multiple breaks out of switches. Use a switch stack instead.	2022-07-22 15:31:40 +02:00
Hans-Kristian Arntzen	c24d5a7b90	Merge pull request #1985 from KhronosGroup/fix-1973 Handle collapsed loops similar to collapsed selections.	2022-07-22 15:28:54 +02:00
Hans-Kristian Arntzen	be904dcc1e	Handle collapsed loops similar to collapsed selections. If we hit the deprecated scenario of selection with identical branches, we need to collapse loops into direct branches if need be.	2022-07-22 14:37:46 +02:00
Hans-Kristian Arntzen	f7e98c39db	Avoid detection of false loop variables. If the loop variable candidate has a dominating block that lives in an unrelated loop, we have to reject the loop variable candidate.	2022-07-22 14:08:21 +02:00
Hans-Kristian Arntzen	963fdfdf68	Handle PHI in collapsed switch constructs. Collapsed switch constructs only branch directly to the merge block.	2022-07-04 13:55:00 +02:00
Hans-Kristian Arntzen	e08e0cf28d	GLSL: Handle forced temporary OpConvertUToAccelerationStructureKHR. Need to lower to temporary before casting to RTAS.	2022-06-17 13:38:45 +02:00
Hans-Kristian Arntzen	46e4b5a3c8	Fix control flow bug where we missed continue; Case which caused failure: if (cond) { continue; } break; Only allow tracing from inner selections if the outer header never merges execution.	2022-06-07 15:04:01 +02:00
Hans-Kristian Arntzen	1c88730e12	GLSL: Implement 1D texture emulation for ES. ES does not support 1D images at all. Fake it by promoting 1D images to 2D.	2022-05-27 11:51:34 +02:00
Hans-Kristian Arntzen	4b9e60273c	Add tests for OpCompositeInsert edge cases.	2022-05-18 16:43:32 +02:00
Hans-Kristian Arntzen	7a6c2da9aa	GLSL: Handle more proper semantics for RelaxedPrecision. GLSL and RelaxedPrecision are quite different in what they affect. RelaxedPrecision affects operations, while this is merely implied in GLSL based on inputs. This leads to situations where we have to promote mediump inputs to highp, and the simplest approach is to force highp temporaries for inputs which are consumed in a highp context. For completeness, we also demote RelaxedPrecision inputs to mediump variables. PHI is handled by copying the PHI into a temporary. We have to be very careful with hoisted temporaries, since the child temporary will not be analyzed up-front. We inherit the hoisted-ness state and emit the hoisted child temporary as necessary. When faking the temporaries with OpCopyObject, we make sure to block any variable hoisting. Hoisting children of PHI variables is fine, since PHIs are not hoisted with the same framework as other temporaries.	2022-05-02 15:11:24 +02:00
Hans-Kristian Arntzen	40b3053b4c	Handle early reads from loop variables with initializers. Need to consider that static_expression can be 0. Fallback as necessary. Also need to clear the static expression.	2022-04-28 14:36:53 +02:00
Hans-Kristian Arntzen	d2a4f9842b	GLSL: Support GL_EXT_debug_printf.	2022-04-19 12:07:54 +02:00
Hans-Kristian Arntzen	4ab5bbb4e5	Fixup names of anonymous inner structs. Just like we try to fixup struct names for block types, inner structs can be "anonymous" structs. HLSL codegen from DXC tends to emit this, and emitting dummy struct names tends to break GL linkage on some drivers.	2022-03-10 15:45:38 +01:00
Hans-Kristian Arntzen	15d29f00e2	Add test for SPIR-V 1.6 Volatile HelperInvocation.	2022-03-04 11:19:33 +01:00
Hans-Kristian Arntzen	31be74a853	Add relax_nan_checks options. Makes codegen from typical D3D emulation SPIR-V more readable. Also makes cross compilation with NotEqual more sensible. It's very rare to actually need the strict NaN-checks in practice. Also, glslang now emits UnordNotEqual by default it seems, so give up trying to assume OrdNotEqual. Harmonize for UnordNotEqual as the sane default.	2022-03-03 14:50:56 +01:00
Hans-Kristian Arntzen	dc62cc792b	Handle OpTerminateInvocation.	2022-03-03 10:36:07 +01:00
Hans-Kristian Arntzen	5d9a95370f	GLSL: Ensure correct signed integer type for more texture functions.	2022-02-16 11:14:07 +01:00
Hans-Kristian Arntzen	ac46140ba3	Test aliased names in declared LUTs.	2022-01-18 12:39:16 +01:00
Hans-Kristian Arntzen	48b5a9069f	Handle aliased names in spec constants. Need to register resource name.	2022-01-18 12:31:28 +01:00
Hans-Kristian Arntzen	7dd974b9db	Merge pull request #1847 from KhronosGroup/fix-1775 Rework how loop iteration counts are validated.	2022-01-17 14:55:56 +01:00
Hans-Kristian Arntzen	1d13a3e36a	Rework how loop iteration counts are validated. Introduces an idea of a recompilation making forward progress. There are some extreme edge cases where we need more than 3 loops, but only allow this in specific circumstances where we can reason about forward progress being made.	2022-01-17 14:12:01 +01:00
Hans-Kristian Arntzen	ef6bde6580	Do not forward expressions which carry a huge amount of dependencies. Need to force temporaries eventually, or compilers have a tendency to explode.	2022-01-17 13:45:12 +01:00
Hans-Kristian Arntzen	fe5a0aa72f	Merge pull request #1839 from KhronosGroup/spv1.6 Add sanity test for SPIR-V 1.6 modules.	2022-01-06 15:51:11 +01:00
Hans-Kristian Arntzen	7c12228359	Add sanity test for SPIR-V 1.6 modules. Just verify that we don't blow up on these modules.	2022-01-06 14:16:28 +01:00
Hans-Kristian Arntzen	7c83fc22fa	Add support for LocalSizeId. WorkgroupSize builtin is deprecated in 1.6 and LocalSizeId is supported in Vulkan starting with maintenance4.	2022-01-06 13:57:10 +01:00
Hans-Kristian Arntzen	f1b411c9e8	GLSL: Deal with buffer_reference_align. This is somewhat awkward to support, but the best effort we can do here is to analyze various Load/Store opcodes and deduce the ideal overall alignment based on this. This is not a 100% perfect solution, but should be correct for any reasonable use case. Also fix various nitpicks with BDA support while I'm at it.	2021-11-07 17:11:46 +01:00
Hans-Kristian Arntzen	1adc53b107	Merge pull request #1806 from KhronosGroup/fix-1777 CFG: Handle degenerate selection constructs.	2021-11-07 14:10:48 +01:00
Hans-Kristian Arntzen	2714f5410c	CFG: Handle degenerate selection constructs. Apparently, it's legal to use a selection construct where both paths branch to same location, but a different merge point is used. This breaks many assumptions the variable scope analyzer makes. The only logical way to generate code for this scenario is to treat the selection construct as a trivial switch construct with only a default case.	2021-11-07 13:32:29 +01:00
Hans-Kristian Arntzen	4561ecddbd	Handle Modf/Frexp in more cases. Consider it a write to a variable, similar to OpStore.	2021-11-07 11:36:44 +01:00
Hans-Kristian Arntzen	6382f15470	Test behavior around OpSelect with matrices.	2021-10-13 16:08:29 +02:00
Hans-Kristian Arntzen	6071df5840	Fix wrong detection of trivial_mix_op. Effectively, only the last component of the select was considered, need to correctly early out if any case is hit.	2021-10-13 15:34:00 +02:00

1 2 3

147 Commits