SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	f6f849397e	MSL: Re-roll array expressions in initializers. We cannot rely on copy path when using an array as part of a struct initializer, so reroll such expressions to an initializer list again.	2019-07-10 11:19:33 +02:00
Hans-Kristian Arntzen	53ab2144b9	Merge pull request #1064 from KhronosGroup/fix-1062 Fall back to complex loop if non-trivial continue block is found.	2019-07-08 13:58:35 +02:00
Hans-Kristian Arntzen	50342966c0	Fall back to complex loop if non-trivial continue block is found. There is a case where we can deduce a for/while loop, but the continue block is actually very painful to deal with, so handle that case as well. Removes an exceptional case.	2019-07-08 11:54:29 +02:00
Hans-Kristian Arntzen	d12b54bbb4	Propagate NonUniformEXT to dependent expressions. This decoration might only be present for the very last ID which is consumed by a sampling or Load/Store instruction. To make sure our access chains are emitted correctly, we have to back-propagate this decoration.	2019-07-08 11:19:38 +02:00
Lifeng Pan	5ca8779044	Parse SPIR-V debug information extended instructions, as well as OpNoLine. No impact on result shader string.	2019-07-04 16:21:44 +08:00
Hans-Kristian Arntzen	581ed0fd59	HLSL: Does not support case-fallthrough. Disable any fallthrough on HLSL. Risky business if fallthrough blocks had a barrier(), but can't do anything about that ...	2019-06-27 15:10:17 +02:00
Hans-Kristian Arntzen	c76b99b711	Handle more cases with FP16 and texture sampling.	2019-06-27 15:04:22 +02:00
Hans-Kristian Arntzen	bcef66fbf3	Fix declaration of loop variables with a Phi helper copy. Certain Phi variables need to maintain a temporary copy, but we forgot to declare them when the master variable is a loop variable itself.	2019-06-25 10:45:15 +02:00
Hans-Kristian Arntzen	7557ff5567	Workaround GCC 9 bug.	2019-06-24 10:17:25 +02:00
Hans-Kristian Arntzen	b4e0163749	Run format_all.sh.	2019-06-21 16:02:22 +02:00
Hans-Kristian Arntzen	bcec5cb370	Old MSVC does not like +[] constructs.	2019-06-21 14:59:51 +02:00
Hans-Kristian Arntzen	c365cc1b43	Deal with OpPhi and case fallthrough. This is quite complex since we cannot flush Phi inside the case labels, we have to do it outside by emitting a lot of manual branches ourselves. This should be extremely rare, but we need to handle this case.	2019-06-21 13:38:23 +02:00
Hans-Kristian Arntzen	22e3beaab9	Deal with switch block fallthrough more correctly ...	2019-06-20 12:14:19 +02:00
Hans-Kristian Arntzen	bc3bf47446	Rewrite how switch block case labels are emitted.	2019-06-20 11:57:05 +02:00
Hans-Kristian Arntzen	7fdb418f18	Merge pull request #1028 from KhronosGroup/fix-1010 MSL: Support barycentrics and PrimitiveID in fragment shaders	2019-06-19 15:29:14 +02:00
Hans-Kristian Arntzen	707312b83a	GLSL: Support NV barycentrics.	2019-06-19 09:52:35 +02:00
Hans-Kristian Arntzen	f171d82590	MSL: Support MinLod operand.	2019-06-19 09:43:03 +02:00
Hans-Kristian Arntzen	d81bfc5b58	MSL: Fix regression with Private parameter declaration. If we compile multiple times due to forced_recompile, we had deferred_declaration = true while emitting function prototypes which broke an assumption. Fix this by clearing out stale state before leaving a function.	2019-06-13 10:36:21 +02:00
Hans-Kristian Arntzen	a9da59b0b8	GLSL: Support GL_ARB_shader_stencil_export.	2019-06-12 10:06:54 +02:00
Hans-Kristian Arntzen	bf56dc88b9	Rewrite how loop dominators are propagated. Do this analysis in the CFG stage rather than last minute with the ad-hoc algorithm we had in place before CFG was introduced.	2019-06-06 12:17:46 +02:00
Hans-Kristian Arntzen	720681da39	Merge pull request #1006 from KhronosGroup/fix-1003 Deal with case where a block is somehow emitted in a duplicated fashion.	2019-06-05 16:11:06 +02:00
Patrick Mours	8d64d5e776	Fix storage packing qualifiers missing on "shaderRecordNV" buffers	2019-06-05 13:31:24 +02:00
Patrick Mours	be3035db26	Fix callable data variables	2019-06-05 13:31:24 +02:00
Patrick Mours	789178666f	Add support for "shaderRecordNV" qualifier	2019-06-05 13:31:24 +02:00
Hans-Kristian Arntzen	c09ca74c61	Deal with case where a block is somehow emitted in a duplicated fashion. We seen to have to deal with a case where a block is used multiple times without any "proper" structured control flow, so we risk losing deferred declaration state.	2019-06-05 12:39:40 +02:00
Hans-Kristian Arntzen	65af09d2d1	Support emitting OpLine directive. Facilitates easier mapping from source language to cross-compiled output in tooling.	2019-05-28 13:44:24 +02:00
Hans-Kristian Arntzen	23889f7b87	GLSL: Support std430 in UBOs with scalar layout.	2019-05-28 12:22:44 +02:00
Hans-Kristian Arntzen	b3094cd02a	Run format_all.sh.	2019-05-27 16:54:13 +02:00
Hans-Kristian Arntzen	7b9e0fb428	MSL: Implement OpArrayLength. This gets rather complicated because MSL does not support OpArrayLength natively. We need to pass down a buffer which contains buffer sizes, and we compute the array length on-demand. Support both discrete descriptors as well as argument buffers.	2019-05-27 16:13:09 +02:00
Hans-Kristian Arntzen	96492648d4	MSL: Fix struct declaration order with complex type aliases. MSL generally emits the aliases, which means we cannot always place the master type first, unlike GLSL and HLSL. The logic fix is just to reorder after we have tagged types with packing information, rather than doing it in the parser fixup.	2019-05-23 14:54:04 +02:00
Hans-Kristian Arntzen	45a36ad034	Run format_all.sh.	2019-05-14 09:54:35 +02:00
Hans-Kristian Arntzen	c52d6bcd0c	Merge pull request #975 from alpqr/master GLSL: Add option to disable buffer blocks regardless of version	2019-05-14 09:51:39 +02:00
Laszlo Agocs	7bc31491be	GLSL: Add option to disable buffer blocks regardless of version	2019-05-13 21:29:06 +02:00
Hans-Kristian Arntzen	647ddaee42	HLSL/MSL: Deal correctly with nonuniformEXT qualifier. MSL does not seem to have a qualifier for this, but HLSL SM 5.1 does. glslangValidator for HLSL does not support this, so skip any validation, but it passes in FXC.	2019-05-13 14:58:27 +02:00
Hans-Kristian Arntzen	6fcf8c83d9	GLSL: Support OpBitcast for buffer references. Update glslang/SPIRV-Tools/SPIRV-Headers references.	2019-05-09 10:29:31 +02:00
Hans-Kristian Arntzen	b6f8a20624	GLSL: Return correct sign for OpArrayLength. .length() returns int, not uint ...	2019-05-07 19:02:32 +02:00
Hans-Kristian Arntzen	3186701739	GLSL: Support GL_EXT_nonuniform_qualifier.	2019-05-02 11:15:51 +02:00
Hans-Kristian Arntzen	6f091e7c8f	GLSL: Support GL_EXT_scalar_block_layout.	2019-04-26 15:43:37 +02:00
Hans-Kristian Arntzen	758427e127	Fix GCC 4.x warning.	2019-04-26 13:09:54 +02:00
Hans-Kristian Arntzen	2cc374a0c8	GLSL: Implement GL_EXT_buffer_reference. Buffer objects can contain arbitrary pointers to blocks. We can also implement ConvertPtrToU and ConvertUToPtr. The latter can cast a uint64_t to any type as it pleases, so we will need to generate fake buffer reference blocks to be able to cast the type.	2019-04-26 11:43:51 +02:00
Hans-Kristian Arntzen	8b236f24f1	Fix infinite loop when OpAtomic* temporaries are used in other blocks. We made the mistake of registering a dependency on the atomic variable even if the atomic result was forced to a temporary. There is no need to register reads from atomic variables like this as we always force atomic results to a temporary and argument read/writes do not need to be tracked.	2019-04-24 09:33:39 +02:00
Hans-Kristian Arntzen	e23c9ea700	Force complex loop in certain rare access chain scenarios. If we generate an access chain in a loop body, and it is consumed in the loop continue block, we have a problem because we cannot emit a temporary here holding the access chain reference. Force a complex loop body to workaround this exceptionally rare case.	2019-04-10 16:02:03 +02:00
Hans-Kristian Arntzen	9ae91c2d1e	Deal with mismatched signs in S/U/F conversion opcodes.	2019-04-10 14:03:58 +02:00
Hans-Kristian Arntzen	a489ba7fd1	Reduce pressure on global allocation. - Replace ostringstream with custom implementation. ~30% performance uplift on vector-shuffle-oom test. Allocations are measurably reduced in Valgrind. - Replace std::vector with SmallVector. Classic malloc optimization, small vectors are backed by inline data. ~ 7-8% gain on vector-shuffle-oom on GCC 8 on Linux. - Use an object pool for IVariant type. We generally allocate a lot of SPIR* objects. We can amortize these allocations neatly by pooling them. - ~15% overall uplift on ./test_shaders.py --iterations 10000 shaders/.	2019-04-09 15:09:44 +02:00
Hans-Kristian Arntzen	b2c2f724f4	Merge pull request #938 from KhronosGroup/fix-937 MSL: Fix OpLoad of array which is forced to a temporary.	2019-04-09 15:08:53 +02:00
Hans-Kristian Arntzen	bf07e5fa7b	MSL: Fix OpLoad of array which is forced to a temporary.	2019-04-09 11:50:45 +02:00
lifpan	876627df3b	Add OpUndef instruction to block's instruction list for completeness.	2019-04-08 19:45:31 +08:00
Hans-Kristian Arntzen	3ca8bc5e0d	Support fma() in older GLSL targets.	2019-04-08 10:38:32 +02:00
Hans-Kristian Arntzen	317144a59c	Detect invalid DoWhileLoop early. We had a bug where error conditions in DoWhileLoop emit path would not detect that statements were being emitted due to the masking behavior which happens when force_recompile is true. Fix this. Also, refactor force_recompile into member functions so we can properly break on any situation where this is set, without having to rely on watchpoints in debuggers.	2019-04-05 12:19:32 +02:00
Hans-Kristian Arntzen	44834f2115	Merge pull request #927 from KhronosGroup/fix-925 GLSL: Fix OpImageFetch with uint coordinates and LOD.	2019-04-03 12:32:43 +02:00
Hans-Kristian Arntzen	e4d5c6183a	GLSL: Fix OpImageFetch with uint coordinates and LOD. Also fix some minor issues with too many coordinate dimensions in HLSL and GLSL.	2019-04-03 10:50:32 +02:00
Hans-Kristian Arntzen	7e37623e82	MSL: Fix depth2d 4-component fixup. Need to look at the backing image for the image. We might have found diverging use at the image variable level, not just expression level.	2019-04-03 10:24:22 +02:00
Hans-Kristian Arntzen	9b92e68d71	Add an option to override the namespace used for spirv_cross. This is a pragmatic trick to avoid symbol collision where a project links against SPIRV-Cross statically, while linking to other projects which also use SPIRV-Cross statically. We can end up with very awkward symbol collisions which can resolve themselves silently because SPIRV-Cross is pulled in as necessary. To fix this, we must use different symbols and embed two copies of SPIRV-Cross in this scenario, now with different namespaces, which in turn leads to different symbols.	2019-03-29 10:29:44 +01:00
Bill Hollings	c48702d8c2	Fix crash when backend.int16_t_literal_suffix set to null. The design of backend.int16_t_literal_suffix and backend.uint16_t_literal_suffix allows them to be set to null, but that was not always tested for. I have removed the expectation that they can be null and set backend.int16_t_literal_suffix to "" when no suffix is needed. That has the same effect, and seemed to be a more usable and defensive approach.	2019-03-28 14:23:32 -04:00
Hans-Kristian Arntzen	eeb3f24991	Properly deal with sign-dependent GLSL opcodes. The GLSLstd450 spec is very lax about input signs, so we need to do the bitcasting dance to implement it correctly.	2019-03-27 12:20:53 +01:00
Patrick Mours	c96bab0659	Replace usage of "require_extension" with "require_extension_internal" and "to_func_call_arg" with "to_expression"	2019-03-26 14:04:39 +01:00
Patrick Mours	c74d7a412c	Add "GL_NV_ray_tracing" extension to output when ray tracing execution model is found	2019-03-25 15:06:01 +01:00
Patrick Mours	b2651d01e5	Merge branch master into SPV_NV_ray_tracing	2019-03-25 14:09:15 +01:00
Hans-Kristian Arntzen	8eb33c8017	Support -1 index in OpVectorShuffle. -1 (0xffffffff) literal means the component should be undefined. Since we cannot express undefined directly, just use a 0 literal in the appropriate type.	2019-03-25 10:17:05 +01:00
Hans-Kristian Arntzen	2a0365c813	GLSL/HLSL: Implement NMin/NMax/NClamp. Need to emulate these calls for correctness.	2019-03-21 15:26:46 +01:00
Hans-Kristian Arntzen	0b20180537	GLSL: Deal with array loads from input in tessellation. We have an edge case where the array is declared with a concrete size, but in GLSL we must emit an unsized array, which breaks array copies. Deal explicitly with this.	2019-03-21 11:50:53 +01:00
Hans-Kristian Arntzen	d2961b30db	GLSL: Unroll loads from builtin pos/point arrays. Odd-ball case for certain geometry shaders coming from HLSL.	2019-03-21 11:25:41 +01:00
Hans-Kristian Arntzen	45baf24a17	Move check for structured OpSwitch to CompilerGLSL. Can still parse correctly.	2019-03-20 10:42:38 +01:00
Hans-Kristian Arntzen	a94490498d	Merge pull request #894 from KhronosGroup/fix-882 GLSL: Support emitting push constant block as a plain UBO.	2019-03-19 11:56:24 +01:00
Hans-Kristian Arntzen	1389aa34e4	GLSL: Check target version for push constant location = N.	2019-03-19 11:20:53 +01:00
Hans-Kristian Arntzen	0474848d4a	GLSL: Support emitting push constant block as a plain UBO.	2019-03-19 10:58:52 +01:00
Hans-Kristian Arntzen	7310274a4f	Fix build on Android API < 26.	2019-03-18 10:14:04 +01:00
Hans-Kristian Arntzen	cff057ca5a	We emit loop header variables even for while and dowhile. Make the name clearer.	2019-03-06 12:30:11 +01:00
Hans-Kristian Arntzen	8bfb04d29d	Run format_all.sh Disable clang format in C wrapper for now. Some weird formatting bug with the try/catch macro.	2019-03-06 12:20:13 +01:00
Hans-Kristian Arntzen	ef24337849	Support do-while where test is negative.	2019-03-06 12:17:38 +01:00
Hans-Kristian Arntzen	70ff96b03f	Deal with more for loop candidate cases. We can trivially deal with cases where the loop tests are simply inverted. We can also deal with cases where the condition block branches to the merge block via other noop blocks. This makes SPIR-V codegen easier when targeting SPIRV-Cross.	2019-03-06 11:24:43 +01:00
Hans-Kristian Arntzen	4096552c26	Use RADIXCHAR, which is the portable variant of DECIMAL_POINT.	2019-02-28 12:32:52 +01:00
Hans-Kristian Arntzen	8255dd3ed6	Use nl_langinfo on POSIX systems. localeconv is not MT-safe.	2019-02-28 11:51:08 +01:00
Hans-Kristian Arntzen	825ff4af7e	Replace locale handling. We were using std::locale::global() to force a C locale which is not safe when SPIRV-Cross is used in a multi-threaded environment. To fix this, we could tap into various per-platform specific locale handling to get safe thread-local locales, but since locales only affect the decimal point in floats, we simply query the locale instead and do the necessary radix replacement ourselves, without touching the locale. This should be much safer and cleaner than the alternative.	2019-02-28 11:28:31 +01:00
Patrick Mours	da39a7b02f	Add support for SPV_NV_ray_tracing	2019-02-26 15:43:03 +01:00
Hans-Kristian Arntzen	a4ac27546a	MSL: Fix textures which are sampled and compared against. depth2d in MSL only returns float, not float4, even for normal sampling. We need to conditionally remap-swizzle back to float4.	2019-02-22 12:27:40 +01:00
Hans-Kristian Arntzen	58f264c99d	Merge pull request #865 from KhronosGroup/fix-863 Always value-cast FP16 constants instead of using literals.	2019-02-20 14:58:44 +01:00
Hans-Kristian Arntzen	4ef51331b2	Always value-cast FP16 constants instead of using literals. GL_NV_gpu_shader5 doesn't support "hf", so to avoid lots of complicated workarounds, just value-cast the half literals.	2019-02-20 12:30:01 +01:00
Hans-Kristian Arntzen	056a0ba27e	Fix case where a struct is loaded which contains a row-major matrix.	2019-02-20 12:19:00 +01:00
Chip Davis	e75add42c9	MSL: Add support for tessellation evaluation shaders. These are mapped to Metal's post-tessellation vertex functions. The semantic difference is much less here, so this change should be simpler than the previous one. There are still some hairy parts, though. In MSL, the array of control point data is represented by a special type, `patch_control_point<T>`, where `T` is a valid stage-input type. This object must be embedded inside the patch-level stage input. For this reason, I've added a new type to the type system to represent this. On Mac, the number of input control points to the function must be specified in the `patch()` attribute. This is optional on iOS. SPIRV-Cross takes this from the `OutputVertices` execution mode; the intent is that if it's not set in the shader itself, MoltenVK will set it from the tessellation control shader. If you're translating these offline, you'll have to update the control point count manually, since this number must match the number that is passed to the `drawPatches:...` family of methods. Fixes #120.	2019-02-14 10:00:08 -06:00
Hans-Kristian Arntzen	d7090b8322	GLSL: Fix block name shenanigans in edge cases. When we force recompile, the old var.self name we used as a fallback name might have been disturbed, so we should recover certain names back to their original form in case we are forced to take a recompile to make the naming algorithm more deterministic.	2019-02-13 16:39:59 +01:00
Hans-Kristian Arntzen	3e584f2c3f	Support LUTs in single-function CFGs on Private storage class. Fairly common pattern in unoptimized SPIR-V. Support this case as well.	2019-02-06 10:38:59 +01:00
Chip Davis	ef0b1fc841	Move assertions after the check for equal types. `bitcast_glsl_op()` is sometimes called for `Boolean` types, e.g. for specialization constants. We don't want the assert to trip if this is going to be a no-op anyway.	2019-01-31 14:28:21 -06:00
Hans-Kristian Arntzen	2ed171e525	GLSL/MSL: Implement 8-bit part of VK_KHR_shader_float16_int8. Storage was in place already, so mostly just dealing with bitcasts and constants. Simplies some of the bitcasting logic, and this exposed some bugs in the implementation. Refactor to use correct width integers with explicit bitcast opcodes.	2019-01-30 15:45:24 +01:00
Hans-Kristian Arntzen	2edee351f0	Run format_all.sh.	2019-01-30 13:42:50 +01:00
Hans-Kristian Arntzen	3e09879131	Support initializers on StorageClassOutput.	2019-01-30 10:29:08 +01:00
Hans-Kristian Arntzen	8c632da461	MSL: Use correct alignment rule for whole structs. Structs are aligned as you would expect in MSL (maximum member alignment), and it is not minimum 16 bytes like in std140. Also rename the dummy "pad" members to a reserved naming scheme.	2019-01-28 15:20:30 +01:00
Hans-Kristian Arntzen	3aa08f764e	MSL: Fix image load/store for short vectors. Same fixes as for GLSL.	2019-01-17 14:54:29 +01:00
Hans-Kristian Arntzen	73d9da7070	Avoid unintentional name conflict with HLSL backend.	2019-01-17 12:21:16 +01:00
Hans-Kristian Arntzen	432aaed737	Need to know the original packed type when unpacking loads.	2019-01-17 11:39:46 +01:00
Hans-Kristian Arntzen	40e7723051	Run format_all.sh.	2019-01-17 11:29:50 +01:00
Hans-Kristian Arntzen	de7e5ccd8b	Refactor out packed expressions to extended decorations. Can't safely just cast to the original enum without lots of hacks.	2019-01-17 11:28:51 +01:00
Hans-Kristian Arntzen	72377366d3	Replace custom use of DecorationCPacked with an explicit one. Will need to use more variants of this decoration, so might as well make it clearer what is going on with CPacked.	2019-01-17 10:36:56 +01:00
Hans-Kristian Arntzen	f4026a5618	Refactor access_chain_internal to be more readable from callsite.	2019-01-17 10:30:13 +01:00
Hans-Kristian Arntzen	15b52bee48	Deal with packing/unpacking on store. Still a bit buggy, since we cannot deduce between float2[] and packed_float2. Need a deeper refactor to plumb this through ...	2019-01-17 10:06:23 +01:00
Hans-Kristian Arntzen	7ee04936ac	MSL: Fix case where we pass arrays to functions by value. MSL does not support value semantics for arrays (sigh), so we need to force constant references and deal with copies if we have a different address space than what we end up guessing.	2019-01-14 11:00:14 +01:00
Hans-Kristian Arntzen	6e1c3ccb72	Run format_all.sh.	2019-01-11 12:56:00 +01:00
Hans-Kristian Arntzen	2fb9aa251e	Workaround bugs on MSVC. Bug: https://developercommunity.visualstudio.com/content/problem/303996/c-error-c2668-ambiguous-overloaded-in-lambda-with.html	2019-01-11 09:29:28 +01:00
Hans-Kristian Arntzen	b629878f45	Make meta a hashmap. A flat array was consuming way too much memory and was far too slow to initialize properly with a very large ID bound (8 million IDs, showed up as #1 hotspot in perf). Meta struct does not have to be in-order as we never iterate over it in a meaningful way, so using a hashmap here is reasonable. Very few IDs should need decorations or meta-data, so this should also be a quite decent memory save. For the pathological case, a 6x uplift was observed.	2019-01-10 14:04:01 +01:00
Hans-Kristian Arntzen	d92de00cc1	Rewrite how IDs are iterated over. This is a fairly fundamental change on how IDs are handled. It serves many purposes: - Improve performance. We only need to iterate over IDs which are relevant at any one time. - Makes sure we iterate through IDs in SPIR-V module declaration order rather than ID space. IDs don't have to be monotonically increasing, which was an assumption SPIRV-Cross used to have. It has apparently never been a problem until now. - Support LUTs of structs. We do this by interleaving declaration of constants and struct types in SPIR-V module order. To support this, the ParsedIR interface needed to change slightly. Before setting any ID with variant_set<T> we let ParsedIR know that an ID with a specific type has been added. The surface for change should be minimal. ParsedIR will maintain a per-type list of IDs which the cross-compiler will need to consider for later. Instead of looping over ir.ids[] (which can be extremely large), we loop over types now, using: ir.for_each_typed_id<SPIRVariable>([&](uint32_t id, SPIRVariable &var) { handle_variable(var); }); Now we make sure that we're never looking at irrelevant types.	2019-01-10 12:52:56 +01:00

1 2 3 4 5 ...

662 Commits