SPIRV-Cross

Author	SHA1	Message	Date
Patrick Mours	8d64d5e776	Fix storage packing qualifiers missing on "shaderRecordNV" buffers	2019-06-05 13:31:24 +02:00
Patrick Mours	b1d406e481	Add test for callable data	2019-06-05 13:31:24 +02:00
Patrick Mours	789178666f	Add support for "shaderRecordNV" qualifier	2019-06-05 13:31:24 +02:00
Hans-Kristian Arntzen	314efdcc42	MSL: Fix declaration of unused input variables. In multiple-entry-point modules, we declared builtin inputs which were not supposed to be used for that entry point. Fix this, by being more strict when checking which builtins to emit.	2019-05-31 13:23:34 +02:00
Hans-Kristian Arntzen	65af09d2d1	Support emitting OpLine directive. Facilitates easier mapping from source language to cross-compiled output in tooling.	2019-05-28 13:44:24 +02:00
Hans-Kristian Arntzen	23889f7b87	GLSL: Support std430 in UBOs with scalar layout.	2019-05-28 12:22:44 +02:00
Hans-Kristian Arntzen	fd0feb1ec1	MSL: Use correct address space when passing array-of-buffers. Need to check if the descriptor set is actually an argument buffer.	2019-05-27 16:53:30 +02:00
Hans-Kristian Arntzen	42e64597a7	OpArrayLength must trigger active variables.	2019-05-27 16:44:02 +02:00
Hans-Kristian Arntzen	7b9e0fb428	MSL: Implement OpArrayLength. This gets rather complicated because MSL does not support OpArrayLength natively. We need to pass down a buffer which contains buffer sizes, and we compute the array length on-demand. Support both discrete descriptors as well as argument buffers.	2019-05-27 16:13:09 +02:00
Hans-Kristian Arntzen	55ff233526	MSL: Add test case for complex type alias.	2019-05-23 15:05:30 +02:00
Hans-Kristian Arntzen	96492648d4	MSL: Fix struct declaration order with complex type aliases. MSL generally emits the aliases, which means we cannot always place the master type first, unlike GLSL and HLSL. The logic fix is just to reorder after we have tagged types with packing information, rather than doing it in the parser fixup.	2019-05-23 14:54:04 +02:00
Hans-Kristian Arntzen	eaf7afed97	MSL: Support argument buffers and image swizzling. Change aux buffer to swizzle buffer. There is no good reason to expand the aux buffer, so name it appropriately. Make the code cleaner by emitting a straight pointer to uint rather than a dummy struct which only contains a single unsized array member anyways. This will also end up being very similar to how we implement swizzle buffers for argument buffers. Do not use implied binding if it overflows int32_t.	2019-05-18 10:30:06 +02:00
Chip Davis	8983920edf	Remove fallback for OpGroupNonUniformElect. It's not safe to enable subgroup support without this actually working correctly.	2019-05-16 13:42:09 -05:00
Chip Davis	9d9415754b	MSL: Add support for subgroup operations. Some support for subgroups is present starting in Metal 2.0 on both iOS and macOS. macOS gains more complete support in 10.14 (Metal 2.1). Some restrictions are present. On iOS and on macOS 10.13, the implementation of `OpGroupNonUniformElect` is incorrect: if thread 0 has already terminated or is not executing a conditional branch, the first thread that is will falsely believe itself not to be. Unfortunately, this operation is part of the "basic" feature set; without it, subgroups cannot be supported at all. The `SubgroupSize` and `SubgroupLocalInvocationId` builtins are only available in compute shaders (and, by extension, tessellation control shaders), despite SPIR-V making them available in all stages. This limits the usefulness of some of the subgroup operations in fragment shaders. Although Metal on macOS supports some clustered, inclusive, and exclusive operations, it does not support them all. In particular, inclusive and exclusive min, max, and, or, and xor; as well as cluster sizes other than 4 are not supported. If this becomes a problem, they could be emulated, but at a significant performance cost due to the need for non-uniform operations.	2019-05-15 17:40:04 -05:00
Hans-Kristian Arntzen	03da32a124	Fix nonuniform test for MSL. Binding index overlaps.	2019-05-13 15:14:18 +02:00
Hans-Kristian Arntzen	647ddaee42	HLSL/MSL: Deal correctly with nonuniformEXT qualifier. MSL does not seem to have a qualifier for this, but HLSL SM 5.1 does. glslangValidator for HLSL does not support this, so skip any validation, but it passes in FXC.	2019-05-13 14:58:27 +02:00
Hans-Kristian Arntzen	ac5eea3326	MSL: Add test for passing single swizzled texture arg from array.	2019-05-09 14:19:40 +02:00
Hans-Kristian Arntzen	97d39dc9d5	MSL: Deal with texture swizzle on arrays of images.	2019-05-09 11:25:45 +02:00
Hans-Kristian Arntzen	6fcf8c83d9	GLSL: Support OpBitcast for buffer references. Update glslang/SPIRV-Tools/SPIRV-Headers references.	2019-05-09 10:29:31 +02:00
Hans-Kristian Arntzen	b6f8a20624	GLSL: Return correct sign for OpArrayLength. .length() returns int, not uint ...	2019-05-07 19:02:32 +02:00
Hans-Kristian Arntzen	e9da5ed631	HLSL: Support OpArrayLength.	2019-05-07 15:53:41 +02:00
Hans-Kristian Arntzen	3186701739	GLSL: Support GL_EXT_nonuniform_qualifier.	2019-05-02 11:15:51 +02:00
Chip Davis	01c491648b	Fix a copy-pasto.	2019-04-26 17:16:21 -05:00
Hans-Kristian Arntzen	6f091e7c8f	GLSL: Support GL_EXT_scalar_block_layout.	2019-04-26 15:43:37 +02:00
Hans-Kristian Arntzen	2cc374a0c8	GLSL: Implement GL_EXT_buffer_reference. Buffer objects can contain arbitrary pointers to blocks. We can also implement ConvertPtrToU and ConvertUToPtr. The latter can cast a uint64_t to any type as it pleases, so we will need to generate fake buffer reference blocks to be able to cast the type.	2019-04-26 11:43:51 +02:00
Hans-Kristian Arntzen	8b236f24f1	Fix infinite loop when OpAtomic* temporaries are used in other blocks. We made the mistake of registering a dependency on the atomic variable even if the atomic result was forced to a temporary. There is no need to register reads from atomic variables like this as we always force atomic results to a temporary and argument read/writes do not need to be tracked.	2019-04-24 09:33:39 +02:00
Hans-Kristian Arntzen	c2715c3908	MSL: Cast texture_buffer index to uint.	2019-04-23 12:46:48 +02:00
Hans-Kristian Arntzen	fc4f39b11f	MSL: Support native texture_buffer type, throw error on atomics. Atomics are not supported on images or texture_buffers in MSL. Properly throw an error if OpImageTexelPointer is used (since it can only be used for atomic operations anyways).	2019-04-23 12:21:43 +02:00
Michael Barriault	105bfd368a	Only use MSL constant address space for tessellation control shader.	2019-04-16 17:56:02 +01:00
Michael Barriault	16911c5a4d	Merge remote-tracking branch 'origin/master' * origin/master: Support running {,update_}test_shader.sh with CMake builds. Don't apply vertex attribute remapping other non-vertex or non-input interface blocks Force complex loop in certain rare access chain scenarios. Fix guard around [[noreturn]]. Deal with mismatched signs in S/U/F conversion opcodes. Workaround lack of lvalue/rvalue operator overload on MSVC 2013. Support direct conversions to std::vector from SmallVector. Fix some minor copy constructor issues in Variant. Make sure ids_for_types are moved correctly in move operator. Run format_all.sh. Refactor out error handling and containers to new headers. Do not use SmallVector as input type in public interfaces. Fix various bugs found in testing. Explicitly implement move operators for ParsedIR. Try another MSVC 2013 workaround. Implement edge cases in insert/end and add a simple test case. Fix GCC 4.x warnings. Workaround lack of alignas on MSVC 2013. Reduce pressure on global allocation. CLI: Make --iterations more useful.	2019-04-13 18:06:29 +01:00
Michael Barriault	d6754c5713	Fix tests for device->constant address space change in MSL tessellation control shader generation.	2019-04-10 18:37:04 +01:00
Hans-Kristian Arntzen	e23c9ea700	Force complex loop in certain rare access chain scenarios. If we generate an access chain in a loop body, and it is consumed in the loop continue block, we have a problem because we cannot emit a temporary here holding the access chain reference. Force a complex loop body to workaround this exceptionally rare case.	2019-04-10 16:02:03 +02:00
Hans-Kristian Arntzen	9ae91c2d1e	Deal with mismatched signs in S/U/F conversion opcodes.	2019-04-10 14:03:58 +02:00
Hans-Kristian Arntzen	23db744e35	Deal with case where we need to emit SpvImplArrayCopy late. We cannot deduce if OpLoad needs ArrayCopy templates early since it's heavily context dependent, and we might only know on 3rd iteration of the compile loop.	2019-04-09 12:28:46 +02:00
Hans-Kristian Arntzen	bf07e5fa7b	MSL: Fix OpLoad of array which is forced to a temporary.	2019-04-09 11:50:45 +02:00
Hans-Kristian Arntzen	3ca8bc5e0d	Support fma() in older GLSL targets.	2019-04-08 10:38:32 +02:00
Bill Hollings	efbe7ca16f	MSL: Fix infinite CAS loop on atomic_compare_exchange_weak_explicit().	2019-04-05 21:28:57 -04:00
Hans-Kristian Arntzen	317144a59c	Detect invalid DoWhileLoop early. We had a bug where error conditions in DoWhileLoop emit path would not detect that statements were being emitted due to the masking behavior which happens when force_recompile is true. Fix this. Also, refactor force_recompile into member functions so we can properly break on any situation where this is set, without having to rely on watchpoints in debuggers.	2019-04-05 12:19:32 +02:00
Frank Henigman	248e95a42f	Update to latest glslang/SPIRV-Tools.	2019-04-03 15:37:38 -04:00
Hans-Kristian Arntzen	44834f2115	Merge pull request #927 from KhronosGroup/fix-925 GLSL: Fix OpImageFetch with uint coordinates and LOD.	2019-04-03 12:32:43 +02:00
Hans-Kristian Arntzen	e4d5c6183a	GLSL: Fix OpImageFetch with uint coordinates and LOD. Also fix some minor issues with too many coordinate dimensions in HLSL and GLSL.	2019-04-03 10:50:32 +02:00
Hans-Kristian Arntzen	7e37623e82	MSL: Fix depth2d 4-component fixup. Need to look at the backing image for the image. We might have found diverging use at the image variable level, not just expression level.	2019-04-03 10:24:22 +02:00
Hans-Kristian Arntzen	18d4f67a87	Merge pull request #919 from KhronosGroup/fix-915 MSL: Declare gl_WorkGroupSize constant with [[maybe_unused]].	2019-03-28 14:00:49 +01:00
Hans-Kristian Arntzen	0909975655	MSL: Declare gl_WorkGroupSize constant with [[maybe_unused]]. Avoids ugly warnings on nearly every compute shader. We could do analysis to detect whether we need to emit this constant, but it's a bit tedious to figure out if an OpConstantComponent is actually used by opcodes, so just make it simple.	2019-03-28 10:54:18 +01:00
Hans-Kristian Arntzen	c37f88fea6	MSL: Fix crash where variable storage buffer pointers are passed down. Only deal with readonly decoration for actual block types.	2019-03-28 10:16:46 +01:00
Hans-Kristian Arntzen	eeb3f24991	Properly deal with sign-dependent GLSL opcodes. The GLSLstd450 spec is very lax about input signs, so we need to do the bitcasting dance to implement it correctly.	2019-03-27 12:20:53 +01:00
Hans-Kristian Arntzen	df3e21a762	Parser: Fix OpCompositeConstruct with OpUndef. Just treat any undefined argument as 0. It is risky to use the undefined variable as it might not lower to a true constant.	2019-03-27 10:51:23 +01:00
Hans-Kristian Arntzen	689a7deb3e	Make ray-tracing files .nocompat.	2019-03-27 10:04:42 +01:00
Patrick Mours	0f72199b8f	Add generated reference shaders again	2019-03-26 15:21:12 +01:00
Hans-Kristian Arntzen	8eb33c8017	Support -1 index in OpVectorShuffle. -1 (0xffffffff) literal means the component should be undefined. Since we cannot express undefined directly, just use a 0 literal in the appropriate type.	2019-03-25 10:17:05 +01:00
Hans-Kristian Arntzen	2a0365c813	GLSL/HLSL: Implement NMin/NMax/NClamp. Need to emulate these calls for correctness.	2019-03-21 15:26:46 +01:00
Hans-Kristian Arntzen	0b20180537	GLSL: Deal with array loads from input in tessellation. We have an edge case where the array is declared with a concrete size, but in GLSL we must emit an unsized array, which breaks array copies. Deal explicitly with this.	2019-03-21 11:50:53 +01:00
Hans-Kristian Arntzen	d2961b30db	GLSL: Unroll loads from builtin pos/point arrays. Odd-ball case for certain geometry shaders coming from HLSL.	2019-03-21 11:25:41 +01:00
Hans-Kristian Arntzen	0474848d4a	GLSL: Support emitting push constant block as a plain UBO.	2019-03-19 10:58:52 +01:00
Hans-Kristian Arntzen	e2aadf8995	Rename "push descriptor set" to "discrete descriptor set". Check for case where iOS doesn't support writable argument buffer textures.	2019-03-15 21:53:21 +01:00
Hans-Kristian Arntzen	b3380ec9dd	MSL: Support VK_KHR_push_descriptor. If we have argument buffers, we also need to support using plain descriptor sets for certain cases where API wants it.	2019-03-15 14:08:47 +01:00
Hans-Kristian Arntzen	bc21ccb7ce	MSL: Emit correct SSBO constness for argument buffers.	2019-03-15 12:05:35 +01:00
Hans-Kristian Arntzen	969566aff5	MSL: Fixup buffer array case issue on MSL 1.0.	2019-03-15 11:37:34 +01:00
Hans-Kristian Arntzen	af8a9ccdcb	MSL: Need to emit two layers of address space. When passing down arrays of buffer pointers, the array itself needs an address space.	2019-03-15 11:29:17 +01:00
Hans-Kristian Arntzen	e47a77d596	MSL: Implement Metal 2.0 indirect argument buffers.	2019-03-15 11:01:27 +01:00
Hans-Kristian Arntzen	ef24337849	Support do-while where test is negative.	2019-03-06 12:17:38 +01:00
Hans-Kristian Arntzen	ee395afa83	MSL: Emit proper name for optimized UBO/SSBO arrays.	2019-02-25 11:09:00 +01:00
Hans-Kristian Arntzen	ad6134262e	Merge pull request #877 from cdavis5e/msl-tesc-early-return MSL: Return early from helper tesc invocations.	2019-02-25 09:13:06 +01:00
Chip Davis	a43dcd7b99	MSL: Return early from helper tesc invocations. Return after loading the input control point array if there are more input points than output points, and this was one of the helper invocations spun off to load the input points. I was hesitant to do this initially, since the MSL spec has this to say about barriers: > The `threadgroup_barrier` (or `simdgroup_barrier`) function must be > encountered by all threads in a threadgroup (or SIMD-group) executing > the kernel. That is, if any thread executes the barrier, then all threads must execute it, or the barrier'd invocations will hang. But, the key words here seem to be "executing the kernel;" inactive invocations, those that have already returned, need not encounter the barrier to prevent hangs. Indeed, I've encountered no problems from doing this, at least on my hardware. This also fixes a few CTS tests that were failing due to execution ordering; apparently, my assumption that the later, invalid data written by the helpers would get overwritten was wrong.	2019-02-24 12:17:47 -06:00
Chip Davis	f3c0942d10	MSL: Use vectors for the tessellation level builtins in tese shaders. The tessellation levels in Metal are stored as a densely-packed array of half-precision floating point values. But, stage-in attributes in Metal have to have offsets and strides aligned to a multiple of four, so we can't add them individually. Luckily for us, the arrays have lengths less than 4. So, let's use vectors for them! Triangles get a single attribute with a `float4`, where the outer levels are in `.xyz` and the inner levels are in `.w`. The arrays are unpacked as though we had added the elements individually. Quads get two: a `float4` with the outer levels and a `float2` with the inner levels. Further, since vectors can be indexed as arrays, there's no need to unpack them in this case. This also saves on precious vertex attributes. Before, we were using up to 6 of them. Now we need two at most.	2019-02-22 12:18:51 -06:00
Hans-Kristian Arntzen	a4ac27546a	MSL: Fix textures which are sampled and compared against. depth2d in MSL only returns float, not float4, even for normal sampling. We need to conditionally remap-swizzle back to float4.	2019-02-22 12:27:40 +01:00
Chip Davis	7a7e210515	MSL: Force unnamed array builtin attributes to have a name. That way, when we refer to them, they'll have the name that we're expecting.	2019-02-20 22:16:51 -06:00
Chip Davis	285ca4c2b1	MSL: Don't bother fixing up triangle tess coords. Instead, I'm going to have MoltenVK reverse the winding order in the lower-left case. This seems to be what the test suite expects to happen anyhow.	2019-02-20 14:30:44 -06:00
Chip Davis	8095434dc4	MSL: Drop stores to nonexistent tess levels. In SPIR-V, there are always two inner levels and four outer levels, even if the input patch isn't a quad patch. But in MSL, due to requirements imposed by Metal, only one inner level and three outer levels exist when the input patch is a triangle patch. We must explicitly ignore any write to the nonexistent second inner and fourth outer levels in this case.	2019-02-20 09:11:24 -06:00
Chip Davis	c8ee9fbe76	MSL: Expand quad gl_TessCoord to a float3. This is the actual SPIR-V type of the builtin. We forced to a `float2` in the declaration because that's what Metal wants.	2019-02-20 09:11:24 -06:00
Hans-Kristian Arntzen	58f264c99d	Merge pull request #865 from KhronosGroup/fix-863 Always value-cast FP16 constants instead of using literals.	2019-02-20 14:58:44 +01:00
Hans-Kristian Arntzen	4ef51331b2	Always value-cast FP16 constants instead of using literals. GL_NV_gpu_shader5 doesn't support "hf", so to avoid lots of complicated workarounds, just value-cast the half literals.	2019-02-20 12:30:01 +01:00
Hans-Kristian Arntzen	056a0ba27e	Fix case where a struct is loaded which contains a row-major matrix.	2019-02-20 12:19:00 +01:00
Hans-Kristian Arntzen	d2cc43e667	Fix edge case where opaque types can be declared on stack. In the bizarre case where the ID of a loaded opaque type aliased with a literal which was used as part of another texturing instruction, we could end up with a case where domination analysis assumed the loaded opaque type needed to be moved to a different scope. Fix the issue by never doing dominance analysis for opaque temporaries, and be more robust when analyzing texturing instructions. Also make sure reflection output is deterministic. This patch slightly alterered output for some unknown reason, but it came from an unordered_map, so it's fine.	2019-02-19 17:28:31 +01:00
Chip Davis	41d9424233	MSL: Add an option to set the tessellation domain origin. This is intended to be used to support `VK_KHR_maintenance2`'s tessellation domain origin feature. If `tess_domain_origin_lower_left` is `true`, the `v` coordinate will be inverted with respect to the domain. Additionally, in `Triangles` mode, the `v` and `w` coordinates will be swapped. This is because the winding order is interpreted differently in lower-left mode.	2019-02-18 14:25:42 -06:00
Chip Davis	08863c1e28	Don't set any aliases or do any flattening for arrayed per-vertex I/O. We already handle all that specially.	2019-02-15 17:24:16 -06:00
Chip Davis	6b7988046d	Handle blocks of patch I/O. In this case, each member of the block will be decorated with `DecorationPatch`, rather than the block variable having the decoration.	2019-02-15 17:21:38 -06:00
Chip Davis	e75add42c9	MSL: Add support for tessellation evaluation shaders. These are mapped to Metal's post-tessellation vertex functions. The semantic difference is much less here, so this change should be simpler than the previous one. There are still some hairy parts, though. In MSL, the array of control point data is represented by a special type, `patch_control_point<T>`, where `T` is a valid stage-input type. This object must be embedded inside the patch-level stage input. For this reason, I've added a new type to the type system to represent this. On Mac, the number of input control points to the function must be specified in the `patch()` attribute. This is optional on iOS. SPIRV-Cross takes this from the `OutputVertices` execution mode; the intent is that if it's not set in the shader itself, MoltenVK will set it from the tessellation control shader. If you're translating these offline, you'll have to update the control point count manually, since this number must match the number that is passed to the `drawPatches:...` family of methods. Fixes #120.	2019-02-14 10:00:08 -06:00
Hans-Kristian Arntzen	aab4a5632b	Merge branch 'tesc-shader' of git://github.com/cdavis5e/SPIRV-Cross	2019-02-14 09:17:32 +01:00
Chip Davis	13df78bebf	Unflatten inputs when copying to outputs. This should fix a whole host of issues related to structs in the `Input` class in a tessellation control shader. Also, use pointer arithmetic instead of dereferencing the `ops` array. This is critical in case we wind up stepping beyond the bounds of the array.	2019-02-13 12:37:24 -06:00
Hans-Kristian Arntzen	d7090b8322	GLSL: Fix block name shenanigans in edge cases. When we force recompile, the old var.self name we used as a fallback name might have been disturbed, so we should recover certain names back to their original form in case we are forced to take a recompile to make the naming algorithm more deterministic.	2019-02-13 16:39:59 +01:00
Chip Davis	0bb6bbda22	Never flatten outputs when capturing them. There's no need to do so, since these are not stage-out structs being returned, but regular structures being written to a buffer. This also neatly avoids issues writing to composite (e.g. arrayed) per-patch outputs from a tessellation control shader.	2019-02-11 17:18:54 -06:00
Chip Davis	1919eb1b46	Pass the original pointer type to ensure_correct_attribute_type(). This prevents us from overwriting the variable's type with a non-pointer type.	2019-02-11 16:07:43 -06:00
Chip Davis	eb89c3a428	MSL: Add support for tessellation control shaders. These are transpiled to kernel functions that write the output of the shader to three buffers: one for per-vertex varyings, one for per-patch varyings, and one for the tessellation levels. This structure is mandated by the way Metal works, where the tessellation factors are supplied to the draw method in their own buffer, while the per-patch and per-vertex varyings are supplied as though they were vertex attributes; since they have different step rates, they must be in separate buffers. The kernel is expected to be run in a workgroup whose size is the greater of the number of input or output control points. It uses Metal's support for vertex-style stage input to a compute shader to get the input values; therefore, at least one instance must run per input point. Meanwhile, Vulkan mandates that it run at least once per output point. Overrunning the output array is a concern, but any values written should either be discarded or overwritten by subsequent patches. I'm probably going to put some slop space in the buffer when I integrate this into MoltenVK to be on the safe side.	2019-02-07 08:51:22 -06:00
Hans-Kristian Arntzen	d9ed3dcc7a	Merge pull request #848 from cdavis5e/capture-output-buffer MSL: Add a setting to capture vertex shader output to a buffer.	2019-02-07 15:11:41 +01:00
Chip Davis	056c0e207d	Take the vertex count from any indirect parameters passed. This is necessary to deal with indirect draws, where the draw parameters are given in a buffer instead of passed by the CPU. For normal draws, the draw parameters are set with Metal's `setVertexBytes:` method. This undoes the change to add the vertex count to the aux buffer, rendering that entire discussion largely moot. Oh well. It was a discussion that needed to happen anyway.	2019-02-06 15:17:14 -06:00
Chip Davis	546f1ccbb5	Test that out variables still work in leaf functions with capture on.	2019-02-06 10:49:25 -06:00
Hans-Kristian Arntzen	d5385190ff	Merge pull request #850 from KhronosGroup/fix-846 Support LUTs in single-function CFGs on Private storage class.	2019-02-06 11:34:30 +01:00
Hans-Kristian Arntzen	3e584f2c3f	Support LUTs in single-function CFGs on Private storage class. Fairly common pattern in unoptimized SPIR-V. Support this case as well.	2019-02-06 10:38:59 +01:00
Chip Davis	0757fae511	MSL: Stop passing the aux buffer around. Since we pass the component swizzle around now, there's no need to pass it to every function that takes a sampled image.	2019-02-05 20:04:32 -06:00
Chip Davis	c51e5b7911	MSL: Add a setting to capture vertex shader output to a buffer. This will be necessary to support transform feedback, as well as tessellation shaders.	2019-02-05 20:00:10 -06:00
Hans-Kristian Arntzen	fcbe999d99	MSL: Fix another test incompatibility.	2019-01-30 17:22:38 +01:00
Hans-Kristian Arntzen	b78ffa1cc7	Fixup MSL iOS test.	2019-01-30 16:26:41 +01:00
Hans-Kristian Arntzen	2ed171e525	GLSL/MSL: Implement 8-bit part of VK_KHR_shader_float16_int8. Storage was in place already, so mostly just dealing with bitcasts and constants. Simplies some of the bitcasting logic, and this exposed some bugs in the implementation. Refactor to use correct width integers with explicit bitcast opcodes.	2019-01-30 15:45:24 +01:00
Hans-Kristian Arntzen	4e7777c443	Update to latest glslang/SPIRV-Tools. Fix various bugs along the way.	2019-01-30 13:41:57 +01:00
Hans-Kristian Arntzen	3e09879131	Support initializers on StorageClassOutput.	2019-01-30 10:29:08 +01:00
Hans-Kristian Arntzen	912fde95f1	MSL: Use correct size for structs. Need to align the size of structs to the natural alignment.	2019-01-28 15:20:30 +01:00
Hans-Kristian Arntzen	217eb5b5f9	MSL: Add a preliminary check for bad arrays of structs. ArrayStride can be larger than the declared struct size. We have no obvious solution for now, but warn about it in the MSL output for the time being.	2019-01-28 15:20:30 +01:00
Hans-Kristian Arntzen	8c632da461	MSL: Use correct alignment rule for whole structs. Structs are aligned as you would expect in MSL (maximum member alignment), and it is not minimum 16 bytes like in std140. Also rename the dummy "pad" members to a reserved naming scheme.	2019-01-28 15:20:30 +01:00
Hans-Kristian Arntzen	18a4accd2f	HLSL/MSL: Fix texture projection with Dref. We need to divide the Dref by q.	2019-01-28 10:25:13 +01:00
Hans-Kristian Arntzen	437fc87a89	MSL: Deal with resource name aliasing. Apparently we didn't use those yet. MSL seems to be able to alias struct types and variable types to a degree, so that's why it has escaped testing until now.	2019-01-18 16:27:57 +01:00
Hans-Kristian Arntzen	1040cf6cc1	Merge pull request #831 from cdavis5e/force-recompile-hooks MSL: Hoist fixup hooks in entry_point_args() out of the compile loop.	2019-01-17 19:42:05 +01:00
Chip Davis	f500d2f70c	MSL: Hoist fixup hooks in entry_point_args() out of the compile loop. Otherwise, in the event of a forced recompile, we could end up adding them twice.	2019-01-17 10:18:38 -06:00
Hans-Kristian Arntzen	3aa08f764e	MSL: Fix image load/store for short vectors. Same fixes as for GLSL.	2019-01-17 14:54:29 +01:00
Hans-Kristian Arntzen	432aaed737	Need to know the original packed type when unpacking loads.	2019-01-17 11:39:46 +01:00
Hans-Kristian Arntzen	f4b74f2f4f	Add basic test for std140 small vector arrays.	2019-01-17 11:29:09 +01:00
Hans-Kristian Arntzen	de7e5ccd8b	Refactor out packed expressions to extended decorations. Can't safely just cast to the original enum without lots of hacks.	2019-01-17 11:28:51 +01:00
Hans-Kristian Arntzen	f4026a5618	Refactor access_chain_internal to be more readable from callsite.	2019-01-17 10:30:13 +01:00
Hans-Kristian Arntzen	15b52bee48	Deal with packing/unpacking on store. Still a bit buggy, since we cannot deduce between float2[] and packed_float2. Need a deeper refactor to plumb this through ...	2019-01-17 10:06:23 +01:00
Hans-Kristian Arntzen	9e3a41ad00	Merge pull request #821 from cdavis5e/pass-sampled-images MSL: Fix passing a sampled image to a function.	2019-01-15 09:05:54 +01:00
Chip Davis	664df22d12	MSL: Fix passing a sampled image to a function. In the past, SPIRV-Cross threw an error in this case because it couldn't work out which swizzle from the auxiliary buffer needs to be passed. Now, we pass the swizzle around with the texture object, like a combined image-sampler and its associated sampler.	2019-01-14 09:29:31 -06:00
Hans-Kristian Arntzen	b8033d7525	MSL: Add option to pad fragment outputs. If not enough components are provided in the shader, the shader MSL compiler throws an error rather than make components undefined. This hurts portability, so we need to add explicit padding here.	2019-01-14 15:11:52 +01:00
Hans-Kristian Arntzen	7ee04936ac	MSL: Fix case where we pass arrays to functions by value. MSL does not support value semantics for arrays (sigh), so we need to force constant references and deal with copies if we have a different address space than what we end up guessing.	2019-01-14 11:00:14 +01:00
Hans-Kristian Arntzen	4394249b86	Merge pull request #815 from KhronosGroup/fix-811 HLSL: Support dual-source blending.	2019-01-11 12:52:26 +01:00
Hans-Kristian Arntzen	a365ff17bd	HLSL: Support dual-source blending.	2019-01-11 10:03:45 +01:00
Hans-Kristian Arntzen	d92de00cc1	Rewrite how IDs are iterated over. This is a fairly fundamental change on how IDs are handled. It serves many purposes: - Improve performance. We only need to iterate over IDs which are relevant at any one time. - Makes sure we iterate through IDs in SPIR-V module declaration order rather than ID space. IDs don't have to be monotonically increasing, which was an assumption SPIRV-Cross used to have. It has apparently never been a problem until now. - Support LUTs of structs. We do this by interleaving declaration of constants and struct types in SPIR-V module order. To support this, the ParsedIR interface needed to change slightly. Before setting any ID with variant_set<T> we let ParsedIR know that an ID with a specific type has been added. The surface for change should be minimal. ParsedIR will maintain a per-type list of IDs which the cross-compiler will need to consider for later. Instead of looping over ir.ids[] (which can be extremely large), we loop over types now, using: ir.for_each_typed_id<SPIRVariable>([&](uint32_t id, SPIRVariable &var) { handle_variable(var); }); Now we make sure that we're never looking at irrelevant types.	2019-01-10 12:52:56 +01:00
Hans-Kristian Arntzen	ddfd261776	Fix input array size in tessellation evaluation shaders.	2019-01-09 10:47:16 +01:00
Hans-Kristian Arntzen	5345756cab	MSL: Support composites inside I/O blocks I had to refactor the existing add_interface_block as it was getting extremely large. Now it's all split up into different readable functions.	2019-01-09 09:33:10 +01:00
Hans-Kristian Arntzen	9c47b2837e	Merge pull request #807 from cdavis5e/variable-pointers MSL: Support SPV_KHR_variable_pointers.	2019-01-09 09:07:43 +01:00
Chip Davis	d6aa911156	Flush all variables after storing through a variable pointer. Since we can't know which variable was modified, we therefore have to conservatively assume that any variable might have been modified.	2019-01-08 15:16:33 -06:00
Chip Davis	3394f53734	MSL: Fix mapping of identity-swizzled components. Before, if any component was not identity-mapped, those components that were still identity-mapped were set to 0. Now we properly leave them alone.	2019-01-07 11:20:13 -06:00
Chip Davis	3bfb2f94d4	MSL: Support SPV_KHR_variable_pointers. This allows shaders to declare and use pointer-type variables. Pointers may be loaded and stored, be the result of an `OpSelect`, be passed to and returned from functions, and even be passed as inputs to the `OpPhi` instruction. All types of pointers may be used as variable pointers. Variable pointers to storage buffers and workgroup memory may even be loaded from and stored to, as though they were ordinary variables. In addition, this enables using an interior pointer to an array as though it were an array pointer itself using the `OpPtrAccessChain` instruction. This is a rather large and involved change, mostly because this is somewhat complicated with a lot of moving parts. It's a wonder SPIRV-Cross's output is largely unchanged. Indeed, many of these changes are to accomplish exactly that! Perhaps the largest source of changes was the violation of the assumption that, when emitting types, the pointer type didn't matter. One of the test cases added by the change doesn't optimize very well; the output of `spirv-opt` here is invalid SPIR-V. I need to file a bug with SPIRV-Tools about this. I wanted to test that variable pointers to images worked too, but I couldn't figure out how to propagate the access qualifier properly--in MSL, it's part of the type, so getting this right is important. I've punted on that for now.	2019-01-07 11:19:10 -06:00
Hans-Kristian Arntzen	d4926a0405	Deal with phi copies which happen inside continue blocks.	2019-01-07 14:24:07 +01:00
Hans-Kristian Arntzen	c8ddf7e7d5	Fix case where OpPhi is used to swap values.	2019-01-07 13:54:16 +01:00
Hans-Kristian Arntzen	cacfeef89e	Merge pull request #804 from KhronosGroup/fix-788 Forward meta information in OpCompositeExtract.	2019-01-07 11:43:43 +01:00
Hans-Kristian Arntzen	66263d4569	Forward meta information in OpCompositeExtract. Just like OpAccessChain we need to make use of the meta information available to use from access_chain_internal as we can extract a packed vector or transposed vector from a composite, not just memory load.	2019-01-07 10:43:55 +01:00
Hans-Kristian Arntzen	649ce3c7bb	MSL: Workaround missing gradient2d() for sampler_compare.	2019-01-07 10:01:00 +01:00
Hans-Kristian Arntzen	211abfb7ef	Merge pull request #799 from KhronosGroup/fix-780 Use correct block-name / other-name aliasing rules.	2019-01-04 16:08:10 +01:00
Hans-Kristian Arntzen	9728f9c1b7	Use correct block-name / other-name aliasing rules. A block name cannot alias with any name in its own scope, and it cannot alias with any other "global" name. To solve this, we need to complicate the name cache updates a little bit where we have a "primary" namespace and "secondary" namespace.	2019-01-04 15:02:54 +01:00
Hans-Kristian Arntzen	acae607703	Register implied expression reads in OpLoad/OpAccessChain. This is required to avoid relying on complex sub-expression elimination in compilers, and generates cleaner code. The problem case is if a complex expression is used in an access chain, like: Composite comp = buffer[texture(...)]; vec4 a = comp.a + comp.b + comp.c; Before, we did not have common subexpression tracking for OpLoad/OpAccessChain, so we easily ended up with code like: vec4 a = buffer[texture(...)].a + buffer[texture(...)].b + buffer[texture(...)].c; A good compiler will optimize this, but we should not rely on it, and forcing texture(...) to a temporary also looks better. The solution is to add a vector "implied_expression_reads", which works similarly to expression_dependencies. We also need an extra mechanism in to_expression which lets us skip expression read checking and do it later. E.g. for expr -> access chain -> load, we should only trigger a read of expr when using the loaded expression.	2019-01-04 14:56:12 +01:00
Hans-Kristian Arntzen	61f1d8b2cf	Support gl_HelperInvocation on GLSL and MSL. There is no obvious builtin for this on HLSL.	2018-11-28 15:18:43 +01:00
Hans-Kristian Arntzen	04f410d35c	Fix unsigned switch case selectors.	2018-11-26 10:36:50 +01:00
Hans-Kristian Arntzen	816c1167ce	Handle invariant decoration more robustly. Avoids certain cases of variance between translation units by forcing every dependent expression of a store to be temporary. Should avoid the major failure cases where invariance matters.	2018-11-22 11:55:57 +01:00
Hans-Kristian Arntzen	fa42ed37ba	Parser: Handle group decorations.	2018-11-15 10:51:52 +01:00
Hans-Kristian Arntzen	510e1475c6	Merge pull request #756 from cdavis5e/relaxed-block-layout-2 MSL: Also pack 2- and 4- element vectors when necessary.	2018-11-15 10:09:09 +01:00
Chip Davis	a5882da091	Test loading from and storing to packed vectors.	2018-11-14 10:47:20 -06:00
Chip Davis	6d675ae6a2	Correct carry/borrow bit checks. Don't use `addsat()`/`subsat()`; that'll erroneously flag cases where the sum is exactly the maximum integer value, or the difference is exactly 0. Also, correct the condition for the `select()` function; it's basically `mix()` with a boolean factor. (What was I thinking?)	2018-11-14 10:13:56 -06:00
Chip Davis	cf2a890e4f	MSL: Support extended arithmetic opcodes.	2018-11-13 17:33:03 -06:00
Chip Davis	bed4918cb5	MSL: Also pack 2- and 4- element vectors when necessary. This is also needed for `VK_KHR_relaxed_block_layout` support.	2018-11-13 17:31:47 -06:00
Hans-Kristian Arntzen	2a8a4fe706	GLSL: Support extended arithmetic opcodes. - uaddCarry - usubBorrow - umulExtended - imulExtended	2018-11-13 14:50:46 +01:00
Hans-Kristian Arntzen	d6be21543d	MSL: Split out early_fragment_tests. Was causing compilation failures, jumped the merge a bit too soon.	2018-11-12 16:20:49 +01:00
Hans-Kristian Arntzen	4e5c8d7199	Deal with depth_greater/depth_less qualifiers. Adds support on HLSL SM 5.0, and fixes bug on GLSL. Makes sure early fragment tests is tested on MSL as well.	2018-11-12 10:35:36 +01:00
Hans-Kristian Arntzen	b778e16e48	HLSL: Complete support for combined image samplers in legacy.	2018-11-12 09:59:39 +01:00
Connor McLaughlin	1dd676c1de	MSL: Emit wrapper for SSign (sign() for int types) Metal does not define the sign() function for integer types, only floating-point types.	2018-11-08 13:08:34 +10:00
Chip Davis	e50eecfeeb	MSL: Also pack members at unaligned offsets. This is necessary to support `VK_KHR_relaxed_block_layout`.	2018-11-07 09:42:54 -06:00
Chip Davis	0d949e11ff	Support bitcasts of 16-bit types.	2018-11-05 14:56:36 -06:00
Chip Davis	ca4744ab72	Support constants of 16-bit integral type in GLSL and MSL. Constants of 8-bit type aren't supported in GLSL, since there's no extension letting you use them.	2018-11-02 14:39:55 -05:00
Chip Davis	117ccf407c	Use specific base types for 8- and 16-bit integers.	2018-11-01 17:45:10 -05:00
Chip Davis	1fb27b4cda	Add support for 8- and 16-bit types to GLSL and MSL. In GLSL, 8-bit types require GL_EXT_shader_8bit_storage. 16-bit types can use either GL_AMD_gpu_shader_int16/GL_AMD_gpu_shader_half_float or GL_EXT_shader_16bit_storage.	2018-11-01 10:20:57 -05:00
Hans-Kristian Arntzen	480acdad18	Deal with OpSpecConstantOp used as array size. When trying to validate buffer sizes, we usually need to bail out when using SpecConstantOps, but for some very specific cases where we allow unsized arrays currently, we can safely allow "unknown" sized arrays as well. This is probably the best we can do, when we have even more difficult cases than this, we throw a more sensible error message.	2018-11-01 14:58:02 +01:00

1 2 3 4 5 ...

760 Commits