SPIRV-Cross

Author	SHA1	Message	Date
Chip Davis	01c491648b	Fix a copy-pasto.	2019-04-26 17:16:21 -05:00
Michael Barriault	d6754c5713	Fix tests for device->constant address space change in MSL tessellation control shader generation.	2019-04-10 18:37:04 +01:00
Hans-Kristian Arntzen	0909975655	MSL: Declare gl_WorkGroupSize constant with [[maybe_unused]]. Avoids ugly warnings on nearly every compute shader. We could do analysis to detect whether we need to emit this constant, but it's a bit tedious to figure out if an OpConstantComponent is actually used by opcodes, so just make it simple.	2019-03-28 10:54:18 +01:00
Hans-Kristian Arntzen	ee395afa83	MSL: Emit proper name for optimized UBO/SSBO arrays.	2019-02-25 11:09:00 +01:00
Hans-Kristian Arntzen	ad6134262e	Merge pull request #877 from cdavis5e/msl-tesc-early-return MSL: Return early from helper tesc invocations.	2019-02-25 09:13:06 +01:00
Chip Davis	a43dcd7b99	MSL: Return early from helper tesc invocations. Return after loading the input control point array if there are more input points than output points, and this was one of the helper invocations spun off to load the input points. I was hesitant to do this initially, since the MSL spec has this to say about barriers: > The `threadgroup_barrier` (or `simdgroup_barrier`) function must be > encountered by all threads in a threadgroup (or SIMD-group) executing > the kernel. That is, if any thread executes the barrier, then all threads must execute it, or the barrier'd invocations will hang. But, the key words here seem to be "executing the kernel;" inactive invocations, those that have already returned, need not encounter the barrier to prevent hangs. Indeed, I've encountered no problems from doing this, at least on my hardware. This also fixes a few CTS tests that were failing due to execution ordering; apparently, my assumption that the later, invalid data written by the helpers would get overwritten was wrong.	2019-02-24 12:17:47 -06:00
Chip Davis	f3c0942d10	MSL: Use vectors for the tessellation level builtins in tese shaders. The tessellation levels in Metal are stored as a densely-packed array of half-precision floating point values. But, stage-in attributes in Metal have to have offsets and strides aligned to a multiple of four, so we can't add them individually. Luckily for us, the arrays have lengths less than 4. So, let's use vectors for them! Triangles get a single attribute with a `float4`, where the outer levels are in `.xyz` and the inner levels are in `.w`. The arrays are unpacked as though we had added the elements individually. Quads get two: a `float4` with the outer levels and a `float2` with the inner levels. Further, since vectors can be indexed as arrays, there's no need to unpack them in this case. This also saves on precious vertex attributes. Before, we were using up to 6 of them. Now we need two at most.	2019-02-22 12:18:51 -06:00
Hans-Kristian Arntzen	a4ac27546a	MSL: Fix textures which are sampled and compared against. depth2d in MSL only returns float, not float4, even for normal sampling. We need to conditionally remap-swizzle back to float4.	2019-02-22 12:27:40 +01:00
Chip Davis	7a7e210515	MSL: Force unnamed array builtin attributes to have a name. That way, when we refer to them, they'll have the name that we're expecting.	2019-02-20 22:16:51 -06:00
Chip Davis	8095434dc4	MSL: Drop stores to nonexistent tess levels. In SPIR-V, there are always two inner levels and four outer levels, even if the input patch isn't a quad patch. But in MSL, due to requirements imposed by Metal, only one inner level and three outer levels exist when the input patch is a triangle patch. We must explicitly ignore any write to the nonexistent second inner and fourth outer levels in this case.	2019-02-20 09:11:24 -06:00
Hans-Kristian Arntzen	056a0ba27e	Fix case where a struct is loaded which contains a row-major matrix.	2019-02-20 12:19:00 +01:00
Chip Davis	eb89c3a428	MSL: Add support for tessellation control shaders. These are transpiled to kernel functions that write the output of the shader to three buffers: one for per-vertex varyings, one for per-patch varyings, and one for the tessellation levels. This structure is mandated by the way Metal works, where the tessellation factors are supplied to the draw method in their own buffer, while the per-patch and per-vertex varyings are supplied as though they were vertex attributes; since they have different step rates, they must be in separate buffers. The kernel is expected to be run in a workgroup whose size is the greater of the number of input or output control points. It uses Metal's support for vertex-style stage input to a compute shader to get the input values; therefore, at least one instance must run per input point. Meanwhile, Vulkan mandates that it run at least once per output point. Overrunning the output array is a concern, but any values written should either be discarded or overwritten by subsequent patches. I'm probably going to put some slop space in the buffer when I integrate this into MoltenVK to be on the safe side.	2019-02-07 08:51:22 -06:00
Hans-Kristian Arntzen	3e584f2c3f	Support LUTs in single-function CFGs on Private storage class. Fairly common pattern in unoptimized SPIR-V. Support this case as well.	2019-02-06 10:38:59 +01:00
Hans-Kristian Arntzen	3e09879131	Support initializers on StorageClassOutput.	2019-01-30 10:29:08 +01:00
Hans-Kristian Arntzen	217eb5b5f9	MSL: Add a preliminary check for bad arrays of structs. ArrayStride can be larger than the declared struct size. We have no obvious solution for now, but warn about it in the MSL output for the time being.	2019-01-28 15:20:30 +01:00
Hans-Kristian Arntzen	8c632da461	MSL: Use correct alignment rule for whole structs. Structs are aligned as you would expect in MSL (maximum member alignment), and it is not minimum 16 bytes like in std140. Also rename the dummy "pad" members to a reserved naming scheme.	2019-01-28 15:20:30 +01:00
Hans-Kristian Arntzen	437fc87a89	MSL: Deal with resource name aliasing. Apparently we didn't use those yet. MSL seems to be able to alias struct types and variable types to a degree, so that's why it has escaped testing until now.	2019-01-18 16:27:57 +01:00
Hans-Kristian Arntzen	3aa08f764e	MSL: Fix image load/store for short vectors. Same fixes as for GLSL.	2019-01-17 14:54:29 +01:00
Hans-Kristian Arntzen	432aaed737	Need to know the original packed type when unpacking loads.	2019-01-17 11:39:46 +01:00
Hans-Kristian Arntzen	de7e5ccd8b	Refactor out packed expressions to extended decorations. Can't safely just cast to the original enum without lots of hacks.	2019-01-17 11:28:51 +01:00
Hans-Kristian Arntzen	f4026a5618	Refactor access_chain_internal to be more readable from callsite.	2019-01-17 10:30:13 +01:00
Hans-Kristian Arntzen	15b52bee48	Deal with packing/unpacking on store. Still a bit buggy, since we cannot deduce between float2[] and packed_float2. Need a deeper refactor to plumb this through ...	2019-01-17 10:06:23 +01:00
Hans-Kristian Arntzen	d92de00cc1	Rewrite how IDs are iterated over. This is a fairly fundamental change on how IDs are handled. It serves many purposes: - Improve performance. We only need to iterate over IDs which are relevant at any one time. - Makes sure we iterate through IDs in SPIR-V module declaration order rather than ID space. IDs don't have to be monotonically increasing, which was an assumption SPIRV-Cross used to have. It has apparently never been a problem until now. - Support LUTs of structs. We do this by interleaving declaration of constants and struct types in SPIR-V module order. To support this, the ParsedIR interface needed to change slightly. Before setting any ID with variant_set<T> we let ParsedIR know that an ID with a specific type has been added. The surface for change should be minimal. ParsedIR will maintain a per-type list of IDs which the cross-compiler will need to consider for later. Instead of looping over ir.ids[] (which can be extremely large), we loop over types now, using: ir.for_each_typed_id<SPIRVariable>([&](uint32_t id, SPIRVariable &var) { handle_variable(var); }); Now we make sure that we're never looking at irrelevant types.	2019-01-10 12:52:56 +01:00
Chip Davis	d6aa911156	Flush all variables after storing through a variable pointer. Since we can't know which variable was modified, we therefore have to conservatively assume that any variable might have been modified.	2019-01-08 15:16:33 -06:00
Chip Davis	3bfb2f94d4	MSL: Support SPV_KHR_variable_pointers. This allows shaders to declare and use pointer-type variables. Pointers may be loaded and stored, be the result of an `OpSelect`, be passed to and returned from functions, and even be passed as inputs to the `OpPhi` instruction. All types of pointers may be used as variable pointers. Variable pointers to storage buffers and workgroup memory may even be loaded from and stored to, as though they were ordinary variables. In addition, this enables using an interior pointer to an array as though it were an array pointer itself using the `OpPtrAccessChain` instruction. This is a rather large and involved change, mostly because this is somewhat complicated with a lot of moving parts. It's a wonder SPIRV-Cross's output is largely unchanged. Indeed, many of these changes are to accomplish exactly that! Perhaps the largest source of changes was the violation of the assumption that, when emitting types, the pointer type didn't matter. One of the test cases added by the change doesn't optimize very well; the output of `spirv-opt` here is invalid SPIR-V. I need to file a bug with SPIRV-Tools about this. I wanted to test that variable pointers to images worked too, but I couldn't figure out how to propagate the access qualifier properly--in MSL, it's part of the type, so getting this right is important. I've punted on that for now.	2019-01-07 11:19:10 -06:00
Hans-Kristian Arntzen	66263d4569	Forward meta information in OpCompositeExtract. Just like OpAccessChain we need to make use of the meta information available to use from access_chain_internal as we can extract a packed vector or transposed vector from a composite, not just memory load.	2019-01-07 10:43:55 +01:00
Hans-Kristian Arntzen	9728f9c1b7	Use correct block-name / other-name aliasing rules. A block name cannot alias with any name in its own scope, and it cannot alias with any other "global" name. To solve this, we need to complicate the name cache updates a little bit where we have a "primary" namespace and "secondary" namespace.	2019-01-04 15:02:54 +01:00
Chip Davis	a5882da091	Test loading from and storing to packed vectors.	2018-11-14 10:47:20 -06:00
Chip Davis	bed4918cb5	MSL: Also pack 2- and 4- element vectors when necessary. This is also needed for `VK_KHR_relaxed_block_layout` support.	2018-11-13 17:31:47 -06:00
Chip Davis	e50eecfeeb	MSL: Also pack members at unaligned offsets. This is necessary to support `VK_KHR_relaxed_block_layout`.	2018-11-07 09:42:54 -06:00
Hans-Kristian Arntzen	6157bf3cae	Add Windows support in Travis CI. - Add new Windows support - Use CMake/CTest instead of Make + shell scripts - Use --parallel in CTest - Fix CTest on Windows - Cleanups in test_shaders.py - Force specific commit for SPIRV-Headers - Fix Inf/NaN odd-ball case by moving to ASM	2018-10-27 00:22:30 +02:00
Chip Davis	47089a48a0	Make the test case a lot simpler.	2018-10-04 11:26:46 -05:00
Chip Davis	9919fbbe0d	MSL: Handle OpImage on OpSampledImage expressions. I have seen this happen. The included test case is one such case.	2018-10-03 11:48:46 -05:00
Hans-Kristian Arntzen	af75ef005f	Update glslang and SPIRV-Tools. A lot of changes in spirv-opt output. Some new invalid SPIR-V was found but most of them were not significant for SPIRV-Cross, so just marked them as invalid.	2018-09-27 11:10:22 +02:00
Hans-Kristian Arntzen	a77880787d	Merge pull request #698 from KhronosGroup/fix-695 MSL: Support global I/O block and struct Input/Output usage.	2018-09-17 14:54:58 +02:00
Hans-Kristian Arntzen	1bbb4032c8	Merge pull request #693 from cdavis5e/msl-atomic-inc-dec MSL: Fix OpAtomicIIncrement and OpAtomicIDecrement.	2018-09-13 16:19:27 +02:00
Hans-Kristian Arntzen	d310060f92	MSL: Support global I/O block and struct Input/Output usage. Implement this by flattening outputs and unflattening inputs explicitly. This allows us to pass down a single struct instead of dealing with the insanity that would be passing down each flattened member separately. Remove stage_uniforms_var_id. Seems to be dead code. Naked uniforms do not exist in SPIR-V for Vulkan, which this seems to have been intended for. It was also unused elsewhere.	2018-09-13 16:04:24 +02:00
Chip Davis	986345c754	Fix tests for changes to my last patch.	2018-09-12 09:43:12 -05:00
Hans-Kristian Arntzen	38d19821d4	MSL: Support copying array of arrays.	2018-09-12 09:54:55 +02:00
Chip Davis	41eb5c43b5	MSL: Fix OpAtomicIIncrement and OpAtomicIDecrement. We were passing a constant '1' to `emit_atomic_func_op()`--which caused us to refer to SPIR-V value `%1`, which is almost certainly not what we want! What we really want is to add/subtract the literal constant '1' to/from the memory location.	2018-09-11 17:29:54 -05:00
Hans-Kristian Arntzen	403011e973	Merge pull request #684 from cdavis5e/msl-builtin-vector-cast MSL: Cast uses of builtin vectors to their declared SPIR-V type.	2018-09-11 19:59:58 +02:00
Chip Davis	6757ef8512	Use bitcast_to_builtin_load() instead of hacking to_expression(). This only affects the builtin when it is used, and not when it's passed to a function. It's a lot cleaner than the way I was doing it before. Remove the `to_expression()` hack.	2018-09-11 11:15:17 -05:00
Chip Davis	acb3fac747	Opt for a simple value cast in lieu of a bitcast.	2018-09-10 14:05:36 -05:00
Hans-Kristian Arntzen	b114889102	Only declare typed initializer list for non-array types. Also, cleanup now redundant constant_expression virtualization for MSL.	2018-09-10 10:04:17 +02:00
Chip Davis	f7dad9da66	MSL: Cast uses of builtin vectors to their declared SPIR-V type. In SPIR-V, builtin integral vectors can be either signed or unsigned, but in MSL they're always unsigned. Unfortunately, the MSL spec forbids implicit conversions between vector types--even if the corresponding scalar types would implicitly convert. If you try, the result is a cryptic error message such as: ``` program_source:37:60: error: cannot convert between vector values of different size ('int4' (aka 'vector_int4') and 'vector_uint4' (vector of 4 'unsigned int' values)) float4 r3 = as_type<float4>((as_type<int4>(r0) * gl_LocalInvocationID.xyyy) + as_type<int4>(r2)); ~~~~~~~~~~~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~~~~ ``` Therefore, uses of these builtins must be explicitly cast, since the rest of the binary likely assumes that the builtin is of its declared type.	2018-09-08 21:17:54 -05:00
Chip Davis	4b99fdd5d0	MSL: Account for components when assigning locations to varyings. Two varyings (vertex outputs/fragment inputs) might have the same location but be in different components--e.g. the compiler may have packed what were two different varyings into a single varying vector. Giving both varyings the same `[[user]]` attribute won't work--it may yield unexpected results, or flat out fail to link. We could eventually pack such varyings into a single vector, but that would require us to handle the case where the varyings are different types--e.g. a `float` and a `uint` packed into the same vector. For now, it seems most prudent to give them unique `[[user]]` locations and let Apple's compiler work out the best way to pack them.	2018-09-06 13:52:33 -05:00
Chip Davis	9e6469bd40	MSL: Handle interpolation qualifiers.	2018-09-05 12:02:07 -05:00
Chip Davis	680ef9d773	MSL: Correct number of words to skip in OpImageWrite. The length field in `Instruction` doesn't include the initial opcode/length word. We only need to skip three words instead of four.	2018-09-05 10:02:25 -05:00
Chip Davis	9fbe39c9c0	MSL: Emit spvTexelBufferCoord() on ImageWrite to a Buffer as well. This is necessary to get the coordinates to give to the texture's `write()` method.	2018-09-04 12:14:34 -05:00
Chip Davis	27af716c3a	MSL: Emit F{Min,Max,Clamp} as fast:: and N{Min,Max,Clamp} as precise::. This roughly matches their semantics in SPIR-V and MSL. For `FMin`, `FMax`, and `FClamp`, and the Metal functions `fast::min()`, `fast::max()`, and `fast::clamp()`, the result is undefined if any operand is NaN. For the 'N' operations and their corresponding MSL `precise::` functions, the result is consistent with IEEE 754 (first non-NaN wins; result is NaN if all operands are NaN). We can only do this with 32-bit floats, though, because Metal only provides these variants for `float`. `half` only has one variant of these functions that is presumably consistent with IEEE 754. I guess that's OK; the SPIR-V spec only says that `F{Min,Max,Clamp}` are undefined for NaNs. Performance might suffer, though.	2018-09-01 23:01:46 -05:00

1 2

97 Commits