SPIRV-Cross

Author	SHA1	Message	Date
Mark Satterthwaite	69b703f1da	Add an option to SPIRV-Cross to enforce invariant floating point math to prevent different depth calculation between prepass & basepass when running on Metal 2.0 and earlier.	2019-09-12 08:35:15 -04:00
Mark Satterthwaite	842e8c48c6	Provide the Metal bindings as part of the options structure as that is more convenient.	2019-09-10 13:24:59 -04:00
Mark Satterthwaite	9e54a8dd7b	Slight modifications to IAB support for Metal output, so that the caller can specify an offset for the IAB start index, as for HLSL shaders UAVs need to occupy slots 0-7. The runtime support for SSBO robustness is also much simpler if the buffer size block is at index 0. Change made in two parts. 1. Allow the caller to specify the Metal translation should use argument buffers. 2. Move this to the front of IABs for convenience of the runtime.	2019-09-10 13:09:49 -04:00
Mark Satterthwaite	d9f3576305	Metal doesn't automatically enforce robust access to buffers unlike other APIs, so for storage-buffers, which become raw T* buffers in Metal, we need to fetch the buffer size and clamp the access to a valid index within the buffer ourselves. This is essential for shaders converted from HLSL which expects all resource access to be robust, though this implementation is technically different to the HLSL specification of return-0 for OOB reads, ignore OOB writes.	2019-09-10 12:32:32 -04:00
Mark Satterthwaite	9ce3158193	When compiling from HLSL which pads and aligns float[]/float2[] within structures to float4[] we need to unpack the original type in Metal from the float4.	2019-09-10 11:21:43 -04:00
Mark Satterthwaite	42b8a62870	Fixes to the generation of Metal tessellation shaders from SPIRV so that it works correctly in more complicated cases. First, when generating from HLSL before invoking the code that comes from the HLSL patch-function a control-flow and full memory-barrier are required to ensure that all the temporary values in thread-local storage for the patch are available. Second, the inputs to control and evaluation shaders must be properly forwarded from the global variables in SPIRV to the member variables in the relevant input structure. Finally when arrays of interpolators are used for input or output we need to add an extra level of array indirection because Metal works at a different granularity than SPIRV. Five parts. 1. Fix tessellation patch function processing. 2. Fix loads from tessellation control inputs not being forwarded to the gl_in structure array. 3. Fix loads from tessellation evaluation inputs not being forwarded to the stage_in structure array. 4. Workaround SPIRV losing an array indirection in tessellation shaders - not the best solution but enough to keep things progressing. 5. Apparently gl_TessLevelInner/Outer is special and needs to not be placed into the input array.	2019-09-10 10:37:07 -04:00
Mark Satterthwaite	de6441af88	Work-around HLSL using zero-based InstanceID and VertexID variables, but SPIRV, like Metal, includes BaseInstance & BaseVertex. Until this can be fixed in DXC, which is really the proper place to solve this, we can decrement InstanceID & VertexID when the source is HLSL. Made in two parts. 1. Handle HLSL-style 0-based vertex/instance index. 2. We zero-base the InstanceID & VertexID variables for HLSL emulation elsewhere, so don't do it twice.	2019-09-09 16:55:59 -04:00
Mark Satterthwaite	97a66ff906	On iOS sub-passes can be implemented using the frame-buffer fetch API which is much more efficient than binding the textures. Change was made in three parts. 1. Use Metal's native frame-buffer fetch API for subpass inputs. 2. Make sure that frame-buffer-fetch is only available on iOS. 3. Default to using Metal's native frame-buffer fetch for subpass inputs on iOS.	2019-09-09 15:02:11 -04:00
Mark Satterthwaite	32557e9093	SPIRV doesn't distinguish depth textures from regular textures, but Metal does, so if we've ever seen a depth comparison operation we must ensure that the texture is specified as a depth-texture.	2019-09-06 16:58:27 -04:00
Mark Satterthwaite	5e8590a23d	Emulate texture atomics in Metal by binding the underlying buffer that backs the resource to a separate binding point and using that for Metal's atomic operations. This will work with texture_buffer and texture2d created from an MTLBuffer, so is perfect for emulating HLSL atomics on RWBuffer and sufficient, but not ideal, for RWTexture2D with some restrictions (limited format support and can't be used for render-targets).	2019-09-05 15:13:28 -04:00
Mark Satterthwaite	239e04762b	Support Metal 2.1's texture_buffer type which is the equivalent to HLSL's Buffer/RWBuffer, so doesn't require modifying buffer sizes to match alignments.	2019-09-05 14:46:15 -04:00
Mark Satterthwaite	8596bf5ee2	In order to use Metal shader libraries properly you have to ensure that you have no duplicated global symbol names for different entities, otherwise 'metallib' won't be able to combine multiple shaders into a single library. This is broken into two parts. 1. Constant arrays of non-primitive types (i.e. matrices) won't link properly into Metal libraries. 2. Metal helper functions must be static force-inline otherwise they will cause problems when linked together in a single Metallib.	2019-09-05 14:39:06 -04:00
Mark Satterthwaite	d50659af92	Rework the way arrays are handled in Metal to remove the array copies as they are unnecessary from Metal 1.2. There were cases where copies were not being inserted and others appeared unncessary, using the template type should allow the 'metal' compiler to do the best possible optimisation. The changes are broken into three stages. 1. Allow Metal to use the array<T> template to make arrays a value type. 2. Force the use of C style array declaration for some cases which cannot be wrapped with a template. 3. Threadgroup arrays can't have a wrapper type. 4. Tweak the code to use unsafe_array in a few more places so that we can handle passing arrays of resources into the shader and then through shaders into sub-functions. 5. Handle packed matrix types inside arrays within structs. 6. Make sure that builtin arguments still retain their array qualifiers when used in leaf functions. 7. Fix declaration of array-of-array constants for Metal so we can use the array<T> template.	2019-09-05 12:39:44 -04:00
Chip Davis	39dce88d3b	MSL: Add support for sampler Y'CbCr conversion. This change introduces functions and in one case, a class, to support the `VK_KHR_sampler_ycbcr_conversion` extension. Except in the case of GBGR8 and BGRG8 formats, for which Metal natively supports implicit chroma reconstruction, we're on our own here. We have to do everything ourselves. Much of the complexity comes from the need to support multiple planes, which must now be passed to functions that use the corresponding combined image-samplers. The rest is from the actual Y'CbCr conversion itself, which requires additional post-processing of the sample retrieved from the image. Passing sampled images to a function was a particular problem. To support this, I've added a new class which is emitted to MSL shaders that pass sampled images with Y'CbCr conversions attached around. It can handle sampled images with or without Y'CbCr conversion. This is an awful abomination that should not exist, but I'm worried that there's some shader out there which does this. This support requires Metal 2.0 to work properly, because it uses default-constructed texture objects, which were only added in MSL 2. I'm not even going to get into arrays of combined image-samplers--that's a whole other can of worms. They are deliberately unsupported in this change. I've taken the liberty of refactoring the support for texture swizzling while I'm at it. It's now treated as a post-processing step similar to Y'CbCr conversion. I'd like to think this is cleaner than having everything in `to_function_name()`/`to_function_args()`. It still looks really hairy, though. I did, however, get rid of the explicit type arguments to `spvGatherSwizzle()`/`spvGatherCompareSwizzle()`. Update the C API. In addition to supporting this new functionality, add some compiler options that I added in previous changes, but for which I neglected to update the C API.	2019-09-01 18:35:53 -05:00
Hans-Kristian Arntzen	3ccfbce264	Run format_all.sh.	2019-08-28 14:25:26 +02:00
Hans-Kristian Arntzen	9436cd3036	MSL: Deal with array copies from and to threadgroup.	2019-08-27 13:18:01 +02:00
Chip Davis	df18d98bea	MSL: Unify the get_*_address_space() methods. These methods have largely the same logic, with minor differences. That I felt compelled to duplicate the logic into another method was one of the things that bothered me about the variable pointers change. This cleans that part of the code up; now we don't have two places to change.	2019-07-26 09:43:28 -05:00
Chip Davis	fb5ee4cb5c	MSL: Adjust BuiltInWorkgroupId for vkCmdDispatchBase(). This command allows the caller to set the base value of `BuiltInWorkgroupId`, and thus of `BuiltInGlobalInvocationId`. Metal provides no direct support for this... but it does provide a builtin, `[[grid_origin]]`, normally used to pass the base values for the stage input region, which we will now abuse to pass the dispatch base and avoid burning a buffer binding. `[[grid_origin]]`, as part of Metal's support for compute stage input, requires MSL 1.2. For 1.0 and 1.1, we're forced to provide a buffer. (Curiously, this builtin was undocumented until the MSL 2.2 release. Go figure.)	2019-07-24 08:56:15 -05:00
Hans-Kristian Arntzen	5c1cb7accf	Recursively pack struct types when we find scalar packed structs.	2019-07-23 15:24:53 +02:00
Hans-Kristian Arntzen	3fa2b14634	Run format_all.sh.	2019-07-23 12:23:41 +02:00
Hans-Kristian Arntzen	7277c7ac46	Use to_unpacked_row_major_expression to unify row-major in MSL/GLSL.	2019-07-23 11:36:54 +02:00
Hans-Kristian Arntzen	2172b19be2	Remove obsolete matrix workaround code.	2019-07-22 16:27:47 +02:00
Hans-Kristian Arntzen	14afb968dd	Correctly unpack row-major matrices when storing to LHS.	2019-07-22 12:03:12 +02:00
Hans-Kristian Arntzen	be2fccd837	Tests run clean.	2019-07-22 10:23:39 +02:00
Hans-Kristian Arntzen	e90d816cdd	Deal with scalar layout of entire structs. Mark all candidate struct types.	2019-07-19 14:18:14 +02:00
Hans-Kristian Arntzen	12c5020854	Pass down row-major state to unpacking functions.	2019-07-19 13:03:08 +02:00
Hans-Kristian Arntzen	27b75c2c5a	Deal with all forms of matrix writes ...	2019-07-19 12:53:10 +02:00
Hans-Kristian Arntzen	f6251e4699	Can deal with std140 matrices now. Refactor is coming together.	2019-07-19 11:21:02 +02:00
Hans-Kristian Arntzen	b09b8d3fa9	Deal more cleanly with matrices and row-major.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	c160d5227f	Reintroduce struct_member_* MSL queries. Need to remap to physical type + packed qualifier, and this is handy to do in a helper function.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	a86308bce1	MSL: Begin rewrite of buffer packing logic.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	c7eda1bce9	Test glsl.std450 more exhaustively. Make sure to test everything with scalar as well to catch any weird edge cases. Not all opcodes are covered here, just the arithmetic ones. FP64 packing is also ignored.	2019-07-17 11:53:05 +02:00
Hans-Kristian Arntzen	33d2bbcf69	Merge branch 'msl-amd-trinary-functions' of git://github.com/cdavis5e/SPIRV-Cross	2019-07-15 09:46:31 +02:00
Chip Davis	6a58554568	Support the SPV_KHR_device_group extension. The only piece added by this extension is the `DeviceIndex` builtin, which tells the shader which device in a grouped logical device it is running on. Metal's pipeline state objects are owned by the `MTLDevice` that created them. Since Metal doesn't support logical grouping of devices the way Vulkan does, we'll thus have to create a pipeline state for each device in a grouped logical device. The upcoming peer group support in Metal 3 will not change this. For this reason, for Metal, the device index is supplied as a constant at pipeline compile time. There's an interaction between `VK_KHR_device_group` and `VK_KHR_multiview` in the `VK_PIPELINE_CREATE_VIEW_INDEX_FROM_DEVICE_INDEX_BIT`, which defines the view index to be the same as the device index. The new `view_index_from_device_index` MSL option supports this functionality.	2019-07-13 16:45:54 -05:00
Chip Davis	ca91fcfe5f	MSL: Support the SPV_AMD_shader_trinary_minmax extension. This requires MSL 2.1.	2019-07-13 16:43:57 -05:00
Chip Davis	058f1a0933	MSL: Handle coherent, volatile, and restrict. This maps them to their MSL equivalents. I've mapped `Coherent` to `volatile` since MSL doesn't have anything weaker than `volatile` but stronger than nothing. As part of this, I had to remove the implicit `volatile` added for atomic operation casts. If the buffer is already `coherent` or `volatile`, then we would add a second `volatile`, which would be redundant. I think this is OK even when the buffer doesn't have `coherent`: `T ` is implicitly convertible to `volatile T `, but not vice-versa. It seems to compile OK at any rate. (Note that the non-`volatile` overloads of the atomic functions documented in the spec aren't present in the MSL 2.2 stdlib headers.) `restrict` is tricky, because in MSL, as in C++, it needs to go after the asterisk or ampersand for the pointer type it's modifying. Another issue is that, in the `Simple`, `GLSL450`, and `Vulkan` memory models, `Restrict` is the default (i.e. does not need to be specified); but MSL likely follows the `OpenCL` model where `Aliased` is the default. We probably need to implicitly set either `Restrict` or `Aliased` depending on the module's declared memory model.	2019-07-11 10:22:30 -05:00
Chip Davis	28454facbb	MSL: Handle packed matrices. The old method of using a different unpacked matrix type doesn't work for scalar alignment. It certainly wouldn't have any effect for a square matrix, since the number of columns and rows are the same. So now we'll store them as arrays of packed vectors.	2019-07-10 18:37:31 -05:00
Chip Davis	e5fa7edfd6	MSL: Support scalar block layout. Relaxed block layout relaxed the restrictions on vector alignment, allowing them to be aligned on scalar boundaries. Scalar block layout relaxes this further, allowing any member to be aligned on a scalar boundary. The requirement that a vector not improperly straddle a 16-byte boundary is also relaxed. I've also added a test showing that `std430` layout works with UBOs. I'm troubled by the dual meaning of the `Packed` extended decoration. In some instances (struct, `float[]`, and `vec2[]` members), it actually means the exact opposite, that the member needs extra padding. This is especially problematic for `vec2[]`, because now we need to distinguish the two cases by checking the array stride. I wonder if this should actually be split into two decorations.	2019-07-09 20:59:32 -05:00
Hans-Kristian Arntzen	909040e2eb	MSVC 2013: Work around another compiler bug with array init.	2019-07-09 15:31:01 +02:00
Hans-Kristian Arntzen	041f103d44	MSL/HLSL: Support scalar reflect and refract.	2019-07-03 12:31:52 +02:00
Chip Davis	7eecf5a46b	MSL: Support SPV_KHR_multiview. This is needed to support `VK_KHR_multiview`, which is in turn needed for Vulkan 1.1 support. Unfortunately, Metal provides no native support for this, and Apple is once again less than forthcoming, so we have to implement it all ourselves. Tessellation and geometry shaders are deliberately unsupported for now. The problem is that the current implementation encodes the `ViewIndex` as part of the `InstanceIndex`, which in the SPIR-V environment at least only exists in the vertex shader. So we need to work out a way to pass the view index along to the later stages. This implementation runs vertex shaders for all views up to the highest bit set in the view mask, even those whose bits are clear. The fragments for the inactive views are then discarded. Avoiding this is difficult: calculating the view indices becomes far more complicated if we can only run for those views which are set in the mask.	2019-06-29 09:43:55 -05:00
Hans-Kristian Arntzen	c76b99b711	Handle more cases with FP16 and texture sampling.	2019-06-27 15:04:22 +02:00
Hans-Kristian Arntzen	45805857e5	MSL: De-virtualize get_declared_struct_member_size. It does not make sense to use a virtual call in the Compiler base class here. Make it clearer by renaming the MSL-specific version to _msl.	2019-06-26 19:11:38 +02:00
Hans-Kristian Arntzen	048f2380f3	MSL: Support custom bindings for argument buffer itself.	2019-06-24 11:10:20 +02:00
Hans-Kristian Arntzen	b4e0163749	Run format_all.sh.	2019-06-21 16:02:22 +02:00
Hans-Kristian Arntzen	3a4a9acac9	MSL: Add C API for querying automatic resource bindings.	2019-06-21 13:19:59 +02:00
Hans-Kristian Arntzen	e2c95bdcbc	MSL: Rewrite how resource indices are fallback-assigned. We used to use the Binding decoration for this, but this method is hopelessly broken. If no explicit MSL resource remapping exists, we remap automatically in a manner which should always "just work".	2019-06-21 12:54:08 +02:00
Hans-Kristian Arntzen	7fdb418f18	Merge pull request #1028 from KhronosGroup/fix-1010 MSL: Support barycentrics and PrimitiveID in fragment shaders	2019-06-19 15:29:14 +02:00
Hans-Kristian Arntzen	2e1cee5e1e	MSL: Support PrimitiveID in fragment and barycentrics.	2019-06-19 09:52:35 +02:00
Hans-Kristian Arntzen	f171d82590	MSL: Support MinLod operand.	2019-06-19 09:43:03 +02:00

1 2 3 4 5

238 Commits