SPIRV-Cross

Author	SHA1	Message	Date
Chip Davis	cab7335e64	MSL: Don't set the layer for multiview if the device doesn't support it. Some older iOS devices don't support layered rendering. In that case, don't set `[[render_target_array_index]]`, because the compiler will reject the shader in that case. The client will then have to unroll the render pass manually.	2020-09-01 19:30:28 -05:00
Hans-Kristian Arntzen	57c93d44ac	GLSL: Add option to force flattening IO blocks. It is not always desirable to use actual blocks. A prime example in the case where EXT_shader_io_blocks is not supported on the target implementation.	2020-07-28 15:16:06 +02:00
Tomek Ponitka	18f23c47d9	Enabling setting a fixed sampleMask in Metal fragment shaders. In Metal render pipelines don't have an option to set a sampleMask parameter, the only way to get that functionality is to set the sample_mask output of the fragment shader to this value directly. We also need to take care to combine the fixed sample mask with the one that the shader might possibly output.	2020-07-24 11:19:46 +02:00
Chip Davis	688c5fcbda	MSL: Add support for processing more than one patch per workgroup. This should hopefully reduce underutilization of the GPU, especially on GPUs where the thread execution width is greater than the number of control points. This also simplifies initialization by reading the buffer directly instead of using Metal's vertex-attribute-in-compute support. It turns out the only way in which shader stages are allowed to differ in their interfaces is in the number of components per vector; the base type must be the same. Since we are using the raw buffer instead of attributes, we can now also emit arrays and matrices directly into the buffer, instead of flattening them and then unpacking them. Structs are still flattened, however; this is due to the need to handle vectors with fewer components than were output, and I think handling this while also directly emitting structs could get ugly. Another advantage of this scheme is that the extra invocations needed to read the attributes when there were more input than output points are now no more. The number of threads per workgroup is now lcm(SIMD-size, output control points). This should ensure we always process a whole number of patches per workgroup. To avoid complexity handling indices in the tessellation control shader, I've also changed the way vertex shaders for tessellation are handled. They are now compute kernels using Metal's support for vertex-style stage input. This lets us always emit vertices into the buffer in order of vertex shader execution. Now we no longer have to deal with indexing in the tessellation control shader. This also fixes a long-standing issue where if an index were greater than the number of vertices to draw, the vertex shader would wind up writing outside the buffer, and the vertex would be lost. This is a breaking change, and I know SPIRV-Cross has other clients, so I've hidden this behind an option for now. In the future, I want to remove this option and make it the default.	2020-07-23 17:59:54 -05:00
Hans-Kristian Arntzen	d573a95a9c	Run format_all.sh.	2020-07-01 11:42:58 +02:00
Chip Davis	5281d9997e	MSL: Fix up input variables' vector lengths in all stages. Metal is picky about interface matching. If the types don't match exactly, down to the number of vector components, Metal fails pipline compilation. To support pipelines where the number of components consumed by the fragment shader is less than that produced by the vertex shader, we have to fix up the fragment shader to accept all the components produced.	2020-06-16 14:50:30 -05:00
Hans-Kristian Arntzen	2d5200650a	HLSL: Add native support for 16-bit types. Adds support for templated load/store in SM 6.2 to deal with small types.	2020-06-04 12:33:56 +02:00
Hans-Kristian Arntzen	165392a2b0	Document all CLI options.	2020-05-28 13:23:33 +02:00
Hans-Kristian Arntzen	ebf463674d	MSL: Allow removing clip distance user varyings. Only safe if user knows that subsequent shader stage will not read clip distance.	2020-04-20 09:58:40 +02:00
Chip Davis	b29f83c383	MSL: Add options to control emission of fragment outputs. Like with `point_size` when not rendering points, Metal complains when writing to a variable using the `[[depth]]` qualifier when no depth buffer be attached. In that case, we must avoid emitting `FragDepth`, just like with `PointSize`. I assume it will also complain if there be no stencil attachment and the shader write to `[[stencil]]`, or it write to `[[color(n)]]` but there be no color attachment at n.	2020-04-13 15:29:11 -05:00
Hanno	4560ee24fd	Improve compatibility with clang-cl	2020-04-09 17:30:20 +02:00
Hans-Kristian Arntzen	941cceedb4	Expose a query if samplers or images are comparison resources.	2020-04-03 17:43:42 +02:00
Hans-Kristian Arntzen	28bf9057df	HLSL: Add support for treating NonWritable UAV texture as SRV instead.	2020-04-03 11:50:50 +02:00
Hans-Kristian Arntzen	b8905bbd95	Add support for forcefully zero-initialized variables. Useful to better support certain platforms which require all variables to be initialized to something.	2020-03-26 13:38:27 +01:00
Hans-Kristian Arntzen	04e877df12	GLSL: Implement GL_EXT_shader_framebuffer_fetch.	2020-03-19 14:53:39 +01:00
Hans-Kristian Arntzen	c2655ab291	Run format_all.sh.	2020-03-19 14:22:49 +01:00
Hans-Kristian Arntzen	95cd20f1c7	Add test for disable-storage-image-qualifier-deduction.	2020-03-04 16:42:31 +01:00
Hans-Kristian Arntzen	c27e1efbf1	HLSL: Add option to always treat SSBO as UAV, even with readonly. This can make codegen more predictable since ByteAddressBuffer is SRV and not UAV.	2020-03-04 16:42:31 +01:00
Hans-Kristian Arntzen	3f2de0d5d3	Add -V alias for --vulkan-semantics.	2020-03-02 11:56:23 +01:00
Hans-Kristian Arntzen	c9d4f9cd74	MSL: Add a workaround path to force native arrays for everything.	2020-02-24 12:47:14 +01:00
Chip Davis	fedbc35315	MSL: Support inline uniform blocks in argument buffers. Here, the inline uniform block is explicit: we instantiate the buffer block itself in the argument buffer, instead of a pointer to the buffer. I just hope this will work with the `MTLArgumentDescriptor` API... Note that Metal recursively assigns individual members of embedded structs IDs. This means for automatic assignment that we have to calculate the binding stride for a given buffer block. For MoltenVK, we'll simply increment the ID by the size of the inline uniform block. Then the later IDs will never conflict with the inline uniform block. We can get away with this because Metal doesn't require that IDs be contiguous, only monotonically increasing.	2020-01-24 18:51:24 -06:00
Hans-Kristian Arntzen	f9818f0804	Update license headers to 2020.	2020-01-16 15:24:37 +01:00
Hans-Kristian Arntzen	7a411258af	Run format_all.sh.	2020-01-16 15:20:59 +01:00
Hans-Kristian Arntzen	c3bd136df1	MSL: Add support for force-activating IAB resources. Important for ABI compatibility on MSL in certain cases.	2020-01-16 11:12:06 +01:00
Akio Gaule	1280df6c7a	Added --msl-decoration-binding command line argument to enable binding decoration for Metal.	2019-11-27 20:49:08 -08:00
Hans-Kristian Arntzen	e38cbb9433	HLSL: Add CLI support for --hlsl-auto-binding.	2019-11-12 10:49:01 +01:00
Hans-Kristian Arntzen	8ad9584c2e	Fix formatting in main.cpp.	2019-10-24 10:56:36 +02:00
Lukas Hermanns	84351d3aed	Merge remote-tracking branch 'upstream/master'	2019-10-21 18:55:36 -04:00
Hans-Kristian Arntzen	4bb673a626	MSL: Add opt-in support for huge IABs. If there are enough members in an IAB, we cannot use the constant address space as MSL compiler complains about there being too many members. Support emitting the device address space instead.	2019-10-14 16:20:34 +02:00
Lukas Hermanns	ffbd801853	Added '--msl-invariant-float-math' option and new test case for it.	2019-10-09 14:03:06 -04:00
Lukas Hermanns	f3a6d28a1d	Further updates for pull request #1162 ; also added two test cases for spvCubemapTo2DArrayFace function and added '--msl-framebuffer-fetch'/ '--msl-emulate-cube-array' compiler options.	2019-09-27 15:49:54 -04:00
Hans-Kristian Arntzen	333980ae91	Refactor into stronger types in public API. Some fallout where internal functions are using stronger types. Overkill to move everything over to strong types right now, but perhaps move over to it slowly over time.	2019-09-06 12:29:47 +02:00
Hans-Kristian Arntzen	afa5480210	Add dynamic offsets to C API.	2019-09-06 10:17:31 +02:00
Hans-Kristian Arntzen	1935f1a8e3	Fix some issues on certain compilers.	2019-09-06 10:11:18 +02:00
Chip Davis	cb35934248	MSL: Support dynamic offsets for buffers in argument buffers. Vulkan has two types of buffer descriptors, `VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER_DYNAMIC` and `VK_DESCRIPTOR_TYPE_STORAGE_BUFFER_DYNAMIC`, which allow the client to offset the buffers by an amount given when the descriptor set is bound to a pipeline. Metal provides no direct support for this when the buffer in question is in an argument buffer, so once again we're on our own. These offsets cannot be stored or associated in any way with the argument buffer itself, because they are set at bind time. Different pipelines may have different offsets set. Therefore, we must use a separate buffer, not in any argument buffer, to hold these offsets. Then the shader must manually offset the buffer pointer. This change fully supports arrays, including arrays of arrays, even though Vulkan forbids them. It does not, however, support runtime arrays. Perhaps later.	2019-09-05 23:29:00 -05:00
Hans-Kristian Arntzen	b97e9b0499	Fix severe performance issue with invariant expression invalidation. We were going down a tree of expressions multiple times and this caused an exponential explosion in time, which was not caught until recently. Fix this by blocking any traversal going through an ID more than one time. This fix overall improves performance by almost an order of magnitude on a particular test shader rather than slowing it down by ~75x.	2019-08-01 09:55:21 +02:00
Hans-Kristian Arntzen	12ca9d1982	Vulkan GLSL: Support disabling samplerless texture function EXT. Some platforms support Vulkan GLSL, but not this extension apparently ...	2019-07-25 11:07:14 +02:00
Chip Davis	fb5ee4cb5c	MSL: Adjust BuiltInWorkgroupId for vkCmdDispatchBase(). This command allows the caller to set the base value of `BuiltInWorkgroupId`, and thus of `BuiltInGlobalInvocationId`. Metal provides no direct support for this... but it does provide a builtin, `[[grid_origin]]`, normally used to pass the base values for the stage input region, which we will now abuse to pass the dispatch base and avoid burning a buffer binding. `[[grid_origin]]`, as part of Metal's support for compute stage input, requires MSL 1.2. For 1.0 and 1.1, we're forced to provide a buffer. (Curiously, this builtin was undocumented until the MSL 2.2 release. Go figure.)	2019-07-24 08:56:15 -05:00
Chip Davis	6a58554568	Support the SPV_KHR_device_group extension. The only piece added by this extension is the `DeviceIndex` builtin, which tells the shader which device in a grouped logical device it is running on. Metal's pipeline state objects are owned by the `MTLDevice` that created them. Since Metal doesn't support logical grouping of devices the way Vulkan does, we'll thus have to create a pipeline state for each device in a grouped logical device. The upcoming peer group support in Metal 3 will not change this. For this reason, for Metal, the device index is supplied as a constant at pipeline compile time. There's an interaction between `VK_KHR_device_group` and `VK_KHR_multiview` in the `VK_PIPELINE_CREATE_VIEW_INDEX_FROM_DEVICE_INDEX_BIT`, which defines the view index to be the same as the device index. The new `view_index_from_device_index` MSL option supports this functionality.	2019-07-13 16:45:54 -05:00
Chip Davis	7eecf5a46b	MSL: Support SPV_KHR_multiview. This is needed to support `VK_KHR_multiview`, which is in turn needed for Vulkan 1.1 support. Unfortunately, Metal provides no native support for this, and Apple is once again less than forthcoming, so we have to implement it all ourselves. Tessellation and geometry shaders are deliberately unsupported for now. The problem is that the current implementation encodes the `ViewIndex` as part of the `InstanceIndex`, which in the SPIR-V environment at least only exists in the vertex shader. So we need to work out a way to pass the view index along to the later stages. This implementation runs vertex shaders for all views up to the highest bit set in the view mask, even those whose bits are clear. The fragments for the inactive views are then discarded. Avoiding this is difficult: calculating the view indices becomes far more complicated if we can only run for those views which are set in the mask.	2019-06-29 09:43:55 -05:00
Hans-Kristian Arntzen	65af09d2d1	Support emitting OpLine directive. Facilitates easier mapping from source language to cross-compiled output in tooling.	2019-05-28 13:44:24 +02:00
Hans-Kristian Arntzen	0b9a884f3f	Add Git/timestamp --revision support.	2019-05-24 15:24:41 +02:00
Laszlo Agocs	7bc31491be	GLSL: Add option to disable buffer blocks regardless of version	2019-05-13 21:29:06 +02:00
Hans-Kristian Arntzen	fc4f39b11f	MSL: Support native texture_buffer type, throw error on atomics. Atomics are not supported on images or texture_buffers in MSL. Properly throw an error if OpImageTexelPointer is used (since it can only be used for atomic operations anyways).	2019-04-23 12:21:43 +02:00
Hans-Kristian Arntzen	3fe57d3798	Do not use SmallVector as input type in public interfaces. This is an API break, which we need to be careful with. Handing out SmallVectors is easier since the interface is basically the same.	2019-04-09 15:09:44 +02:00
Hans-Kristian Arntzen	a489ba7fd1	Reduce pressure on global allocation. - Replace ostringstream with custom implementation. ~30% performance uplift on vector-shuffle-oom test. Allocations are measurably reduced in Valgrind. - Replace std::vector with SmallVector. Classic malloc optimization, small vectors are backed by inline data. ~ 7-8% gain on vector-shuffle-oom on GCC 8 on Linux. - Use an object pool for IVariant type. We generally allocate a lot of SPIR* objects. We can amortize these allocations neatly by pooling them. - ~15% overall uplift on ./test_shaders.py --iterations 10000 shaders/.	2019-04-09 15:09:44 +02:00
Hans-Kristian Arntzen	c60b9a1e96	CLI: Make --iterations more useful. Add a basic benchmarking mode to test_shaders.py. We cannot safely just call compile() in a loop. Do the full pipeline for each iteration.	2019-04-09 15:09:16 +02:00
Hans-Kristian Arntzen	9b92e68d71	Add an option to override the namespace used for spirv_cross. This is a pragmatic trick to avoid symbol collision where a project links against SPIRV-Cross statically, while linking to other projects which also use SPIRV-Cross statically. We can end up with very awkward symbol collisions which can resolve themselves silently because SPIRV-Cross is pulled in as necessary. To fix this, we must use different symbols and embed two copies of SPIRV-Cross in this scenario, now with different namespaces, which in turn leads to different symbols.	2019-03-29 10:29:44 +01:00
Hans-Kristian Arntzen	88ce958a51	Add ray-tracing reflection to main.cpp and C API.	2019-03-27 10:21:30 +01:00
Hans-Kristian Arntzen	0474848d4a	GLSL: Support emitting push constant block as a plain UBO.	2019-03-19 10:58:52 +01:00

1 2 3 4

151 Commits