SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	ffca8735ff	Merge pull request #1105 from cdavis5e/msl-unify-as MSL: Unify the get_*_address_space() methods.	2019-07-29 10:19:12 +02:00
Chip Davis	df18d98bea	MSL: Unify the get_*_address_space() methods. These methods have largely the same logic, with minor differences. That I felt compelled to duplicate the logic into another method was one of the things that bothered me about the variable pointers change. This cleans that part of the code up; now we don't have two places to change.	2019-07-26 09:43:28 -05:00
Hans-Kristian Arntzen	d378413040	Merge pull request #1103 from KhronosGroup/fix-1100 MSL: Cleanup temporary use with emit_uninitialized_temporary.	2019-07-26 14:35:18 +02:00
Hans-Kristian Arntzen	c3e8e728d8	MSL: Cleanup temporary use with emit_uninitialized_temporary.	2019-07-26 11:16:43 +02:00
Hans-Kristian Arntzen	abb345d0b3	MSL: Deal with Modf/Frexp where output is access chain to scalar. This is not allowed as we cannot take mutable reference to a vec.{x,y,z,w}. We only care about scalar since entire vectors are fine.	2019-07-26 11:02:38 +02:00
Hans-Kristian Arntzen	3c03b55c46	Workaround MSVC 2013 compiler issues.	2019-07-25 10:28:11 +02:00
Chip Davis	fb5ee4cb5c	MSL: Adjust BuiltInWorkgroupId for vkCmdDispatchBase(). This command allows the caller to set the base value of `BuiltInWorkgroupId`, and thus of `BuiltInGlobalInvocationId`. Metal provides no direct support for this... but it does provide a builtin, `[[grid_origin]]`, normally used to pass the base values for the stage input region, which we will now abuse to pass the dispatch base and avoid burning a buffer binding. `[[grid_origin]]`, as part of Metal's support for compute stage input, requires MSL 1.2. For 1.0 and 1.1, we're forced to provide a buffer. (Curiously, this builtin was undocumented until the MSL 2.2 release. Go figure.)	2019-07-24 08:56:15 -05:00
Hans-Kristian Arntzen	c62503bca7	Do not attempt to pack types which are already scalar.	2019-07-24 11:52:28 +02:00
Hans-Kristian Arntzen	646e04294a	Fix some warnings when building in MoltenVK.	2019-07-23 16:39:13 +02:00
Hans-Kristian Arntzen	5c1cb7accf	Recursively pack struct types when we find scalar packed structs.	2019-07-23 15:24:53 +02:00
Hans-Kristian Arntzen	3fa2b14634	Run format_all.sh.	2019-07-23 12:23:41 +02:00
Hans-Kristian Arntzen	7277c7ac46	Use to_unpacked_row_major_expression to unify row-major in MSL/GLSL.	2019-07-23 11:36:54 +02:00
Hans-Kristian Arntzen	47a18b9f1b	Simplify row-major matrix/vector multiplies.	2019-07-23 10:56:57 +02:00
Hans-Kristian Arntzen	6224199c76	Add struct size padding tests.	2019-07-23 10:30:37 +02:00
Hans-Kristian Arntzen	2172b19be2	Remove obsolete matrix workaround code.	2019-07-22 16:27:47 +02:00
Hans-Kristian Arntzen	609d087f8f	Only transpose unpacked expressions.	2019-07-22 16:06:09 +02:00
Hans-Kristian Arntzen	6057ffcbb1	Deal correctly with complete stores to row_major matrices.	2019-07-22 15:49:17 +02:00
Hans-Kristian Arntzen	19f5cd3e90	Declare correct matrix type when unpacking.	2019-07-22 13:25:45 +02:00
Hans-Kristian Arntzen	f2d6a77c95	Don't forget to register a write to LHS expression in certain case.	2019-07-22 13:06:30 +02:00
Hans-Kristian Arntzen	745a2f7b0e	Deal with swizzled stores to std140 matrices.	2019-07-22 13:05:23 +02:00
Hans-Kristian Arntzen	180a6b38c5	Fix some row-major column store cases.	2019-07-22 12:56:14 +02:00
Hans-Kristian Arntzen	4ab2829cf6	Fix more stray parens.	2019-07-22 12:13:07 +02:00
Hans-Kristian Arntzen	d6004bfc97	Fixup stray parent in output.	2019-07-22 12:08:56 +02:00
Hans-Kristian Arntzen	14afb968dd	Correctly unpack row-major matrices when storing to LHS.	2019-07-22 12:03:12 +02:00
Hans-Kristian Arntzen	249f8e5180	MSL: Support storing to row-major column. Defer transposes to actual Load or Store.	2019-07-22 11:13:44 +02:00
Hans-Kristian Arntzen	be2fccd837	Tests run clean.	2019-07-22 10:23:39 +02:00
Hans-Kristian Arntzen	b66a53a979	Traverse correct types when checking scalar layout.	2019-07-19 14:43:42 +02:00
Hans-Kristian Arntzen	e90d816cdd	Deal with scalar layout of entire structs. Mark all candidate struct types.	2019-07-19 14:18:14 +02:00
Hans-Kristian Arntzen	12c5020854	Pass down row-major state to unpacking functions.	2019-07-19 13:03:08 +02:00
Hans-Kristian Arntzen	27b75c2c5a	Deal with all forms of matrix writes ...	2019-07-19 12:53:10 +02:00
Hans-Kristian Arntzen	f6251e4699	Can deal with std140 matrices now. Refactor is coming together.	2019-07-19 11:21:02 +02:00
Hans-Kristian Arntzen	dd7ebaf9f7	Start considering how to emit physical type ID.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	b09b8d3fa9	Deal more cleanly with matrices and row-major.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	c160d5227f	Reintroduce struct_member_* MSL queries. Need to remap to physical type + packed qualifier, and this is handy to do in a helper function.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	a86308bce1	MSL: Begin rewrite of buffer packing logic.	2019-07-19 10:06:19 +02:00
Chip Davis	12a8654784	Don't forward uses of an OpIsHelperInvocationEXT op. If this is computed before a `demote`, but used after, forwarding it will produce the wrong value. This does make for uglier shaders, but it's necessary right now to ensure correctness. I needed to use an assembly shader to produce the test for this. `spirv-opt` is not smart enough (or too smart?) to eliminate the variable that would be used in GLSL to express this.	2019-07-18 17:32:35 -05:00
Chip Davis	50dce10c5d	Support the SPV_EXT_demote_to_helper_invocation extension. This extension provides a new operation which causes a fragment to be discarded without terminating the fragment shader invocation. The invocation for the discarded fragment becomes a helper invocation, so that derivatives will remain defined. The old `HelperInvocation` builtin becomes undefined when this occurs, so a second new instruction queries the current helper invocation status. This is only fully supported for GLSL. HLSL doesn't support the `IsHelperInvocation` operation and MSL doesn't support the `DemoteToHelperInvocation` op. Fixes #1052.	2019-07-17 09:12:22 -05:00
Hans-Kristian Arntzen	c7eda1bce9	Test glsl.std450 more exhaustively. Make sure to test everything with scalar as well to catch any weird edge cases. Not all opcodes are covered here, just the arithmetic ones. FP64 packing is also ignored.	2019-07-17 11:53:05 +02:00
Chip Davis	bc646574a6	MSL: Support the SPV_INTEL_shader_integer_functions2 extension. This provides a few functions normally available in OpenCL to the SPIR-V shader environment. These functions happen to be available in Metal as well. No GLSL, unfortunately. Intel has yet to publish a `GL_INTEL_shader_integer_functions2` spec.	2019-07-15 09:42:36 -05:00
Hans-Kristian Arntzen	33d2bbcf69	Merge branch 'msl-amd-trinary-functions' of git://github.com/cdavis5e/SPIRV-Cross	2019-07-15 09:46:31 +02:00
Chip Davis	6a58554568	Support the SPV_KHR_device_group extension. The only piece added by this extension is the `DeviceIndex` builtin, which tells the shader which device in a grouped logical device it is running on. Metal's pipeline state objects are owned by the `MTLDevice` that created them. Since Metal doesn't support logical grouping of devices the way Vulkan does, we'll thus have to create a pipeline state for each device in a grouped logical device. The upcoming peer group support in Metal 3 will not change this. For this reason, for Metal, the device index is supplied as a constant at pipeline compile time. There's an interaction between `VK_KHR_device_group` and `VK_KHR_multiview` in the `VK_PIPELINE_CREATE_VIEW_INDEX_FROM_DEVICE_INDEX_BIT`, which defines the view index to be the same as the device index. The new `view_index_from_device_index` MSL option supports this functionality.	2019-07-13 16:45:54 -05:00
Chip Davis	ca91fcfe5f	MSL: Support the SPV_AMD_shader_trinary_minmax extension. This requires MSL 2.1.	2019-07-13 16:43:57 -05:00
Hans-Kristian Arntzen	92e5255570	Run format_all.sh.	2019-07-12 10:59:53 +02:00
Hans-Kristian Arntzen	932ee0e328	Deal correctly with return sign of bitscan operations.	2019-07-12 10:57:56 +02:00
Hans-Kristian Arntzen	19ebbd48c7	Merge pull request #1077 from cdavis5e/msl-spirv-qualifiers MSL: Handle coherent, volatile, and restrict.	2019-07-12 10:03:06 +02:00
Hans-Kristian Arntzen	ad5eae46ed	Merge pull request #1078 from cdavis5e/post-depth-coverage Support the SPV_KHR_post_depth_coverage extension.	2019-07-12 09:56:26 +02:00
Chip Davis	6628ea6e48	MSL: Use the select() function for OpSelect. This significantly improves codegen for vector `OpSelect` in MSL.	2019-07-11 10:30:37 -05:00
Chip Davis	1df47db6ba	Support the SPV_KHR_post_depth_coverage extension. Using the `PostDepthCoverage` mode specifies that the `gl_SampleMaskIn` variable is to contain the computed coverage mask following the early fragment tests, which this mode requires and implicitly enables. Note that unlike Vulkan and OpenGL, Metal places this on the sample mask input itself, and furthermore does not implicitly enable early fragment testing. If it isn't enabled explicitly with an `[[early_fragment_tests]]` attribute, the compiler will error out. So we have to enable that mode explicitly if `PostDepthCoverage` is enabled but `EarlyFragmentTests` isn't. For Metal, only iOS supports this; for some reason, Apple has yet to implement it on macOS, even though many desktop cards support it.	2019-07-11 10:28:43 -05:00
Chip Davis	058f1a0933	MSL: Handle coherent, volatile, and restrict. This maps them to their MSL equivalents. I've mapped `Coherent` to `volatile` since MSL doesn't have anything weaker than `volatile` but stronger than nothing. As part of this, I had to remove the implicit `volatile` added for atomic operation casts. If the buffer is already `coherent` or `volatile`, then we would add a second `volatile`, which would be redundant. I think this is OK even when the buffer doesn't have `coherent`: `T ` is implicitly convertible to `volatile T `, but not vice-versa. It seems to compile OK at any rate. (Note that the non-`volatile` overloads of the atomic functions documented in the spec aren't present in the MSL 2.2 stdlib headers.) `restrict` is tricky, because in MSL, as in C++, it needs to go after the asterisk or ampersand for the pointer type it's modifying. Another issue is that, in the `Simple`, `GLSL450`, and `Vulkan` memory models, `Restrict` is the default (i.e. does not need to be specified); but MSL likely follows the `OpenCL` model where `Aliased` is the default. We probably need to implicitly set either `Restrict` or `Aliased` depending on the module's declared memory model.	2019-07-11 10:22:30 -05:00
Hans-Kristian Arntzen	1a592b7c0f	Merge pull request #1067 from cdavis5e/msl-scalar-block-layout MSL: Support scalar block layout.	2019-07-11 13:03:03 +02:00
Chip Davis	28454facbb	MSL: Handle packed matrices. The old method of using a different unpacked matrix type doesn't work for scalar alignment. It certainly wouldn't have any effect for a square matrix, since the number of columns and rows are the same. So now we'll store them as arrays of packed vectors.	2019-07-10 18:37:31 -05:00
Chip Davis	ea5c0ed82f	MSL: Fix alignment of packed types. Packed types have scalar alignment.	2019-07-10 11:57:04 -05:00
Hans-Kristian Arntzen	6b010e0cbc	Merge pull request #1069 from KhronosGroup/fix-1053 MSL: Re-roll array expressions in initializers.	2019-07-10 12:15:12 +02:00
Hans-Kristian Arntzen	f6f849397e	MSL: Re-roll array expressions in initializers. We cannot rely on copy path when using an array as part of a struct initializer, so reroll such expressions to an initializer list again.	2019-07-10 11:19:33 +02:00
Chip Davis	e5fa7edfd6	MSL: Support scalar block layout. Relaxed block layout relaxed the restrictions on vector alignment, allowing them to be aligned on scalar boundaries. Scalar block layout relaxes this further, allowing any member to be aligned on a scalar boundary. The requirement that a vector not improperly straddle a 16-byte boundary is also relaxed. I've also added a test showing that `std430` layout works with UBOs. I'm troubled by the dual meaning of the `Packed` extended decoration. In some instances (struct, `float[]`, and `vec2[]` members), it actually means the exact opposite, that the member needs extra padding. This is especially problematic for `vec2[]`, because now we need to distinguish the two cases by checking the array stride. I wonder if this should actually be split into two decorations.	2019-07-09 20:59:32 -05:00
Hans-Kristian Arntzen	909040e2eb	MSVC 2013: Work around another compiler bug with array init.	2019-07-09 15:31:01 +02:00
Hans-Kristian Arntzen	4056d0b74e	Don't use scalar dot().	2019-07-03 14:32:06 +02:00
Hans-Kristian Arntzen	041f103d44	MSL/HLSL: Support scalar reflect and refract.	2019-07-03 12:31:52 +02:00
Chip Davis	31b6c93516	MSL: Support SubgroupLocalInvocationId and SubgroupSize in all stages. MSL prior to 2.2 doesn't support these natively in any stage but compute. But, we can (assuming no threads were terminated prematurely) get their values with some creative uses of the `simd_prefix_exclusive_sum()` and `simd_sum()` functions. Also, fix a missing `to_expression()` with `BuiltInSubgroupEqMask`. For KhronosGroup/MoltenVK#629.	2019-07-02 11:48:59 -05:00
Hans-Kristian Arntzen	f8b084de61	MSL/HLSL: Support OpOuterProduct.	2019-07-01 10:57:27 +02:00
Chip Davis	7eecf5a46b	MSL: Support SPV_KHR_multiview. This is needed to support `VK_KHR_multiview`, which is in turn needed for Vulkan 1.1 support. Unfortunately, Metal provides no native support for this, and Apple is once again less than forthcoming, so we have to implement it all ourselves. Tessellation and geometry shaders are deliberately unsupported for now. The problem is that the current implementation encodes the `ViewIndex` as part of the `InstanceIndex`, which in the SPIR-V environment at least only exists in the vertex shader. So we need to work out a way to pass the view index along to the later stages. This implementation runs vertex shaders for all views up to the highest bit set in the view mask, even those whose bits are clear. The fragments for the inactive views are then discarded. Avoiding this is difficult: calculating the view indices becomes far more complicated if we can only run for those views which are set in the mask.	2019-06-29 09:43:55 -05:00
Hans-Kristian Arntzen	ff87419607	Deal with scalar input values for distance/length/normalize. HLSL and MSL don't support it, so fall back to simpler intrinsics.	2019-06-28 11:20:14 +02:00
Hans-Kristian Arntzen	1543bdaf7b	Run format_all.sh.	2019-06-27 15:10:59 +02:00
Hans-Kristian Arntzen	c76b99b711	Handle more cases with FP16 and texture sampling.	2019-06-27 15:04:22 +02:00
Hans-Kristian Arntzen	45805857e5	MSL: De-virtualize get_declared_struct_member_size. It does not make sense to use a virtual call in the Compiler base class here. Make it clearer by renaming the MSL-specific version to _msl.	2019-06-26 19:11:38 +02:00
Hans-Kristian Arntzen	02b2a1015d	MSL: Fix minor XCode /analyze warning. Written variable, but never read.	2019-06-26 16:10:58 +02:00
Hans-Kristian Arntzen	8f6939cb0d	Merge pull request #1041 from KhronosGroup/fix-1011 MSL: Add support for SubgroupSize / SubgroupInvocationID in fragment.	2019-06-26 15:01:13 +02:00
Hans-Kristian Arntzen	ab3798fd91	MSL: Add support for SubgroupSize / SubgroupInvocationID in fragment.	2019-06-24 12:31:54 +02:00
Hans-Kristian Arntzen	048f2380f3	MSL: Support custom bindings for argument buffer itself.	2019-06-24 11:10:20 +02:00
Hans-Kristian Arntzen	b4e0163749	Run format_all.sh.	2019-06-21 16:02:22 +02:00
Hans-Kristian Arntzen	3a4a9acac9	MSL: Add C API for querying automatic resource bindings.	2019-06-21 13:19:59 +02:00
Hans-Kristian Arntzen	e2c95bdcbc	MSL: Rewrite how resource indices are fallback-assigned. We used to use the Binding decoration for this, but this method is hopelessly broken. If no explicit MSL resource remapping exists, we remap automatically in a manner which should always "just work".	2019-06-21 12:54:08 +02:00
Hans-Kristian Arntzen	a1f7c8dc8e	Merge pull request #1031 from KhronosGroup/fix-1009 MSL: Support 64-bit integers.	2019-06-19 15:29:27 +02:00
Hans-Kristian Arntzen	7fdb418f18	Merge pull request #1028 from KhronosGroup/fix-1010 MSL: Support barycentrics and PrimitiveID in fragment shaders	2019-06-19 15:29:14 +02:00
Hans-Kristian Arntzen	4c20c941f0	Merge pull request #1025 from KhronosGroup/fix-1013 MSL: Support OpImageQueryLod.	2019-06-19 14:07:39 +02:00
Hans-Kristian Arntzen	a6798d06a2	MSL: Error out on int64_t/uint64_t buffer members. Not supported for whatever reason.	2019-06-19 10:14:46 +02:00
Hans-Kristian Arntzen	a6b71ae999	MSL: Support 64-bit integers.	2019-06-19 09:55:00 +02:00
Hans-Kristian Arntzen	2e1cee5e1e	MSL: Support PrimitiveID in fragment and barycentrics.	2019-06-19 09:52:35 +02:00
Hans-Kristian Arntzen	0671b3c35b	MSL: Support OpImageQueryLod. Correctness is a bit unclear at the moment. The spec document for 2.2 is not updated for query-lod, but this is the best we can do anyways.	2019-06-19 09:51:56 +02:00
Hans-Kristian Arntzen	f171d82590	MSL: Support MinLod operand.	2019-06-19 09:43:03 +02:00
Hans-Kristian Arntzen	95053ea4bc	Merge pull request #1024 from KhronosGroup/fix-1016 GLSL/MSL: Support stencil export	2019-06-12 12:48:10 +02:00
Hans-Kristian Arntzen	14d0a1eb0c	MSL: Support stencil export.	2019-06-12 10:21:20 +02:00
Hans-Kristian Arntzen	a7b2ba28a0	MSL: Support Invariant qualifier on position.	2019-06-12 09:39:12 +02:00
Hans-Kristian Arntzen	30bb197a5d	MSL: Support remapping constexpr samplers by set/binding. Older API was oriented around IDs which are not available unless you're doing full reflection, which is awkward for certain use cases which know their set/bindings up front. Optimize resource bindings to be hashmap rather than doing linear seeks all the time.	2019-06-10 15:41:36 +02:00
Hans-Kristian Arntzen	314efdcc42	MSL: Fix declaration of unused input variables. In multiple-entry-point modules, we declared builtin inputs which were not supposed to be used for that entry point. Fix this, by being more strict when checking which builtins to emit.	2019-05-31 13:23:34 +02:00
Hans-Kristian Arntzen	b3094cd02a	Run format_all.sh.	2019-05-27 16:54:13 +02:00
Hans-Kristian Arntzen	fd0feb1ec1	MSL: Use correct address space when passing array-of-buffers. Need to check if the descriptor set is actually an argument buffer.	2019-05-27 16:53:30 +02:00
Hans-Kristian Arntzen	7b9e0fb428	MSL: Implement OpArrayLength. This gets rather complicated because MSL does not support OpArrayLength natively. We need to pass down a buffer which contains buffer sizes, and we compute the array length on-demand. Support both discrete descriptors as well as argument buffers.	2019-05-27 16:13:09 +02:00
Hans-Kristian Arntzen	96492648d4	MSL: Fix struct declaration order with complex type aliases. MSL generally emits the aliases, which means we cannot always place the master type first, unlike GLSL and HLSL. The logic fix is just to reorder after we have tagged types with packing information, rather than doing it in the parser fixup.	2019-05-23 14:54:04 +02:00
Hans-Kristian Arntzen	eaf7afed97	MSL: Support argument buffers and image swizzling. Change aux buffer to swizzle buffer. There is no good reason to expand the aux buffer, so name it appropriately. Make the code cleaner by emitting a straight pointer to uint rather than a dummy struct which only contains a single unsized array member anyways. This will also end up being very similar to how we implement swizzle buffers for argument buffers. Do not use implied binding if it overflows int32_t.	2019-05-18 10:30:06 +02:00
Chip Davis	8983920edf	Remove fallback for OpGroupNonUniformElect. It's not safe to enable subgroup support without this actually working correctly.	2019-05-16 13:42:09 -05:00
Chip Davis	9d9415754b	MSL: Add support for subgroup operations. Some support for subgroups is present starting in Metal 2.0 on both iOS and macOS. macOS gains more complete support in 10.14 (Metal 2.1). Some restrictions are present. On iOS and on macOS 10.13, the implementation of `OpGroupNonUniformElect` is incorrect: if thread 0 has already terminated or is not executing a conditional branch, the first thread that is will falsely believe itself not to be. Unfortunately, this operation is part of the "basic" feature set; without it, subgroups cannot be supported at all. The `SubgroupSize` and `SubgroupLocalInvocationId` builtins are only available in compute shaders (and, by extension, tessellation control shaders), despite SPIR-V making them available in all stages. This limits the usefulness of some of the subgroup operations in fragment shaders. Although Metal on macOS supports some clustered, inclusive, and exclusive operations, it does not support them all. In particular, inclusive and exclusive min, max, and, or, and xor; as well as cluster sizes other than 4 are not supported. If this becomes a problem, they could be emulated, but at a significant performance cost due to the need for non-uniform operations.	2019-05-15 17:40:04 -05:00
Hans-Kristian Arntzen	647ddaee42	HLSL/MSL: Deal correctly with nonuniformEXT qualifier. MSL does not seem to have a qualifier for this, but HLSL SM 5.1 does. glslangValidator for HLSL does not support this, so skip any validation, but it passes in FXC.	2019-05-13 14:58:27 +02:00
Hans-Kristian Arntzen	ad95173a48	Fix GCC 4.x warning.	2019-05-09 12:28:34 +02:00
Hans-Kristian Arntzen	97d39dc9d5	MSL: Deal with texture swizzle on arrays of images.	2019-05-09 11:25:45 +02:00
Hans-Kristian Arntzen	2cc374a0c8	GLSL: Implement GL_EXT_buffer_reference. Buffer objects can contain arbitrary pointers to blocks. We can also implement ConvertPtrToU and ConvertUToPtr. The latter can cast a uint64_t to any type as it pleases, so we will need to generate fake buffer reference blocks to be able to cast the type.	2019-04-26 11:43:51 +02:00
Hans-Kristian Arntzen	c2715c3908	MSL: Cast texture_buffer index to uint.	2019-04-23 12:46:48 +02:00
Hans-Kristian Arntzen	de1148b8ba	Run format_all.sh.	2019-04-23 12:21:53 +02:00
Hans-Kristian Arntzen	fc4f39b11f	MSL: Support native texture_buffer type, throw error on atomics. Atomics are not supported on images or texture_buffers in MSL. Properly throw an error if OpImageTexelPointer is used (since it can only be used for atomic operations anyways).	2019-04-23 12:21:43 +02:00
Michael Barriault	82b4ad8a30	Correct formatting.	2019-04-16 19:13:57 +01:00
Michael Barriault	105bfd368a	Only use MSL constant address space for tessellation control shader.	2019-04-16 17:56:02 +01:00
Michael Barriault	16911c5a4d	Merge remote-tracking branch 'origin/master' * origin/master: Support running {,update_}test_shader.sh with CMake builds. Don't apply vertex attribute remapping other non-vertex or non-input interface blocks Force complex loop in certain rare access chain scenarios. Fix guard around [[noreturn]]. Deal with mismatched signs in S/U/F conversion opcodes. Workaround lack of lvalue/rvalue operator overload on MSVC 2013. Support direct conversions to std::vector from SmallVector. Fix some minor copy constructor issues in Variant. Make sure ids_for_types are moved correctly in move operator. Run format_all.sh. Refactor out error handling and containers to new headers. Do not use SmallVector as input type in public interfaces. Fix various bugs found in testing. Explicitly implement move operators for ParsedIR. Try another MSVC 2013 workaround. Implement edge cases in insert/end and add a simple test case. Fix GCC 4.x warnings. Workaround lack of alignas on MSVC 2013. Reduce pressure on global allocation. CLI: Make --iterations more useful.	2019-04-13 18:06:29 +01:00
Michael Barriault	ca7df787b3	Use constant address space for SPIR-V parameters when generating tessellation control shader.	2019-04-09 19:41:31 +01:00
Hans-Kristian Arntzen	3fe57d3798	Do not use SmallVector as input type in public interfaces. This is an API break, which we need to be careful with. Handing out SmallVectors is easier since the interface is basically the same.	2019-04-09 15:09:44 +02:00
Hans-Kristian Arntzen	a489ba7fd1	Reduce pressure on global allocation. - Replace ostringstream with custom implementation. ~30% performance uplift on vector-shuffle-oom test. Allocations are measurably reduced in Valgrind. - Replace std::vector with SmallVector. Classic malloc optimization, small vectors are backed by inline data. ~ 7-8% gain on vector-shuffle-oom on GCC 8 on Linux. - Use an object pool for IVariant type. We generally allocate a lot of SPIR* objects. We can amortize these allocations neatly by pooling them. - ~15% overall uplift on ./test_shaders.py --iterations 10000 shaders/.	2019-04-09 15:09:44 +02:00
Hans-Kristian Arntzen	23db744e35	Deal with case where we need to emit SpvImplArrayCopy late. We cannot deduce if OpLoad needs ArrayCopy templates early since it's heavily context dependent, and we might only know on 3rd iteration of the compile loop.	2019-04-09 12:28:46 +02:00
Bill Hollings	efbe7ca16f	MSL: Fix infinite CAS loop on atomic_compare_exchange_weak_explicit().	2019-04-05 21:28:57 -04:00
Hans-Kristian Arntzen	317144a59c	Detect invalid DoWhileLoop early. We had a bug where error conditions in DoWhileLoop emit path would not detect that statements were being emitted due to the masking behavior which happens when force_recompile is true. Fix this. Also, refactor force_recompile into member functions so we can properly break on any situation where this is set, without having to rely on watchpoints in debuggers.	2019-04-05 12:19:32 +02:00
Hans-Kristian Arntzen	9b92e68d71	Add an option to override the namespace used for spirv_cross. This is a pragmatic trick to avoid symbol collision where a project links against SPIRV-Cross statically, while linking to other projects which also use SPIRV-Cross statically. We can end up with very awkward symbol collisions which can resolve themselves silently because SPIRV-Cross is pulled in as necessary. To fix this, we must use different symbols and embed two copies of SPIRV-Cross in this scenario, now with different namespaces, which in turn leads to different symbols.	2019-03-29 10:29:44 +01:00
Bill Hollings	c48702d8c2	Fix crash when backend.int16_t_literal_suffix set to null. The design of backend.int16_t_literal_suffix and backend.uint16_t_literal_suffix allows them to be set to null, but that was not always tested for. I have removed the expectation that they can be null and set backend.int16_t_literal_suffix to "" when no suffix is needed. That has the same effect, and seemed to be a more usable and defensive approach.	2019-03-28 14:23:32 -04:00
Hans-Kristian Arntzen	18d4f67a87	Merge pull request #919 from KhronosGroup/fix-915 MSL: Declare gl_WorkGroupSize constant with [[maybe_unused]].	2019-03-28 14:00:49 +01:00
Hans-Kristian Arntzen	0909975655	MSL: Declare gl_WorkGroupSize constant with [[maybe_unused]]. Avoids ugly warnings on nearly every compute shader. We could do analysis to detect whether we need to emit this constant, but it's a bit tedious to figure out if an OpConstantComponent is actually used by opcodes, so just make it simple.	2019-03-28 10:54:18 +01:00
Hans-Kristian Arntzen	c37f88fea6	MSL: Fix crash where variable storage buffer pointers are passed down. Only deal with readonly decoration for actual block types.	2019-03-28 10:16:46 +01:00
Hans-Kristian Arntzen	eeb3f24991	Properly deal with sign-dependent GLSL opcodes. The GLSLstd450 spec is very lax about input signs, so we need to do the bitcasting dance to implement it correctly.	2019-03-27 12:20:53 +01:00
Hans-Kristian Arntzen	e2aadf8995	Rename "push descriptor set" to "discrete descriptor set". Check for case where iOS doesn't support writable argument buffer textures.	2019-03-15 21:53:21 +01:00
Hans-Kristian Arntzen	b3380ec9dd	MSL: Support VK_KHR_push_descriptor. If we have argument buffers, we also need to support using plain descriptor sets for certain cases where API wants it.	2019-03-15 14:08:47 +01:00
Hans-Kristian Arntzen	c310b40fd3	MSL: Make sure get_buffer_block_flags is only used in right context.	2019-03-15 12:27:54 +01:00
Hans-Kristian Arntzen	bc21ccb7ce	MSL: Emit correct SSBO constness for argument buffers.	2019-03-15 12:05:35 +01:00
Hans-Kristian Arntzen	969566aff5	MSL: Fixup buffer array case issue on MSL 1.0.	2019-03-15 11:37:34 +01:00
Hans-Kristian Arntzen	af8a9ccdcb	MSL: Need to emit two layers of address space. When passing down arrays of buffer pointers, the array itself needs an address space.	2019-03-15 11:29:17 +01:00
Hans-Kristian Arntzen	e47a77d596	MSL: Implement Metal 2.0 indirect argument buffers.	2019-03-15 11:01:27 +01:00
Hans-Kristian Arntzen	e74c21a39b	Review fixups.	2019-03-04 10:08:31 +01:00
Hans-Kristian Arntzen	9bbdccddb7	Add a stable C API for SPIRV-Cross. This adds a new C API for SPIRV-Cross which is intended to be stable, both API and ABI wise. The C++ API has been refactored a bit to make the C wrapper easier and cleaner to write. Especially the vertex attribute / resource interfaces for MSL has been rewritten to avoid taking mutable pointers into the interface. This would be very annoying to wrap and it didn't fit well with the rest of the C++ API to begin with. While doing this, I went ahead and removed all the old deprecated interfaces. The CMake build system has also seen an overhaul. It is now possible to build static/shared/CLI separately with -D options. The shared library only exposes the C API, as it is the only ABI-stable API. pkg-configs as well as CMake modules are exported and installed for the shared library configuration.	2019-03-01 11:53:51 +01:00
Hans-Kristian Arntzen	825ff4af7e	Replace locale handling. We were using std::locale::global() to force a C locale which is not safe when SPIRV-Cross is used in a multi-threaded environment. To fix this, we could tap into various per-platform specific locale handling to get safe thread-local locales, but since locales only affect the decimal point in floats, we simply query the locale instead and do the necessary radix replacement ourselves, without touching the locale. This should be much safer and cleaner than the alternative.	2019-02-28 11:28:31 +01:00
Hans-Kristian Arntzen	ee395afa83	MSL: Emit proper name for optimized UBO/SSBO arrays.	2019-02-25 11:09:00 +01:00
Hans-Kristian Arntzen	ad6134262e	Merge pull request #877 from cdavis5e/msl-tesc-early-return MSL: Return early from helper tesc invocations.	2019-02-25 09:13:06 +01:00
Hans-Kristian Arntzen	7874f7fc49	Merge pull request #876 from cdavis5e/msl-tese-fixup-2 MSL: Make sure we fix up the output position.	2019-02-25 09:12:47 +01:00
Chip Davis	a43dcd7b99	MSL: Return early from helper tesc invocations. Return after loading the input control point array if there are more input points than output points, and this was one of the helper invocations spun off to load the input points. I was hesitant to do this initially, since the MSL spec has this to say about barriers: > The `threadgroup_barrier` (or `simdgroup_barrier`) function must be > encountered by all threads in a threadgroup (or SIMD-group) executing > the kernel. That is, if any thread executes the barrier, then all threads must execute it, or the barrier'd invocations will hang. But, the key words here seem to be "executing the kernel;" inactive invocations, those that have already returned, need not encounter the barrier to prevent hangs. Indeed, I've encountered no problems from doing this, at least on my hardware. This also fixes a few CTS tests that were failing due to execution ordering; apparently, my assumption that the later, invalid data written by the helpers would get overwritten was wrong.	2019-02-24 12:17:47 -06:00
Chip Davis	f3267db1d8	MSL: Make sure we fix up the output position. If a stage takes the position as both an input and an output (i.e. a tessellation shader or a geometry shader), then we could wind up fixing up the input position by mistake. Ensure that doesn't happen, by only setting the `qual_pos_var_name` variable from the output position.	2019-02-22 15:28:28 -06:00
Chip Davis	f3c0942d10	MSL: Use vectors for the tessellation level builtins in tese shaders. The tessellation levels in Metal are stored as a densely-packed array of half-precision floating point values. But, stage-in attributes in Metal have to have offsets and strides aligned to a multiple of four, so we can't add them individually. Luckily for us, the arrays have lengths less than 4. So, let's use vectors for them! Triangles get a single attribute with a `float4`, where the outer levels are in `.xyz` and the inner levels are in `.w`. The arrays are unpacked as though we had added the elements individually. Quads get two: a `float4` with the outer levels and a `float2` with the inner levels. Further, since vectors can be indexed as arrays, there's no need to unpack them in this case. This also saves on precious vertex attributes. Before, we were using up to 6 of them. Now we need two at most.	2019-02-22 12:18:51 -06:00
Hans-Kristian Arntzen	a4ac27546a	MSL: Fix textures which are sampled and compared against. depth2d in MSL only returns float, not float4, even for normal sampling. We need to conditionally remap-swizzle back to float4.	2019-02-22 12:27:40 +01:00
Chip Davis	dae4a88b06	MSL: Don't do the fixup at all when capturing output.	2019-02-21 17:05:37 -06:00
Chip Davis	b34fd63c2d	MSL: Do position fixup for tessellation evaluation shaders, too.	2019-02-21 16:57:56 -06:00
Chip Davis	7042cb9bec	Quiesce truncation warnings.	2019-02-21 15:11:45 -06:00
Chip Davis	c756a91c3c	MSL: Fix a case I missed initializing vtx_attrs_by_builtin.	2019-02-21 13:14:03 -06:00
Chip Davis	9d8a5be725	MSL: Ignore duplicate builtin vertex attributes. These are often arrayed builtins, which MSL maps to more than one attribute. SPIRV-Cross automatically assigns succeeding addresses to arrayed attributes, so we really only need the first one. This of course assumes that the inputs are sorted by location.	2019-02-21 13:14:03 -06:00
Chip Davis	5069ec72bb	MSL: Set location of builtins based on client input. Builtin attributes in SPIR-V aren't linked by location, but by their built-in-ness. This poses a problem for MSL, since builtin inputs in the vertex pipeline are just regular attributes. We must then assign them locations so that they can be matched up to the attributes in the stage input descriptor--and also to avoid duplicate attribute numbers in tessellation evaluation shaders, where there are two different stage-in structs, so the member index therein is no longer unique!	2019-02-20 22:16:51 -06:00
Chip Davis	7a7e210515	MSL: Force unnamed array builtin attributes to have a name. That way, when we refer to them, they'll have the name that we're expecting.	2019-02-20 22:16:51 -06:00
Hans-Kristian Arntzen	ed7292fec4	Merge pull request #867 from cdavis5e/tese-shader-origin-2 MSL: Don't bother fixing up triangle tess coords.	2019-02-20 22:36:21 +01:00
Chip Davis	285ca4c2b1	MSL: Don't bother fixing up triangle tess coords. Instead, I'm going to have MoltenVK reverse the winding order in the lower-left case. This seems to be what the test suite expects to happen anyhow.	2019-02-20 14:30:44 -06:00
Hans-Kristian Arntzen	c1a93b8a71	Run format_all.sh. Missed some nits in earlier reviews.	2019-02-20 17:29:57 +01:00
Chip Davis	ba8593b112	Fix formatting.	2019-02-20 09:19:25 -06:00
Chip Davis	8095434dc4	MSL: Drop stores to nonexistent tess levels. In SPIR-V, there are always two inner levels and four outer levels, even if the input patch isn't a quad patch. But in MSL, due to requirements imposed by Metal, only one inner level and three outer levels exist when the input patch is a triangle patch. We must explicitly ignore any write to the nonexistent second inner and fourth outer levels in this case.	2019-02-20 09:11:24 -06:00
Chip Davis	c8ee9fbe76	MSL: Expand quad gl_TessCoord to a float3. This is the actual SPIR-V type of the builtin. We forced to a `float2` in the declaration because that's what Metal wants.	2019-02-20 09:11:24 -06:00
Hans-Kristian Arntzen	58f264c99d	Merge pull request #865 from KhronosGroup/fix-863 Always value-cast FP16 constants instead of using literals.	2019-02-20 14:58:44 +01:00
Hans-Kristian Arntzen	4ef51331b2	Always value-cast FP16 constants instead of using literals. GL_NV_gpu_shader5 doesn't support "hf", so to avoid lots of complicated workarounds, just value-cast the half literals.	2019-02-20 12:30:01 +01:00
Hans-Kristian Arntzen	056a0ba27e	Fix case where a struct is loaded which contains a row-major matrix.	2019-02-20 12:19:00 +01:00
Chip Davis	41d9424233	MSL: Add an option to set the tessellation domain origin. This is intended to be used to support `VK_KHR_maintenance2`'s tessellation domain origin feature. If `tess_domain_origin_lower_left` is `true`, the `v` coordinate will be inverted with respect to the domain. Additionally, in `Triangles` mode, the `v` and `w` coordinates will be swapped. This is because the winding order is interpreted differently in lower-left mode.	2019-02-18 14:25:42 -06:00
Chip Davis	08863c1e28	Don't set any aliases or do any flattening for arrayed per-vertex I/O. We already handle all that specially.	2019-02-15 17:24:16 -06:00
Chip Davis	6b7988046d	Handle blocks of patch I/O. In this case, each member of the block will be decorated with `DecorationPatch`, rather than the block variable having the decoration.	2019-02-15 17:21:38 -06:00

1 2 3 4 5 ...

676 Commits