SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	7277c7ac46	Use to_unpacked_row_major_expression to unify row-major in MSL/GLSL.	2019-07-23 11:36:54 +02:00
Hans-Kristian Arntzen	47a18b9f1b	Simplify row-major matrix/vector multiplies.	2019-07-23 10:56:57 +02:00
Hans-Kristian Arntzen	d584d833fa	Test array of std140 vectors.	2019-07-23 10:38:32 +02:00
Hans-Kristian Arntzen	6224199c76	Add struct size padding tests.	2019-07-23 10:30:37 +02:00
Hans-Kristian Arntzen	82c819ee6c	Add test for CompositeExtract from row-major loaded vector.	2019-07-22 16:32:22 +02:00
Hans-Kristian Arntzen	d7a5303cf2	Add test for split access chain into row-major matrix.	2019-07-22 16:28:05 +02:00
Hans-Kristian Arntzen	2172b19be2	Remove obsolete matrix workaround code.	2019-07-22 16:27:47 +02:00
Hans-Kristian Arntzen	609d087f8f	Only transpose unpacked expressions.	2019-07-22 16:06:09 +02:00
Hans-Kristian Arntzen	6057ffcbb1	Deal correctly with complete stores to row_major matrices.	2019-07-22 15:49:17 +02:00
Hans-Kristian Arntzen	19f5cd3e90	Declare correct matrix type when unpacking.	2019-07-22 13:25:45 +02:00
Hans-Kristian Arntzen	745a2f7b0e	Deal with swizzled stores to std140 matrices.	2019-07-22 13:05:23 +02:00
Hans-Kristian Arntzen	180a6b38c5	Fix some row-major column store cases.	2019-07-22 12:56:14 +02:00
Hans-Kristian Arntzen	4ab2829cf6	Fix more stray parens.	2019-07-22 12:13:07 +02:00
Hans-Kristian Arntzen	d6004bfc97	Fixup stray parent in output.	2019-07-22 12:08:56 +02:00
Hans-Kristian Arntzen	14afb968dd	Correctly unpack row-major matrices when storing to LHS.	2019-07-22 12:03:12 +02:00
Hans-Kristian Arntzen	172185016f	MSL: Add std140 and scalar matrix layouts.	2019-07-22 11:30:03 +02:00
Hans-Kristian Arntzen	6471236652	MSL: Add std430 matrix access test.	2019-07-22 11:23:06 +02:00
Hans-Kristian Arntzen	249f8e5180	MSL: Support storing to row-major column. Defer transposes to actual Load or Store.	2019-07-22 11:13:44 +02:00
Hans-Kristian Arntzen	be2fccd837	Tests run clean.	2019-07-22 10:23:39 +02:00
Hans-Kristian Arntzen	6c1f97b4a9	Fix unpacking of packed but not remapped types on load.	2019-07-19 14:50:35 +02:00
Hans-Kristian Arntzen	b66a53a979	Traverse correct types when checking scalar layout.	2019-07-19 14:43:42 +02:00
Hans-Kristian Arntzen	e90d816cdd	Deal with scalar layout of entire structs. Mark all candidate struct types.	2019-07-19 14:18:14 +02:00
Chip Davis	12a8654784	Don't forward uses of an OpIsHelperInvocationEXT op. If this is computed before a `demote`, but used after, forwarding it will produce the wrong value. This does make for uglier shaders, but it's necessary right now to ensure correctness. I needed to use an assembly shader to produce the test for this. `spirv-opt` is not smart enough (or too smart?) to eliminate the variable that would be used in GLSL to express this.	2019-07-18 17:32:35 -05:00
Chip Davis	50dce10c5d	Support the SPV_EXT_demote_to_helper_invocation extension. This extension provides a new operation which causes a fragment to be discarded without terminating the fragment shader invocation. The invocation for the discarded fragment becomes a helper invocation, so that derivatives will remain defined. The old `HelperInvocation` builtin becomes undefined when this occurs, so a second new instruction queries the current helper invocation status. This is only fully supported for GLSL. HLSL doesn't support the `IsHelperInvocation` operation and MSL doesn't support the `DemoteToHelperInvocation` op. Fixes #1052.	2019-07-17 09:12:22 -05:00
Hans-Kristian Arntzen	c7eda1bce9	Test glsl.std450 more exhaustively. Make sure to test everything with scalar as well to catch any weird edge cases. Not all opcodes are covered here, just the arithmetic ones. FP64 packing is also ignored.	2019-07-17 11:53:05 +02:00
Chip Davis	bc646574a6	MSL: Support the SPV_INTEL_shader_integer_functions2 extension. This provides a few functions normally available in OpenCL to the SPIR-V shader environment. These functions happen to be available in Metal as well. No GLSL, unfortunately. Intel has yet to publish a `GL_INTEL_shader_integer_functions2` spec.	2019-07-15 09:42:36 -05:00
Hans-Kristian Arntzen	33d2bbcf69	Merge branch 'msl-amd-trinary-functions' of git://github.com/cdavis5e/SPIRV-Cross	2019-07-15 09:46:31 +02:00
Hans-Kristian Arntzen	4d7aae1050	Merge branch 'device-group' of git://github.com/cdavis5e/SPIRV-Cross	2019-07-15 09:46:16 +02:00
Chip Davis	343c6f4ff4	Update external repos. Fix fallout from changes. There's a bug in glslang that prevents `float16_t`, `[u]int16_t`, and `[u]int8_t` constants from adding the corresponding SPIR-V capabilities. SPIRV-Tools, meanwhile, tightened validation so that these constants are only valid if the corresponding `Float16`, `Int16`, and `Int8` caps are on. This affects the `16bit-constants.frag` test for GLSL and MSL.	2019-07-13 16:50:21 -05:00
Chip Davis	6a58554568	Support the SPV_KHR_device_group extension. The only piece added by this extension is the `DeviceIndex` builtin, which tells the shader which device in a grouped logical device it is running on. Metal's pipeline state objects are owned by the `MTLDevice` that created them. Since Metal doesn't support logical grouping of devices the way Vulkan does, we'll thus have to create a pipeline state for each device in a grouped logical device. The upcoming peer group support in Metal 3 will not change this. For this reason, for Metal, the device index is supplied as a constant at pipeline compile time. There's an interaction between `VK_KHR_device_group` and `VK_KHR_multiview` in the `VK_PIPELINE_CREATE_VIEW_INDEX_FROM_DEVICE_INDEX_BIT`, which defines the view index to be the same as the device index. The new `view_index_from_device_index` MSL option supports this functionality.	2019-07-13 16:45:54 -05:00
Chip Davis	ca91fcfe5f	MSL: Support the SPV_AMD_shader_trinary_minmax extension. This requires MSL 2.1.	2019-07-13 16:43:57 -05:00
Hans-Kristian Arntzen	932ee0e328	Deal correctly with return sign of bitscan operations.	2019-07-12 10:57:56 +02:00
Hans-Kristian Arntzen	19ebbd48c7	Merge pull request #1077 from cdavis5e/msl-spirv-qualifiers MSL: Handle coherent, volatile, and restrict.	2019-07-12 10:03:06 +02:00
Hans-Kristian Arntzen	ad5eae46ed	Merge pull request #1078 from cdavis5e/post-depth-coverage Support the SPV_KHR_post_depth_coverage extension.	2019-07-12 09:56:26 +02:00
Chip Davis	6628ea6e48	MSL: Use the select() function for OpSelect. This significantly improves codegen for vector `OpSelect` in MSL.	2019-07-11 10:30:37 -05:00
Chip Davis	1df47db6ba	Support the SPV_KHR_post_depth_coverage extension. Using the `PostDepthCoverage` mode specifies that the `gl_SampleMaskIn` variable is to contain the computed coverage mask following the early fragment tests, which this mode requires and implicitly enables. Note that unlike Vulkan and OpenGL, Metal places this on the sample mask input itself, and furthermore does not implicitly enable early fragment testing. If it isn't enabled explicitly with an `[[early_fragment_tests]]` attribute, the compiler will error out. So we have to enable that mode explicitly if `PostDepthCoverage` is enabled but `EarlyFragmentTests` isn't. For Metal, only iOS supports this; for some reason, Apple has yet to implement it on macOS, even though many desktop cards support it.	2019-07-11 10:28:43 -05:00
Chip Davis	058f1a0933	MSL: Handle coherent, volatile, and restrict. This maps them to their MSL equivalents. I've mapped `Coherent` to `volatile` since MSL doesn't have anything weaker than `volatile` but stronger than nothing. As part of this, I had to remove the implicit `volatile` added for atomic operation casts. If the buffer is already `coherent` or `volatile`, then we would add a second `volatile`, which would be redundant. I think this is OK even when the buffer doesn't have `coherent`: `T ` is implicitly convertible to `volatile T `, but not vice-versa. It seems to compile OK at any rate. (Note that the non-`volatile` overloads of the atomic functions documented in the spec aren't present in the MSL 2.2 stdlib headers.) `restrict` is tricky, because in MSL, as in C++, it needs to go after the asterisk or ampersand for the pointer type it's modifying. Another issue is that, in the `Simple`, `GLSL450`, and `Vulkan` memory models, `Restrict` is the default (i.e. does not need to be specified); but MSL likely follows the `OpenCL` model where `Aliased` is the default. We probably need to implicitly set either `Restrict` or `Aliased` depending on the module's declared memory model.	2019-07-11 10:22:30 -05:00
Hans-Kristian Arntzen	1a592b7c0f	Merge pull request #1067 from cdavis5e/msl-scalar-block-layout MSL: Support scalar block layout.	2019-07-11 13:03:03 +02:00
Chip Davis	28454facbb	MSL: Handle packed matrices. The old method of using a different unpacked matrix type doesn't work for scalar alignment. It certainly wouldn't have any effect for a square matrix, since the number of columns and rows are the same. So now we'll store them as arrays of packed vectors.	2019-07-10 18:37:31 -05:00
Hans-Kristian Arntzen	f6f849397e	MSL: Re-roll array expressions in initializers. We cannot rely on copy path when using an array as part of a struct initializer, so reroll such expressions to an initializer list again.	2019-07-10 11:19:33 +02:00
Chip Davis	e5fa7edfd6	MSL: Support scalar block layout. Relaxed block layout relaxed the restrictions on vector alignment, allowing them to be aligned on scalar boundaries. Scalar block layout relaxes this further, allowing any member to be aligned on a scalar boundary. The requirement that a vector not improperly straddle a 16-byte boundary is also relaxed. I've also added a test showing that `std430` layout works with UBOs. I'm troubled by the dual meaning of the `Packed` extended decoration. In some instances (struct, `float[]`, and `vec2[]` members), it actually means the exact opposite, that the member needs extra padding. This is especially problematic for `vec2[]`, because now we need to distinguish the two cases by checking the array stride. I wonder if this should actually be split into two decorations.	2019-07-09 20:59:32 -05:00
Hans-Kristian Arntzen	53ab2144b9	Merge pull request #1064 from KhronosGroup/fix-1062 Fall back to complex loop if non-trivial continue block is found.	2019-07-08 13:58:35 +02:00
Hans-Kristian Arntzen	50342966c0	Fall back to complex loop if non-trivial continue block is found. There is a case where we can deduce a for/while loop, but the continue block is actually very painful to deal with, so handle that case as well. Removes an exceptional case.	2019-07-08 11:54:29 +02:00
Hans-Kristian Arntzen	fa9af7223a	Add test shaders for NonUniformEXT propagation.	2019-07-08 11:32:09 +02:00
Hans-Kristian Arntzen	13378ad1ac	Add simple test for extended debug operations.	2019-07-05 10:44:30 +02:00
Hans-Kristian Arntzen	4056d0b74e	Don't use scalar dot().	2019-07-03 14:32:06 +02:00
Hans-Kristian Arntzen	041f103d44	MSL/HLSL: Support scalar reflect and refract.	2019-07-03 12:31:52 +02:00
Hans-Kristian Arntzen	fc9fe4e480	Fix variable scope when an if or else block dominates a variable. Just like loops, we need complicated hoisting again to make this work.	2019-07-03 11:18:50 +02:00
Chip Davis	31b6c93516	MSL: Support SubgroupLocalInvocationId and SubgroupSize in all stages. MSL prior to 2.2 doesn't support these natively in any stage but compute. But, we can (assuming no threads were terminated prematurely) get their values with some creative uses of the `simd_prefix_exclusive_sum()` and `simd_sum()` functions. Also, fix a missing `to_expression()` with `BuiltInSubgroupEqMask`. For KhronosGroup/MoltenVK#629.	2019-07-02 11:48:59 -05:00
Hans-Kristian Arntzen	f8b084de61	MSL/HLSL: Support OpOuterProduct.	2019-07-01 10:57:27 +02:00

1 2 3 4 5 ...

739 Commits