SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	2d20b1ab93	Run format_all.sh.	2019-10-07 10:29:04 +02:00
Lukas Hermanns	f3a6d28a1d	Further updates for pull request #1162 ; also added two test cases for spvCubemapTo2DArrayFace function and added '--msl-framebuffer-fetch'/ '--msl-emulate-cube-array' compiler options.	2019-09-27 15:49:54 -04:00
Lukas Hermanns	c3d6022956	Update for pull request #1162 rev. 1	2019-09-24 18:13:04 -04:00
Lukas Hermanns	7ad0a84778	Updates for pull request #1162	2019-09-24 14:35:25 -04:00
Lukas Hermanns	37df74035b	Merge branch 'ue4_dev'	2019-09-20 09:42:42 -04:00
Lukas Hermanns	9f9276f5ce	Fixed false-positive optimization of builtin variables (may happen when 'spvOut' is emitted).	2019-09-19 14:44:30 -04:00
Hans-Kristian Arntzen	3c11254ece	MSL: Fix 16-bit integer literals. There is no suffix, so bitcasts failed.	2019-09-19 10:19:51 +02:00
Lukas Hermanns	50ac6862ac	Rearranged all 'UE Change' comments to match to project's coding style.	2019-09-18 14:03:54 -04:00
Lukas Hermanns	137e9d6d98	Removed reference specifiers in 'spvFMul*' functions to avoid address specifiers.	2019-09-17 16:50:33 -04:00
Lukas Hermanns	51be601922	Avoid emitting 'spvUnsafeArray<>', 'spvFMul*', and 'spvFAdd' custom functions if they are not needed.	2019-09-17 15:10:39 -04:00
Lukas Hermanns	36eab88b23	Further adjustments to make Metal backend work again in UE4 on Mac.	2019-09-17 11:40:01 -04:00
Lukas Hermanns	7cf5d4f7a1	Added a new 'emulate_cube_array' option to SPIRV-Cross to cope with translating TextureCubeArray into texture2d_array for iOS where this type is not available. (Original Author: Mark Satterthwaite)	2019-09-13 17:24:27 -04:00
Lukas Hermanns	a9f3c981d9	Adjustments after rebase of ue4_dev branch.	2019-09-13 14:03:02 -04:00
Mark Satterthwaite	c4f9704af0	OpImageTexelPointer needs to use an int coordinate type for GLSL, but not for MSL.	2019-09-12 08:52:08 -04:00
Mark Satterthwaite	fdaf9b47bd	Remove obsolete memory barrier scope specification from Metal output, this API has been removed.	2019-09-12 08:35:28 -04:00
Mark Satterthwaite	69b703f1da	Add an option to SPIRV-Cross to enforce invariant floating point math to prevent different depth calculation between prepass & basepass when running on Metal 2.0 and earlier.	2019-09-12 08:35:15 -04:00
Mark Satterthwaite	e4c6388571	More fixes to handling packing & access elements in an array. Made in two parts. 1. Don't allow AccessChain operations to add duplicated swizzles when accessing packed arrays. 2. Only pack arrays when there is the proper amount of space between members in a struct, otherwise it will definitely be wrong.	2019-09-11 16:15:10 -04:00
Mark Satterthwaite	b491806b47	Fix texture swizzling.	2019-09-11 14:56:54 -04:00
Mark Satterthwaite	9e54a8dd7b	Slight modifications to IAB support for Metal output, so that the caller can specify an offset for the IAB start index, as for HLSL shaders UAVs need to occupy slots 0-7. The runtime support for SSBO robustness is also much simpler if the buffer size block is at index 0. Change made in two parts. 1. Allow the caller to specify the Metal translation should use argument buffers. 2. Move this to the front of IABs for convenience of the runtime.	2019-09-10 13:09:49 -04:00
Mark Satterthwaite	d9f3576305	Metal doesn't automatically enforce robust access to buffers unlike other APIs, so for storage-buffers, which become raw T* buffers in Metal, we need to fetch the buffer size and clamp the access to a valid index within the buffer ourselves. This is essential for shaders converted from HLSL which expects all resource access to be robust, though this implementation is technically different to the HLSL specification of return-0 for OOB reads, ignore OOB writes.	2019-09-10 12:32:32 -04:00
Mark Satterthwaite	0428faada3	HLSL makes position calculations invariant by default to eliminate problems with depth-precision, Apple added a similar qualifier for Metal 2.1 that can and should be used in Vertex & Domain/TessEval shaders for the same effect.	2019-09-10 11:47:40 -04:00
Mark Satterthwaite	9ce3158193	When compiling from HLSL which pads and aligns float[]/float2[] within structures to float4[] we need to unpack the original type in Metal from the float4.	2019-09-10 11:21:43 -04:00
Mark Satterthwaite	40a4456a54	Fix conversion of the SampleMask intrinsic from SPIRV, where it is an array to Metal where it isn't.	2019-09-10 10:46:42 -04:00
Mark Satterthwaite	42b8a62870	Fixes to the generation of Metal tessellation shaders from SPIRV so that it works correctly in more complicated cases. First, when generating from HLSL before invoking the code that comes from the HLSL patch-function a control-flow and full memory-barrier are required to ensure that all the temporary values in thread-local storage for the patch are available. Second, the inputs to control and evaluation shaders must be properly forwarded from the global variables in SPIRV to the member variables in the relevant input structure. Finally when arrays of interpolators are used for input or output we need to add an extra level of array indirection because Metal works at a different granularity than SPIRV. Five parts. 1. Fix tessellation patch function processing. 2. Fix loads from tessellation control inputs not being forwarded to the gl_in structure array. 3. Fix loads from tessellation evaluation inputs not being forwarded to the stage_in structure array. 4. Workaround SPIRV losing an array indirection in tessellation shaders - not the best solution but enough to keep things progressing. 5. Apparently gl_TessLevelInner/Outer is special and needs to not be placed into the input array.	2019-09-10 10:37:07 -04:00
Mark Satterthwaite	de6441af88	Work-around HLSL using zero-based InstanceID and VertexID variables, but SPIRV, like Metal, includes BaseInstance & BaseVertex. Until this can be fixed in DXC, which is really the proper place to solve this, we can decrement InstanceID & VertexID when the source is HLSL. Made in two parts. 1. Handle HLSL-style 0-based vertex/instance index. 2. We zero-base the InstanceID & VertexID variables for HLSL emulation elsewhere, so don't do it twice.	2019-09-09 16:55:59 -04:00
Mark Satterthwaite	97a66ff906	On iOS sub-passes can be implemented using the frame-buffer fetch API which is much more efficient than binding the textures. Change was made in three parts. 1. Use Metal's native frame-buffer fetch API for subpass inputs. 2. Make sure that frame-buffer-fetch is only available on iOS. 3. Default to using Metal's native frame-buffer fetch for subpass inputs on iOS.	2019-09-09 15:02:11 -04:00
Wade Brainerd	f2a1b4320f	MSL: Fix array copies to/from interpolators	2019-09-06 18:23:57 -07:00
Mark Satterthwaite	32557e9093	SPIRV doesn't distinguish depth textures from regular textures, but Metal does, so if we've ever seen a depth comparison operation we must ensure that the texture is specified as a depth-texture.	2019-09-06 16:58:27 -04:00
Hans-Kristian Arntzen	2082e7e801	Run format_all.sh.	2019-09-06 14:23:16 +02:00
Hans-Kristian Arntzen	333980ae91	Refactor into stronger types in public API. Some fallout where internal functions are using stronger types. Overkill to move everything over to strong types right now, but perhaps move over to it slowly over time.	2019-09-06 12:29:47 +02:00
Hans-Kristian Arntzen	1935f1a8e3	Fix some issues on certain compilers.	2019-09-06 10:11:18 +02:00
Chip Davis	cb35934248	MSL: Support dynamic offsets for buffers in argument buffers. Vulkan has two types of buffer descriptors, `VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER_DYNAMIC` and `VK_DESCRIPTOR_TYPE_STORAGE_BUFFER_DYNAMIC`, which allow the client to offset the buffers by an amount given when the descriptor set is bound to a pipeline. Metal provides no direct support for this when the buffer in question is in an argument buffer, so once again we're on our own. These offsets cannot be stored or associated in any way with the argument buffer itself, because they are set at bind time. Different pipelines may have different offsets set. Therefore, we must use a separate buffer, not in any argument buffer, to hold these offsets. Then the shader must manually offset the buffer pointer. This change fully supports arrays, including arrays of arrays, even though Vulkan forbids them. It does not, however, support runtime arrays. Perhaps later.	2019-09-05 23:29:00 -05:00
Mark Satterthwaite	5e8590a23d	Emulate texture atomics in Metal by binding the underlying buffer that backs the resource to a separate binding point and using that for Metal's atomic operations. This will work with texture_buffer and texture2d created from an MTLBuffer, so is perfect for emulating HLSL atomics on RWBuffer and sufficient, but not ideal, for RWTexture2D with some restrictions (limited format support and can't be used for render-targets).	2019-09-05 15:13:28 -04:00
Mark Satterthwaite	239e04762b	Support Metal 2.1's texture_buffer type which is the equivalent to HLSL's Buffer/RWBuffer, so doesn't require modifying buffer sizes to match alignments.	2019-09-05 14:46:15 -04:00
Mark Satterthwaite	8596bf5ee2	In order to use Metal shader libraries properly you have to ensure that you have no duplicated global symbol names for different entities, otherwise 'metallib' won't be able to combine multiple shaders into a single library. This is broken into two parts. 1. Constant arrays of non-primitive types (i.e. matrices) won't link properly into Metal libraries. 2. Metal helper functions must be static force-inline otherwise they will cause problems when linked together in a single Metallib.	2019-09-05 14:39:06 -04:00
Mark Satterthwaite	d50659af92	Rework the way arrays are handled in Metal to remove the array copies as they are unnecessary from Metal 1.2. There were cases where copies were not being inserted and others appeared unncessary, using the template type should allow the 'metal' compiler to do the best possible optimisation. The changes are broken into three stages. 1. Allow Metal to use the array<T> template to make arrays a value type. 2. Force the use of C style array declaration for some cases which cannot be wrapped with a template. 3. Threadgroup arrays can't have a wrapper type. 4. Tweak the code to use unsafe_array in a few more places so that we can handle passing arrays of resources into the shader and then through shaders into sub-functions. 5. Handle packed matrix types inside arrays within structs. 6. Make sure that builtin arguments still retain their array qualifiers when used in leaf functions. 7. Fix declaration of array-of-array constants for Metal so we can use the array<T> template.	2019-09-05 12:39:44 -04:00
Chip Davis	103817009c	MSL: Force storage images on iOS to use discrete descriptors. Writable textures cannot use argument buffers on iOS. They must be passed as arguments directly to the shader function. Since we won't know if a given storage image will have the `NonWritable` decoration at the time we encode the argument buffer, we must therefore pass all storage images as discrete arguments. Previously, we were throwing an error if we encountered an argument buffer with a writable texture in it on iOS.	2019-09-05 11:01:05 -05:00
Hans-Kristian Arntzen	261b46982a	Deal with complex interlock cases in GLSL.	2019-09-04 12:18:04 +02:00
Chip Davis	2eff420d9a	Support the SPV_EXT_fragment_shader_interlock extension. This was straightforward to implement in GLSL. The `ShadingRateInterlockOrderedEXT` and `ShadingRateInterlockUnorderedEXT` modes aren't implemented yet, because we don't support `SPV_NV_shading_rate` or `SPV_EXT_fragment_invocation_density` yet. HLSL and MSL were more interesting. They don't support this directly, but they do support marking resources as "rasterizer ordered," which does roughly the same thing. So this implementation scans all accesses inside the critical section and marks all storage resources found therein as rasterizer ordered. They also don't support the fine-grained controls on pixel- vs. sample-level interlock and disabling ordering guarantees that GLSL and SPIR-V do, but that's OK. "Unordered" here merely means the order is undefined; that it just so happens to be the same as rasterizer order is immaterial. As for pixel- vs. sample-level interlock, Vulkan explicitly states: > With sample shading enabled, [the `PixelInterlockOrderedEXT` and > `PixelInterlockUnorderedEXT`] execution modes are treated like > `SampleInterlockOrderedEXT` or `SampleInterlockUnorderedEXT` > respectively. and: > If [the `SampleInterlockOrderedEXT` or `SampleInterlockUnorderedEXT`] > execution modes are used in single-sample mode they are treated like > `PixelInterlockOrderedEXT` or `PixelInterlockUnorderedEXT` > respectively. So this will DTRT for MoltenVK and gfx-rs, at least. MSL additionally supports multiple raster order groups; resources that are not accessed together can be placed in different ROGs to allow them to be synchronized separately. A more sophisticated analysis might be able to place resources optimally, but that's outside the scope of this change. For now, we assign all resources to group 0, which should do for our purposes. `glslang` doesn't support the `RasterizerOrdered` UAVs this implementation produces for HLSL, so the test case needs `fxc.exe`. It also insists on GLSL 4.50 for `GL_ARB_fragment_shader_interlock`, even though the spec says it needs either 4.20 or `GL_ARB_shader_image_load_store`; and it doesn't support the `GL_NV_fragment_shader_interlock` extension at all. So I haven't been able to test those code paths. Fixes #1002.	2019-09-02 12:31:10 -05:00
Chip Davis	39dce88d3b	MSL: Add support for sampler Y'CbCr conversion. This change introduces functions and in one case, a class, to support the `VK_KHR_sampler_ycbcr_conversion` extension. Except in the case of GBGR8 and BGRG8 formats, for which Metal natively supports implicit chroma reconstruction, we're on our own here. We have to do everything ourselves. Much of the complexity comes from the need to support multiple planes, which must now be passed to functions that use the corresponding combined image-samplers. The rest is from the actual Y'CbCr conversion itself, which requires additional post-processing of the sample retrieved from the image. Passing sampled images to a function was a particular problem. To support this, I've added a new class which is emitted to MSL shaders that pass sampled images with Y'CbCr conversions attached around. It can handle sampled images with or without Y'CbCr conversion. This is an awful abomination that should not exist, but I'm worried that there's some shader out there which does this. This support requires Metal 2.0 to work properly, because it uses default-constructed texture objects, which were only added in MSL 2. I'm not even going to get into arrays of combined image-samplers--that's a whole other can of worms. They are deliberately unsupported in this change. I've taken the liberty of refactoring the support for texture swizzling while I'm at it. It's now treated as a post-processing step similar to Y'CbCr conversion. I'd like to think this is cleaner than having everything in `to_function_name()`/`to_function_args()`. It still looks really hairy, though. I did, however, get rid of the explicit type arguments to `spvGatherSwizzle()`/`spvGatherCompareSwizzle()`. Update the C API. In addition to supporting this new functionality, add some compiler options that I added in previous changes, but for which I neglected to update the C API.	2019-09-01 18:35:53 -05:00
Hans-Kristian Arntzen	9b845a4788	Merge pull request #1141 from troughton/inline-everything MSL: Inline all non-entry-point functions	2019-08-30 11:05:04 +02:00
Thomas Roughton	6b5403206e	Clang-format changes	2019-08-30 20:25:40 +12:00
Hans-Kristian Arntzen	07c76f66b5	MSL: Add {Base,}{Vertex,Instance}Index to bitcast_from_builtin_load. Totally missed these, so float(index) would not work correctly for negative numbers.	2019-08-29 13:56:37 +02:00
Thomas Roughton	e5f9e2c203	Inline all non-entry-point functions	2019-08-29 17:07:57 +12:00
Thomas Roughton	6338f0aa0f	MSL: inline all emitted functions # Conflicts: # spirv_msl.cpp	2019-08-29 17:07:27 +12:00
Hans-Kristian Arntzen	3ccfbce264	Run format_all.sh.	2019-08-28 14:25:26 +02:00
Hans-Kristian Arntzen	9436cd3036	MSL: Deal with array copies from and to threadgroup.	2019-08-27 13:18:01 +02:00
Hans-Kristian Arntzen	b3305799a8	Deal correctly with sign on bitfield operations. Need a lot of special purpose implementation functions for these.	2019-08-26 11:36:36 +02:00
Hans-Kristian Arntzen	ffca8735ff	Merge pull request #1105 from cdavis5e/msl-unify-as MSL: Unify the get_*_address_space() methods.	2019-07-29 10:19:12 +02:00
Chip Davis	df18d98bea	MSL: Unify the get_*_address_space() methods. These methods have largely the same logic, with minor differences. That I felt compelled to duplicate the logic into another method was one of the things that bothered me about the variable pointers change. This cleans that part of the code up; now we don't have two places to change.	2019-07-26 09:43:28 -05:00
Hans-Kristian Arntzen	d378413040	Merge pull request #1103 from KhronosGroup/fix-1100 MSL: Cleanup temporary use with emit_uninitialized_temporary.	2019-07-26 14:35:18 +02:00
Hans-Kristian Arntzen	c3e8e728d8	MSL: Cleanup temporary use with emit_uninitialized_temporary.	2019-07-26 11:16:43 +02:00
Hans-Kristian Arntzen	abb345d0b3	MSL: Deal with Modf/Frexp where output is access chain to scalar. This is not allowed as we cannot take mutable reference to a vec.{x,y,z,w}. We only care about scalar since entire vectors are fine.	2019-07-26 11:02:38 +02:00
Hans-Kristian Arntzen	3c03b55c46	Workaround MSVC 2013 compiler issues.	2019-07-25 10:28:11 +02:00
Chip Davis	fb5ee4cb5c	MSL: Adjust BuiltInWorkgroupId for vkCmdDispatchBase(). This command allows the caller to set the base value of `BuiltInWorkgroupId`, and thus of `BuiltInGlobalInvocationId`. Metal provides no direct support for this... but it does provide a builtin, `[[grid_origin]]`, normally used to pass the base values for the stage input region, which we will now abuse to pass the dispatch base and avoid burning a buffer binding. `[[grid_origin]]`, as part of Metal's support for compute stage input, requires MSL 1.2. For 1.0 and 1.1, we're forced to provide a buffer. (Curiously, this builtin was undocumented until the MSL 2.2 release. Go figure.)	2019-07-24 08:56:15 -05:00
Hans-Kristian Arntzen	c62503bca7	Do not attempt to pack types which are already scalar.	2019-07-24 11:52:28 +02:00
Hans-Kristian Arntzen	646e04294a	Fix some warnings when building in MoltenVK.	2019-07-23 16:39:13 +02:00
Hans-Kristian Arntzen	5c1cb7accf	Recursively pack struct types when we find scalar packed structs.	2019-07-23 15:24:53 +02:00
Hans-Kristian Arntzen	3fa2b14634	Run format_all.sh.	2019-07-23 12:23:41 +02:00
Hans-Kristian Arntzen	7277c7ac46	Use to_unpacked_row_major_expression to unify row-major in MSL/GLSL.	2019-07-23 11:36:54 +02:00
Hans-Kristian Arntzen	47a18b9f1b	Simplify row-major matrix/vector multiplies.	2019-07-23 10:56:57 +02:00
Hans-Kristian Arntzen	6224199c76	Add struct size padding tests.	2019-07-23 10:30:37 +02:00
Hans-Kristian Arntzen	2172b19be2	Remove obsolete matrix workaround code.	2019-07-22 16:27:47 +02:00
Hans-Kristian Arntzen	609d087f8f	Only transpose unpacked expressions.	2019-07-22 16:06:09 +02:00
Hans-Kristian Arntzen	6057ffcbb1	Deal correctly with complete stores to row_major matrices.	2019-07-22 15:49:17 +02:00
Hans-Kristian Arntzen	19f5cd3e90	Declare correct matrix type when unpacking.	2019-07-22 13:25:45 +02:00
Hans-Kristian Arntzen	f2d6a77c95	Don't forget to register a write to LHS expression in certain case.	2019-07-22 13:06:30 +02:00
Hans-Kristian Arntzen	745a2f7b0e	Deal with swizzled stores to std140 matrices.	2019-07-22 13:05:23 +02:00
Hans-Kristian Arntzen	180a6b38c5	Fix some row-major column store cases.	2019-07-22 12:56:14 +02:00
Hans-Kristian Arntzen	4ab2829cf6	Fix more stray parens.	2019-07-22 12:13:07 +02:00
Hans-Kristian Arntzen	d6004bfc97	Fixup stray parent in output.	2019-07-22 12:08:56 +02:00
Hans-Kristian Arntzen	14afb968dd	Correctly unpack row-major matrices when storing to LHS.	2019-07-22 12:03:12 +02:00
Hans-Kristian Arntzen	249f8e5180	MSL: Support storing to row-major column. Defer transposes to actual Load or Store.	2019-07-22 11:13:44 +02:00
Hans-Kristian Arntzen	be2fccd837	Tests run clean.	2019-07-22 10:23:39 +02:00
Hans-Kristian Arntzen	b66a53a979	Traverse correct types when checking scalar layout.	2019-07-19 14:43:42 +02:00
Hans-Kristian Arntzen	e90d816cdd	Deal with scalar layout of entire structs. Mark all candidate struct types.	2019-07-19 14:18:14 +02:00
Hans-Kristian Arntzen	12c5020854	Pass down row-major state to unpacking functions.	2019-07-19 13:03:08 +02:00
Hans-Kristian Arntzen	27b75c2c5a	Deal with all forms of matrix writes ...	2019-07-19 12:53:10 +02:00
Hans-Kristian Arntzen	f6251e4699	Can deal with std140 matrices now. Refactor is coming together.	2019-07-19 11:21:02 +02:00
Hans-Kristian Arntzen	dd7ebaf9f7	Start considering how to emit physical type ID.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	b09b8d3fa9	Deal more cleanly with matrices and row-major.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	c160d5227f	Reintroduce struct_member_* MSL queries. Need to remap to physical type + packed qualifier, and this is handy to do in a helper function.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	a86308bce1	MSL: Begin rewrite of buffer packing logic.	2019-07-19 10:06:19 +02:00
Chip Davis	12a8654784	Don't forward uses of an OpIsHelperInvocationEXT op. If this is computed before a `demote`, but used after, forwarding it will produce the wrong value. This does make for uglier shaders, but it's necessary right now to ensure correctness. I needed to use an assembly shader to produce the test for this. `spirv-opt` is not smart enough (or too smart?) to eliminate the variable that would be used in GLSL to express this.	2019-07-18 17:32:35 -05:00
Chip Davis	50dce10c5d	Support the SPV_EXT_demote_to_helper_invocation extension. This extension provides a new operation which causes a fragment to be discarded without terminating the fragment shader invocation. The invocation for the discarded fragment becomes a helper invocation, so that derivatives will remain defined. The old `HelperInvocation` builtin becomes undefined when this occurs, so a second new instruction queries the current helper invocation status. This is only fully supported for GLSL. HLSL doesn't support the `IsHelperInvocation` operation and MSL doesn't support the `DemoteToHelperInvocation` op. Fixes #1052.	2019-07-17 09:12:22 -05:00
Hans-Kristian Arntzen	c7eda1bce9	Test glsl.std450 more exhaustively. Make sure to test everything with scalar as well to catch any weird edge cases. Not all opcodes are covered here, just the arithmetic ones. FP64 packing is also ignored.	2019-07-17 11:53:05 +02:00
Chip Davis	bc646574a6	MSL: Support the SPV_INTEL_shader_integer_functions2 extension. This provides a few functions normally available in OpenCL to the SPIR-V shader environment. These functions happen to be available in Metal as well. No GLSL, unfortunately. Intel has yet to publish a `GL_INTEL_shader_integer_functions2` spec.	2019-07-15 09:42:36 -05:00
Hans-Kristian Arntzen	33d2bbcf69	Merge branch 'msl-amd-trinary-functions' of git://github.com/cdavis5e/SPIRV-Cross	2019-07-15 09:46:31 +02:00
Chip Davis	6a58554568	Support the SPV_KHR_device_group extension. The only piece added by this extension is the `DeviceIndex` builtin, which tells the shader which device in a grouped logical device it is running on. Metal's pipeline state objects are owned by the `MTLDevice` that created them. Since Metal doesn't support logical grouping of devices the way Vulkan does, we'll thus have to create a pipeline state for each device in a grouped logical device. The upcoming peer group support in Metal 3 will not change this. For this reason, for Metal, the device index is supplied as a constant at pipeline compile time. There's an interaction between `VK_KHR_device_group` and `VK_KHR_multiview` in the `VK_PIPELINE_CREATE_VIEW_INDEX_FROM_DEVICE_INDEX_BIT`, which defines the view index to be the same as the device index. The new `view_index_from_device_index` MSL option supports this functionality.	2019-07-13 16:45:54 -05:00
Chip Davis	ca91fcfe5f	MSL: Support the SPV_AMD_shader_trinary_minmax extension. This requires MSL 2.1.	2019-07-13 16:43:57 -05:00
Hans-Kristian Arntzen	92e5255570	Run format_all.sh.	2019-07-12 10:59:53 +02:00
Hans-Kristian Arntzen	932ee0e328	Deal correctly with return sign of bitscan operations.	2019-07-12 10:57:56 +02:00
Hans-Kristian Arntzen	19ebbd48c7	Merge pull request #1077 from cdavis5e/msl-spirv-qualifiers MSL: Handle coherent, volatile, and restrict.	2019-07-12 10:03:06 +02:00
Hans-Kristian Arntzen	ad5eae46ed	Merge pull request #1078 from cdavis5e/post-depth-coverage Support the SPV_KHR_post_depth_coverage extension.	2019-07-12 09:56:26 +02:00
Chip Davis	6628ea6e48	MSL: Use the select() function for OpSelect. This significantly improves codegen for vector `OpSelect` in MSL.	2019-07-11 10:30:37 -05:00
Chip Davis	1df47db6ba	Support the SPV_KHR_post_depth_coverage extension. Using the `PostDepthCoverage` mode specifies that the `gl_SampleMaskIn` variable is to contain the computed coverage mask following the early fragment tests, which this mode requires and implicitly enables. Note that unlike Vulkan and OpenGL, Metal places this on the sample mask input itself, and furthermore does not implicitly enable early fragment testing. If it isn't enabled explicitly with an `[[early_fragment_tests]]` attribute, the compiler will error out. So we have to enable that mode explicitly if `PostDepthCoverage` is enabled but `EarlyFragmentTests` isn't. For Metal, only iOS supports this; for some reason, Apple has yet to implement it on macOS, even though many desktop cards support it.	2019-07-11 10:28:43 -05:00
Chip Davis	058f1a0933	MSL: Handle coherent, volatile, and restrict. This maps them to their MSL equivalents. I've mapped `Coherent` to `volatile` since MSL doesn't have anything weaker than `volatile` but stronger than nothing. As part of this, I had to remove the implicit `volatile` added for atomic operation casts. If the buffer is already `coherent` or `volatile`, then we would add a second `volatile`, which would be redundant. I think this is OK even when the buffer doesn't have `coherent`: `T ` is implicitly convertible to `volatile T `, but not vice-versa. It seems to compile OK at any rate. (Note that the non-`volatile` overloads of the atomic functions documented in the spec aren't present in the MSL 2.2 stdlib headers.) `restrict` is tricky, because in MSL, as in C++, it needs to go after the asterisk or ampersand for the pointer type it's modifying. Another issue is that, in the `Simple`, `GLSL450`, and `Vulkan` memory models, `Restrict` is the default (i.e. does not need to be specified); but MSL likely follows the `OpenCL` model where `Aliased` is the default. We probably need to implicitly set either `Restrict` or `Aliased` depending on the module's declared memory model.	2019-07-11 10:22:30 -05:00
Hans-Kristian Arntzen	1a592b7c0f	Merge pull request #1067 from cdavis5e/msl-scalar-block-layout MSL: Support scalar block layout.	2019-07-11 13:03:03 +02:00
Chip Davis	28454facbb	MSL: Handle packed matrices. The old method of using a different unpacked matrix type doesn't work for scalar alignment. It certainly wouldn't have any effect for a square matrix, since the number of columns and rows are the same. So now we'll store them as arrays of packed vectors.	2019-07-10 18:37:31 -05:00
Chip Davis	ea5c0ed82f	MSL: Fix alignment of packed types. Packed types have scalar alignment.	2019-07-10 11:57:04 -05:00
Hans-Kristian Arntzen	6b010e0cbc	Merge pull request #1069 from KhronosGroup/fix-1053 MSL: Re-roll array expressions in initializers.	2019-07-10 12:15:12 +02:00
Hans-Kristian Arntzen	f6f849397e	MSL: Re-roll array expressions in initializers. We cannot rely on copy path when using an array as part of a struct initializer, so reroll such expressions to an initializer list again.	2019-07-10 11:19:33 +02:00
Chip Davis	e5fa7edfd6	MSL: Support scalar block layout. Relaxed block layout relaxed the restrictions on vector alignment, allowing them to be aligned on scalar boundaries. Scalar block layout relaxes this further, allowing any member to be aligned on a scalar boundary. The requirement that a vector not improperly straddle a 16-byte boundary is also relaxed. I've also added a test showing that `std430` layout works with UBOs. I'm troubled by the dual meaning of the `Packed` extended decoration. In some instances (struct, `float[]`, and `vec2[]` members), it actually means the exact opposite, that the member needs extra padding. This is especially problematic for `vec2[]`, because now we need to distinguish the two cases by checking the array stride. I wonder if this should actually be split into two decorations.	2019-07-09 20:59:32 -05:00
Hans-Kristian Arntzen	909040e2eb	MSVC 2013: Work around another compiler bug with array init.	2019-07-09 15:31:01 +02:00
Hans-Kristian Arntzen	4056d0b74e	Don't use scalar dot().	2019-07-03 14:32:06 +02:00
Hans-Kristian Arntzen	041f103d44	MSL/HLSL: Support scalar reflect and refract.	2019-07-03 12:31:52 +02:00
Chip Davis	31b6c93516	MSL: Support SubgroupLocalInvocationId and SubgroupSize in all stages. MSL prior to 2.2 doesn't support these natively in any stage but compute. But, we can (assuming no threads were terminated prematurely) get their values with some creative uses of the `simd_prefix_exclusive_sum()` and `simd_sum()` functions. Also, fix a missing `to_expression()` with `BuiltInSubgroupEqMask`. For KhronosGroup/MoltenVK#629.	2019-07-02 11:48:59 -05:00
Hans-Kristian Arntzen	f8b084de61	MSL/HLSL: Support OpOuterProduct.	2019-07-01 10:57:27 +02:00
Chip Davis	7eecf5a46b	MSL: Support SPV_KHR_multiview. This is needed to support `VK_KHR_multiview`, which is in turn needed for Vulkan 1.1 support. Unfortunately, Metal provides no native support for this, and Apple is once again less than forthcoming, so we have to implement it all ourselves. Tessellation and geometry shaders are deliberately unsupported for now. The problem is that the current implementation encodes the `ViewIndex` as part of the `InstanceIndex`, which in the SPIR-V environment at least only exists in the vertex shader. So we need to work out a way to pass the view index along to the later stages. This implementation runs vertex shaders for all views up to the highest bit set in the view mask, even those whose bits are clear. The fragments for the inactive views are then discarded. Avoiding this is difficult: calculating the view indices becomes far more complicated if we can only run for those views which are set in the mask.	2019-06-29 09:43:55 -05:00
Hans-Kristian Arntzen	ff87419607	Deal with scalar input values for distance/length/normalize. HLSL and MSL don't support it, so fall back to simpler intrinsics.	2019-06-28 11:20:14 +02:00
Hans-Kristian Arntzen	1543bdaf7b	Run format_all.sh.	2019-06-27 15:10:59 +02:00
Hans-Kristian Arntzen	c76b99b711	Handle more cases with FP16 and texture sampling.	2019-06-27 15:04:22 +02:00
Hans-Kristian Arntzen	45805857e5	MSL: De-virtualize get_declared_struct_member_size. It does not make sense to use a virtual call in the Compiler base class here. Make it clearer by renaming the MSL-specific version to _msl.	2019-06-26 19:11:38 +02:00
Hans-Kristian Arntzen	02b2a1015d	MSL: Fix minor XCode /analyze warning. Written variable, but never read.	2019-06-26 16:10:58 +02:00
Hans-Kristian Arntzen	8f6939cb0d	Merge pull request #1041 from KhronosGroup/fix-1011 MSL: Add support for SubgroupSize / SubgroupInvocationID in fragment.	2019-06-26 15:01:13 +02:00
Hans-Kristian Arntzen	ab3798fd91	MSL: Add support for SubgroupSize / SubgroupInvocationID in fragment.	2019-06-24 12:31:54 +02:00
Hans-Kristian Arntzen	048f2380f3	MSL: Support custom bindings for argument buffer itself.	2019-06-24 11:10:20 +02:00
Hans-Kristian Arntzen	b4e0163749	Run format_all.sh.	2019-06-21 16:02:22 +02:00
Hans-Kristian Arntzen	3a4a9acac9	MSL: Add C API for querying automatic resource bindings.	2019-06-21 13:19:59 +02:00
Hans-Kristian Arntzen	e2c95bdcbc	MSL: Rewrite how resource indices are fallback-assigned. We used to use the Binding decoration for this, but this method is hopelessly broken. If no explicit MSL resource remapping exists, we remap automatically in a manner which should always "just work".	2019-06-21 12:54:08 +02:00
Hans-Kristian Arntzen	a1f7c8dc8e	Merge pull request #1031 from KhronosGroup/fix-1009 MSL: Support 64-bit integers.	2019-06-19 15:29:27 +02:00
Hans-Kristian Arntzen	7fdb418f18	Merge pull request #1028 from KhronosGroup/fix-1010 MSL: Support barycentrics and PrimitiveID in fragment shaders	2019-06-19 15:29:14 +02:00
Hans-Kristian Arntzen	4c20c941f0	Merge pull request #1025 from KhronosGroup/fix-1013 MSL: Support OpImageQueryLod.	2019-06-19 14:07:39 +02:00
Hans-Kristian Arntzen	a6798d06a2	MSL: Error out on int64_t/uint64_t buffer members. Not supported for whatever reason.	2019-06-19 10:14:46 +02:00
Hans-Kristian Arntzen	a6b71ae999	MSL: Support 64-bit integers.	2019-06-19 09:55:00 +02:00
Hans-Kristian Arntzen	2e1cee5e1e	MSL: Support PrimitiveID in fragment and barycentrics.	2019-06-19 09:52:35 +02:00
Hans-Kristian Arntzen	0671b3c35b	MSL: Support OpImageQueryLod. Correctness is a bit unclear at the moment. The spec document for 2.2 is not updated for query-lod, but this is the best we can do anyways.	2019-06-19 09:51:56 +02:00
Hans-Kristian Arntzen	f171d82590	MSL: Support MinLod operand.	2019-06-19 09:43:03 +02:00
Hans-Kristian Arntzen	95053ea4bc	Merge pull request #1024 from KhronosGroup/fix-1016 GLSL/MSL: Support stencil export	2019-06-12 12:48:10 +02:00
Hans-Kristian Arntzen	14d0a1eb0c	MSL: Support stencil export.	2019-06-12 10:21:20 +02:00
Hans-Kristian Arntzen	a7b2ba28a0	MSL: Support Invariant qualifier on position.	2019-06-12 09:39:12 +02:00
Hans-Kristian Arntzen	30bb197a5d	MSL: Support remapping constexpr samplers by set/binding. Older API was oriented around IDs which are not available unless you're doing full reflection, which is awkward for certain use cases which know their set/bindings up front. Optimize resource bindings to be hashmap rather than doing linear seeks all the time.	2019-06-10 15:41:36 +02:00
Hans-Kristian Arntzen	314efdcc42	MSL: Fix declaration of unused input variables. In multiple-entry-point modules, we declared builtin inputs which were not supposed to be used for that entry point. Fix this, by being more strict when checking which builtins to emit.	2019-05-31 13:23:34 +02:00
Hans-Kristian Arntzen	b3094cd02a	Run format_all.sh.	2019-05-27 16:54:13 +02:00
Hans-Kristian Arntzen	fd0feb1ec1	MSL: Use correct address space when passing array-of-buffers. Need to check if the descriptor set is actually an argument buffer.	2019-05-27 16:53:30 +02:00
Hans-Kristian Arntzen	7b9e0fb428	MSL: Implement OpArrayLength. This gets rather complicated because MSL does not support OpArrayLength natively. We need to pass down a buffer which contains buffer sizes, and we compute the array length on-demand. Support both discrete descriptors as well as argument buffers.	2019-05-27 16:13:09 +02:00
Hans-Kristian Arntzen	96492648d4	MSL: Fix struct declaration order with complex type aliases. MSL generally emits the aliases, which means we cannot always place the master type first, unlike GLSL and HLSL. The logic fix is just to reorder after we have tagged types with packing information, rather than doing it in the parser fixup.	2019-05-23 14:54:04 +02:00
Hans-Kristian Arntzen	eaf7afed97	MSL: Support argument buffers and image swizzling. Change aux buffer to swizzle buffer. There is no good reason to expand the aux buffer, so name it appropriately. Make the code cleaner by emitting a straight pointer to uint rather than a dummy struct which only contains a single unsized array member anyways. This will also end up being very similar to how we implement swizzle buffers for argument buffers. Do not use implied binding if it overflows int32_t.	2019-05-18 10:30:06 +02:00
Chip Davis	8983920edf	Remove fallback for OpGroupNonUniformElect. It's not safe to enable subgroup support without this actually working correctly.	2019-05-16 13:42:09 -05:00
Chip Davis	9d9415754b	MSL: Add support for subgroup operations. Some support for subgroups is present starting in Metal 2.0 on both iOS and macOS. macOS gains more complete support in 10.14 (Metal 2.1). Some restrictions are present. On iOS and on macOS 10.13, the implementation of `OpGroupNonUniformElect` is incorrect: if thread 0 has already terminated or is not executing a conditional branch, the first thread that is will falsely believe itself not to be. Unfortunately, this operation is part of the "basic" feature set; without it, subgroups cannot be supported at all. The `SubgroupSize` and `SubgroupLocalInvocationId` builtins are only available in compute shaders (and, by extension, tessellation control shaders), despite SPIR-V making them available in all stages. This limits the usefulness of some of the subgroup operations in fragment shaders. Although Metal on macOS supports some clustered, inclusive, and exclusive operations, it does not support them all. In particular, inclusive and exclusive min, max, and, or, and xor; as well as cluster sizes other than 4 are not supported. If this becomes a problem, they could be emulated, but at a significant performance cost due to the need for non-uniform operations.	2019-05-15 17:40:04 -05:00
Hans-Kristian Arntzen	647ddaee42	HLSL/MSL: Deal correctly with nonuniformEXT qualifier. MSL does not seem to have a qualifier for this, but HLSL SM 5.1 does. glslangValidator for HLSL does not support this, so skip any validation, but it passes in FXC.	2019-05-13 14:58:27 +02:00
Hans-Kristian Arntzen	ad95173a48	Fix GCC 4.x warning.	2019-05-09 12:28:34 +02:00
Hans-Kristian Arntzen	97d39dc9d5	MSL: Deal with texture swizzle on arrays of images.	2019-05-09 11:25:45 +02:00
Hans-Kristian Arntzen	2cc374a0c8	GLSL: Implement GL_EXT_buffer_reference. Buffer objects can contain arbitrary pointers to blocks. We can also implement ConvertPtrToU and ConvertUToPtr. The latter can cast a uint64_t to any type as it pleases, so we will need to generate fake buffer reference blocks to be able to cast the type.	2019-04-26 11:43:51 +02:00
Hans-Kristian Arntzen	c2715c3908	MSL: Cast texture_buffer index to uint.	2019-04-23 12:46:48 +02:00
Hans-Kristian Arntzen	de1148b8ba	Run format_all.sh.	2019-04-23 12:21:53 +02:00
Hans-Kristian Arntzen	fc4f39b11f	MSL: Support native texture_buffer type, throw error on atomics. Atomics are not supported on images or texture_buffers in MSL. Properly throw an error if OpImageTexelPointer is used (since it can only be used for atomic operations anyways).	2019-04-23 12:21:43 +02:00
Michael Barriault	82b4ad8a30	Correct formatting.	2019-04-16 19:13:57 +01:00
Michael Barriault	105bfd368a	Only use MSL constant address space for tessellation control shader.	2019-04-16 17:56:02 +01:00
Michael Barriault	16911c5a4d	Merge remote-tracking branch 'origin/master' * origin/master: Support running {,update_}test_shader.sh with CMake builds. Don't apply vertex attribute remapping other non-vertex or non-input interface blocks Force complex loop in certain rare access chain scenarios. Fix guard around [[noreturn]]. Deal with mismatched signs in S/U/F conversion opcodes. Workaround lack of lvalue/rvalue operator overload on MSVC 2013. Support direct conversions to std::vector from SmallVector. Fix some minor copy constructor issues in Variant. Make sure ids_for_types are moved correctly in move operator. Run format_all.sh. Refactor out error handling and containers to new headers. Do not use SmallVector as input type in public interfaces. Fix various bugs found in testing. Explicitly implement move operators for ParsedIR. Try another MSVC 2013 workaround. Implement edge cases in insert/end and add a simple test case. Fix GCC 4.x warnings. Workaround lack of alignas on MSVC 2013. Reduce pressure on global allocation. CLI: Make --iterations more useful.	2019-04-13 18:06:29 +01:00
Michael Barriault	ca7df787b3	Use constant address space for SPIR-V parameters when generating tessellation control shader.	2019-04-09 19:41:31 +01:00
Hans-Kristian Arntzen	3fe57d3798	Do not use SmallVector as input type in public interfaces. This is an API break, which we need to be careful with. Handing out SmallVectors is easier since the interface is basically the same.	2019-04-09 15:09:44 +02:00
Hans-Kristian Arntzen	a489ba7fd1	Reduce pressure on global allocation. - Replace ostringstream with custom implementation. ~30% performance uplift on vector-shuffle-oom test. Allocations are measurably reduced in Valgrind. - Replace std::vector with SmallVector. Classic malloc optimization, small vectors are backed by inline data. ~ 7-8% gain on vector-shuffle-oom on GCC 8 on Linux. - Use an object pool for IVariant type. We generally allocate a lot of SPIR* objects. We can amortize these allocations neatly by pooling them. - ~15% overall uplift on ./test_shaders.py --iterations 10000 shaders/.	2019-04-09 15:09:44 +02:00
Hans-Kristian Arntzen	23db744e35	Deal with case where we need to emit SpvImplArrayCopy late. We cannot deduce if OpLoad needs ArrayCopy templates early since it's heavily context dependent, and we might only know on 3rd iteration of the compile loop.	2019-04-09 12:28:46 +02:00
Bill Hollings	efbe7ca16f	MSL: Fix infinite CAS loop on atomic_compare_exchange_weak_explicit().	2019-04-05 21:28:57 -04:00
Hans-Kristian Arntzen	317144a59c	Detect invalid DoWhileLoop early. We had a bug where error conditions in DoWhileLoop emit path would not detect that statements were being emitted due to the masking behavior which happens when force_recompile is true. Fix this. Also, refactor force_recompile into member functions so we can properly break on any situation where this is set, without having to rely on watchpoints in debuggers.	2019-04-05 12:19:32 +02:00
Hans-Kristian Arntzen	9b92e68d71	Add an option to override the namespace used for spirv_cross. This is a pragmatic trick to avoid symbol collision where a project links against SPIRV-Cross statically, while linking to other projects which also use SPIRV-Cross statically. We can end up with very awkward symbol collisions which can resolve themselves silently because SPIRV-Cross is pulled in as necessary. To fix this, we must use different symbols and embed two copies of SPIRV-Cross in this scenario, now with different namespaces, which in turn leads to different symbols.	2019-03-29 10:29:44 +01:00
Bill Hollings	c48702d8c2	Fix crash when backend.int16_t_literal_suffix set to null. The design of backend.int16_t_literal_suffix and backend.uint16_t_literal_suffix allows them to be set to null, but that was not always tested for. I have removed the expectation that they can be null and set backend.int16_t_literal_suffix to "" when no suffix is needed. That has the same effect, and seemed to be a more usable and defensive approach.	2019-03-28 14:23:32 -04:00
Hans-Kristian Arntzen	18d4f67a87	Merge pull request #919 from KhronosGroup/fix-915 MSL: Declare gl_WorkGroupSize constant with [[maybe_unused]].	2019-03-28 14:00:49 +01:00
Hans-Kristian Arntzen	0909975655	MSL: Declare gl_WorkGroupSize constant with [[maybe_unused]]. Avoids ugly warnings on nearly every compute shader. We could do analysis to detect whether we need to emit this constant, but it's a bit tedious to figure out if an OpConstantComponent is actually used by opcodes, so just make it simple.	2019-03-28 10:54:18 +01:00
Hans-Kristian Arntzen	c37f88fea6	MSL: Fix crash where variable storage buffer pointers are passed down. Only deal with readonly decoration for actual block types.	2019-03-28 10:16:46 +01:00
Hans-Kristian Arntzen	eeb3f24991	Properly deal with sign-dependent GLSL opcodes. The GLSLstd450 spec is very lax about input signs, so we need to do the bitcasting dance to implement it correctly.	2019-03-27 12:20:53 +01:00
Hans-Kristian Arntzen	e2aadf8995	Rename "push descriptor set" to "discrete descriptor set". Check for case where iOS doesn't support writable argument buffer textures.	2019-03-15 21:53:21 +01:00
Hans-Kristian Arntzen	b3380ec9dd	MSL: Support VK_KHR_push_descriptor. If we have argument buffers, we also need to support using plain descriptor sets for certain cases where API wants it.	2019-03-15 14:08:47 +01:00
Hans-Kristian Arntzen	c310b40fd3	MSL: Make sure get_buffer_block_flags is only used in right context.	2019-03-15 12:27:54 +01:00
Hans-Kristian Arntzen	bc21ccb7ce	MSL: Emit correct SSBO constness for argument buffers.	2019-03-15 12:05:35 +01:00
Hans-Kristian Arntzen	969566aff5	MSL: Fixup buffer array case issue on MSL 1.0.	2019-03-15 11:37:34 +01:00
Hans-Kristian Arntzen	af8a9ccdcb	MSL: Need to emit two layers of address space. When passing down arrays of buffer pointers, the array itself needs an address space.	2019-03-15 11:29:17 +01:00
Hans-Kristian Arntzen	e47a77d596	MSL: Implement Metal 2.0 indirect argument buffers.	2019-03-15 11:01:27 +01:00
Hans-Kristian Arntzen	e74c21a39b	Review fixups.	2019-03-04 10:08:31 +01:00
Hans-Kristian Arntzen	9bbdccddb7	Add a stable C API for SPIRV-Cross. This adds a new C API for SPIRV-Cross which is intended to be stable, both API and ABI wise. The C++ API has been refactored a bit to make the C wrapper easier and cleaner to write. Especially the vertex attribute / resource interfaces for MSL has been rewritten to avoid taking mutable pointers into the interface. This would be very annoying to wrap and it didn't fit well with the rest of the C++ API to begin with. While doing this, I went ahead and removed all the old deprecated interfaces. The CMake build system has also seen an overhaul. It is now possible to build static/shared/CLI separately with -D options. The shared library only exposes the C API, as it is the only ABI-stable API. pkg-configs as well as CMake modules are exported and installed for the shared library configuration.	2019-03-01 11:53:51 +01:00
Hans-Kristian Arntzen	825ff4af7e	Replace locale handling. We were using std::locale::global() to force a C locale which is not safe when SPIRV-Cross is used in a multi-threaded environment. To fix this, we could tap into various per-platform specific locale handling to get safe thread-local locales, but since locales only affect the decimal point in floats, we simply query the locale instead and do the necessary radix replacement ourselves, without touching the locale. This should be much safer and cleaner than the alternative.	2019-02-28 11:28:31 +01:00
Hans-Kristian Arntzen	ee395afa83	MSL: Emit proper name for optimized UBO/SSBO arrays.	2019-02-25 11:09:00 +01:00
Hans-Kristian Arntzen	ad6134262e	Merge pull request #877 from cdavis5e/msl-tesc-early-return MSL: Return early from helper tesc invocations.	2019-02-25 09:13:06 +01:00
Hans-Kristian Arntzen	7874f7fc49	Merge pull request #876 from cdavis5e/msl-tese-fixup-2 MSL: Make sure we fix up the output position.	2019-02-25 09:12:47 +01:00
Chip Davis	a43dcd7b99	MSL: Return early from helper tesc invocations. Return after loading the input control point array if there are more input points than output points, and this was one of the helper invocations spun off to load the input points. I was hesitant to do this initially, since the MSL spec has this to say about barriers: > The `threadgroup_barrier` (or `simdgroup_barrier`) function must be > encountered by all threads in a threadgroup (or SIMD-group) executing > the kernel. That is, if any thread executes the barrier, then all threads must execute it, or the barrier'd invocations will hang. But, the key words here seem to be "executing the kernel;" inactive invocations, those that have already returned, need not encounter the barrier to prevent hangs. Indeed, I've encountered no problems from doing this, at least on my hardware. This also fixes a few CTS tests that were failing due to execution ordering; apparently, my assumption that the later, invalid data written by the helpers would get overwritten was wrong.	2019-02-24 12:17:47 -06:00
Chip Davis	f3267db1d8	MSL: Make sure we fix up the output position. If a stage takes the position as both an input and an output (i.e. a tessellation shader or a geometry shader), then we could wind up fixing up the input position by mistake. Ensure that doesn't happen, by only setting the `qual_pos_var_name` variable from the output position.	2019-02-22 15:28:28 -06:00
Chip Davis	f3c0942d10	MSL: Use vectors for the tessellation level builtins in tese shaders. The tessellation levels in Metal are stored as a densely-packed array of half-precision floating point values. But, stage-in attributes in Metal have to have offsets and strides aligned to a multiple of four, so we can't add them individually. Luckily for us, the arrays have lengths less than 4. So, let's use vectors for them! Triangles get a single attribute with a `float4`, where the outer levels are in `.xyz` and the inner levels are in `.w`. The arrays are unpacked as though we had added the elements individually. Quads get two: a `float4` with the outer levels and a `float2` with the inner levels. Further, since vectors can be indexed as arrays, there's no need to unpack them in this case. This also saves on precious vertex attributes. Before, we were using up to 6 of them. Now we need two at most.	2019-02-22 12:18:51 -06:00
Hans-Kristian Arntzen	a4ac27546a	MSL: Fix textures which are sampled and compared against. depth2d in MSL only returns float, not float4, even for normal sampling. We need to conditionally remap-swizzle back to float4.	2019-02-22 12:27:40 +01:00
Chip Davis	dae4a88b06	MSL: Don't do the fixup at all when capturing output.	2019-02-21 17:05:37 -06:00
Chip Davis	b34fd63c2d	MSL: Do position fixup for tessellation evaluation shaders, too.	2019-02-21 16:57:56 -06:00
Chip Davis	7042cb9bec	Quiesce truncation warnings.	2019-02-21 15:11:45 -06:00
Chip Davis	c756a91c3c	MSL: Fix a case I missed initializing vtx_attrs_by_builtin.	2019-02-21 13:14:03 -06:00
Chip Davis	9d8a5be725	MSL: Ignore duplicate builtin vertex attributes. These are often arrayed builtins, which MSL maps to more than one attribute. SPIRV-Cross automatically assigns succeeding addresses to arrayed attributes, so we really only need the first one. This of course assumes that the inputs are sorted by location.	2019-02-21 13:14:03 -06:00
Chip Davis	5069ec72bb	MSL: Set location of builtins based on client input. Builtin attributes in SPIR-V aren't linked by location, but by their built-in-ness. This poses a problem for MSL, since builtin inputs in the vertex pipeline are just regular attributes. We must then assign them locations so that they can be matched up to the attributes in the stage input descriptor--and also to avoid duplicate attribute numbers in tessellation evaluation shaders, where there are two different stage-in structs, so the member index therein is no longer unique!	2019-02-20 22:16:51 -06:00
Chip Davis	7a7e210515	MSL: Force unnamed array builtin attributes to have a name. That way, when we refer to them, they'll have the name that we're expecting.	2019-02-20 22:16:51 -06:00
Hans-Kristian Arntzen	ed7292fec4	Merge pull request #867 from cdavis5e/tese-shader-origin-2 MSL: Don't bother fixing up triangle tess coords.	2019-02-20 22:36:21 +01:00
Chip Davis	285ca4c2b1	MSL: Don't bother fixing up triangle tess coords. Instead, I'm going to have MoltenVK reverse the winding order in the lower-left case. This seems to be what the test suite expects to happen anyhow.	2019-02-20 14:30:44 -06:00
Hans-Kristian Arntzen	c1a93b8a71	Run format_all.sh. Missed some nits in earlier reviews.	2019-02-20 17:29:57 +01:00
Chip Davis	ba8593b112	Fix formatting.	2019-02-20 09:19:25 -06:00
Chip Davis	8095434dc4	MSL: Drop stores to nonexistent tess levels. In SPIR-V, there are always two inner levels and four outer levels, even if the input patch isn't a quad patch. But in MSL, due to requirements imposed by Metal, only one inner level and three outer levels exist when the input patch is a triangle patch. We must explicitly ignore any write to the nonexistent second inner and fourth outer levels in this case.	2019-02-20 09:11:24 -06:00
Chip Davis	c8ee9fbe76	MSL: Expand quad gl_TessCoord to a float3. This is the actual SPIR-V type of the builtin. We forced to a `float2` in the declaration because that's what Metal wants.	2019-02-20 09:11:24 -06:00
Hans-Kristian Arntzen	58f264c99d	Merge pull request #865 from KhronosGroup/fix-863 Always value-cast FP16 constants instead of using literals.	2019-02-20 14:58:44 +01:00
Hans-Kristian Arntzen	4ef51331b2	Always value-cast FP16 constants instead of using literals. GL_NV_gpu_shader5 doesn't support "hf", so to avoid lots of complicated workarounds, just value-cast the half literals.	2019-02-20 12:30:01 +01:00
Hans-Kristian Arntzen	056a0ba27e	Fix case where a struct is loaded which contains a row-major matrix.	2019-02-20 12:19:00 +01:00
Chip Davis	41d9424233	MSL: Add an option to set the tessellation domain origin. This is intended to be used to support `VK_KHR_maintenance2`'s tessellation domain origin feature. If `tess_domain_origin_lower_left` is `true`, the `v` coordinate will be inverted with respect to the domain. Additionally, in `Triangles` mode, the `v` and `w` coordinates will be swapped. This is because the winding order is interpreted differently in lower-left mode.	2019-02-18 14:25:42 -06:00
Chip Davis	08863c1e28	Don't set any aliases or do any flattening for arrayed per-vertex I/O. We already handle all that specially.	2019-02-15 17:24:16 -06:00
Chip Davis	6b7988046d	Handle blocks of patch I/O. In this case, each member of the block will be decorated with `DecorationPatch`, rather than the block variable having the decoration.	2019-02-15 17:21:38 -06:00
Chip Davis	e75add42c9	MSL: Add support for tessellation evaluation shaders. These are mapped to Metal's post-tessellation vertex functions. The semantic difference is much less here, so this change should be simpler than the previous one. There are still some hairy parts, though. In MSL, the array of control point data is represented by a special type, `patch_control_point<T>`, where `T` is a valid stage-input type. This object must be embedded inside the patch-level stage input. For this reason, I've added a new type to the type system to represent this. On Mac, the number of input control points to the function must be specified in the `patch()` attribute. This is optional on iOS. SPIRV-Cross takes this from the `OutputVertices` execution mode; the intent is that if it's not set in the shader itself, MoltenVK will set it from the tessellation control shader. If you're translating these offline, you'll have to update the control point count manually, since this number must match the number that is passed to the `drawPatches:...` family of methods. Fixes #120.	2019-02-14 10:00:08 -06:00
Hans-Kristian Arntzen	cbd76e7c3b	Run format_all.sh.	2019-02-14 09:28:46 +01:00
Hans-Kristian Arntzen	878c502f96	MSL: Hoist out complicated tesc workaround code.	2019-02-14 09:28:17 +01:00
Chip Davis	13df78bebf	Unflatten inputs when copying to outputs. This should fix a whole host of issues related to structs in the `Input` class in a tessellation control shader. Also, use pointer arithmetic instead of dereferencing the `ops` array. This is critical in case we wind up stepping beyond the bounds of the array.	2019-02-13 12:37:24 -06:00
Chip Davis	83b7e66218	Throw an error if the shader specifies isoline tessellation.	2019-02-11 17:21:36 -06:00
Chip Davis	0bb6bbda22	Never flatten outputs when capturing them. There's no need to do so, since these are not stage-out structs being returned, but regular structures being written to a buffer. This also neatly avoids issues writing to composite (e.g. arrayed) per-patch outputs from a tessellation control shader.	2019-02-11 17:18:54 -06:00
Chip Davis	8860a97d4a	Fix formatting of uint32_t casts.	2019-02-11 16:14:00 -06:00
Chip Davis	1919eb1b46	Pass the original pointer type to ensure_correct_attribute_type(). This prevents us from overwriting the variable's type with a non-pointer type.	2019-02-11 16:07:43 -06:00
Chip Davis	eb89c3a428	MSL: Add support for tessellation control shaders. These are transpiled to kernel functions that write the output of the shader to three buffers: one for per-vertex varyings, one for per-patch varyings, and one for the tessellation levels. This structure is mandated by the way Metal works, where the tessellation factors are supplied to the draw method in their own buffer, while the per-patch and per-vertex varyings are supplied as though they were vertex attributes; since they have different step rates, they must be in separate buffers. The kernel is expected to be run in a workgroup whose size is the greater of the number of input or output control points. It uses Metal's support for vertex-style stage input to a compute shader to get the input values; therefore, at least one instance must run per input point. Meanwhile, Vulkan mandates that it run at least once per output point. Overrunning the output array is a concern, but any values written should either be discarded or overwritten by subsequent patches. I'm probably going to put some slop space in the buffer when I integrate this into MoltenVK to be on the safe side.	2019-02-07 08:51:22 -06:00
Hans-Kristian Arntzen	d9ed3dcc7a	Merge pull request #848 from cdavis5e/capture-output-buffer MSL: Add a setting to capture vertex shader output to a buffer.	2019-02-07 15:11:41 +01:00
Chip Davis	056c0e207d	Take the vertex count from any indirect parameters passed. This is necessary to deal with indirect draws, where the draw parameters are given in a buffer instead of passed by the CPU. For normal draws, the draw parameters are set with Metal's `setVertexBytes:` method. This undoes the change to add the vertex count to the aux buffer, rendering that entire discussion largely moot. Oh well. It was a discussion that needed to happen anyway.	2019-02-06 15:17:14 -06:00
Chip Davis	f55253dc1b	On second thought, don't use a feature struct for the aux buffer.	2019-02-06 14:45:26 -06:00
Chip Davis	d86adbe550	Add a structure to hold optional members of the aux buffer. Programs can query the version to know what features are present, and turn them on and off at will.	2019-02-06 14:26:06 -06:00
Hans-Kristian Arntzen	d5385190ff	Merge pull request #850 from KhronosGroup/fix-846 Support LUTs in single-function CFGs on Private storage class.	2019-02-06 11:34:30 +01:00
Hans-Kristian Arntzen	3e584f2c3f	Support LUTs in single-function CFGs on Private storage class. Fairly common pattern in unoptimized SPIR-V. Support this case as well.	2019-02-06 10:38:59 +01:00
Chip Davis	0757fae511	MSL: Stop passing the aux buffer around. Since we pass the component swizzle around now, there's no need to pass it to every function that takes a sampled image.	2019-02-05 20:04:32 -06:00
Chip Davis	c51e5b7911	MSL: Add a setting to capture vertex shader output to a buffer. This will be necessary to support transform feedback, as well as tessellation shaders.	2019-02-05 20:00:10 -06:00
Chip Davis	ef0b1fc841	Move assertions after the check for equal types. `bitcast_glsl_op()` is sometimes called for `Boolean` types, e.g. for specialization constants. We don't want the assert to trip if this is going to be a no-op anyway.	2019-01-31 14:28:21 -06:00
Hans-Kristian Arntzen	2ed171e525	GLSL/MSL: Implement 8-bit part of VK_KHR_shader_float16_int8. Storage was in place already, so mostly just dealing with bitcasts and constants. Simplies some of the bitcasting logic, and this exposed some bugs in the implementation. Refactor to use correct width integers with explicit bitcast opcodes.	2019-01-30 15:45:24 +01:00
Hans-Kristian Arntzen	2edee351f0	Run format_all.sh.	2019-01-30 13:42:50 +01:00
Hans-Kristian Arntzen	4e7777c443	Update to latest glslang/SPIRV-Tools. Fix various bugs along the way.	2019-01-30 13:41:57 +01:00
Hans-Kristian Arntzen	3e09879131	Support initializers on StorageClassOutput.	2019-01-30 10:29:08 +01:00
Hans-Kristian Arntzen	5ff12d780b	Run format_all.sh.	2019-01-28 15:20:30 +01:00
Hans-Kristian Arntzen	912fde95f1	MSL: Use correct size for structs. Need to align the size of structs to the natural alignment.	2019-01-28 15:20:30 +01:00
Hans-Kristian Arntzen	217eb5b5f9	MSL: Add a preliminary check for bad arrays of structs. ArrayStride can be larger than the declared struct size. We have no obvious solution for now, but warn about it in the MSL output for the time being.	2019-01-28 15:20:30 +01:00
Hans-Kristian Arntzen	8c632da461	MSL: Use correct alignment rule for whole structs. Structs are aligned as you would expect in MSL (maximum member alignment), and it is not minimum 16 bytes like in std140. Also rename the dummy "pad" members to a reserved naming scheme.	2019-01-28 15:20:30 +01:00
Hans-Kristian Arntzen	18a4accd2f	HLSL/MSL: Fix texture projection with Dref. We need to divide the Dref by q.	2019-01-28 10:25:13 +01:00
Hans-Kristian Arntzen	437fc87a89	MSL: Deal with resource name aliasing. Apparently we didn't use those yet. MSL seems to be able to alias struct types and variable types to a degree, so that's why it has escaped testing until now.	2019-01-18 16:27:57 +01:00
Hans-Kristian Arntzen	1040cf6cc1	Merge pull request #831 from cdavis5e/force-recompile-hooks MSL: Hoist fixup hooks in entry_point_args() out of the compile loop.	2019-01-17 19:42:05 +01:00
Chip Davis	f500d2f70c	MSL: Hoist fixup hooks in entry_point_args() out of the compile loop. Otherwise, in the event of a forced recompile, we could end up adding them twice.	2019-01-17 10:18:38 -06:00
Hans-Kristian Arntzen	3aa08f764e	MSL: Fix image load/store for short vectors. Same fixes as for GLSL.	2019-01-17 14:54:29 +01:00
Hans-Kristian Arntzen	522c4eea97	Merge pull request #832 from KhronosGroup/fix-828 MSL: Support std140 packing rules for float[] and float2[]	2019-01-17 14:30:06 +01:00
Hans-Kristian Arntzen	73d9da7070	Avoid unintentional name conflict with HLSL backend.	2019-01-17 12:21:16 +01:00
Hans-Kristian Arntzen	76bf6d0c34	Fixup some MSL comments.	2019-01-17 11:47:37 +01:00
Hans-Kristian Arntzen	432aaed737	Need to know the original packed type when unpacking loads.	2019-01-17 11:39:46 +01:00
Hans-Kristian Arntzen	40e7723051	Run format_all.sh.	2019-01-17 11:29:50 +01:00
Hans-Kristian Arntzen	de7e5ccd8b	Refactor out packed expressions to extended decorations. Can't safely just cast to the original enum without lots of hacks.	2019-01-17 11:28:51 +01:00
Hans-Kristian Arntzen	72377366d3	Replace custom use of DecorationCPacked with an explicit one. Will need to use more variants of this decoration, so might as well make it clearer what is going on with CPacked.	2019-01-17 10:36:56 +01:00
Hans-Kristian Arntzen	15b52bee48	Deal with packing/unpacking on store. Still a bit buggy, since we cannot deduce between float2[] and packed_float2. Need a deeper refactor to plumb this through ...	2019-01-17 10:06:23 +01:00
Chip Davis	1d7d910765	MSL: Fix some types I missed when implementing variable pointers.	2019-01-16 16:15:57 -06:00
Hans-Kristian Arntzen	64ca1ec677	MSL: Start considering float[] and float2[] in std140 layout.	2019-01-16 16:16:39 +01:00
Hans-Kristian Arntzen	9e3a41ad00	Merge pull request #821 from cdavis5e/pass-sampled-images MSL: Fix passing a sampled image to a function.	2019-01-15 09:05:54 +01:00
Chip Davis	664df22d12	MSL: Fix passing a sampled image to a function. In the past, SPIRV-Cross threw an error in this case because it couldn't work out which swizzle from the auxiliary buffer needs to be passed. Now, we pass the swizzle around with the texture object, like a combined image-sampler and its associated sampler.	2019-01-14 09:29:31 -06:00
Hans-Kristian Arntzen	b8033d7525	MSL: Add option to pad fragment outputs. If not enough components are provided in the shader, the shader MSL compiler throws an error rather than make components undefined. This hurts portability, so we need to add explicit padding here.	2019-01-14 15:11:52 +01:00
Hans-Kristian Arntzen	7ee04936ac	MSL: Fix case where we pass arrays to functions by value. MSL does not support value semantics for arrays (sigh), so we need to force constant references and deal with copies if we have a different address space than what we end up guessing.	2019-01-14 11:00:14 +01:00
Chip Davis	c4b08bd770	MSL: Add more illegal identifiers. Add most macros from the Metal standard library headers that aren't in the reserved namespace (i.e. those that don't start with `_`).	2019-01-14 00:08:09 -06:00
Hans-Kristian Arntzen	6e1c3ccb72	Run format_all.sh.	2019-01-11 12:56:00 +01:00
Hans-Kristian Arntzen	2fb9aa251e	Workaround bugs on MSVC. Bug: https://developercommunity.visualstudio.com/content/problem/303996/c-error-c2668-ambiguous-overloaded-in-lambda-with.html	2019-01-11 09:29:28 +01:00
Hans-Kristian Arntzen	b629878f45	Make meta a hashmap. A flat array was consuming way too much memory and was far too slow to initialize properly with a very large ID bound (8 million IDs, showed up as #1 hotspot in perf). Meta struct does not have to be in-order as we never iterate over it in a meaningful way, so using a hashmap here is reasonable. Very few IDs should need decorations or meta-data, so this should also be a quite decent memory save. For the pathological case, a 6x uplift was observed.	2019-01-10 14:04:01 +01:00
Hans-Kristian Arntzen	d92de00cc1	Rewrite how IDs are iterated over. This is a fairly fundamental change on how IDs are handled. It serves many purposes: - Improve performance. We only need to iterate over IDs which are relevant at any one time. - Makes sure we iterate through IDs in SPIR-V module declaration order rather than ID space. IDs don't have to be monotonically increasing, which was an assumption SPIRV-Cross used to have. It has apparently never been a problem until now. - Support LUTs of structs. We do this by interleaving declaration of constants and struct types in SPIR-V module order. To support this, the ParsedIR interface needed to change slightly. Before setting any ID with variant_set<T> we let ParsedIR know that an ID with a specific type has been added. The surface for change should be minimal. ParsedIR will maintain a per-type list of IDs which the cross-compiler will need to consider for later. Instead of looping over ir.ids[] (which can be extremely large), we loop over types now, using: ir.for_each_typed_id<SPIRVariable>([&](uint32_t id, SPIRVariable &var) { handle_variable(var); }); Now we make sure that we're never looking at irrelevant types.	2019-01-10 12:52:56 +01:00
Hans-Kristian Arntzen	5345756cab	MSL: Support composites inside I/O blocks I had to refactor the existing add_interface_block as it was getting extremely large. Now it's all split up into different readable functions.	2019-01-09 09:33:10 +01:00
Hans-Kristian Arntzen	9c47b2837e	Merge pull request #807 from cdavis5e/variable-pointers MSL: Support SPV_KHR_variable_pointers.	2019-01-09 09:07:43 +01:00
Chip Davis	fc02b3d656	Rename get_non_pointer_type() methods. This better reflects their purpose now.	2019-01-08 12:55:22 -06:00
Chip Davis	a046f7a878	Add missing break.	2019-01-08 12:55:22 -06:00
Chip Davis	3394f53734	MSL: Fix mapping of identity-swizzled components. Before, if any component was not identity-mapped, those components that were still identity-mapped were set to 0. Now we properly leave them alone.	2019-01-07 11:20:13 -06:00
Chip Davis	3bfb2f94d4	MSL: Support SPV_KHR_variable_pointers. This allows shaders to declare and use pointer-type variables. Pointers may be loaded and stored, be the result of an `OpSelect`, be passed to and returned from functions, and even be passed as inputs to the `OpPhi` instruction. All types of pointers may be used as variable pointers. Variable pointers to storage buffers and workgroup memory may even be loaded from and stored to, as though they were ordinary variables. In addition, this enables using an interior pointer to an array as though it were an array pointer itself using the `OpPtrAccessChain` instruction. This is a rather large and involved change, mostly because this is somewhat complicated with a lot of moving parts. It's a wonder SPIRV-Cross's output is largely unchanged. Indeed, many of these changes are to accomplish exactly that! Perhaps the largest source of changes was the violation of the assumption that, when emitting types, the pointer type didn't matter. One of the test cases added by the change doesn't optimize very well; the output of `spirv-opt` here is invalid SPIR-V. I need to file a bug with SPIRV-Tools about this. I wanted to test that variable pointers to images worked too, but I couldn't figure out how to propagate the access qualifier properly--in MSL, it's part of the type, so getting this right is important. I've punted on that for now.	2019-01-07 11:19:10 -06:00
Hans-Kristian Arntzen	5b8762223d	Run format_all.sh.	2019-01-07 10:01:28 +01:00
Hans-Kristian Arntzen	649ce3c7bb	MSL: Workaround missing gradient2d() for sampler_compare.	2019-01-07 10:01:00 +01:00
Hans-Kristian Arntzen	acae607703	Register implied expression reads in OpLoad/OpAccessChain. This is required to avoid relying on complex sub-expression elimination in compilers, and generates cleaner code. The problem case is if a complex expression is used in an access chain, like: Composite comp = buffer[texture(...)]; vec4 a = comp.a + comp.b + comp.c; Before, we did not have common subexpression tracking for OpLoad/OpAccessChain, so we easily ended up with code like: vec4 a = buffer[texture(...)].a + buffer[texture(...)].b + buffer[texture(...)].c; A good compiler will optimize this, but we should not rely on it, and forcing texture(...) to a temporary also looks better. The solution is to add a vector "implied_expression_reads", which works similarly to expression_dependencies. We also need an extra mechanism in to_expression which lets us skip expression read checking and do it later. E.g. for expr -> access chain -> load, we should only trigger a read of expr when using the loaded expression.	2019-01-04 14:56:12 +01:00
Hans-Kristian Arntzen	318c17cbb2	Nonfunctional: Update copyright headers for 2019.	2019-01-04 12:38:35 +01:00
Bill Hollings	ab329a7906	MSL don't emit `memory_scope` after MSL 2.0.	2018-12-11 16:28:29 -05:00
Chip Davis	6db79b80c1	MSL: Use an enum instead of two mutually exclusive booleans. NFCI.	2018-12-04 13:54:29 -06:00
Bill Hollings	2cd54e4e6d	Merge pull request #779 from cdavis5e/force-signedness MSL: Force signedness of shader vertex attributes to match the host.	2018-12-04 09:37:55 -05:00
Chip Davis	06d483459b	MSL: Force signedness of shader vertex attributes to match the host. Based on a patch by Stefan Dösinger. Metal cannot do signedness conversion on vertex attributes, and for good reason. Putting a `uint4` into an `int4`, or a `char4` into a `uint4`, would lose those values that are outside the range of the target type. But putting a `uchar4` into a `short4` or an `int4`, or a `ushort4` into an `int4`, should work. In that case, force the signedness in the shader to match the declared type of the host. Unfortunately, I don't really know how to automatically test this. This remapping is done based on input parameters normally supplied by MoltenVK. I'm not sure how we'd set this up for the command-line `spirv-cross` tool.	2018-11-28 17:53:56 -06:00
Hans-Kristian Arntzen	61f1d8b2cf	Support gl_HelperInvocation on GLSL and MSL. There is no obvious builtin for this on HLSL.	2018-11-28 15:18:43 +01:00
Hans-Kristian Arntzen	510e1475c6	Merge pull request #756 from cdavis5e/relaxed-block-layout-2 MSL: Also pack 2- and 4- element vectors when necessary.	2018-11-15 10:09:09 +01:00
Chip Davis	6d675ae6a2	Correct carry/borrow bit checks. Don't use `addsat()`/`subsat()`; that'll erroneously flag cases where the sum is exactly the maximum integer value, or the difference is exactly 0. Also, correct the condition for the `select()` function; it's basically `mix()` with a boolean factor. (What was I thinking?)	2018-11-14 10:13:56 -06:00
Chip Davis	cf2a890e4f	MSL: Support extended arithmetic opcodes.	2018-11-13 17:33:03 -06:00
Chip Davis	bed4918cb5	MSL: Also pack 2- and 4- element vectors when necessary. This is also needed for `VK_KHR_relaxed_block_layout` support.	2018-11-13 17:31:47 -06:00
Connor McLaughlin	1dd676c1de	MSL: Emit wrapper for SSign (sign() for int types) Metal does not define the sign() function for integer types, only floating-point types.	2018-11-08 13:08:34 +10:00
Hans-Kristian Arntzen	cf5e1c2801	Merge pull request #743 from cdavis5e/relaxed-block-layout MSL: Also pack members at unaligned offsets.	2018-11-07 19:38:56 +01:00
Chip Davis	e50eecfeeb	MSL: Also pack members at unaligned offsets. This is necessary to support `VK_KHR_relaxed_block_layout`.	2018-11-07 09:42:54 -06:00
Connor McLaughlin	801431b45b	MSL: Print early_fragment_tests specifier before fragment The compiler in 10.14 reports an error that the attribute cannot be applied to types if the specifier is printed before fragment.	2018-11-07 21:54:19 +10:00
Chip Davis	0d949e11ff	Support bitcasts of 16-bit types.	2018-11-05 14:56:36 -06:00
Chip Davis	ca4744ab72	Support constants of 16-bit integral type in GLSL and MSL. Constants of 8-bit type aren't supported in GLSL, since there's no extension letting you use them.	2018-11-02 14:39:55 -05:00
Chip Davis	117ccf407c	Use specific base types for 8- and 16-bit integers.	2018-11-01 17:45:10 -05:00
Chip Davis	1fb27b4cda	Add support for 8- and 16-bit types to GLSL and MSL. In GLSL, 8-bit types require GL_EXT_shader_8bit_storage. 16-bit types can use either GL_AMD_gpu_shader_int16/GL_AMD_gpu_shader_half_float or GL_EXT_shader_16bit_storage.	2018-11-01 10:20:57 -05:00
Hans-Kristian Arntzen	480acdad18	Deal with OpSpecConstantOp used as array size. When trying to validate buffer sizes, we usually need to bail out when using SpecConstantOps, but for some very specific cases where we allow unsized arrays currently, we can safely allow "unknown" sized arrays as well. This is probably the best we can do, when we have even more difficult cases than this, we throw a more sensible error message.	2018-11-01 14:58:02 +01:00
Hans-Kristian Arntzen	6e99fcf695	Run format_all.sh.	2018-11-01 11:23:48 +01:00
Hans-Kristian Arntzen	62db535b3f	Update tests.	2018-11-01 11:23:48 +01:00
Hans-Kristian Arntzen	5bcf02f7c9	Hoist out parsing module from spirv_cross::Compiler. This is a large refactor which splits out the SPIR-V parser from Compiler and moves it into its more appropriately named Parser module. The Parser is responsible for building a ParsedIR structure which is then consumed by one or more compilers. Compiler can take a ParsedIR by value or move reference. This should allow for optimal case for both multiple compilations and single compilation scenarios.	2018-10-19 12:01:31 +02:00
Hans-Kristian Arntzen	a697299bc1	Refactor MSL to use SPIRCombinedImageSampler. Avoids special "meta" data to express this type. Makes MSL implementation in line with HLSL.	2018-10-05 09:49:57 +02:00
Hans-Kristian Arntzen	519565b030	Merge pull request #718 from cdavis5e/op-image-sampled-image MSL: Handle OpImage on OpSampledImage expressions.	2018-10-04 21:30:08 +02:00
Chip Davis	9919fbbe0d	MSL: Handle OpImage on OpSampledImage expressions. I have seen this happen. The included test case is one such case.	2018-10-03 11:48:46 -05:00
Chip Davis	010fecc466	MSL: Swizzle gathers on depth textures as well. Might as well.	2018-10-03 11:47:13 -05:00
Chip Davis	b7433c01ee	Minor cleanups. Throw an error for cases we don't support. Add a blank line after each local array declaration.	2018-09-27 11:01:46 -05:00
Chip Davis	2506046cb4	Merge remote-tracking branch 'origin' into resource-arrays-msl	2018-09-27 10:50:16 -05:00
Hans-Kristian Arntzen	c07c303999	Use GL_EXT_samplerless_texture_functions in Vulkan GLSL.	2018-09-27 13:36:38 +02:00
Chip Davis	3a9af9681c	MSL: Expand arrays of buffers passed as input. Even as of Metal 2.1, MSL still doesn't support arrays of buffers directly. Therefore, we must manually expand them. In the prologue, we define arrays holding the argument pointers; these arrays are what the transpiled code ends up referencing. We might be able to do similar things for textures and samplers prior to MSL 2.0. Speaking of which, also enable texture arrays on iOS MSL 1.2.	2018-09-26 20:48:09 -05:00
Hans-Kristian Arntzen	69b034f26e	Merge pull request #706 from cdavis5e/component-swizzle MSL: Add an option to insert texture swizzles into generated shaders.	2018-09-25 10:06:03 +02:00
Chip Davis	7107f40f99	Provide feedback on whether or not the auxiliary buffer is needed.	2018-09-24 13:38:27 -05:00
Chip Davis	7956b002eb	Give up on non-aliased sampled image parameters. This needs extra work to map them back to the original resource.	2018-09-24 12:42:39 -05:00
Chip Davis	db7a40ce77	Use is_sampled_image_type() elsewhere.	2018-09-24 12:33:11 -05:00
Chip Davis	8855ea0a3e	Move is_sampled_image_type() onto the Compiler class. While I'm at it, don't use a bitwise op with a `bool` variable. Apparently, MSVC doesn't like that.	2018-09-24 12:24:58 -05:00
Chip Davis	c11374c3cf	Don't override Compiler::analyze_image_and_sampler_usage(). Just add our own separate function for analyzing sampled image usage.	2018-09-24 12:10:27 -05:00
Hans-Kristian Arntzen	34014886e3	Merge pull request #710 from cdavis5e/buffer-image-reads MSL: Add spvTexelBufferCoord for buffer image reads, too.	2018-09-24 10:22:15 +02:00
Chip Davis	7cb817e40e	Add spvTexelBufferCoord for buffer image reads, too. I should've caught this when I fixed this for writes.	2018-09-23 14:37:03 -05:00
Chip Davis	4302c5abfb	Pass the swizzle constants as a buffer. It'll be useful to have an "auxiliary buffer" for other builtins--e.g. `DrawIndex` (which should be easier to implement now), or `ViewIndex` when someone gets around to implementing multiview. Pass this buffer to leaf functions as well. Test that we handle this for integer textures as well.	2018-09-22 19:36:11 -05:00
Chip Davis	c793868417	Pack texture component swizzles by bytes.	2018-09-22 19:15:15 -05:00
Chip Davis	7fff65a811	Remove extraneous space in enum class decl.	2018-09-21 13:52:20 -05:00
Bill Hollings	daa831f59d	Fix integer precision warnings on assignments.	2018-09-20 16:10:42 -04:00
Chip Davis	2583321657	MSL: Add an option to insert texture swizzles into generated shaders. It's intended to be used with MoltenVK to support arbitrary `VkComponentMapping` settings. The idea is that MoltenVK will pass a buffer (which it set to some buffer index that isn't being used) containing packed versions of the `VkComponentMapping` struct, one for each sampled image. Yes, this is horribly ugly. It is unfortunately necessary. Much of the ugliness is to support swizzling gather operations, where we need to alter the component that the gather operates on--something complicated by the `gather()` method requiring the passed-in component to be a constant expression. It doesn't even support swizzling gathers on depth textures, though I could add that if it turns out we need it.	2018-09-19 22:32:24 -05:00
Chip Davis	ec857f6778	Cast uses of Layer and ViewportIndex to the expected type.	2018-09-19 09:13:30 -05:00
Chip Davis	0e9ad14ba6	MSL: Handle the ViewportIndex builtin. This requires MSL 2.0+. Also, force `ViewportIndex` and `Layer` to be defined as the correct type, which is always `uint` in MSL. Since Metal doesn't yet have geometry shaders, the vertex shader (or tessellation evaluation shader == "post-tessellation vertex shader" in Metal jargon) is the only kind of shader that can set this output. This currently requires an extension to Vulkan, which causes validation of the SPIR-V binaries for the test cases to fail. Therefore, the test cases are marked "invalid", even though they're actually perfectly valid SPIR-V--they just won't work without the `SPV_EXT_shader_viewport_index_layer` extension.	2018-09-18 09:52:30 -05:00
Chip Davis	7dcfed888a	Use a hook to emit a local for the sample position. That way, we don't have to handle it specially when constructing a call.	2018-09-17 11:51:09 -05:00
Chip Davis	72fc1cce53	Merge remote-tracking branch 'origin' into msl-sample-pos	2018-09-17 11:20:34 -05:00
Hans-Kristian Arntzen	a77880787d	Merge pull request #698 from KhronosGroup/fix-695 MSL: Support global I/O block and struct Input/Output usage.	2018-09-17 14:54:58 +02:00
Hans-Kristian Arntzen	340957a3ab	Make fixup_hooks more flexible. No reason why it needs to return a string. Callbacks can just do one or more statements themselves.	2018-09-17 14:06:44 +02:00
Hans-Kristian Arntzen	4aead55ca6	Remove dead comment.	2018-09-17 13:58:48 +02:00
Hans-Kristian Arntzen	49ac538a64	Remove maybe_assign_input_struct. This is obsolete and wrong since we already unflatten I/O structs.	2018-09-17 13:51:02 +02:00
Chip Davis	39bc101e82	MSL: Handle the SamplePosition builtin. This is somewhat tricky, because in MSL this value is obtained through a function, `get_sample_position()`. Since the call expression is an rvalue, it can't be passed by reference, so functions get a copy instead. This was the last piece preventing us from turning on sample-rate shading support in MoltenVK.	2018-09-13 09:34:28 -05:00
Hans-Kristian Arntzen	1bbb4032c8	Merge pull request #693 from cdavis5e/msl-atomic-inc-dec MSL: Fix OpAtomicIIncrement and OpAtomicIDecrement.	2018-09-13 16:19:27 +02:00
Hans-Kristian Arntzen	d310060f92	MSL: Support global I/O block and struct Input/Output usage. Implement this by flattening outputs and unflattening inputs explicitly. This allows us to pass down a single struct instead of dealing with the insanity that would be passing down each flattened member separately. Remove stage_uniforms_var_id. Seems to be dead code. Naked uniforms do not exist in SPIR-V for Vulkan, which this seems to have been intended for. It was also unused elsewhere.	2018-09-13 16:04:24 +02:00
Chip Davis	06edf804ac	Clarify name of this parameter.	2018-09-13 08:56:23 -05:00
Hans-Kristian Arntzen	71bb7785ac	MSL: textureQueryLod() is not supported. Don't bother with hacky workaround unless required.	2018-09-13 13:44:46 +02:00
Hans-Kristian Arntzen	89e3b8ff0d	Run format_all.sh.	2018-09-12 10:53:50 +02:00
Hans-Kristian Arntzen	2f65a1583e	MSL: Support array-of-arrays composite construction.	2018-09-12 10:25:51 +02:00
Hans-Kristian Arntzen	38d19821d4	MSL: Support copying array of arrays.	2018-09-12 09:54:55 +02:00
Chip Davis	41eb5c43b5	MSL: Fix OpAtomicIIncrement and OpAtomicIDecrement. We were passing a constant '1' to `emit_atomic_func_op()`--which caused us to refer to SPIR-V value `%1`, which is almost certainly not what we want! What we really want is to add/subtract the literal constant '1' to/from the memory location.	2018-09-11 17:29:54 -05:00
Hans-Kristian Arntzen	403011e973	Merge pull request #684 from cdavis5e/msl-builtin-vector-cast MSL: Cast uses of builtin vectors to their declared SPIR-V type.	2018-09-11 19:59:58 +02:00
Chip Davis	6757ef8512	Use bitcast_to_builtin_load() instead of hacking to_expression(). This only affects the builtin when it is used, and not when it's passed to a function. It's a lot cleaner than the way I was doing it before. Remove the `to_expression()` hack.	2018-09-11 11:15:17 -05:00
Chip Davis	acb3fac747	Opt for a simple value cast in lieu of a bitcast.	2018-09-10 14:05:36 -05:00
Hans-Kristian Arntzen	b114889102	Only declare typed initializer list for non-array types. Also, cleanup now redundant constant_expression virtualization for MSL.	2018-09-10 10:04:17 +02:00
Chip Davis	f7dad9da66	MSL: Cast uses of builtin vectors to their declared SPIR-V type. In SPIR-V, builtin integral vectors can be either signed or unsigned, but in MSL they're always unsigned. Unfortunately, the MSL spec forbids implicit conversions between vector types--even if the corresponding scalar types would implicitly convert. If you try, the result is a cryptic error message such as: ``` program_source:37:60: error: cannot convert between vector values of different size ('int4' (aka 'vector_int4') and 'vector_uint4' (vector of 4 'unsigned int' values)) float4 r3 = as_type<float4>((as_type<int4>(r0) * gl_LocalInvocationID.xyyy) + as_type<int4>(r2)); ~~~~~~~~~~~~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~~~~ ``` Therefore, uses of these builtins must be explicitly cast, since the rest of the binary likely assumes that the builtin is of its declared type.	2018-09-08 21:17:54 -05:00
Hans-Kristian Arntzen	9ffd4172b4	Merge pull request #680 from cdavis5e/msl-varying-components MSL: Account for components when assigning locations to varyings.	2018-09-07 16:01:53 +02:00
Hans-Kristian Arntzen	32823b0838	MSL: Do not emit function constants for version < 1.2.	2018-09-07 09:33:34 +02:00
Chip Davis	4b99fdd5d0	MSL: Account for components when assigning locations to varyings. Two varyings (vertex outputs/fragment inputs) might have the same location but be in different components--e.g. the compiler may have packed what were two different varyings into a single varying vector. Giving both varyings the same `[[user]]` attribute won't work--it may yield unexpected results, or flat out fail to link. We could eventually pack such varyings into a single vector, but that would require us to handle the case where the varyings are different types--e.g. a `float` and a `uint` packed into the same vector. For now, it seems most prudent to give them unique `[[user]]` locations and let Apple's compiler work out the best way to pack them.	2018-09-06 13:52:33 -05:00
Chip Davis	674f97a40e	Handle interpolation qualifiers on the entire struct, too.	2018-09-06 12:29:42 -05:00
Chip Davis	9e6469bd40	MSL: Handle interpolation qualifiers.	2018-09-05 12:02:07 -05:00
Chip Davis	680ef9d773	MSL: Correct number of words to skip in OpImageWrite. The length field in `Instruction` doesn't include the initial opcode/length word. We only need to skip three words instead of four.	2018-09-05 10:02:25 -05:00
Chip Davis	9fbe39c9c0	MSL: Emit spvTexelBufferCoord() on ImageWrite to a Buffer as well. This is necessary to get the coordinates to give to the texture's `write()` method.	2018-09-04 12:14:34 -05:00
Hans-Kristian Arntzen	917ca818ed	Merge pull request #673 from cdavis5e/min-max-clamp MSL: Emit F{Min,Max,Clamp} as fast:: and N{Min,Max,Clamp} as precise::.	2018-09-04 15:15:18 +02:00
Hans-Kristian Arntzen	0c1d4d8b6a	MSL: Support texture2d_ms_array.	2018-09-03 11:02:31 +02:00
Hans-Kristian Arntzen	778f998cd2	MSL: Throw error on multisampled array textures.	2018-09-03 10:21:59 +02:00
Chip Davis	27af716c3a	MSL: Emit F{Min,Max,Clamp} as fast:: and N{Min,Max,Clamp} as precise::. This roughly matches their semantics in SPIR-V and MSL. For `FMin`, `FMax`, and `FClamp`, and the Metal functions `fast::min()`, `fast::max()`, and `fast::clamp()`, the result is undefined if any operand is NaN. For the 'N' operations and their corresponding MSL `precise::` functions, the result is consistent with IEEE 754 (first non-NaN wins; result is NaN if all operands are NaN). We can only do this with 32-bit floats, though, because Metal only provides these variants for `float`. `half` only has one variant of these functions that is presumably consistent with IEEE 754. I guess that's OK; the SPIR-V spec only says that `F{Min,Max,Clamp}` are undefined for NaNs. Performance might suffer, though.	2018-09-01 23:01:46 -05:00
Chip Davis	d3233690cb	MSL: Support unordered relational operators. The SPIR-V spec says that these check if the operands either are unordered or satisfy the given condition. So that's just what we'll do, using Metal's `isunordered()` stdlib function. Apple's optimizers ought to be able to collapse that to a single unordered compare.	2018-08-31 13:54:42 -05:00
Chip Davis	2ee8ebbc62	Throw an exception anytime we try to compile DrawIndex to MSL.	2018-08-29 12:05:33 -05:00
Chip Davis	97d01b6450	Punt on DrawIndex in MSL for now. Metal doesn't properly support this.	2018-08-29 10:21:42 -05:00
Chip Davis	fcad019e11	Support the shader_draw_parameters extension.	2018-08-29 10:07:21 -05:00
Chip Davis	1fd8cd9468	[MSL] Give the FragDepth builtin a type of float.	2018-08-28 13:47:50 -05:00
Hans-Kristian Arntzen	87de951105	MSL: Fix naming issue of aliased global variables. When the name of an alias global variable collides with a global declaration, MSL would emit inconsistent names, sometimes with the naming fix, sometimes without, because names were being tracked in two separate meta blocks. Fix this by always redirecting parameter naming to the original base variable as necessary.	2018-08-27 09:59:55 +02:00
Hans-Kristian Arntzen	ffb753ff66	MSL: Fix segfault when trying to store to an array inside struct.	2018-08-08 16:48:22 +02:00
Hans-Kristian Arntzen	981d7c1d85	Need to make sure the fetch expression is uint.	2018-08-07 16:02:17 +02:00
Hans-Kristian Arntzen	eee290a029	MSL: Fix support for texelFetchOffset. Just apply the offset directly, MSL has no immediate offset parameter.	2018-08-07 15:28:04 +02:00
Hans-Kristian Arntzen	361fe52c9d	MSL: Properly support passing parameters by value. MSL would force thread const& which would not work if the input argument came from a different storage class. Emit proper non-reference arguments for such values.	2018-08-06 15:43:51 +02:00
Bill Hollings	c3d74e1e14	CompilerMSL disable rasterization on buffer writes in vertex shader.	2018-07-27 16:53:36 -04:00
Bill Hollings	0d6202e770	Add CompilerMSL::get_is_rasterization_disabled() to manage rasterization status.	2018-07-26 16:40:32 -04:00
Bill Hollings	ac238b858b	CompilerMSL vertex entry point return void when rasterization disabled. Add CompilerMSL::Options::disable_rasterization input/output API flag. Disable rasterization via API flag or when writing to textures. Disable rasterization when shader declares no output. Add test shaders for vertex no output and write texture forcing void output.	2018-07-26 00:50:33 -04:00
Hans-Kristian Arntzen	2bf57d6dff	Deal with composite constants in variable initializer.	2018-07-05 15:29:49 +02:00
Hans-Kristian Arntzen	af290ede87	Remove some redundant spvArrayCopy declarations.	2018-07-05 14:43:12 +02:00
Hans-Kristian Arntzen	d29f48ef06	Deduce constant LUTs from read-write variables.	2018-07-05 13:25:57 +02:00
Hans-Kristian Arntzen	b5ed706860	Hoist out variable scope analysis.	2018-07-05 10:42:05 +02:00
Hans-Kristian Arntzen	c26c41b26b	Make the CFGs for all active functions available. Will make writing other CFG-depended stuff easier.	2018-07-04 17:26:53 +02:00
Hans-Kristian Arntzen	e044732896	Support OpTypeImage with depth == 2 (unknown) properly. Track which OpSampledImages are ever used with Dref opcodes.	2018-07-04 14:26:23 +02:00
Hans-Kristian Arntzen	9ddbd5aff6	Run format_all.sh.	2018-06-28 23:00:26 +02:00
Hans-Kristian Arntzen	f1752e58e1	Add basic namespace to internal macros. Some projects build SPIRV-Cross as a single translation unit and this causes a lot of warnings because the same macro is redeclared multiple times in the different backends. This make sure that each backend has its own namespace for internal macros.	2018-06-28 22:57:52 +02:00
Bill Hollings	9bf226cb05	Fixes for code review of PR 626.	2018-06-27 10:34:15 -04:00
Bill Hollings	4c5142b9d3	CompilerMSL support larger texel buffers by using 2D Metal textures. Add CompilerMSL::Options::texture_width_max. Emit and use spvTexelBufferCoord() function to convert 1D texel buffer coordinates to 2D Metal texture coordinates.	2018-06-26 17:30:21 -04:00
Bill Hollings	4beefe756c	Fixes from PR 621 code review.	2018-06-25 11:40:20 -04:00
Bill Hollings	f66507a701	Merge branch 'master' of https://github.com/KhronosGroup/SPIRV-Cross	2018-06-25 10:52:15 -04:00
Bill Hollings	e091031613	CompilerMSL pass builtin struct members into functions. Add and use Compiler::get_non_pointer_type() convenience functions.	2018-06-24 15:06:12 -04:00
Hans-Kristian Arntzen	d94d20f4f3	Deal with some builtins being declared with wrong signedness.	2018-06-22 11:30:56 +02:00
Bill Hollings	ab2ea93e35	Merge branch 'master' of https://github.com/KhronosGroup/SPIRV-Cross	2018-06-12 11:42:56 -04:00
Bill Hollings	9b4defe202	CompilerMSL support matrices & arrays in stage-in & stage-out. Support flattening StorageOutput & StorageInput matrices and arrays. No longer move matrix & array inputs to separate buffer. Add separate SPIRFunction::fixup_statements_in & SPIRFunction::fixup_statements_out instead of just SPIRFunction::fixup_statements. Emit SPIRFunction::fixup_statements at beginning of functions. CompilerMSL track vars_needing_early_declaration. Pass global output variables as variables to functions that access them. Sort input structs by location, same as output structs. Emit struct declarations in order output, input, uniforms. Regenerate reference shaders to new formats defined by above.	2018-06-12 11:41:35 -04:00
Hans-Kristian Arntzen	58fab58e5e	Do not unpack transposed matrices.	2018-06-12 09:43:47 +02:00
Hans-Kristian Arntzen	04b149feb0	Fix image load/store on cube arrays in MSL.	2018-05-25 12:43:25 +02:00
Hans-Kristian Arntzen	6b3da831be	Declare read-only SSBOs as const device in MSL.	2018-05-25 10:14:05 +02:00
Hans-Kristian Arntzen	bcaae84c76	Deal with scoping for Private variables.	2018-05-16 10:49:30 +02:00
Hans-Kristian Arntzen	26b887ec99	Fix atomic_compare_exchange_weak_explicit. Need to emit a CAS loop. Fix shared memory declaration. Declare atomic ops with correct memory scope.	2018-05-15 16:04:21 +02:00
Hans-Kristian Arntzen	fb7181bff1	Run format_all.sh.	2018-05-15 14:24:59 +02:00
Hans-Kristian Arntzen	991b655c72	Declare OpSpecConstantOp up-front on relevant targets. Required, since spec constants can include results from constant ops.	2018-05-15 14:20:16 +02:00
Hans-Kristian Arntzen	d2df067dd4	Force recompile if we add row-major transpose functions in MSL.	2018-05-04 09:43:34 +02:00
Hans-Kristian Arntzen	7b95168c3d	Do not clear spv_function_implementations on MSL. Will fail when recompiles are necessary.	2018-05-02 21:37:36 +02:00
Bill Hollings	57213cb7ca	Compiler MSL default gather offset when component specified.	2018-04-30 16:30:29 -04:00
Hans-Kristian Arntzen	d93807a625	Deal with OpImageFetch without explicit LOD.	2018-04-30 10:54:44 +02:00
Hans-Kristian Arntzen	e351e5c565	Use convert_to_string for lod clamp.	2018-04-18 16:31:08 +02:00
Hans-Kristian Arntzen	e30a94225f	Complete MSL constexpr samplers. Deal with defaults and avoid verbose declarations.	2018-04-18 16:19:55 +02:00
Hans-Kristian Arntzen	64f9461d72	Check for array of samplers.	2018-04-17 17:47:15 +02:00
Hans-Kristian Arntzen	df58debf7a	Add support for constexpr samplers in MSL.	2018-04-17 17:43:32 +02:00
Hans-Kristian Arntzen	9c2761f69a	Run format_all.sh.	2018-04-10 12:32:14 +02:00
Hans-Kristian Arntzen	8175e2e200	Fix depth compare textures when used in functions without argument.	2018-04-10 12:31:13 +02:00
Hans-Kristian Arntzen	ac81a0ce68	Use declared binding in SPIR-V as a fallback for explicit MSL binds.	2018-04-04 12:25:11 +02:00
Hans-Kristian Arntzen	e8ca39b7b5	Add test for sampler image arrays.	2018-04-04 09:41:20 +02:00
Hans-Kristian Arntzen	382101bd05	Run format_all.sh.	2018-04-04 09:26:53 +02:00
Hans-Kristian Arntzen	5827dd54ea	Support array of images and samplers in MSL.	2018-04-04 09:26:53 +02:00
Hans-Kristian Arntzen	81eb72a9a0	Ignore LOD when sampling 1D textures in MSL. Not supported.	2018-04-04 09:26:53 +02:00
Hans-Kristian Arntzen	65be63fd04	Merge pull request #521 from KhronosGroup/fix-516 Support dual-source blending on GLSL and MSL.	2018-04-03 16:54:32 +02:00
Hans-Kristian Arntzen	a6e211e00b	Support dual-source blending on GLSL and MSL.	2018-04-03 16:04:49 +02:00
Hans-Kristian Arntzen	3229e6efb6	Add more illegal name replacement in MSL.	2018-04-03 15:36:35 +02:00
Hans-Kristian Arntzen	719cf9d42f	Run format_all.sh.	2018-03-13 14:05:33 +01:00
Hans-Kristian Arntzen	8e90382675	Properly flatten MRT outputs in MSL.	2018-03-13 14:03:35 +01:00
Hans-Kristian Arntzen	6e6ca0b237	Attempt MRT-as-array in MSL.	2018-03-13 13:17:17 +01:00
Hans-Kristian Arntzen	4979d10b54	Implement packHalf2x16/unpackHalf2x16 on MSL.	2018-03-12 17:51:14 +01:00
Hans-Kristian Arntzen	938c7debed	Handle control-dependent temporaries. Derivatives, subgroup and implicit-lod instructions all need to happen in the block they were created.	2018-03-12 17:34:54 +01:00
Hans-Kristian Arntzen	e8e58844d4	Rewrite everything to use Bitset rather than uint64_t.	2018-03-12 13:24:14 +01:00
Hans-Kristian Arntzen	a803e5ae38	Deprecate set_options()/get_options() interface, replace it. Replace with common/hlsl/msl instead. The old interface had some bad interaction with overloading which meant you had to up-cast to base class to be able to use set_options, which was awkward.	2018-03-09 15:25:25 +01:00
Hans-Kristian Arntzen	ac0e93f392	Run format_all.sh.	2018-03-07 10:29:20 +01:00
Hans-Kristian Arntzen	18ad1be3c3	Add FP16 test for MSL as well.	2018-03-07 10:29:11 +01:00
Hans-Kristian Arntzen	47d94ff8d9	Add FP16 to HLSL. Cannot be used in buffer types, similar to mediump in GLSL. half is useless, because it's 32-bit in FXC.	2018-03-07 10:21:25 +01:00
Hans-Kristian Arntzen	d9da2db442	Some compat fixes for MSL and Half.	2018-03-06 17:09:18 +01:00
Hans-Kristian Arntzen	294259e2f1	Fix type aliasing on MSL. Be careful about who gets to be the alias master, and don't alias types when we have packed types in play.	2018-03-05 16:27:04 +01:00
Hans-Kristian Arntzen	6a12ff7fb7	Fix multiple declaration of spvDet2x2 on MSL.	2018-02-23 16:52:11 +01:00
Hans-Kristian Arntzen	dd603eab58	Support spec constant array size in blocks. Won't really be correct if the spec constant is changed outside SPIRV-Cross, but nothing we can do about that, really.	2018-02-23 15:11:45 +01:00
Hans-Kristian Arntzen	a04bdcc7f7	Handle overloaded functions which share the same OpName. Awkward, but legal SPIR-V.	2018-02-23 14:15:51 +01:00
Bill Hollings	50ef6cd95f	CompilerMSL remove incorrect packing of non-interface type-aliased structs.	2018-02-21 17:52:03 -05:00
Hans-Kristian Arntzen	54a065bb5f	Run format_all.sh.	2018-02-15 13:32:49 +01:00
Hans-Kristian Arntzen	3fa6cc8f2c	Implement FRem.	2018-02-15 13:31:29 +01:00
Bill Hollings	2964e328e6	CompilerMSL support gl_SampleMask and convert it to scalar uint from array.	2018-02-13 14:44:40 -05:00
Bill Hollings	b453348370	Merge branch 'master' of https://github.com/billhollings/SPIRV-Cross	2018-02-11 16:54:25 -05:00
Bill Hollings	607b0d6d42	CompilerMSL support smaller offsets for 3-row row-major matrices. Support MSL typedefs to declare 3-row row-major matrices as 3-column matrices. Allow those matrices to be decorated as packed. Support transposing those matrices when used. Modify how member alignments are calculated.	2018-02-11 16:52:57 -05:00
Hans-Kristian Arntzen	a3104e98f9	Also check that type we load is an image.	2018-02-10 11:12:05 +01:00
Hans-Kristian Arntzen	a3ae861844	Fix depth image usage in MSL for separate image/samplers.	2018-02-10 10:55:10 +01:00
Hans-Kristian Arntzen	702e08671b	Support passing implicit frag_coord arguments down to functions.	2018-02-10 10:55:09 +01:00
Hans-Kristian Arntzen	0912427046	Begin implementing subpassLoad in MSL.	2018-02-10 10:54:56 +01:00
Hans-Kristian Arntzen	c9db3e5521	Overload on constant storage.	2018-02-08 17:58:46 +01:00
Hans-Kristian Arntzen	b2c9487b0f	Attempt to deduce constant/thread storage.	2018-02-08 17:07:50 +01:00
Hans-Kristian Arntzen	1a9c960058	MSL cannot declare inline arrays except in certain cases.	2018-02-08 13:06:29 +01:00
Hans-Kristian Arntzen	156dd905fd	Implicit return value takes thread storage.	2018-02-08 12:22:08 +01:00
Hans-Kristian Arntzen	d89b79025b	Fix wrong function declaration in MSL.	2018-02-08 12:22:08 +01:00
Hans-Kristian Arntzen	00ccd590ee	Return arrays in HLSL/MSL by writing to an output variable instead.	2018-02-08 12:22:08 +01:00
Hans-Kristian Arntzen	9fa91f7e1c	Support returning arrays from functions in GLSL/MSL. Not possible in HLSL apparently, need workaround ...	2018-02-08 12:22:08 +01:00
msiglreith	d096f5cafe	hlsl: Support custom root constant layout	2018-02-07 15:21:52 +01:00
Hans-Kristian Arntzen	6ca408aac2	Merge pull request #420 from billhollings/master Update copyright dates to 2018 in main files.	2018-02-01 09:02:11 +01:00
Hans-Kristian Arntzen	4c1e57ee03	Merge pull request #413 from zeux/master MSL: Order resources by type and binding index in the output	2018-02-01 09:01:34 +01:00
Bill Hollings	1c94715350	Update copyright dates to 2018 in main files.	2018-01-31 17:08:43 -05:00
Arseny Kapoulkine	7c8db865c4	Format spirv_msl.cpp	2018-01-29 06:42:34 -08:00
Arseny Kapoulkine	050361422c	MSL: Order resources by type and binding index in the output We've hit a bizarre bug on NVidia / macOS 10.13 where if two subsequent draw calls use two different shaders that both have VS use buffers 0 & 1, but one declares them in the increasing binding order and another one declares them in the decreasing binding order, then the second draw call (with the decreasing order) doesn't get correct data in some cases. This has been reported to Apple and they will probably fix it at some point; to work around that it's sufficient to sort resources by their binding index. For consistency we also sort by type to get a stable order, and output builtins after that to prevent random bugs like this from happening.	2018-01-29 06:35:41 -08:00
Hans-Kristian Arntzen	38b8f733d1	Fix passing arrays of arrays to functions in MSL.	2018-01-29 10:57:52 +01:00
Bill Hollings	e43f244399	Merge branch 'master' of https://github.com/KhronosGroup/SPIRV-Cross	2018-01-24 17:34:50 -05:00
Bill Hollings	fe3683eefa	CompilerMSL declare threadgroup variables accessed in called functions.	2018-01-24 15:38:17 -05:00
Hans-Kristian Arntzen	09f550f718	Handle exponential explosion of code-gen during first phase of compile. Certain patterns with OpVectorShuffle (and probably others) will cascade to so large, that they can cause OOM. After we have observed force_recompile, don't spend unnecessary memory emitting code which will never be used.	2018-01-24 18:12:41 +01:00
Hans-Kristian Arntzen	06041985d0	Fix HLSL regression with struct declaration. It actually worked surprisingly. Fix it properly.	2018-01-23 16:36:20 +01:00
Hans-Kristian Arntzen	7d223b8987	Fix CFG for forwarded temporaries. Forwarded temporaries would never declare a temporary. Figure out all result types ahead of time so we can deal with those temporaries as well.	2018-01-18 12:11:33 +01:00
Bill Hollings	ba1e415a9c	Use initializer list for composite initializations if backend.use_initializer_list is on.	2018-01-12 17:19:24 -05:00
Hans-Kristian Arntzen	f708b497a4	Opt in to gl_in/gl_out handling rather than other way around.	2018-01-09 09:16:33 +01:00
Bill Hollings	27d4af75a0	Revert to not forcing gl_in/gl_out block for MSL, and add MSL gl_ClipDistance tests.	2018-01-08 16:18:34 -05:00
Bill Hollings	6371d9e43a	CompilerMSL emit no-warning pragma when emitting spvConvertFromRowMajorCxR functions.	2018-01-06 00:51:25 -05:00
Bill Hollings	5ee6b46087	Fixes from review of PR #373 . Code fixes from review. Refactor MSL tests back to using the SPIRV-Tools and glslang loaded by checkout_glslang_spirv_tools.sh.	2018-01-05 23:22:36 -05:00
Bill Hollings	3a7e8a1035	CompilerMSL fix bad cast error on result type derivation.	2018-01-04 21:13:38 -05:00
Bill Hollings	8890578d2a	CompilerMSL support conversion of non-square row-major matrices.	2018-01-04 16:33:45 -05:00
Bill Hollings	a68b32733a	CompilerMSL enhancements to nested function use of globals. Allow function calls to include globals as arguments. Allow function calls to include built-ins as arguments. Include all meta info when creating function args from globals. Do not manufacture a sampler for Buffer-type sampled images. Add code option to test_shaders.py to preserve SPIR-V code for interactive debugging.	2017-12-26 16:32:45 -05:00
Bill Hollings	3fcdce08ab	CompilerMSL support platform semantics. Support customizing MSL based on iOS or macOS platform. Support SPIV-V containing multiple memory semantics.	2017-12-26 13:39:07 -05:00
Vadim Shcherbakov	3376198740	and a bit better case placemenent	2017-12-13 13:03:31 +03:00
Vadim Shcherbakov	db402236a7	move BuiltInLayer to vertex out function block	2017-12-13 13:02:03 +03:00
Vadim Shcherbakov	717d9fefd8	another formatting fix and a comment	2017-12-11 21:02:13 +03:00
Vadim Shcherbakov	6c41f9e9da	MSL improvements: - pack/unpack nested constant buffer structs - support for write-only textures (only global ones for now) - better rt index support for msl generator	2017-12-06 09:52:07 -08:00
Hans-Kristian Arntzen	aa2557c7df	Fixups for PR #353 .	2017-12-05 09:58:12 +01:00
Bill Hollings	c93d44ba3c	For MSL, use {} instead of constructors to init OpUndef values.	2017-11-30 15:03:27 -05:00
Hans-Kristian Arntzen	ce18d4ce74	Run format_all.sh.	2017-11-17 13:38:29 +01:00
Bill Hollings	e83e2b2217	CompilerMSL support and tests for OpUndef.	2017-11-15 22:44:42 -05:00
Hans-Kristian Arntzen	4427cb993d	Add support for renaming entry points.	2017-11-13 13:50:37 +01:00
Hans-Kristian Arntzen	f486142e36	Run format_all.sh.	2017-11-13 09:52:35 +01:00

... 7 8 9 10 11 ...

1024 Commits