SPIRV-Cross

Author	SHA1	Message	Date
Hans-Kristian Arntzen	a59e25db18	MSL: Small refactors.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	74b2acab9b	MSL: Always emit block variable for block types.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ae7bb41ef4	MSL: Test that we can mask location writes in TESC.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	3255d6cef0	MSL: Explicitly only consider masked variables to be thread-group-like.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ba93b6518d	MSL: Fix masking of vertex block outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	a393de31e6	MSL: Refactor out variable/block member masking.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	e7b37392bf	MSL: Emit correct address space for masked arguments.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	65b5ff7ece	MSL: Don't emit weird reference type for spvUnsafeArray types.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	e7824c8b6e	MSL: Handle masked outputs in extract_global_variables. Need to conditionally add gl_in or gl_out.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	50a6bc058a	MSL: Force builtin arrays for builtin array types. Handles argument_decl() correctly.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	f2b5fb3f45	MSL: Emit threadgroup storage class for masked control point outputs. Shader can still rely on writes to threadgroup memory to be visible.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	ea91579a7f	MSL: Do not redirect tess access chains on masked outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	f682e89188	MSL: Correctly emit array type for masked outputs.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	2a2d57df13	MSL: Sketch out API to aid LTO-style optimization.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	9a144bb2b9	Clean up member sorting.	2021-04-19 12:10:49 +02:00
Hans-Kristian Arntzen	0997e81118	MSL: Sort builtin IO block members by builtin type. Ensures consistent block matching.	2021-04-19 12:10:49 +02:00
Bill Hollings	b3bfe22eaa	MSL: Fixes to support padding Metal argument buffer entries based on argument index. For buffers, support all MSLResourceBinding::basetype pointers, not just void*. Rename MSLResourceBinding::base_type to basetype for consistent use in other structs.	2021-04-18 17:34:55 -04:00
Bill Hollings	daba0dfba6	MSL: Fixes to support padding Metal argument buffer entries based on argument index. For completeness, add [[id(N)]] qualifier to padding struct members. Run clang-format.	2021-04-17 15:20:53 -04:00
Bill Hollings	9060e5a13c	MSL: Fixes to support padding Metal argument buffer entries based on argument index. Use separate lookups for texture and sampler members when padding for SamplerImages. Remove unreachable code following SPIRV_CROSS_THROW.	2021-04-16 15:00:59 -04:00
Bill Hollings	9866cf4496	MSL: Fixes to support padding Metal argument buffer entries based on argument index. Add lookup from argument buffer argument index to resource binding for efficiency. Fix error in advancing padding counts with combined image samplers. Run clang-format.	2021-04-16 09:05:15 -04:00
Bill Hollings	17dab614dc	MSL: Support padding Metal argument buffer entries based on argument index. If CompilerMSL::Options::pad_argument_buffer_resources enabled, Metal argument buffer struct members are positionally aligned to their argument indexes by adding synthetic padding members when needed. The types and sizes of these synthetic members are identified in the resource_bindings vector provided through the API. Add CompilerMSL::Options::pad_argument_buffer_resources to enable padding Metal argument buffer structs to positionally match members to argument indexes. Add MSLResourceBinding::base_type to identify resource type through API.	2021-04-13 19:01:20 -04:00
Mehmet Oguz Derin	e2f7a753d2	Move condition to default block	2021-03-26 16:29:44 +03:00
Mehmet Oguz Derin	0a0c9db9be	MSL: Support long ulong types in buffers in 2.3+. [Metal Shading Language Specification](https://developer.apple.com/metal/Metal-Shading-Language-Specification.pdf) states that: > Note: As of Metal 2.3, Metal supports buffers that contain long or ulong data types. This PR aims to support long and ulong data types in buffers for Metal 2.3+.	2021-03-26 02:41:26 +03:00
Hans-Kristian Arntzen	97796e0609	MSL: Deal with pointer-to-pointer qualifier ordering.	2021-02-26 13:37:14 +01:00
Hans-Kristian Arntzen	621884d709	Merge pull request #1622 from KhronosGroup/fix-1619 MSL: Handle load and store to TessLevel array in TESC.	2021-02-17 20:46:06 +01:00
Hans-Kristian Arntzen	85704f70bc	MSL: Handle load and store to TessLevel array in TESC. More edge cases ... :(	2021-02-17 13:26:08 +01:00
Hans-Kristian Arntzen	ce552f4f91	MSL: Gracefully assign automatic input locations to builtin attributes.	2021-02-17 12:29:19 +01:00
Hans-Kristian Arntzen	aa271c1460	MSL: Refactor out location consumption count computation.	2021-02-17 11:29:33 +01:00
Hans-Kristian Arntzen	bae17e8204	Merge pull request #1617 from KhronosGroup/fix-1608 MSL: Fixup type when using tessellation levels in TESC functions.	2021-02-16 11:10:07 +01:00
Hans-Kristian Arntzen	daddbd4078	MSL: Fixup type when using tessellation levels in TESC functions. Need to rewrite array size depending on execution mode.	2021-02-15 13:28:11 +01:00
Hans-Kristian Arntzen	0ad12a0036	MSL: Always return [[position]] when required.	2021-02-15 12:57:37 +01:00
Hans-Kristian Arntzen	4704482bbc	meta: Update copyright headers to 2021.	2021-01-14 16:07:49 +01:00
Hans-Kristian Arntzen	893a011299	MSL: Fix various bugs with framebuffer fetch on macOS and argument buffers. Introduce a helper to make it clearer if a resource can be considered for argument buffers or not.	2021-01-08 10:19:18 +01:00
Hans-Kristian Arntzen	3136e34215	MSL: Always use input_attachment_index for framebuffer fetch binding. --msl-decoration-binding would end up overriding the input attachment index to binding which is very unexpected and broken.	2021-01-08 10:17:42 +01:00
Hans-Kristian Arntzen	fa76d01203	MSL: Only consider builtin variables if they are part of IO interface.	2021-01-07 10:50:29 +01:00
Hans-Kristian Arntzen	c8837d7d80	MSL: Very slight refactor.	2021-01-07 10:27:14 +01:00
Hans-Kristian Arntzen	efed4c9738	MSL: Fix initializer for tess level outputs. It's an array, not vector.	2021-01-06 10:39:39 +01:00
Hans-Kristian Arntzen	ab9200ffdf	MSL: Don't flatten builtin arrays unless they're part of IO interface.	2021-01-06 10:33:17 +01:00
Hans-Kristian Arntzen	df4f8ef8fe	MSL: Emit correct initializer for tessellation control points.	2021-01-05 15:16:49 +01:00
Hans-Kristian Arntzen	ad3e1584f9	MSL: Handle initializers for tess levels.	2021-01-05 13:25:50 +01:00
Hans-Kristian Arntzen	1a38fec382	Minor redundant nit.	2021-01-04 19:12:46 +01:00
Hans-Kristian Arntzen	a1c784f002	More robust handling of initialized output builtin variables.	2021-01-04 19:12:43 +01:00
Hans-Kristian Arntzen	c4ff129fe3	MSL: Handle reserved identifiers for entry point. We only considered invalid names, and overwrote the alias for the function. The correct fix is to replace illegal names early, do the reserved fixup, then copy back alias to entry point name.	2021-01-04 09:40:11 +01:00
Hans-Kristian Arntzen	6d10da0224	Merge pull request #1553 from comex/no-subgroups-in-vertex-shaders msl: Don't try to use [[thread_index_in_simdgroup]] in vertex shaders.	2020-12-02 19:29:19 +01:00
Hans-Kristian Arntzen	cf1e9e0643	Add MIT dual license for the SPIRV-Cross API.	2020-12-01 16:47:08 +01:00
Jan Sikorski	c09a65c12d	MSL: Added fmin3 and fmax3 library functions to the illegal name list.	2020-11-27 15:25:34 +01:00
comex	5a85fa9400	msl: Don't try to use [[thread_index_in_simdgroup]] in vertex shaders. According to the Metal Shading Language Specification, it's not supported for vertex functions in any Metal version, only fragment and kernel functions.	2020-11-26 20:19:08 -05:00
comex	c80cbde7aa	spirv_msl: Don't add fixup hooks for builtin variables if they're unused. This is necessary to avoid invalid output because of how implicit dependencies on builtins work. For example, the fixup for `BuiltInSubgroupEqMask` initializes the variable based on `builtin_subgroup_invocation_id_id`, a field storing the ID for a variable with decoration `BuiltInSubgroupLocalInvocationId`. This could be either a variable that already exists in the input (spirv_msl.cpp:300) or, if necessary, a newly created one (spirv_msl.cpp:621). In both cases, though, `builtin_subgroup_invocation_id_id` is only set under the condition `need_subgroup_mask \|\| needs_subgroup_invocation_id`. `need_subgroup_mask` is true if any of the `BuiltInSubgroupXXMask` are set in `active_input_builtins`. Normally, if the program contains `BuiltInSubgroupEqMask`, `Compiler::ActiveBuiltinHandler` will set it in `active_input_builtins`. But this only happens if the variable is actually used, whereas `fix_up_shader_inputs_outputs` loops over all variables in the program regardless of whether they're used. If `BuiltInSubgroupEqMask` is not used, `builtin_subgroup_invocation_id_id` is never set, but before this patch the fixup hook would try to use it anyway, producing MSL that references a nonexistent variable named `_0`. Avoid this by changing `fix_up_shader_inputs_outputs` to skip builtins which are not set in `active_input_builtins` or `active_output_builtins`. And add a test case.	2020-11-25 13:41:12 -05:00
Chip Davis	1e67b21ee9	MSL: Don't mask off inactive bits in ballot masks. This was based on my misreading the spec. The Vulkan CTS expects the bits to be set, even if the invocations corresponding to them are inactive.	2020-11-25 09:29:51 -06:00
Chip Davis	fd738e3387	MSL: Adjust FragCoord for sample-rate shading. In Metal, the `[[position]]` input to a fragment shader remains at fragment center, even at sample rate, like OpenGL and Direct3D. In Vulkan, however, when the fragment shader runs at sample rate, the `FragCoord` builtin moves to the sample position in the framebuffer, instead of the fragment center. To account for this difference, adjust the `FragCoord`, if present, by the sample position. The -0.5 offset is because the fragment center is at (0.5, 0.5). Also, add an option to force sample-rate shading in a fragment shader. Since Metal has no explicit control for this, this is done by adding a dummy `[[sample_id]]` which is otherwise unused, if none is already present. This is intended to be used from e.g. MoltenVK when a pipeline's `minSampleShading` value is nonzero. Instead of checking if any `Input` variables have `Sample` interpolation, I've elected to check that the `SampleRateShading` capability is present. Since `SampleId`, `SamplePosition`, and the `Sample` interpolation decoration require this cap, this should be equivalent for any valid SPIR-V module. If this isn't acceptable, let me know.	2020-11-23 10:30:24 -06:00
Hans-Kristian Arntzen	6a614cc7f7	Normalize all internal workaround methods to use spv prefix. We have been interchanging spv and SPIRV_Cross_ for a while, which causes weirdness since we don't explicitly ban SPIRV_Cross identifiers, as these identifiers are generally used for interface variable workarounds.	2020-11-23 15:42:27 +01:00
Chip Davis	68908355a9	MSL: Expand subgroup support. Add support for declaring a fixed subgroup size. Metal, like Vulkan with `VK_EXT_subgroup_size_control`, allows the thread execution width to vary depending on factors such as register usage. Unfortunately, this breaks several tests that depend on the subgroup size being what the device says it is. So we'll fix the subgroup size at the size the device declares. The extra invocations in the subgroup will appear to be inactive. Because of this, the ballot mask builtins are now ANDed with the active subgroup mask. Add support for emulating a subgroup of size 1. This is intended to be used by Vulkan Portability implementations (e.g. MoltenVK) when the hardware/software combo provides insufficient support for subgroups. Luckily for us, Vulkan 1.1 only requires that the subgroup size be at least 1. Add support for quadgroup and SIMD-group functions which were added to iOS in Metal 2.2 and 2.3. This will allow clients to take advantage of expanded quadgroup and SIMD-group support in recent Metal versions and on recent Apple GPUs (families 6 and 7). Gut emulation of subgroup builtins in fragment shaders. It turns out codegen for the SIMD-group functions in fragment wasn't implemented for AMD on Mojave; it's a safe bet that it wasn't implemented for the other drivers either. Subgroup support in fragment shaders now requires Metal 2.2.	2020-11-20 15:55:49 -06:00
Hans-Kristian Arntzen	b3c59263a0	Merge pull request #1541 from cdavis5e/msl-ios-features MSL: Expose some more features on iOS.	2020-11-16 09:43:40 +01:00
Chip Davis	88e25e60ec	MSL: Expose some more features on iOS. `min_lod_clamp()` was actually added in MSL 2.2 on iOS 13. The restriction was based on the beta versions which didn't have it. Since the beta versions didn't support family 6, this leads me to suspect that the reason they lacked `min_lod_clamp()` is that it requires family 6. This does not seem to be documented anywhere. `simd_is_helper_thread()` was added in MSL 2.3 to iOS. I neglected to update this when I finished up `SPV_EXT_demote_to_helper_invocation`. `barycentric_coord` and `primitive_id` were added in MSL 2.3 on iOS 14. They are only supported on family 7.	2020-11-14 01:57:11 -06:00
Hans-Kristian Arntzen	ef0256c23e	Fix switch fallthrough	2020-11-11 16:21:18 +01:00
Jan Sikorski	f0239bce05	MSL: extract global variables from subgroup ballot operations Fixes #1513.	2020-11-09 11:23:01 +01:00
Hans-Kristian Arntzen	6fc2a0581a	Run format_all.sh.	2020-11-08 13:59:52 +01:00
Hans-Kristian Arntzen	46bf1e99d6	Merge pull request #1525 from cdavis5e/msl-interpolation-functions MSL: Support pull-model interpolation on MSL 2.3+.	2020-11-07 17:04:56 +01:00
Hans-Kristian Arntzen	db13762297	MSL: Fix regression in image gather handling. It was not always possible to get backing variable for a late-combined image sampler.	2020-11-06 16:21:30 +01:00
Chip Davis	aca9b6879a	MSL: Support pull-model interpolation on MSL 2.3+. New in MSL 2.3 is a template that can be used in the place of a scalar type in a stage-in struct. This template has methods which interpolate the varying at the given points. Curiously, you can't set interpolation attributes on such a varying; perspective-correctness is encoded in the type, while interpolation must be done using one of the methods. This makes using this somewhat awkward from SPIRV-Cross, requiring us to jump through a bunch of hoops to make this all work. Using varyings from functions in particular is a pain point, requiring us to pass the stage-in struct itself around. An alternative is to pass references to the interpolants; except this will fall over badly with composite types, which naturally must be flattened. As with tessellation, dynamic indexing isn't supported with pull-model interpolation. This is because of the need to reference the original struct member in order to call one of the pull-model interpolation methods on it. Also, this is done at the variable level; this means that if one varying in a struct is used with the pull-model functions, then the entire struct is emitted as pull-model interpolants. For some reason, this was not documented in the MSL spec, though there is a property on `MTLDevice`, `supportsPullModelInterpolation`, indicating support for this, which is documented. This does not appear to be implemented yet for AMD: it returns `NO` from `supportsPullModelInterpolation`, and pipelines with shaders using the templates fail to compile. It is implemeted for Intel. It's probably also implemented for Apple GPUs: on Apple Silicon, OpenGL calls down to Metal, and it wouldn't be possible to use the interpolation functions without this implemented in Metal. Based on my testing, where SPIR-V and GLSL have the offset relative to the pixel center, in Metal it appears to be relative to the pixel's upper-left corner, as in HLSL. Therefore, I've added an offset 0.4375, i.e. one half minus one sixteenth, to all arguments to `interpolate_at_offset()`. This also fixes a long-standing bug: if a pull-model interpolation function is used on a varying, make sure that varying is declared. We were already doing this only for the AMD pull-model function, `interpolateAtVertexAMD()`; for reasons which are completely beyond me, we weren't doing this for the base interpolation functions. I also note that there are no tests for the interpolation functions for GLSL or HLSL.	2020-11-05 11:57:45 -06:00
Hans-Kristian Arntzen	244839d350	Merge pull request #1516 from billhollings/VK_EXT_descriptor_indexing MSL: Support run-time sized image and sampler arrays	2020-11-03 10:15:36 +01:00
Bill Hollings	4bdd49df3f	Syntax and format updates from code review.	2020-11-02 22:15:20 -05:00
Bill Hollings	7f67abe0fe	Minor format and typo updates from code review.	2020-10-30 16:05:44 -04:00
Chip Davis	547c29f7bb	MSL: Allow Bias and Grad arguments with comparison on Mac in MSL 2.3. I kept the code to replace constant zero arguments, because `Bias` and `Grad` still have some problems on desktop GPUs. `Bias` works on AMD GPUs. `Grad` does not. Both work on Intel. Still needs testing on NV. It will definitely work with Apple GPUs.	2020-10-30 11:14:59 -05:00
Hans-Kristian Arntzen	541a801fed	Merge pull request #1514 from cdavis5e/msl-mac-framebuffer-fetch MSL: Allow framebuffer fetch on Mac in MSL 2.3.	2020-10-30 08:09:41 +01:00
Yuwen Wu	c8a43876c7	added metal keyworld: "level" (#1501 ) * added metal keyworld: "level" * added more metal keywords * updated test case.	2020-10-30 08:07:25 +01:00
Bill Hollings	b7b0e804e5	MSL: Support run-time sized image and sampler arrays (GL_EXT_nonuniform_qualifier/SPV_EXT_descriptor_indexing). MSLResourceBinding includes array size through API, and substitutes in that size if the image or sampler array is not explicitly sized. OpCopyObject supports SPIRCombinedImageSampler type in MSL.	2020-10-29 18:50:42 -04:00
Chip Davis	c20d5945a2	MSL: Allow framebuffer fetch on Mac in MSL 2.3. Another Apple GPU feature that will now be supported on Apple Silicon Macs.	2020-10-29 10:50:59 -05:00
Chip Davis	d48d2a95c7	MSL: Allow post-depth coverage on Mac in MSL 2.3. It's still only supported on Apple GPUs, but Macs will have those soon.	2020-10-27 22:07:01 -05:00
Chip Davis	1b6b9705e6	MSL: For 2.1+, don't disable rasterization for vertex writes. Writing to buffers actually works starting in MSL 2.1 (macOS 10.14, iOS 12). Writing to textures works starting in MSL 2.2 (macOS 10.15, iOS 13). No tests unfortunately, because the MSL 2.2 compiler and above produce a warning that cannot be disabled, because it has no associated option.	2020-10-26 11:43:13 -05:00
Hans-Kristian Arntzen	f65f259ab7	MSL: Do not use component::x gather for depth2d textures.	2020-10-26 10:18:17 +01:00
Chip Davis	1264e2705e	MSL: Cast broadcast booleans to ushort. Metal doesn't support broadcasting or shuffling boolean values, but we can work around that by casting it to `ushort`, then casting it back to `bool`. I used `ushort` instead of `uint` because 16-bit values give better throughput on Apple GPUs.	2020-10-23 21:55:46 -05:00
Chip Davis	065b5bda3c	MSL: Mask ballots passed to Ballot bit ops. Only the least n bits are significant, where n is the subgroup size. The Vulkan CTS actually checks this. The `FindLSB` tests weren't actually failing, but I masked that anyway, in case there's some corner case the CTS is missing.	2020-10-23 21:55:46 -05:00
Chip Davis	781367d083	MSL: Support vectors with OpGroupNonUniformAllEqual. This was not tested here in SPIRV-Cross. Predictably, it broke when I tried it in the CTS.	2020-10-23 21:55:46 -05:00
Chip Davis	6ccb902462	MSL: Correct definitions of subgroup ballot mask variables. `SubgroupEqMask` had a fencepost error that gave wrong values for invocation ID 32. For `SubgroupGeMask` and `SubgroupGtMask`, I forgot to shift the values from `extract_bits()` up so that the mask is in the correct position. Using `insert_bits()` instead should fold these two operations into one. `SubgroupLtMask` and `SubgroupLeMask` were already correct.	2020-10-23 21:54:55 -05:00
Chip Davis	064ed448b9	MSL: Don't remove periods from swizzle buffer index exprs.	2020-10-20 17:47:40 -05:00
Chip Davis	5845e009ea	MSL: Handle Offset and Grad operands for 1D-as-2D textures.	2020-10-15 12:51:00 -05:00
Hans-Kristian Arntzen	9c220a8247	Merge pull request #1490 from KhronosGroup/fix-1488 MSL: Support querying and modifying generated combined sampler suffix.	2020-10-15 10:52:28 +02:00
Chip Davis	3e6010d8c5	MSL: Don't use a bitcast for tessellation levels in tesc shaders. `half` cannot be bitcasted to `float`, because the two types are not the same size. Use an expanding cast instead. We were already doing this for stores to the tessellation levels; why I didn't also do this for loads is beyond me.	2020-10-14 18:35:59 -05:00
Hans-Kristian Arntzen	bd1ee4344e	MSL: Support querying and modifying generated combined sampler suffix.	2020-10-14 14:52:18 +02:00
Chip Davis	21d38f74ce	MSL: Fix calculation of atomic image buffer address. Fix reversed coordinates: `y` should be used to calculate the row address. Align row address to the row stride. I've made the row alignment a function constant; this makes it possible to override it at pipeline compile time. Honestly, I don't know how this worked at all for Epic. It definitely didn't work in the CTS prior to this.	2020-10-13 20:51:56 -05:00
Chip Davis	7a5d0d6b29	MSL: Add missing interlock handling to atomic image buffers.	2020-10-13 11:44:17 -05:00
Hans-Kristian Arntzen	fab6ad234e	Merge pull request #1486 from cdavis5e/atomic-image-argument-buffer MSL: Support atomic access to images from argument buffers.	2020-10-13 12:55:43 +02:00
Chip Davis	9cafea6cf8	MSL: Support atomic access to images from argument buffers. This was not added when Epic contributed atomic image support. Fixes #1484.	2020-10-13 02:37:18 -05:00
Chip Davis	2219c4a392	MSL: Support SPV_EXT_demote_to_helper_invocation for MSL 2.3. MSL 2.3 has everything needed to support this extension on all platforms. The existing `discard_fragment()` function was given demote semantics, similar to Direct3D, and the `simd_is_helper_thread()` function was finally added to iOS. I've left the old test alone. Should I remove it in favor of these?	2020-10-13 00:25:32 -05:00
Hans-Kristian Arntzen	5ea576ece2	Allow flip_vert_y in all relevant stages.	2020-09-28 14:10:08 +02:00
Hans-Kristian Arntzen	66afe8c499	Implement a simple evaluator of specialization constants. In some cases, we need to get a literal value from a spec constant op. Mostly relevant when emitting buffers, so implement a 32-bit integer scalar subset of the evaluator. Can be extended as needed to support evaluating any specialization constant operation.	2020-09-14 11:45:59 +02:00
Hans-Kristian Arntzen	bdbef7b1f3	Merge pull request #1461 from Kangz/fix-warnings Fix -Wduplicate-enum and -Wrange-for-analysis.	2020-09-04 12:59:22 +02:00
Corentin Wallez	bcd71536e2	Fix -Wduplicate-enum and -Wrange-for-analysis.	2020-09-04 11:13:21 +02:00
Hans-Kristian Arntzen	18d03b3ea6	Handle OpUndef %void. Apparently this is legal and spirv-opt is starting to emit this kind of code now.	2020-09-04 09:29:44 +02:00
Chip Davis	4cf840ee7b	MSL: Support layered input attachments. These need to use arrayed texture types, or Metal will complain when binding the resource. The target layer is addressed relative to the Layer output by the vertex pipeline, or to the ViewIndex if in a multiview pipeline. Unlike with the s/t coordinates, Vulkan does not forbid non-zero layer coordinates here, though this cannot be expressed in Vulkan GLSL. Supporting 3D textures will require additional work. Part of the problem is that Metal does not allow texture views to subset a 3D texture, so we need some way to pass the base depth to the shader.	2020-09-02 09:18:25 -05:00
Chip Davis	cab7335e64	MSL: Don't set the layer for multiview if the device doesn't support it. Some older iOS devices don't support layered rendering. In that case, don't set `[[render_target_array_index]]`, because the compiler will reject the shader in that case. The client will then have to unroll the render pass manually.	2020-09-01 19:30:28 -05:00
Chip Davis	53080ecca8	MSL: Fix multiview view index calculation with a non-zero base instance. Account for a non-zero base instance when calculating the view index and the "real" instance index. Before, it was likely broken with a non-zero base instance, since the calculated instance index could be less than the base instance.	2020-08-31 20:33:44 -05:00
Hans-Kristian Arntzen	95993f78af	Run format_all.sh.	2020-08-24 11:02:49 +02:00
Hans-Kristian Arntzen	a07441568e	Overhaul how we deal with reserved identifiers. - Do not silently drop reserved identifiers in the parser. This makes it possible to reflect identifiers which are reserved by the cross-compiler module. - Instead of dropping the name, emit _RESERVED_IDENTIFIER_FIXUP in the source to make it clear that a name has been rewritten. - Document what is reserved and not.	2020-08-21 16:33:27 +02:00
Le Hoang Quyen	ab8eb70af1	Fix #1445 : MSL: Enclose args when convert distance(a,b) to abs(a-b)	2020-08-13 21:16:08 +08:00
Chip Davis	3347b1076d	MSL: Fix handling of matrices and structs in the output control point array. Prior to this point, we were treating them as flattened, as they are in old-style tessellation control shaders, and still are for structs in new-style shaders. This is not true for outputs; output composites are not flattened at all. This semantic mismatch broke a Vulkan CTS test. It should now pass.	2020-08-03 17:18:18 -05:00
Tomek Ponitka	18f23c47d9	Enabling setting a fixed sampleMask in Metal fragment shaders. In Metal render pipelines don't have an option to set a sampleMask parameter, the only way to get that functionality is to set the sample_mask output of the fragment shader to this value directly. We also need to take care to combine the fixed sample mask with the one that the shader might possibly output.	2020-07-24 11:19:46 +02:00
Chip Davis	688c5fcbda	MSL: Add support for processing more than one patch per workgroup. This should hopefully reduce underutilization of the GPU, especially on GPUs where the thread execution width is greater than the number of control points. This also simplifies initialization by reading the buffer directly instead of using Metal's vertex-attribute-in-compute support. It turns out the only way in which shader stages are allowed to differ in their interfaces is in the number of components per vector; the base type must be the same. Since we are using the raw buffer instead of attributes, we can now also emit arrays and matrices directly into the buffer, instead of flattening them and then unpacking them. Structs are still flattened, however; this is due to the need to handle vectors with fewer components than were output, and I think handling this while also directly emitting structs could get ugly. Another advantage of this scheme is that the extra invocations needed to read the attributes when there were more input than output points are now no more. The number of threads per workgroup is now lcm(SIMD-size, output control points). This should ensure we always process a whole number of patches per workgroup. To avoid complexity handling indices in the tessellation control shader, I've also changed the way vertex shaders for tessellation are handled. They are now compute kernels using Metal's support for vertex-style stage input. This lets us always emit vertices into the buffer in order of vertex shader execution. Now we no longer have to deal with indexing in the tessellation control shader. This also fixes a long-standing issue where if an index were greater than the number of vertices to draw, the vertex shader would wind up writing outside the buffer, and the vertex would be lost. This is a breaking change, and I know SPIRV-Cross has other clients, so I've hidden this behind an option for now. In the future, I want to remove this option and make it the default.	2020-07-23 17:59:54 -05:00
Chip Davis	884bc6df65	MSL: Factor creating a uint type into its own method. This is so common for artificially created variables that it's worth it to create it once and save it for later use.	2020-07-22 16:25:14 -05:00
Chip Davis	5e13f7fdf2	MSL: Factor a really gnarly condition into its own method. That branch has become nigh unreadable. This new method should make it readable again.	2020-07-22 16:25:10 -05:00
Thomas Roughton	b74a84e4cb	MSL: Ensure OpStore source operands are marked for inclusion in function arguments Without this change, code such as: ``` OpStore %param_var_mipLevelSizes_0 %heightmapMipSizes ``` within a function that then forwards the value `%param_var_mipLevelSizes_0` to another function will not have `%heightmapMipSizes` registered as an argument to the function.	2020-07-07 17:10:36 +12:00
Hans-Kristian Arntzen	711300baed	MSL: Do not emit swizzled writes in packing fixups. Similar to scalar access chain fix, this causes a read-modify-write on memory we're not supposed to write to.	2020-07-06 10:03:46 +02:00
Hans-Kristian Arntzen	fa5b206d97	MSL: Workaround broken vector -> scalar access chain in MSL. On MSL, the compiler refuses to allow access chains into a normal vector type. What happens in practice instead is a read-modify-write where a vector type is loaded, modified and written back. The workaround is to convert a vector into a pointer-to-scalar before the access chain continues to add the scalar index.	2020-07-06 10:03:44 +02:00
Hans-Kristian Arntzen	e1600d4df8	MSL: Use input attachment index directly for resource index fallback.	2020-07-06 09:49:46 +02:00
Hans-Kristian Arntzen	d573a95a9c	Run format_all.sh.	2020-07-01 11:42:58 +02:00
Hans-Kristian Arntzen	f9da366ae6	MSL: Remove the old VertexAttr API. Too many issues with deprecated declarations on various compilers, just get rid of it.	2020-06-22 11:14:24 +02:00
Hans-Kristian Arntzen	7314f51a32	MSL: Deal with loading non-value-type arrays.	2020-06-18 12:46:39 +02:00
Hans-Kristian Arntzen	03d4bcea68	MSL: Improve handling of array types in buffer objects. When loading and storing array types which belong to buffer objects, we need to treat these values as not being value types. Also, need to handle array load/store from/to more address space combinations.	2020-06-18 11:49:03 +02:00
Chip Davis	5281d9997e	MSL: Fix up input variables' vector lengths in all stages. Metal is picky about interface matching. If the types don't match exactly, down to the number of vector components, Metal fails pipline compilation. To support pipelines where the number of components consumed by the fragment shader is less than that produced by the vertex shader, we have to fix up the fragment shader to accept all the components produced.	2020-06-16 14:50:30 -05:00
Hans-Kristian Arntzen	553a7f959b	Merge pull request #1385 from KhronosGroup/fix-1237 GLSL: Implement sparse feedback.	2020-06-08 11:12:00 +02:00
Hans-Kristian Arntzen	cbe0cca73b	Refactor texture fetch function generation. Use structs instead of a million bool/uint32_t arguments passed on stack.	2020-06-08 10:17:40 +02:00
Alexander Meißner	1f5875e23e	Fix missing switch cases in Y'CbCr conversion Otherwise the following lines will never be reached for the other two valid ycbcr_models (RGB_IDENTITY and YCBCR_IDENTITY) as they would cause a SPIRV_CROSS_THROW.	2020-06-06 22:01:05 +02:00
Hans-Kristian Arntzen	275974e062	GLSL: Implement sparse feedback.	2020-06-04 15:50:28 +02:00
Hans-Kristian Arntzen	107ab7c2b7	MSL: Avoid packed arrays in more cases. Extend the array stride relaxation to non-packed arrays as well, as long as the array in question contains a single array element.	2020-05-06 10:27:12 +02:00
Le Hoang Quyen	9ddfe6db6d	Fix #1359 : MSL: If the packed type is scalar, don't emit "pack_" prefix. Scalar type is already packed in metal.	2020-05-06 00:43:34 +08:00
Hans-Kristian Arntzen	0ebb88cc39	MSL: Redirect member indices when buffer has been sorted by Offset. If a buffer rewrites its Offsets, all member references to that struct are invalidated, and must be redirected, do so in to_member_reference, but there might be other places where this is needed. Fix as required. SPIR-V code relying on this is somewhat questionable, but seems to be in-spec.	2020-04-30 11:48:53 +02:00
Hans-Kristian Arntzen	d7d630a0b7	Merge pull request #1347 from KhronosGroup/fix-1343 Implement OpAtomicLoad/OpAtomicStore.	2020-04-27 15:29:21 +02:00
Hans-Kristian Arntzen	9b7140e2ba	Implement OpAtomicLoad/OpAtomicStore. Need some emulation on GLSL/HLSL, fix bug with atomic store on MSL.	2020-04-27 12:11:46 +02:00
Hans-Kristian Arntzen	6ef47d6657	MSL: Fix case where subpassInput is passed to leaf functions.	2020-04-27 11:29:21 +02:00
Hans-Kristian Arntzen	5e5d1c27ce	GLSL: Support f16x2 <-> f32 bitcast. There is no native formulation, so introduce a concept of a "complex" bitcast to handle odd-ball cases which have no native unary operation.	2020-04-21 23:27:33 +02:00
Hans-Kristian Arntzen	17ad62eea4	MSL: Support edge case with DX layout in scalar block layout. DX may emit ArrayStride and MatrixStride of 16, but the size of the object does not align with that and expect to pack other members inside its last member. The workaround is to emit array size/col/row one less than we expect and rely on padding to carve out a "dead zone" for the last member.	2020-04-20 15:29:24 +02:00
Hans-Kristian Arntzen	ebf463674d	MSL: Allow removing clip distance user varyings. Only safe if user knows that subsequent shader stage will not read clip distance.	2020-04-20 09:58:40 +02:00
Chip Davis	96f7008aa8	MSL: Force disabled fragment builtins to have the right name. DXVK emits SPIR-V where fragment shader builtins have names derived from DXBC assembly, e.g. `oDepth` for `FragDepth`. When we declared the disabled output, we used this name, but when referencing it, we continued to use the GLSL name. This breaks compilation.	2020-04-15 19:25:18 -05:00
Chip Davis	495e48de44	MSL: Only disable output variables in fragment shaders. Forgot to do this in #1319. Fixes #1322.	2020-04-15 12:14:57 -05:00
Chip Davis	b29f83c383	MSL: Add options to control emission of fragment outputs. Like with `point_size` when not rendering points, Metal complains when writing to a variable using the `[[depth]]` qualifier when no depth buffer be attached. In that case, we must avoid emitting `FragDepth`, just like with `PointSize`. I assume it will also complain if there be no stencil attachment and the shader write to `[[stencil]]`, or it write to `[[color(n)]]` but there be no color attachment at n.	2020-04-13 15:29:11 -05:00
Hans-Kristian Arntzen	d9d3359ffb	MSL: Deal with cases where builtin is implicitly needed, declared, but unused. We need to make sure any builtins which are declared and unused are emitted as active variables.	2020-04-03 12:50:21 +02:00
Thomas Roughton	4cf736d753	MSL: mark BuiltInFragCoord as implicitly used for subpass reads Without this patch, `gl_FragCoord` is not output for subpass reads in certain cases where `gl_FragCoord` is not used elsewhere in the shader. I'm not sure why this isn't caught by existing tests (e.g. [input-attachment.frag](shaders-msl/frag/input-attachment.frag)), but I encountered this issue in code generated by DXC and passed through spire-opt.	2020-04-03 14:32:31 +13:00
Hans-Kristian Arntzen	3cb6aeb480	MSL: Fix access chain for deep struct hierarchy on array of buffers.	2020-03-31 14:17:29 +02:00
Hans-Kristian Arntzen	b8905bbd95	Add support for forcefully zero-initialized variables. Useful to better support certain platforms which require all variables to be initialized to something.	2020-03-26 13:38:27 +01:00
Hans-Kristian Arntzen	30343f3e95	MSL: Reintroduce workaround for constant arrays being passed by value.	2020-02-24 13:22:52 +01:00
Hans-Kristian Arntzen	20b28f72fa	MSL: Reinstate workaround for returning arrays.	2020-02-24 13:04:10 +01:00
Hans-Kristian Arntzen	c9d4f9cd74	MSL: Add a workaround path to force native arrays for everything.	2020-02-24 12:47:14 +01:00
Hans-Kristian Arntzen	cec0502ba5	Remove old memory_scope flag from iOS barriers. I cannot find any reference to this flag ever having existed in older MSL spec documents, and it breaks compilation on any recent SDK for any iOS/macOS Metal version. Just remove it.	2020-02-06 10:18:00 +01:00
Chip Davis	ae6c05f6f4	MSL: Move inline uniform blocks to the end of the argument buffer. Limit inline blocks to one per descriptor set. This should avoid the need for complicated code to calculate the argument buffer ID stride of an inline uniform block. If there's demand for more inline blocks, we can revisit this.	2020-01-25 13:40:51 -06:00
Chip Davis	fedbc35315	MSL: Support inline uniform blocks in argument buffers. Here, the inline uniform block is explicit: we instantiate the buffer block itself in the argument buffer, instead of a pointer to the buffer. I just hope this will work with the `MTLArgumentDescriptor` API... Note that Metal recursively assigns individual members of embedded structs IDs. This means for automatic assignment that we have to calculate the binding stride for a given buffer block. For MoltenVK, we'll simply increment the ID by the size of the inline uniform block. Then the later IDs will never conflict with the inline uniform block. We can get away with this because Metal doesn't require that IDs be contiguous, only monotonically increasing.	2020-01-24 18:51:24 -06:00
Hans-Kristian Arntzen	f9818f0804	Update license headers to 2020.	2020-01-16 15:24:37 +01:00
Hans-Kristian Arntzen	7a411258af	Run format_all.sh.	2020-01-16 15:20:59 +01:00
Hans-Kristian Arntzen	af787a8a79	Merge pull request #1264 from KhronosGroup/msl-argument-buffer-persist MSL: Add support for force-activating IAB resources.	2020-01-16 14:44:23 +01:00
Hans-Kristian Arntzen	c3bd136df1	MSL: Add support for force-activating IAB resources. Important for ABI compatibility on MSL in certain cases.	2020-01-16 11:12:06 +01:00
Hans-Kristian Arntzen	f79c1e2fed	Deal with illegal names in types as well. - Fixes issue with clip_distance flattening in MSL where member to flatten from would come from to_member_name, where it should have used the builtin name directly. This member name was modified by this patch and broke clip distance test shaders. - Some cleanups with ir.meta, use ir.find_meta instead to not create unnecessary hashmap nodes.	2020-01-16 10:34:49 +01:00
Hans-Kristian Arntzen	172e39f039	Merge pull request #1257 from KhronosGroup/fix-1236 Deal with bitcasting for subgroup Min/Max operations	2020-01-09 15:35:43 +01:00
Hans-Kristian Arntzen	cc153f8d7f	HLSL: Add a resource remapping API similar to MSL. Allows more flexibility of how resources are assigned without having to remap decorations.	2020-01-09 12:41:06 +01:00
Hans-Kristian Arntzen	2bbb012e9c	MSL: Deal with sign on wave min/max.	2020-01-09 12:35:18 +01:00
Hans-Kristian Arntzen	b522b409ae	Run format_all.sh.	2020-01-08 10:48:30 +01:00
Hans-Kristian Arntzen	c024e24d45	MSL: Deal with padded fragment output + Component decoration.	2020-01-07 17:02:12 +01:00
Hans-Kristian Arntzen	8871502a20	MSL: Explicitly don't support component packing for tessellation.	2020-01-07 16:49:19 +01:00
Hans-Kristian Arntzen	ecdfd3eb66	MSL: Don't set OrigID when emitting component packed vectors. There is no unique OrigID, so we shouldn't ever need to look at this value.	2020-01-07 14:57:19 +01:00
Hans-Kristian Arntzen	93f3265fe0	MSL: Deal with packing vectors for vertex input/fragment output.	2020-01-07 14:14:31 +01:00
Hans-Kristian Arntzen	7a69d764b0	MSL: Add trivial tests for Component decoration. Verifies that Component decoration is honored for vertex outputs and fragment inputs.	2020-01-07 11:36:51 +01:00
Hans-Kristian Arntzen	00d5c78447	Remove obsolete use of AtomicCounterMemoryMask.	2019-12-04 15:30:07 +01:00
Hans-Kristian Arntzen	99084dcee4	Fix uninitialized memory issue. Seen on Travis with MSVC Win32 Release builds only.	2019-12-02 15:54:46 +01:00
Hans-Kristian Arntzen	a3fe9756d2	MSL: Support ClipDistance as an input stage variable. MSL does not support this, so we have to emulate it by passing it around as a varying between stages. We use a special "user(clipN)" attribute for this rather than locN which is used for user varyings.	2019-12-02 13:19:42 +01:00
Hans-Kristian Arntzen	b85ab5f5ff	MSL: Fix automatic binding allocation for image atomic buffers. The Primary decoration was used by the atomic buffer, causing the texture binding to be potentially overlapping with other resources.	2019-11-28 11:07:44 +01:00
Hans-Kristian Arntzen	0bbe041411	MSL: Remove dubious workaround code in unpack_expression. From UE4 review, does not cause any changes in test output, and should only change output if we were unpacking arrays or something like that, which we don't support.	2019-11-07 11:35:07 +01:00
Hans-Kristian Arntzen	d9afa9e238	MSL: Fix unpack_expression from column of padded matrix.	2019-11-07 11:35:07 +01:00
Hans-Kristian Arntzen	40cc0a00df	MSL: Avoid some fallthrough warnings.	2019-10-28 15:20:38 +01:00
Hans-Kristian Arntzen	123fa906ec	Merge branch 'ue4-merge'	2019-10-28 15:14:42 +01:00
Hans-Kristian Arntzen	39bd5f1834	Run format_all.sh.	2019-10-28 12:55:14 +01:00
Hans-Kristian Arntzen	1d5eba67f8	MSL: Remove hacky workaround for patch constant passing. There was a hack to workaround a bug in DXC where control point -> patch constant phase was passed in Function storage, but we have to use Workgroup here. We will not support these kinds of hacks for invalid SPIR-V, so hack the reference files to use the "proper" fix and remove the hack for time being.	2019-10-28 12:52:28 +01:00
Hans-Kristian Arntzen	8f13a3f4b1	MSL: Remove workaround for passing constant arrays to functions. Arrays are value-types now, so remove the old workaround.	2019-10-28 12:14:43 +01:00
Hans-Kristian Arntzen	ac8d335480	MSL: Fix integer cast.	2019-10-26 20:30:24 +02:00
Hans-Kristian Arntzen	fa011f8547	MSL: Declare arrays with proper type wrapper. Need to construct with value type spvUnsafeArray<T, N>({ elem0, elem1 }) to make array initialization work in complex scenarios.	2019-10-26 17:57:34 +02:00
Hans-Kristian Arntzen	2745959646	MSL: Deal with chained access chains for tessellation IO variables. Need to specially deal with this since we might have to split up access chain and deal with indexing into IB struct.	2019-10-26 17:26:56 +02:00
Hans-Kristian Arntzen	e1acbd3dcf	MSL: Declare struct type explicitly. Disambiguates initializer list.	2019-10-26 16:21:46 +02:00
Hans-Kristian Arntzen	1fc3347873	MSL: Fix array of array declaration. Arrays-of-arrays were declared in wrong order.	2019-10-26 16:10:12 +02:00
Hans-Kristian Arntzen	27d6d45671	MSL: Rewrite tessellation_access_chain. To support loading array of array properly in tessellation, we need a rewrite of how tessellation access chains are handled. The major change is to remove the implicit unflatten step inside access_chain which does not take into account the case where you load directly from a control point array variable. We defer unflatten step until OpLoad time instead. This fixes cases where we load array of {array,matrix,struct}. Removes the hacky path for MSL access chain index workaround.	2019-10-26 16:10:12 +02:00
Hans-Kristian Arntzen	bac2c34590	MSL: Slight cleanup in emit_tessellation_access_chain. Hoist out some conditionals and make it clear that we go into this path if strip_array is used when declaring resources, i.e. there was no explicit unflatten step.	2019-10-26 16:10:12 +02:00
Hans-Kristian Arntzen	d0953f6e4c	MSL: Do not declare variables which will not be unflattened. Non-patch arrays of IO variables in tesc/tese have their array index stripped, and access chains are specially handled, we shouldn't attempt to create "normal" arrays of these.	2019-10-26 16:10:12 +02:00
Hans-Kristian Arntzen	028ffc26dd	MSL: Report tess input array failures more accurately. We cannot dynamically index into an array of inputs at the moment. Do not fail with opaque "bad cast" error.	2019-10-26 16:10:12 +02:00
Hans-Kristian Arntzen	f2eb940536	MSL: Revert hack with kBufferSizeBufferBinding	2019-10-26 16:10:12 +02:00
Hans-Kristian Arntzen	62c4eaa7bf	MSL: Remove stale code for TextureSwizzle. Was probably added mistakenly during an old rebase.	2019-10-26 16:10:12 +02:00
Hans-Kristian Arntzen	3b5c4c7316	Implement constant empty struct correctly on all backends. MSL actually supports empty structs, so enable that path as well.	2019-10-26 16:10:11 +02:00
Hans-Kristian Arntzen	8066d13599	MSL: Rewrite propagated depth comparison state handling. Far cleaner, and more correct to run the traversal twice. Fixes a case where we propagate depth state through multiple functions.	2019-10-26 16:10:11 +02:00
Hans-Kristian Arntzen	efc978177e	MSL: Remove stray allow_id_rewrite(). Unclear why it's there, and tests do not break when removed.	2019-10-26 16:10:11 +02:00
Hans-Kristian Arntzen	2767257adc	MSL: Do not declare array of UBO/SSBO as spvUnsafeArray<T>. There is no need for these to be copied, and cuts down on template stamping bloat.	2019-10-26 16:10:08 +02:00
Bill Hollings	53d0dd6741	MSL: Support option for treating 1D textures as 2D textures of height 1. Fixes from feedback. Change vertical sampling of 1D texture from 0.0 to 0.5 to avoid edge conditions.	2019-10-24 14:48:21 -04:00
Bill Hollings	4b5c6c188c	MSL: Support option for treating 1D textures as 2D textures of height 1. Add CompilerMSL::Options::texture_1D_as_2D. Metal imposes significant restrictions on 1D textures, including not being renderable, clearable, or permitting mipmaps. This option allows SPIR-V 1D textures to be treated as 2D textures to permit this additional behaviour. App must of course supply the textures to Metal as 2D textures.	2019-10-24 12:31:20 -04:00
Hans-Kristian Arntzen	a82ecbeba9	MSL: Enable proper value types for return and value-passing of arrays. Now that we have spvUnsafeArray<T> there is no need to deal with these special purpose cases.	2019-10-24 13:03:38 +02:00
Hans-Kristian Arntzen	4ac12594c9	MSL: Be a little clearer how needs_base_vertex_idx is implemented. There is an implicit tristate with {-1, 0, +1} values, but it was not obvious how this was supposed to work before studying the implementation, so refactor into a tristate enum class.	2019-10-24 12:41:37 +02:00
Hans-Kristian Arntzen	6ca3fe22c3	MSL: Remove some dead code w.r.t. vertex/instance_idx.	2019-10-24 12:36:01 +02:00
Hans-Kristian Arntzen	d1479f871a	MSL: Do not generate UnsafeArray<> for any array inside buffer objects. This avoids a lot of huge code changes. Arrays generally cannot be copied in and out of buffers, at least no compiler frontend seems to do it. Also avoids a lot of issues surrounding packed vectors and matrices.	2019-10-24 12:22:30 +02:00
Hans-Kristian Arntzen	b433836b08	MSL: Simplify framebuffer fetch implementation.	2019-10-24 11:48:29 +02:00
Hans-Kristian Arntzen	2ca725a78d	MSL: Fall back to GLSL path for non-invariant matrix multiply as well.	2019-10-24 11:37:22 +02:00
Hans-Kristian Arntzen	ba4b31e46f	MSL: Fall back to GLSL path for non-invariant FP implementation.	2019-10-24 11:34:52 +02:00
Hans-Kristian Arntzen	6edbf0c9e9	MSL: Minor cleanups for texture atomic emulation. Storing pointers to internal objects is generally not done, IDs are preferred.	2019-10-24 11:30:20 +02:00
Hans-Kristian Arntzen	43842cefb3	MSL: Cleanup decoration forwarding for SampleMask. Don't want to create Meta instances unless we have to.	2019-10-24 11:15:35 +02:00
Hans-Kristian Arntzen	db55d474f9	MSL: Do not declare complex composite array in main for non-inlined. Need to consider that complex composite arrays may be used in leaf functions, and avoid the MSL library link fix unless everything is nicely inlined.	2019-10-24 11:12:01 +02:00
Lukas Hermanns	b0d616aa6d	Removed 'argument_buffer_offset' and fixed packed matrix Metal output.	2019-10-23 16:28:32 -04:00
Lukas Hermanns	6673a675ba	Simplified overriding of 'access_chain_internal' function in CompilerMSL.	2019-10-22 11:06:16 -04:00
Lukas Hermanns	84351d3aed	Merge remote-tracking branch 'upstream/master'	2019-10-21 18:55:36 -04:00
Lukas Hermanns	e1b161b54b	Removed bounds checks in favor of SPIRV-Tools pass '--graphics-robust-access'	2019-10-21 16:39:53 -04:00
Hans-Kristian Arntzen	4bb673a626	MSL: Add opt-in support for huge IABs. If there are enough members in an IAB, we cannot use the constant address space as MSL compiler complains about there being too many members. Support emitting the device address space instead.	2019-10-14 16:20:34 +02:00
Lukas Hermanns	0853bcaee1	Disabled spvUnsafeArray<> type for packed vectors and added test cases for those arrays.	2019-10-09 17:59:47 -04:00
Lukas Hermanns	ffbd801853	Added '--msl-invariant-float-math' option and new test case for it.	2019-10-09 14:03:06 -04:00
Hans-Kristian Arntzen	2d20b1ab93	Run format_all.sh.	2019-10-07 10:29:04 +02:00
Lukas Hermanns	f3a6d28a1d	Further updates for pull request #1162 ; also added two test cases for spvCubemapTo2DArrayFace function and added '--msl-framebuffer-fetch'/ '--msl-emulate-cube-array' compiler options.	2019-09-27 15:49:54 -04:00
Lukas Hermanns	c3d6022956	Update for pull request #1162 rev. 1	2019-09-24 18:13:04 -04:00
Lukas Hermanns	7ad0a84778	Updates for pull request #1162	2019-09-24 14:35:25 -04:00
Lukas Hermanns	37df74035b	Merge branch 'ue4_dev'	2019-09-20 09:42:42 -04:00
Lukas Hermanns	9f9276f5ce	Fixed false-positive optimization of builtin variables (may happen when 'spvOut' is emitted).	2019-09-19 14:44:30 -04:00
Hans-Kristian Arntzen	3c11254ece	MSL: Fix 16-bit integer literals. There is no suffix, so bitcasts failed.	2019-09-19 10:19:51 +02:00
Lukas Hermanns	50ac6862ac	Rearranged all 'UE Change' comments to match to project's coding style.	2019-09-18 14:03:54 -04:00
Lukas Hermanns	137e9d6d98	Removed reference specifiers in 'spvFMul*' functions to avoid address specifiers.	2019-09-17 16:50:33 -04:00
Lukas Hermanns	51be601922	Avoid emitting 'spvUnsafeArray<>', 'spvFMul*', and 'spvFAdd' custom functions if they are not needed.	2019-09-17 15:10:39 -04:00
Lukas Hermanns	36eab88b23	Further adjustments to make Metal backend work again in UE4 on Mac.	2019-09-17 11:40:01 -04:00
Lukas Hermanns	7cf5d4f7a1	Added a new 'emulate_cube_array' option to SPIRV-Cross to cope with translating TextureCubeArray into texture2d_array for iOS where this type is not available. (Original Author: Mark Satterthwaite)	2019-09-13 17:24:27 -04:00
Lukas Hermanns	a9f3c981d9	Adjustments after rebase of ue4_dev branch.	2019-09-13 14:03:02 -04:00
Mark Satterthwaite	c4f9704af0	OpImageTexelPointer needs to use an int coordinate type for GLSL, but not for MSL.	2019-09-12 08:52:08 -04:00
Mark Satterthwaite	fdaf9b47bd	Remove obsolete memory barrier scope specification from Metal output, this API has been removed.	2019-09-12 08:35:28 -04:00
Mark Satterthwaite	69b703f1da	Add an option to SPIRV-Cross to enforce invariant floating point math to prevent different depth calculation between prepass & basepass when running on Metal 2.0 and earlier.	2019-09-12 08:35:15 -04:00
Mark Satterthwaite	e4c6388571	More fixes to handling packing & access elements in an array. Made in two parts. 1. Don't allow AccessChain operations to add duplicated swizzles when accessing packed arrays. 2. Only pack arrays when there is the proper amount of space between members in a struct, otherwise it will definitely be wrong.	2019-09-11 16:15:10 -04:00
Mark Satterthwaite	b491806b47	Fix texture swizzling.	2019-09-11 14:56:54 -04:00
Mark Satterthwaite	9e54a8dd7b	Slight modifications to IAB support for Metal output, so that the caller can specify an offset for the IAB start index, as for HLSL shaders UAVs need to occupy slots 0-7. The runtime support for SSBO robustness is also much simpler if the buffer size block is at index 0. Change made in two parts. 1. Allow the caller to specify the Metal translation should use argument buffers. 2. Move this to the front of IABs for convenience of the runtime.	2019-09-10 13:09:49 -04:00
Mark Satterthwaite	d9f3576305	Metal doesn't automatically enforce robust access to buffers unlike other APIs, so for storage-buffers, which become raw T* buffers in Metal, we need to fetch the buffer size and clamp the access to a valid index within the buffer ourselves. This is essential for shaders converted from HLSL which expects all resource access to be robust, though this implementation is technically different to the HLSL specification of return-0 for OOB reads, ignore OOB writes.	2019-09-10 12:32:32 -04:00
Mark Satterthwaite	0428faada3	HLSL makes position calculations invariant by default to eliminate problems with depth-precision, Apple added a similar qualifier for Metal 2.1 that can and should be used in Vertex & Domain/TessEval shaders for the same effect.	2019-09-10 11:47:40 -04:00
Mark Satterthwaite	9ce3158193	When compiling from HLSL which pads and aligns float[]/float2[] within structures to float4[] we need to unpack the original type in Metal from the float4.	2019-09-10 11:21:43 -04:00
Mark Satterthwaite	40a4456a54	Fix conversion of the SampleMask intrinsic from SPIRV, where it is an array to Metal where it isn't.	2019-09-10 10:46:42 -04:00
Mark Satterthwaite	42b8a62870	Fixes to the generation of Metal tessellation shaders from SPIRV so that it works correctly in more complicated cases. First, when generating from HLSL before invoking the code that comes from the HLSL patch-function a control-flow and full memory-barrier are required to ensure that all the temporary values in thread-local storage for the patch are available. Second, the inputs to control and evaluation shaders must be properly forwarded from the global variables in SPIRV to the member variables in the relevant input structure. Finally when arrays of interpolators are used for input or output we need to add an extra level of array indirection because Metal works at a different granularity than SPIRV. Five parts. 1. Fix tessellation patch function processing. 2. Fix loads from tessellation control inputs not being forwarded to the gl_in structure array. 3. Fix loads from tessellation evaluation inputs not being forwarded to the stage_in structure array. 4. Workaround SPIRV losing an array indirection in tessellation shaders - not the best solution but enough to keep things progressing. 5. Apparently gl_TessLevelInner/Outer is special and needs to not be placed into the input array.	2019-09-10 10:37:07 -04:00
Mark Satterthwaite	de6441af88	Work-around HLSL using zero-based InstanceID and VertexID variables, but SPIRV, like Metal, includes BaseInstance & BaseVertex. Until this can be fixed in DXC, which is really the proper place to solve this, we can decrement InstanceID & VertexID when the source is HLSL. Made in two parts. 1. Handle HLSL-style 0-based vertex/instance index. 2. We zero-base the InstanceID & VertexID variables for HLSL emulation elsewhere, so don't do it twice.	2019-09-09 16:55:59 -04:00
Mark Satterthwaite	97a66ff906	On iOS sub-passes can be implemented using the frame-buffer fetch API which is much more efficient than binding the textures. Change was made in three parts. 1. Use Metal's native frame-buffer fetch API for subpass inputs. 2. Make sure that frame-buffer-fetch is only available on iOS. 3. Default to using Metal's native frame-buffer fetch for subpass inputs on iOS.	2019-09-09 15:02:11 -04:00
Wade Brainerd	f2a1b4320f	MSL: Fix array copies to/from interpolators	2019-09-06 18:23:57 -07:00
Mark Satterthwaite	32557e9093	SPIRV doesn't distinguish depth textures from regular textures, but Metal does, so if we've ever seen a depth comparison operation we must ensure that the texture is specified as a depth-texture.	2019-09-06 16:58:27 -04:00
Hans-Kristian Arntzen	2082e7e801	Run format_all.sh.	2019-09-06 14:23:16 +02:00
Hans-Kristian Arntzen	333980ae91	Refactor into stronger types in public API. Some fallout where internal functions are using stronger types. Overkill to move everything over to strong types right now, but perhaps move over to it slowly over time.	2019-09-06 12:29:47 +02:00
Hans-Kristian Arntzen	1935f1a8e3	Fix some issues on certain compilers.	2019-09-06 10:11:18 +02:00
Chip Davis	cb35934248	MSL: Support dynamic offsets for buffers in argument buffers. Vulkan has two types of buffer descriptors, `VK_DESCRIPTOR_TYPE_UNIFORM_BUFFER_DYNAMIC` and `VK_DESCRIPTOR_TYPE_STORAGE_BUFFER_DYNAMIC`, which allow the client to offset the buffers by an amount given when the descriptor set is bound to a pipeline. Metal provides no direct support for this when the buffer in question is in an argument buffer, so once again we're on our own. These offsets cannot be stored or associated in any way with the argument buffer itself, because they are set at bind time. Different pipelines may have different offsets set. Therefore, we must use a separate buffer, not in any argument buffer, to hold these offsets. Then the shader must manually offset the buffer pointer. This change fully supports arrays, including arrays of arrays, even though Vulkan forbids them. It does not, however, support runtime arrays. Perhaps later.	2019-09-05 23:29:00 -05:00
Mark Satterthwaite	5e8590a23d	Emulate texture atomics in Metal by binding the underlying buffer that backs the resource to a separate binding point and using that for Metal's atomic operations. This will work with texture_buffer and texture2d created from an MTLBuffer, so is perfect for emulating HLSL atomics on RWBuffer and sufficient, but not ideal, for RWTexture2D with some restrictions (limited format support and can't be used for render-targets).	2019-09-05 15:13:28 -04:00
Mark Satterthwaite	239e04762b	Support Metal 2.1's texture_buffer type which is the equivalent to HLSL's Buffer/RWBuffer, so doesn't require modifying buffer sizes to match alignments.	2019-09-05 14:46:15 -04:00
Mark Satterthwaite	8596bf5ee2	In order to use Metal shader libraries properly you have to ensure that you have no duplicated global symbol names for different entities, otherwise 'metallib' won't be able to combine multiple shaders into a single library. This is broken into two parts. 1. Constant arrays of non-primitive types (i.e. matrices) won't link properly into Metal libraries. 2. Metal helper functions must be static force-inline otherwise they will cause problems when linked together in a single Metallib.	2019-09-05 14:39:06 -04:00
Mark Satterthwaite	d50659af92	Rework the way arrays are handled in Metal to remove the array copies as they are unnecessary from Metal 1.2. There were cases where copies were not being inserted and others appeared unncessary, using the template type should allow the 'metal' compiler to do the best possible optimisation. The changes are broken into three stages. 1. Allow Metal to use the array<T> template to make arrays a value type. 2. Force the use of C style array declaration for some cases which cannot be wrapped with a template. 3. Threadgroup arrays can't have a wrapper type. 4. Tweak the code to use unsafe_array in a few more places so that we can handle passing arrays of resources into the shader and then through shaders into sub-functions. 5. Handle packed matrix types inside arrays within structs. 6. Make sure that builtin arguments still retain their array qualifiers when used in leaf functions. 7. Fix declaration of array-of-array constants for Metal so we can use the array<T> template.	2019-09-05 12:39:44 -04:00
Chip Davis	103817009c	MSL: Force storage images on iOS to use discrete descriptors. Writable textures cannot use argument buffers on iOS. They must be passed as arguments directly to the shader function. Since we won't know if a given storage image will have the `NonWritable` decoration at the time we encode the argument buffer, we must therefore pass all storage images as discrete arguments. Previously, we were throwing an error if we encountered an argument buffer with a writable texture in it on iOS.	2019-09-05 11:01:05 -05:00
Hans-Kristian Arntzen	261b46982a	Deal with complex interlock cases in GLSL.	2019-09-04 12:18:04 +02:00
Chip Davis	2eff420d9a	Support the SPV_EXT_fragment_shader_interlock extension. This was straightforward to implement in GLSL. The `ShadingRateInterlockOrderedEXT` and `ShadingRateInterlockUnorderedEXT` modes aren't implemented yet, because we don't support `SPV_NV_shading_rate` or `SPV_EXT_fragment_invocation_density` yet. HLSL and MSL were more interesting. They don't support this directly, but they do support marking resources as "rasterizer ordered," which does roughly the same thing. So this implementation scans all accesses inside the critical section and marks all storage resources found therein as rasterizer ordered. They also don't support the fine-grained controls on pixel- vs. sample-level interlock and disabling ordering guarantees that GLSL and SPIR-V do, but that's OK. "Unordered" here merely means the order is undefined; that it just so happens to be the same as rasterizer order is immaterial. As for pixel- vs. sample-level interlock, Vulkan explicitly states: > With sample shading enabled, [the `PixelInterlockOrderedEXT` and > `PixelInterlockUnorderedEXT`] execution modes are treated like > `SampleInterlockOrderedEXT` or `SampleInterlockUnorderedEXT` > respectively. and: > If [the `SampleInterlockOrderedEXT` or `SampleInterlockUnorderedEXT`] > execution modes are used in single-sample mode they are treated like > `PixelInterlockOrderedEXT` or `PixelInterlockUnorderedEXT` > respectively. So this will DTRT for MoltenVK and gfx-rs, at least. MSL additionally supports multiple raster order groups; resources that are not accessed together can be placed in different ROGs to allow them to be synchronized separately. A more sophisticated analysis might be able to place resources optimally, but that's outside the scope of this change. For now, we assign all resources to group 0, which should do for our purposes. `glslang` doesn't support the `RasterizerOrdered` UAVs this implementation produces for HLSL, so the test case needs `fxc.exe`. It also insists on GLSL 4.50 for `GL_ARB_fragment_shader_interlock`, even though the spec says it needs either 4.20 or `GL_ARB_shader_image_load_store`; and it doesn't support the `GL_NV_fragment_shader_interlock` extension at all. So I haven't been able to test those code paths. Fixes #1002.	2019-09-02 12:31:10 -05:00
Chip Davis	39dce88d3b	MSL: Add support for sampler Y'CbCr conversion. This change introduces functions and in one case, a class, to support the `VK_KHR_sampler_ycbcr_conversion` extension. Except in the case of GBGR8 and BGRG8 formats, for which Metal natively supports implicit chroma reconstruction, we're on our own here. We have to do everything ourselves. Much of the complexity comes from the need to support multiple planes, which must now be passed to functions that use the corresponding combined image-samplers. The rest is from the actual Y'CbCr conversion itself, which requires additional post-processing of the sample retrieved from the image. Passing sampled images to a function was a particular problem. To support this, I've added a new class which is emitted to MSL shaders that pass sampled images with Y'CbCr conversions attached around. It can handle sampled images with or without Y'CbCr conversion. This is an awful abomination that should not exist, but I'm worried that there's some shader out there which does this. This support requires Metal 2.0 to work properly, because it uses default-constructed texture objects, which were only added in MSL 2. I'm not even going to get into arrays of combined image-samplers--that's a whole other can of worms. They are deliberately unsupported in this change. I've taken the liberty of refactoring the support for texture swizzling while I'm at it. It's now treated as a post-processing step similar to Y'CbCr conversion. I'd like to think this is cleaner than having everything in `to_function_name()`/`to_function_args()`. It still looks really hairy, though. I did, however, get rid of the explicit type arguments to `spvGatherSwizzle()`/`spvGatherCompareSwizzle()`. Update the C API. In addition to supporting this new functionality, add some compiler options that I added in previous changes, but for which I neglected to update the C API.	2019-09-01 18:35:53 -05:00
Hans-Kristian Arntzen	9b845a4788	Merge pull request #1141 from troughton/inline-everything MSL: Inline all non-entry-point functions	2019-08-30 11:05:04 +02:00
Thomas Roughton	6b5403206e	Clang-format changes	2019-08-30 20:25:40 +12:00
Hans-Kristian Arntzen	07c76f66b5	MSL: Add {Base,}{Vertex,Instance}Index to bitcast_from_builtin_load. Totally missed these, so float(index) would not work correctly for negative numbers.	2019-08-29 13:56:37 +02:00
Thomas Roughton	e5f9e2c203	Inline all non-entry-point functions	2019-08-29 17:07:57 +12:00
Thomas Roughton	6338f0aa0f	MSL: inline all emitted functions # Conflicts: # spirv_msl.cpp	2019-08-29 17:07:27 +12:00
Hans-Kristian Arntzen	3ccfbce264	Run format_all.sh.	2019-08-28 14:25:26 +02:00
Hans-Kristian Arntzen	9436cd3036	MSL: Deal with array copies from and to threadgroup.	2019-08-27 13:18:01 +02:00
Hans-Kristian Arntzen	b3305799a8	Deal correctly with sign on bitfield operations. Need a lot of special purpose implementation functions for these.	2019-08-26 11:36:36 +02:00
Hans-Kristian Arntzen	ffca8735ff	Merge pull request #1105 from cdavis5e/msl-unify-as MSL: Unify the get_*_address_space() methods.	2019-07-29 10:19:12 +02:00
Chip Davis	df18d98bea	MSL: Unify the get_*_address_space() methods. These methods have largely the same logic, with minor differences. That I felt compelled to duplicate the logic into another method was one of the things that bothered me about the variable pointers change. This cleans that part of the code up; now we don't have two places to change.	2019-07-26 09:43:28 -05:00
Hans-Kristian Arntzen	d378413040	Merge pull request #1103 from KhronosGroup/fix-1100 MSL: Cleanup temporary use with emit_uninitialized_temporary.	2019-07-26 14:35:18 +02:00
Hans-Kristian Arntzen	c3e8e728d8	MSL: Cleanup temporary use with emit_uninitialized_temporary.	2019-07-26 11:16:43 +02:00
Hans-Kristian Arntzen	abb345d0b3	MSL: Deal with Modf/Frexp where output is access chain to scalar. This is not allowed as we cannot take mutable reference to a vec.{x,y,z,w}. We only care about scalar since entire vectors are fine.	2019-07-26 11:02:38 +02:00
Hans-Kristian Arntzen	3c03b55c46	Workaround MSVC 2013 compiler issues.	2019-07-25 10:28:11 +02:00
Chip Davis	fb5ee4cb5c	MSL: Adjust BuiltInWorkgroupId for vkCmdDispatchBase(). This command allows the caller to set the base value of `BuiltInWorkgroupId`, and thus of `BuiltInGlobalInvocationId`. Metal provides no direct support for this... but it does provide a builtin, `[[grid_origin]]`, normally used to pass the base values for the stage input region, which we will now abuse to pass the dispatch base and avoid burning a buffer binding. `[[grid_origin]]`, as part of Metal's support for compute stage input, requires MSL 1.2. For 1.0 and 1.1, we're forced to provide a buffer. (Curiously, this builtin was undocumented until the MSL 2.2 release. Go figure.)	2019-07-24 08:56:15 -05:00
Hans-Kristian Arntzen	c62503bca7	Do not attempt to pack types which are already scalar.	2019-07-24 11:52:28 +02:00
Hans-Kristian Arntzen	646e04294a	Fix some warnings when building in MoltenVK.	2019-07-23 16:39:13 +02:00
Hans-Kristian Arntzen	5c1cb7accf	Recursively pack struct types when we find scalar packed structs.	2019-07-23 15:24:53 +02:00
Hans-Kristian Arntzen	3fa2b14634	Run format_all.sh.	2019-07-23 12:23:41 +02:00
Hans-Kristian Arntzen	7277c7ac46	Use to_unpacked_row_major_expression to unify row-major in MSL/GLSL.	2019-07-23 11:36:54 +02:00
Hans-Kristian Arntzen	47a18b9f1b	Simplify row-major matrix/vector multiplies.	2019-07-23 10:56:57 +02:00
Hans-Kristian Arntzen	6224199c76	Add struct size padding tests.	2019-07-23 10:30:37 +02:00
Hans-Kristian Arntzen	2172b19be2	Remove obsolete matrix workaround code.	2019-07-22 16:27:47 +02:00
Hans-Kristian Arntzen	609d087f8f	Only transpose unpacked expressions.	2019-07-22 16:06:09 +02:00
Hans-Kristian Arntzen	6057ffcbb1	Deal correctly with complete stores to row_major matrices.	2019-07-22 15:49:17 +02:00
Hans-Kristian Arntzen	19f5cd3e90	Declare correct matrix type when unpacking.	2019-07-22 13:25:45 +02:00
Hans-Kristian Arntzen	f2d6a77c95	Don't forget to register a write to LHS expression in certain case.	2019-07-22 13:06:30 +02:00
Hans-Kristian Arntzen	745a2f7b0e	Deal with swizzled stores to std140 matrices.	2019-07-22 13:05:23 +02:00
Hans-Kristian Arntzen	180a6b38c5	Fix some row-major column store cases.	2019-07-22 12:56:14 +02:00
Hans-Kristian Arntzen	4ab2829cf6	Fix more stray parens.	2019-07-22 12:13:07 +02:00
Hans-Kristian Arntzen	d6004bfc97	Fixup stray parent in output.	2019-07-22 12:08:56 +02:00
Hans-Kristian Arntzen	14afb968dd	Correctly unpack row-major matrices when storing to LHS.	2019-07-22 12:03:12 +02:00
Hans-Kristian Arntzen	249f8e5180	MSL: Support storing to row-major column. Defer transposes to actual Load or Store.	2019-07-22 11:13:44 +02:00
Hans-Kristian Arntzen	be2fccd837	Tests run clean.	2019-07-22 10:23:39 +02:00
Hans-Kristian Arntzen	b66a53a979	Traverse correct types when checking scalar layout.	2019-07-19 14:43:42 +02:00
Hans-Kristian Arntzen	e90d816cdd	Deal with scalar layout of entire structs. Mark all candidate struct types.	2019-07-19 14:18:14 +02:00
Hans-Kristian Arntzen	12c5020854	Pass down row-major state to unpacking functions.	2019-07-19 13:03:08 +02:00
Hans-Kristian Arntzen	27b75c2c5a	Deal with all forms of matrix writes ...	2019-07-19 12:53:10 +02:00
Hans-Kristian Arntzen	f6251e4699	Can deal with std140 matrices now. Refactor is coming together.	2019-07-19 11:21:02 +02:00
Hans-Kristian Arntzen	dd7ebaf9f7	Start considering how to emit physical type ID.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	b09b8d3fa9	Deal more cleanly with matrices and row-major.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	c160d5227f	Reintroduce struct_member_* MSL queries. Need to remap to physical type + packed qualifier, and this is handy to do in a helper function.	2019-07-19 10:06:19 +02:00
Hans-Kristian Arntzen	a86308bce1	MSL: Begin rewrite of buffer packing logic.	2019-07-19 10:06:19 +02:00
Chip Davis	12a8654784	Don't forward uses of an OpIsHelperInvocationEXT op. If this is computed before a `demote`, but used after, forwarding it will produce the wrong value. This does make for uglier shaders, but it's necessary right now to ensure correctness. I needed to use an assembly shader to produce the test for this. `spirv-opt` is not smart enough (or too smart?) to eliminate the variable that would be used in GLSL to express this.	2019-07-18 17:32:35 -05:00
Chip Davis	50dce10c5d	Support the SPV_EXT_demote_to_helper_invocation extension. This extension provides a new operation which causes a fragment to be discarded without terminating the fragment shader invocation. The invocation for the discarded fragment becomes a helper invocation, so that derivatives will remain defined. The old `HelperInvocation` builtin becomes undefined when this occurs, so a second new instruction queries the current helper invocation status. This is only fully supported for GLSL. HLSL doesn't support the `IsHelperInvocation` operation and MSL doesn't support the `DemoteToHelperInvocation` op. Fixes #1052.	2019-07-17 09:12:22 -05:00
Hans-Kristian Arntzen	c7eda1bce9	Test glsl.std450 more exhaustively. Make sure to test everything with scalar as well to catch any weird edge cases. Not all opcodes are covered here, just the arithmetic ones. FP64 packing is also ignored.	2019-07-17 11:53:05 +02:00
Chip Davis	bc646574a6	MSL: Support the SPV_INTEL_shader_integer_functions2 extension. This provides a few functions normally available in OpenCL to the SPIR-V shader environment. These functions happen to be available in Metal as well. No GLSL, unfortunately. Intel has yet to publish a `GL_INTEL_shader_integer_functions2` spec.	2019-07-15 09:42:36 -05:00
Hans-Kristian Arntzen	33d2bbcf69	Merge branch 'msl-amd-trinary-functions' of git://github.com/cdavis5e/SPIRV-Cross	2019-07-15 09:46:31 +02:00
Chip Davis	6a58554568	Support the SPV_KHR_device_group extension. The only piece added by this extension is the `DeviceIndex` builtin, which tells the shader which device in a grouped logical device it is running on. Metal's pipeline state objects are owned by the `MTLDevice` that created them. Since Metal doesn't support logical grouping of devices the way Vulkan does, we'll thus have to create a pipeline state for each device in a grouped logical device. The upcoming peer group support in Metal 3 will not change this. For this reason, for Metal, the device index is supplied as a constant at pipeline compile time. There's an interaction between `VK_KHR_device_group` and `VK_KHR_multiview` in the `VK_PIPELINE_CREATE_VIEW_INDEX_FROM_DEVICE_INDEX_BIT`, which defines the view index to be the same as the device index. The new `view_index_from_device_index` MSL option supports this functionality.	2019-07-13 16:45:54 -05:00
Chip Davis	ca91fcfe5f	MSL: Support the SPV_AMD_shader_trinary_minmax extension. This requires MSL 2.1.	2019-07-13 16:43:57 -05:00
Hans-Kristian Arntzen	92e5255570	Run format_all.sh.	2019-07-12 10:59:53 +02:00
Hans-Kristian Arntzen	932ee0e328	Deal correctly with return sign of bitscan operations.	2019-07-12 10:57:56 +02:00
Hans-Kristian Arntzen	19ebbd48c7	Merge pull request #1077 from cdavis5e/msl-spirv-qualifiers MSL: Handle coherent, volatile, and restrict.	2019-07-12 10:03:06 +02:00
Hans-Kristian Arntzen	ad5eae46ed	Merge pull request #1078 from cdavis5e/post-depth-coverage Support the SPV_KHR_post_depth_coverage extension.	2019-07-12 09:56:26 +02:00
Chip Davis	6628ea6e48	MSL: Use the select() function for OpSelect. This significantly improves codegen for vector `OpSelect` in MSL.	2019-07-11 10:30:37 -05:00
Chip Davis	1df47db6ba	Support the SPV_KHR_post_depth_coverage extension. Using the `PostDepthCoverage` mode specifies that the `gl_SampleMaskIn` variable is to contain the computed coverage mask following the early fragment tests, which this mode requires and implicitly enables. Note that unlike Vulkan and OpenGL, Metal places this on the sample mask input itself, and furthermore does not implicitly enable early fragment testing. If it isn't enabled explicitly with an `[[early_fragment_tests]]` attribute, the compiler will error out. So we have to enable that mode explicitly if `PostDepthCoverage` is enabled but `EarlyFragmentTests` isn't. For Metal, only iOS supports this; for some reason, Apple has yet to implement it on macOS, even though many desktop cards support it.	2019-07-11 10:28:43 -05:00
Chip Davis	058f1a0933	MSL: Handle coherent, volatile, and restrict. This maps them to their MSL equivalents. I've mapped `Coherent` to `volatile` since MSL doesn't have anything weaker than `volatile` but stronger than nothing. As part of this, I had to remove the implicit `volatile` added for atomic operation casts. If the buffer is already `coherent` or `volatile`, then we would add a second `volatile`, which would be redundant. I think this is OK even when the buffer doesn't have `coherent`: `T ` is implicitly convertible to `volatile T `, but not vice-versa. It seems to compile OK at any rate. (Note that the non-`volatile` overloads of the atomic functions documented in the spec aren't present in the MSL 2.2 stdlib headers.) `restrict` is tricky, because in MSL, as in C++, it needs to go after the asterisk or ampersand for the pointer type it's modifying. Another issue is that, in the `Simple`, `GLSL450`, and `Vulkan` memory models, `Restrict` is the default (i.e. does not need to be specified); but MSL likely follows the `OpenCL` model where `Aliased` is the default. We probably need to implicitly set either `Restrict` or `Aliased` depending on the module's declared memory model.	2019-07-11 10:22:30 -05:00
Hans-Kristian Arntzen	1a592b7c0f	Merge pull request #1067 from cdavis5e/msl-scalar-block-layout MSL: Support scalar block layout.	2019-07-11 13:03:03 +02:00
Chip Davis	28454facbb	MSL: Handle packed matrices. The old method of using a different unpacked matrix type doesn't work for scalar alignment. It certainly wouldn't have any effect for a square matrix, since the number of columns and rows are the same. So now we'll store them as arrays of packed vectors.	2019-07-10 18:37:31 -05:00
Chip Davis	ea5c0ed82f	MSL: Fix alignment of packed types. Packed types have scalar alignment.	2019-07-10 11:57:04 -05:00
Hans-Kristian Arntzen	6b010e0cbc	Merge pull request #1069 from KhronosGroup/fix-1053 MSL: Re-roll array expressions in initializers.	2019-07-10 12:15:12 +02:00
Hans-Kristian Arntzen	f6f849397e	MSL: Re-roll array expressions in initializers. We cannot rely on copy path when using an array as part of a struct initializer, so reroll such expressions to an initializer list again.	2019-07-10 11:19:33 +02:00
Chip Davis	e5fa7edfd6	MSL: Support scalar block layout. Relaxed block layout relaxed the restrictions on vector alignment, allowing them to be aligned on scalar boundaries. Scalar block layout relaxes this further, allowing any member to be aligned on a scalar boundary. The requirement that a vector not improperly straddle a 16-byte boundary is also relaxed. I've also added a test showing that `std430` layout works with UBOs. I'm troubled by the dual meaning of the `Packed` extended decoration. In some instances (struct, `float[]`, and `vec2[]` members), it actually means the exact opposite, that the member needs extra padding. This is especially problematic for `vec2[]`, because now we need to distinguish the two cases by checking the array stride. I wonder if this should actually be split into two decorations.	2019-07-09 20:59:32 -05:00
Hans-Kristian Arntzen	909040e2eb	MSVC 2013: Work around another compiler bug with array init.	2019-07-09 15:31:01 +02:00
Hans-Kristian Arntzen	4056d0b74e	Don't use scalar dot().	2019-07-03 14:32:06 +02:00

... 4 5 6 7 8 ...

1069 Commits