skia2

Author	SHA1	Message	Date
Michael Ludwig	767586b330	Update Sk4px to use skvx instead of SkNx Adds a saturated_add function that was on SkNx and used in SkXfermode_opts, but hadn't been ported to skvx yet. Removes the Sk4px_opts variants and simplifies some of its functions; many were already defined skvx. The largest change is that Sk4px does not extend skvx::byte16, since it used to extend Sk16b. Now it just has a vector as a data type. This was necessary so that we could define operators that were typed for Sk4px and Wide w/o conflicting with the free operators that were defined for the base skvx types. Change-Id: I8c667ba86f662ccf07ad85aa32e78abfc0a8c7ae Reviewed-on: https://skia-review.googlesource.com/c/skia/+/542645 Reviewed-by: Herb Derby <herb@google.com> Commit-Queue: Michael Ludwig <michaelludwig@google.com>	2022-05-23 17:41:53 +00:00
Michael Ludwig	4621ef2a8a	Improve skvx::any() and all() intrinsics Removes specializations for all() on AVX2 and SSE 4.1, which give the wrong results if the ints didn't have all bits set (inconsistent with other platforms and non-SIMD). Added a unit test that checks this case. The mirror specializations for any() on AVX2 and SSE 4.1 are actually valid, so added those, and added a 2 instruction specialization for SSE for any() and all(). This is what clang-trunk produces on -O3, but ToT clang struggles to vectorize it. Also adds specializations for NEON for any() and all(), since even clang-trunk was struggling to vectorize it automatically. In particular, this will help skgpu::graphite::Rect's implementations of intersect and contains, which use any/all to get a final boolean value. In the Instruments app, I had see Rect's intersection as a hotspot on the Mac M1, and this vectorization helps a bit. Also takes the opportunity to remove fake C++14 constexpr for a real constexpr. Change-Id: Ib142e305ae5615056a777424e379b6da82d44f0c Reviewed-on: https://skia-review.googlesource.com/c/skia/+/542296 Commit-Queue: Michael Ludwig <michaelludwig@google.com> Reviewed-by: Herb Derby <herb@google.com>	2022-05-20 00:50:59 +00:00
Michael Ludwig	5c08e3c357	Standardize on skvx aliases, plus clean-up This adds aliases like skvx::float2, float4, etc. to SkVx.h and goes through existing usages of SkVx to standardize on those aliases, or refer to the full name directly. In particular, this lets us clean up the equivalent aliases in src/gpu/tessellate, src/gpu/graphite/VectorTypes and src/gpu/ganesh/GrVx Where possible, I switched to using skvx::Foo directly and leveraged auto to make it less redundant. Headers always used the full type except for PatchWriter.h and Rect.h because of the number of their usages. In this case, the alias is scoped to private so it can't leak. This is prep to migrate older code that is still using SkNx and its aliases like Sk4f to SkVx as well. Change-Id: I9dd104e83cf17c2b88995a047cfd2e2b0fe6fac2 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/541058 Reviewed-by: Brian Osman <brianosman@google.com> Commit-Queue: Michael Ludwig <michaelludwig@google.com>	2022-05-17 18:04:55 +00:00
Herb Derby	73065f325f	Reland "add a scaled uint32x4_t divided by uint32_t to SkVx" This is a reland of `35a74eab5d` Added guard for SKNX_NO_SIMD. I guess they don't want speedy goodness. Original change's description: > add a scaled uint32x4_t divided by uint32_t to SkVx > > This extracts the divide used in SkImageBlurFilter.cpp, and > encapsulates it into ScaledDividerU32. It generates results that > are with in +/- 1 of the rounded answer generated by doubles. > > I have added hand coded implementations for sse and for neon to > hopefully to avoid code generation problems. > > Bug: skia:12522 > > Change-Id: Ia7372d45895c799f69f8c0fd9fdea5efac321139 > Reviewed-on: https://skia-review.googlesource.com/c/skia/+/458216 > Reviewed-by: Brian Osman <brianosman@google.com> > Commit-Queue: Herb Derby <herb@google.com> Bug: skia:12522 Change-Id: I9833a98f159827f483147c8155f1b92b7a7130ed Reviewed-on: https://skia-review.googlesource.com/c/skia/+/458716 Reviewed-by: Brian Osman <brianosman@google.com> Commit-Queue: Herb Derby <herb@google.com>	2021-10-12 20:02:01 +00:00
Herb Derby	e4ac6eabe8	Revert "add a scaled uint32x4_t divided by uint32_t to SkVx" This reverts commit `35a74eab5d`. Reason for revert: Breaks Google3 Original change's description: > add a scaled uint32x4_t divided by uint32_t to SkVx > > This extracts the divide used in SkImageBlurFilter.cpp, and > encapsulates it into ScaledDividerU32. It generates results that > are with in +/- 1 of the rounded answer generated by doubles. > > I have added hand coded implementations for sse and for neon to > hopefully to avoid code generation problems. > > Bug: skia:12522 > > Change-Id: Ia7372d45895c799f69f8c0fd9fdea5efac321139 > Reviewed-on: https://skia-review.googlesource.com/c/skia/+/458216 > Reviewed-by: Brian Osman <brianosman@google.com> > Commit-Queue: Herb Derby <herb@google.com> Bug: skia:12522 Change-Id: Id5d6968c813322dfc68e549e2f3afea7da9a0e18 No-Presubmit: true No-Tree-Checks: true No-Try: true Reviewed-on: https://skia-review.googlesource.com/c/skia/+/458258 Auto-Submit: Herb Derby <herb@google.com> Commit-Queue: Rubber Stamper <rubber-stamper@appspot.gserviceaccount.com> Bot-Commit: Rubber Stamper <rubber-stamper@appspot.gserviceaccount.com>	2021-10-12 18:14:18 +00:00
Herb Derby	35a74eab5d	add a scaled uint32x4_t divided by uint32_t to SkVx This extracts the divide used in SkImageBlurFilter.cpp, and encapsulates it into ScaledDividerU32. It generates results that are with in +/- 1 of the rounded answer generated by doubles. I have added hand coded implementations for sse and for neon to hopefully to avoid code generation problems. Bug: skia:12522 Change-Id: Ia7372d45895c799f69f8c0fd9fdea5efac321139 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/458216 Reviewed-by: Brian Osman <brianosman@google.com> Commit-Queue: Herb Derby <herb@google.com>	2021-10-12 15:56:03 +00:00
Chris Dalton	90a66821f0	Add convenient "xyzw" accessors and swizzles to skvx (take 2) Bug: skia:12515 Change-Id: I8db3501c129d93fc1eb822c90840119a7a7f2b4b Reviewed-on: https://skia-review.googlesource.com/c/skia/+/457478 Reviewed-by: Michael Ludwig <michaelludwig@google.com> Commit-Queue: Chris Dalton <csmartdalton@google.com>	2021-10-11 18:59:47 +00:00
Chris Dalton	c63e913f57	Revert "Add convenient "xyzw" accessors and swizzles to skvx" This reverts commit `01b02956c7`. Reason for revert: Codegen regressions Original change's description: > Add convenient "xyzw" accessors and swizzles to skvx > > Change-Id: Ic300285d10679a4e34190ab7b6b08bd1f6d80330 > Reviewed-on: https://skia-review.googlesource.com/c/skia/+/454309 > Reviewed-by: Michael Ludwig <michaelludwig@google.com> > Commit-Queue: Chris Dalton <csmartdalton@google.com> Bug: skia:12515 Change-Id: Id853e4d9e25c6d2ae622668ef064e1b2b078b824 No-Presubmit: true No-Tree-Checks: true No-Try: true Reviewed-on: https://skia-review.googlesource.com/c/skia/+/457476 Auto-Submit: Chris Dalton <csmartdalton@google.com> Reviewed-by: Chris Dalton <csmartdalton@google.com> Reviewed-by: Michael Ludwig <michaelludwig@google.com> Commit-Queue: Michael Ludwig <michaelludwig@google.com>	2021-10-08 23:55:11 +00:00
Chris Dalton	8fa6dbffff	Move approx_acos and strided loads from GrVx to SkVx Change-Id: Icf2d589b7a748f98cfa1be77217f5a21aed0a1b2 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/457187 Commit-Queue: Chris Dalton <csmartdalton@google.com> Reviewed-by: Brian Salomon <bsalomon@google.com>	2021-10-08 17:33:06 +00:00
Chris Dalton	01b02956c7	Add convenient "xyzw" accessors and swizzles to skvx Change-Id: Ic300285d10679a4e34190ab7b6b08bd1f6d80330 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/454309 Reviewed-by: Michael Ludwig <michaelludwig@google.com> Commit-Queue: Chris Dalton <csmartdalton@google.com>	2021-10-08 00:33:24 +00:00
Mike Klein	a221f1c36d	remove skvx::{rsqrt,rcp} These don't return reliable portable results, so I don't want to promote them as good ideas to use. You can get at least 5 different results from these across the four main architectures we support, and they've been the root cause of bugs uncovered only in production on undertested platforms. Luckily, unused outside of tests. Change-Id: I532731fe4cddf127253341e5ace8d9c5c9ebb0f1 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/326108 Reviewed-by: Herb Derby <herb@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2020-10-13 15:52:56 +00:00
Elliot Evans	6b2602ed2d	Remove remaining usages of skvx::mad This CL attempts to remove the remaining subset of skvx::mad usages. https://skia-review.googlesource.com/c/skia/+/304853 removes all usages of skvx::mad but causes small differences in rendering, so it is not suitable for landing. https://skia-review.googlesource.com/c/skia/+/306702/ removes all non-nested usages of skvx::mad Change-Id: Iab5d4cfd0feb856c38b3ebbfe3bf3ed5aad20fe6 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/306722 Reviewed-by: Mike Klein <mtklein@google.com> Commit-Queue: Mike Klein <mtklein@google.com> Commit-Queue: Elliot Evans <elliotevans@google.com>	2020-07-30 20:41:09 +00:00
Mike Klein	4d680cdf07	a bunch of half-related stuff - add f32<->f16 functions to skvx - add f32<->f16 x86 instructions to skvm::Assembler - add f32<->f16 ops to skvm, using the skvx functions in the interpreter Still TODO: use the new x86 instructions in the JIT (For now like in many other ways, the aarch64 JIT continues to languish. Will pick that back up one day.) Change-Id: Ib8dc1ccdc75ecb23769ea4947d66d3ab22520f23 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/302942 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com>	2020-07-15 20:47:31 +00:00
Mike Klein	c0bd9f9fe5	rewrite includes to not need so much -Ifoo Current strategy: everything from the top Things to look at first are the manual changes: - added tools/rewrite_includes.py - removed -Idirectives from BUILD.gn - various compile.sh simplifications - tweak tools/embed_resources.py - update gn/find_headers.py to write paths from the top - update gn/gn_to_bp.py SkUserConfig.h layout so that #include "include/config/SkUserConfig.h" always gets the header we want. No-Presubmit: true Change-Id: I73a4b181654e0e38d229bc456c0d0854bae3363e Reviewed-on: https://skia-review.googlesource.com/c/skia/+/209706 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Hal Canary <halcanary@google.com> Reviewed-by: Brian Osman <brianosman@google.com> Reviewed-by: Florin Malita <fmalita@chromium.org>	2019-04-24 16:27:11 +00:00
Mike Klein	9a885b27f3	pass SkVx::Vec arguments as const& Yet another surprising finding when looking at ARM code generation is that passing these values to functions by const& does make a difference, even when fully inlined. I can only guess that the compiler's somehow more sure that way that the values won't change? Anyway, convert all skvx functions that take Vec arguments to take const Vec& instead. This tweak is enough to let the natural implementation of mull() actually produce good code generation, so I've promoted that to SkVx.h and added a unit test. Notice in the NEON case we've got a base case at N=8 and two recursive cases, one down to 8 as usual when N > 8, but also one up to 8 when N < 8. This also is another big speedup for ARMv7 NEON, bringing it to nearly the same speed as ARMv8 NEON on the same device. Bug: chromium:952502 Change-Id: I0f19bab45cf02222ccc8090053ea2a4a380f1dfe Reviewed-on: https://skia-review.googlesource.com/c/skia/+/208582 Commit-Queue: Michael Ludwig <michaelludwig@google.com> Auto-Submit: Mike Klein <mtklein@google.com> Reviewed-by: Michael Ludwig <michaelludwig@google.com>	2019-04-16 19:24:50 +00:00
Mike Klein	1dcc55b372	rewrite new SkVx unit test - be more explicit about casting - general rewrite for clarity Cq-Include-Trybots: skia.primary:Test-Win2016-MSVC-GCE-CPU-AVX2-x86_64-Debug-All-MSRTC Change-Id: I924d6d247e6b9afcefb27c690715fdad84635a5d Reviewed-on: https://skia-review.googlesource.com/c/skia/+/207721 Reviewed-by: Greg Daniel <egdaniel@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-04-11 19:14:04 +00:00
Mike Klein	4b44a0d01a	add SkVx helpers for working with unorm8 These replicate the base logic of Sk4px::Wide::div255() and Sk4px::approxMulDiv255(), and will come in handy replacing them. No platform specializations yet... want to remind myself what codegen they get from these vanilla versions first, and then I'll fill in the platform specific stuff as needed. The tests should cover everything pretty exhaustively. Change-Id: I5854d1bc0902a85cbb2351f669c4da7cc31a8775 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/207683 Commit-Queue: Mike Klein <mtklein@google.com> Commit-Queue: Michael Ludwig <michaelludwig@google.com> Reviewed-by: Michael Ludwig <michaelludwig@google.com> Auto-Submit: Mike Klein <mtklein@google.com>	2019-04-11 17:54:23 +00:00
Mike Klein	da7b053527	tweak SkVx to play nicely with others Was starting to use this and ran into a few problems with clashing symbols, namely SI and cast(). Seemed simple enough to not use SI, and to move all the free-standing types into skvx: skvx::cast, skvx::shuffle, etc. Change-Id: Ia5d8ef6d0ae5375bf80d76be88d16f0c9cde56e7 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/207340 Commit-Queue: Mike Klein <mtklein@google.com> Auto-Submit: Mike Klein <mtklein@google.com> Reviewed-by: Michael Ludwig <michaelludwig@google.com>	2019-04-10 19:40:05 +00:00
Mike Klein	f4438d56e9	skvx: allow more implicit conversions Guarding the implict constructors and scalar/vector operations with std::is_convertible ought to make SkVx types feel more like normal C types, allowing implicit conversions exactly when the scalar equivalents would. This shouldn't change the behavior of any code, or make anything new possible... just nicer to read and write. Change-Id: Iff4b89012c5b8c7f7933e6841c925b81186bc614 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/201402 Commit-Queue: Mike Klein <mtklein@google.com> Commit-Queue: Michael Ludwig <michaelludwig@google.com> Reviewed-by: Michael Ludwig <michaelludwig@google.com> Auto-Submit: Mike Klein <mtklein@google.com>	2019-03-14 19:11:21 +00:00
Mike Klein	41b995c4fb	specialize if_then_else(int4,float4,float4) Add SSE, SSE4.1, and NEON specializations. The if_then_else() unit tests in SkVxTest.cpp should cover this. I had to give up on my dream of not using Skia headers for now. There's really no good way of knowing whether we've got SSE4.1 support in MSVC except when we explicitly define SK_CPU_SSE_LEVEL=SK_CPU_SSE_LEVEL_SSE41. This refactor to use SK_CPU_SSE_LEVEL let MSVC point out a slight ordering problem that would cause an infinite loop calling any of the specializions like sqrt(float2). I believe moving them after the float4 specializations will fix that. Change-Id: I83639f378a182716d1b37e92b6d725472698f874 Reviewed-on: https://skia-review.googlesource.com/c/195920 Auto-Submit: Mike Klein <mtklein@google.com> Reviewed-by: Michael Ludwig <michaelludwig@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-02-27 20:12:20 +00:00
Mike Klein	dcfc3ef110	skvx wip - remove ALWAYS_INLINE until we find we need it - make bit_puns explicit - implement everything recursively so, e.g. sqrt(float8) picks up sqrt(float4) when not otherwise specialized. - implement SSE specializations: of the operations I tested, only sqrt, rcp, and rsqrt needed any help. The others look good as-is. Change-Id: I1b679c7bd9a99f952272b118d7ade2469b55d604 Reviewed-on: https://skia-review.googlesource.com/c/190222 Auto-Submit: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-02-07 20:06:46 +00:00
Mike Klein	53a5298a2f	add mad() and shuffle() to SkVx Change-Id: Ie3e5b353f84e74d398a5350dc0baff5541789119 Reviewed-on: https://skia-review.googlesource.com/c/189982 Commit-Queue: Mike Klein <mtklein@google.com> Commit-Queue: Herb Derby <herb@google.com> Auto-Submit: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com>	2019-02-06 21:12:48 +00:00
Mike Klein	429251513f	fill in most remaining skvx operations Obviously lots of these new operations like sqrt() will want platform specialization. That'll come later. Change-Id: Ia0758425d4ec5911968a3d0ad63fa387b9b4cb39 Reviewed-on: https://skia-review.googlesource.com/c/189848 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com> Auto-Submit: Mike Klein <mtklein@google.com>	2019-02-06 20:03:24 +00:00
Mike Klein	455c74797b	sketch SkVx Change-Id: I1cb8113af243ed6327179d295835295834a752aa Reviewed-on: https://skia-review.googlesource.com/c/189581 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com> Auto-Submit: Mike Klein <mtklein@google.com>	2019-02-06 16:06:32 +00:00

24 Commits