skia2

Author	SHA1	Message	Date
Florin Malita	feacb0fb34	[skottie] Add support for multiple range selectors Text animators can have more than one range selector. (depends on https://github.com/bodymovin/bodymovin-extension/pull/21) TBR= Change-Id: Id7f73386853f0e0f9e3c0f15d5a87ec1653ba873 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/234319 Reviewed-by: Florin Malita <fmalita@chromium.org> Commit-Queue: Florin Malita <fmalita@chromium.org>	2019-08-14 17:34:02 +00:00
Mike Klein	f996311003	extend lifetimes for hoisted used in loop This makes the register recycling checks a bit more precise. At head we never recycle a register that's holding a hoisted value, which is overly conservative. We really should never recycle a register that's still needed. By extending the lifetime of any hoisted value that's used in the loop, we prevent that, while still allowing hoisted values that are only used in hoisted computation to be reused. This takes just a small tweak in the JIT code (removing the !hoisted({x,y,z}) checks), and a somewhat larger refactoring in the interpreter, making both hoisted and non-hoisted code go through the same recycling register assignment flow. There's one diff in the existing cases where we now reuse a hoisted register, and I've added a second test just to make sure it's covered explicitly. Change-Id: I25b37ab1f1fea3042d7fd167529abc8fed1dddff Reviewed-on: https://skia-review.googlesource.com/c/skia/+/233239 Reviewed-by: Mike Klein <mtklein@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-08-13 02:08:16 +00:00
Florin Malita	17b9d1d1de	[skottie] Initial Hue/Saturate effect support Due to limitations in BodyMovin/AE JSX, full effect data is not available (specifically the "channel range" property). We only support static master hue, static master saturation and static master lightness at this point. This CL also introduces a new animation builder pattern: DiscardableAdapterBase and attachDiscardableAdapter(). The former is a base class for adapters with full animator ownership. This enables a) capturing raw adapter pointers in animator lambdas and b) syncing to SG only once, after all local animators are updated). The latter is a helper for managing adapter creation and optional destruction (when all adapter properties are static we can discard it). Change-Id: Iecc4b78830e5464e7958cb12cdfd75a61010aa25 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/231956 Reviewed-by: Mike Reed <reed@google.com> Commit-Queue: Florin Malita <fmalita@chromium.org>	2019-08-02 18:04:14 +00:00
Mike Klein	8ac9f4e5b2	flesh out SkVM ops a bit more Add missing comparison and selection ops, bit casts, 16-bit memory operations, gathers, uniform loads, and fill in math holes where reasonable. Update some names to be a bit more regular. I think all instructions are implemented in the interpreter, and many tested. More testing and JITs to follow. Change-Id: I8cf377e8b72a86ac950e020892ce82b39e9d7277 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/229893 Reviewed-by: Mike Klein <mtklein@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-07-29 20:43:10 +00:00
Brian Osman	e59acb79b8	Particles: Merge spawn & update into one code string with two functions Change-Id: If57fb79db8f8c5fd185fefaa202167c8082dd846 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/229921 Reviewed-by: Mike Klein <mtklein@google.com> Commit-Queue: Brian Osman <brianosman@google.com>	2019-07-25 23:51:07 +00:00
Brian Osman	d6108add51	Particles: Use list of lines for multi-line string serialization Change-Id: Ic81b3433b485ca9ce0e60bd10ec12706e673ee89 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/229917 Commit-Queue: Brian Osman <brianosman@google.com> Commit-Queue: Mike Klein <mtklein@google.com> Auto-Submit: Brian Osman <brianosman@google.com> Reviewed-by: Mike Reed <reed@google.com> Reviewed-by: Mike Klein <mtklein@google.com>	2019-07-25 20:55:43 +00:00
Brian Osman	fe49163cd1	Major rewrite of the particle system based on the SkSL interpreter This removes all of the fixed-function particle affector classes. Instead, each particle effect just has two SkSL snippets, one for spawn logic, and one for update logic. Each one gets an inout copy of the particle struct. Ultimately, this makes the effects much simpler and smaller, while also being far more flexible (you can do whatever you want with any values you want). Finally, because the interpreter is vectorized and a particular effect's scripts are usually tuned to the specific behaviors desired, it's faster on basically every effect I compared. I re-created all of the old effects in the new system. Many just use pure SkSL (no curves or anything). Some of the old curve and path/text stuff was very handy, though - so those are now exposed as external values in the interpreter. Basically, an effect can have any number of named "bindings" that are a callable thing. This can be a path, text (shortcut for making fancy paths), curve, or color curve. The path ones return a float4 with position and normal, the curves return one or four floats. ... and this transposes all of the particle data storage into SoA form, so that it can use the much faster interpreter entry point. Change-Id: Iebe711c45994c4201041b12d171af976bc5e758e Reviewed-on: https://skia-review.googlesource.com/c/skia/+/222057 Commit-Queue: Brian Osman <brianosman@google.com> Reviewed-by: Mike Reed <reed@google.com>	2019-07-25 19:59:03 +00:00
Mike Klein	5e533c9e1f	move hoist analysis back into Builder Even if a JIT ultimately doesn't end up hoisting any values, it's going to want this information while it decides. Writing it in one place also ensures we only get it wrong in one place... I'm no_ extending the lifetime of hoisted instructions here in Builder. That's something to leave to the backend so they have the flexibility of which of these values to hoist, if any. If they don't hoist, they'll need to know when the value dies. Moving this information back here lets the test expectation goldens reflect the hoist bit again too. Kind of nice. Change-Id: Ib165ca898a97c1d822cb28fe24f15bae4d570a17 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/229024 Reviewed-by: Mike Klein <mtklein@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-07-22 19:34:06 +00:00
Mike Klein	c2fb3b4b72	split deaths() out of other analysis I'm slowly refactoring my way to where hoisting and register assignment are done in backend-specific ways, but this liveness analysis is always going to be useful for each backend. Use deaths() to restore friendly ☠️ dead code markers in test dumps. Change-Id: I3ab94665bbbbf0788b0b27e00d644eba927dff47 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/228113 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Mike Klein <mtklein@google.com>	2019-07-17 18:11:10 +00:00
Florin Malita	5f1108ce46	[skottie] Motion blur support Unlike all other Skottie effects, motion blur requires sampling at multiple points on the timeline. To support this: 1) Introduce MotionBlurEffect - a custom SG render node which can drive the timeline of its subtree using an sksg::Animator. 2) Introduce MotionBlurController to swap for a regular LayerController when needed. MotionBlurController dispatches time ticks to MotionBlurEffect instead of directly to the layer animators. The actual motion blur impl is based on https://skia-review.googlesource.com/c/skia/+/221416. Motion blur requires Lottie files exported with this BodyMovin patch: https://github.com/bodymovin/bodymovin-extension/pull/15 Change-Id: I075e101ea91ec9aa300bac35ee810fd539f1aced Reviewed-on: https://skia-review.googlesource.com/c/skia/+/225416 Reviewed-by: Mike Reed <reed@google.com> Commit-Queue: Florin Malita <fmalita@chromium.org>	2019-07-09 13:02:17 +00:00
Florin Malita	97054c421e	[skottie] Add forgotten linear-wipe test TBR= Change-Id: I643fbe9491d2e134f631435444ec220af9250fc1 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/225423 Reviewed-by: Florin Malita <fmalita@chromium.org> Commit-Queue: Florin Malita <fmalita@chromium.org>	2019-07-03 15:06:42 +00:00
Mike Klein	aab45b5638	add misc. value programs to SkVMTest.expected Noticed we were only dumping the final register programs for the integer code. Might as well also track the value programs. Change-Id: I417c5c655b632691557bbbb136dcbd3f3167af9a Reviewed-on: https://skia-review.googlesource.com/c/skia/+/225324 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com>	2019-07-02 23:13:06 +00:00
Florin Malita	afd2c10c98	[skottie] Use hybrid bounds for custom Shaper VAlign modes We used to rely solely on visual bounds for vertical alignment. That had the downside of leading/trailing empty lines being ignored. Then https://skia-review.googlesource.com/c/skia/+/220916 switched to using typographical bounds. This approach produces results in line with AE, but allows some glyphs to overflow the alignment boundary. This CL introduces a hybrid approach: 1) for standard AE text alignment, continue to use typographical bounds 2) for Skottie VAlign extensions (sk_vj), use the union of typographical and visual bounds - this should mitigate both issues mentioned above Change-Id: Ifd3ccae3d721728ce67942206160ebe92056d3a2 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/224188 Reviewed-by: Ben Wagner <bungeman@google.com> Reviewed-by: Avinash Parchuri <aparchur@google.com>	2019-06-28 11:35:09 +00:00
Florin Malita	b0944553df	[skottie] Venetian Blinds effect Change-Id: I50e133dea448e044fef45379490cb85b39eea3bc Reviewed-on: https://skia-review.googlesource.com/c/skia/+/223856 Reviewed-by: Mike Reed <reed@google.com> Commit-Queue: Florin Malita <fmalita@chromium.org>	2019-06-26 13:13:10 +00:00
Mike Klein	2b7b2a2331	add bit_clear I was just reading the ARM docs and realized that their BIC ("BIt Clear") is the same as SSE's ANDN ("AND Not") instruction. It's kind of a neat little tool to have laying around... comes up more than you'd think, and it's sometimes the clearest way to express what you're doing, as in the changed program here where the comment is "mask away the low bits". That's a bit_clear with a mask for what you want to clear away! And the real reason to write this up is that I want to have a CL to point to that shows how to add an instruction top to bottom. Change-Id: I99690ed9c1009427b3986955e7ae6264de4d215c Reviewed-on: https://skia-review.googlesource.com/c/skia/+/223120 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com> Reviewed-by: Mike Reed <reed@google.com>	2019-06-24 16:31:15 +00:00
Mike Klein	a1167abcae	split out Analysis struct from Instruction Instruction is the fundamental data, and Analysis derived from it. The fields in Analysis are only* needed in Builder::done(), and this split seems to help clarify what done() can tweak (Analysis) and what it cannot (fProgram, Instructions). done() is now const. No speed change as far as I can tell. * As you may notice looking at the test expectations, making analysis ephemeral means that dump() can no longer print the skull for dead code or the arrow for hoisted. The register program that's also in the expectation file still reflects both of these optimizations, so we're not really losing any information. Just maybe less demo-friendly. Change-Id: I79feb57558525591baf3faadeb59c418c12793f3 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/223119 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com>	2019-06-24 16:00:41 +00:00
Mike Klein	0c3346643a	refactor to remove the need for death schedule This cuts the overhead bench from about 19µs to about 15µs. The key insight here is that the only registers that might become available after any given instruction are the ones that hold that instruction's inputs. We can check when they become available directly from the original Builder::Program, without needing a side death schedule data structure. Marking hoisted instructions as having life == program size helps make this logic a little simpler to reason through. Change-Id: Ifb9957f2d0e323e0e5d07996a2cc988f7c8b4c3f Reviewed-on: https://skia-review.googlesource.com/c/skia/+/223117 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com>	2019-06-24 15:44:10 +00:00
Florin Malita	d7b321afa2	[skottie] Radial swipe effect Implement radial wipe with a sweep gradient shader mask filter. The implementation is slightly convoluted because edge feathering requires a real blur, which in turn requires content layer isolation. So there are two distinct operation modes: - no feather -> draw the content directly into the dest buffer, with the mask filter deferred in SG context - feather -> draw the content into a separate layer, then blend (dstOut) the composed blur+shader mask on top Change-Id: I253701aff42db8010ce463762252c262e2c5d92b Reviewed-on: https://skia-review.googlesource.com/c/skia/+/222596 Reviewed-by: Mike Reed <reed@google.com> Commit-Queue: Florin Malita <fmalita@chromium.org>	2019-06-21 14:03:45 +00:00
Mike Klein	397fc88fc0	first VEX ymm vector ops - 32x8 i32 add,sub,mul - add I32_Naive bench/test builder to get better i32 mul coverage - minor refactoring all over Change-Id: I13cc19ff37a2da0bcff289ba51baac08f456d6c5 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/222485 Reviewed-by: Herb Derby <herb@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-06-20 18:20:00 +00:00
Florin Malita	60e60dfc50	[skottie] Add support for motion tile phase The motion tile phase is a one-dimensional shift, applied to every other row or column (based on a selector property). Implement using a masking shader (covering the static rows/cols), and blend mode shader composition (srcIn for static/pass-through rows/cols, and srcOut for phased rows/cols). TBR= Change-Id: I336c150e5d4900962dc2de801a4e1572cf4b5d59 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/221339 Reviewed-by: Florin Malita <fmalita@chromium.org> Commit-Queue: Florin Malita <fmalita@chromium.org>	2019-06-18 14:01:22 +00:00
Florin Malita	b97824d4d1	[skottie] Motion tile effect Implement support for AE's Motion Tile effect [1]. This is the first effect which needs layer size information, so the CL includes related plumbing. Limitations: no phase support at this point. [1] https://helpx.adobe.com/after-effects/using/stylize-effects.html#motion_tile_effect Change-Id: I023bf8a9d3e3d2a48458fa94218f143e6aac4c9f Reviewed-on: https://skia-review.googlesource.com/c/skia/+/221244 Reviewed-by: Mike Reed <reed@google.com> Commit-Queue: Florin Malita <fmalita@chromium.org>	2019-06-17 18:20:15 +00:00
Mike Klein	342b1b2753	proposed: add bytes() op I'm staring at this assembly, vmovups (%rsi), %ymm3 vpsrld $24, %ymm3, %ymm4 vpslld $16, %ymm4, %ymm15 vorps %ymm4, %ymm15, %ymm4 vpsubw %ymm4, %ymm0, %ymm4 Just knowing that could be vmovups (%rsi), %ymm3 vpshufb 0x??(%rip), %ymm3, %ymm4 vpsubw %ymm4, %ymm0, %ymm4 That is, instead of shifting, shifting, and bit-oring to create the 0a0a scale factor from ymm3, we could just byte shuffle directly using some pre-baked control pattern (stored at the end of the program like other constants) pshufb lets you arbitrarily remix bytes from its argument and zero bytes, and NEON has a similar family of vtbl instructions, even including that same feature of injecting zeroes. I think I've got this working, and the speedup is great, from 0.19 to 0.16 ns/px for I32_SWAR, and from 0.43 to 0.38 ns/px for I32. Change-Id: Iab850275e826b4187f0efc9495a4b9eab4402c38 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/220871 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com>	2019-06-17 15:29:34 +00:00
Florin Malita	5fe7429bab	[skottie] Fix layer transform vs. effects interactions Turns out, in addition to solid layers, pre-comp and image layer effects are also subject to layer transforms. TBR= No-Try: true Change-Id: Ie235ff19374b8e0246eeec8e08079a2340e2a92a Reviewed-on: https://skia-review.googlesource.com/c/skia/+/221177 Commit-Queue: Florin Malita <fmalita@chromium.org> Reviewed-by: Florin Malita <fmalita@chromium.org>	2019-06-17 12:26:13 +00:00
Florin Malita	e47d8afabd	[skottie] Add support for Transform distort effect Yet another way to transform a layer, disguised as a distort effect. TBR= Change-Id: Ic2d5479fa6ae27b460de60875924f73f77fc7f71 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/221001 Reviewed-by: Florin Malita <fmalita@chromium.org> Commit-Queue: Florin Malita <fmalita@chromium.org>	2019-06-14 16:58:31 +00:00
Mike Klein	4c4945a252	trim another instruction of I32_SWAR Now that we've got shr_16x2, extract(..., 8, splat(0x00ff00ff)) is better done as shr_16x2(..., 8). This swaps a 16-bit shift in for the 32-bit shift, a wash, but lets us drop the bit_and at the end, saving one whole instruction. This places I32_SWAR a tiny little bit faster than the code in Opts, like .19 ns/px vs .20 ns/px for Opts. Change-Id: I4160dc03ecc8b855c0773a927f1510ad5cbb4b87 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/220856 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com>	2019-06-13 21:58:05 +00:00
Mike Klein	7f061fb53b	streamline srcover math in I32_SWAR This is the final bunny I've got in my hat, I think... Remembering that none of the s += d*invA adds can overflow, we can use a single 32-bit add to add them all at once. This means we don't have to unpack the src pixel into rb/ga halves. We need only extract the alpha for invA. This brings I32_SWAR even with the Opts code! curr/maxrss loops min median mean max stddev samples config bench 36/36 MB 133 0.206ns 0.211ns 0.208ns 0.211ns 1% ▁▇▁█▁▇▁▇▁▇ nonrendering SkVM_4096_I32_SWAR 37/37 MB 152 0.432ns 0.432ns 0.434ns 0.444ns 1% ▃▁▁▁▁▃▁▁█▁ nonrendering SkVM_4096_I32 37/37 MB 50 0.781ns 0.794ns 0.815ns 0.895ns 5% ▆▂█▃▅▂▂▁▂▁ nonrendering SkVM_4096_F32 37/37 MB 76 0.773ns 0.78ns 0.804ns 0.907ns 6% ▄█▅▁▁▁▁▂▁▁ nonrendering SkVM_4096_RP 37/37 MB 268 0.201ns 0.203ns 0.203ns 0.204ns 0% █▇▆▆▆▆▁▆▆▆ nonrendering SkVM_4096_Opts Change-Id: Ibf0a9c5d90b35f1e9cf7265868bd18b7e0a76c43 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/220805 Reviewed-by: Mike Klein <mtklein@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-06-13 21:32:45 +00:00
Mike Klein	57cb5ba122	i16x2 sub/shr More i16x2 ops, as seemed immediately useful in I32_SWAR. I32_SWAR: 0.27 ns/px --> 0.25 ns/px I32: 0.43 ns/px F32: 0.76 ns/px RP: 0.8 ns/px Opts: 0.2 ns/px Change-Id: I04fed0d1ed1c4218d0cafb45fd0ee6d68880de80 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/220801 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com>	2019-06-13 19:17:34 +00:00
Mike Klein	3538908983	baby steps into 16-bit ops I figure the easiest way to expose 16-bit operations is to expose 16x2 pair operations... this means we can continue to always work with the same size vector. Switching from 32-bit multiplies to 16-bit multiplies is going to deliver the most oomph... they cost roughly half what 32-bit multiplies do on x86. Speed now: I32_SWAR: 0.27 ns/px I32: 0.43 ns/px F32: 0.76 ns/px RP: 0.8 ns/px Opts: 0.2 ns/px Change-Id: I8350c71722a9bde714ba18f97b8687fe35cc749f Reviewed-on: https://skia-review.googlesource.com/c/skia/+/220709 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com>	2019-06-13 18:44:44 +00:00
Mike Klein	821f5e8dfe	remove mul_unorm8/mad_unorm8 I just kind of remembered that if we're doing (xy+x)/256 and x is a destination channel and y is 255-sa, then you can get the +x for free by multiplying by 256-sa instead. (d * (255-sa) + d) (d * (255-sa + 1)) (d * (256-sa) ) Duh. This is a trick we play in a lot of legacy code and I've just now realized it's exactly equivalent to the trick I want to play here... sigh. Folding this math in kind of makes mul/mad_unorm8 moot. Speed's getting good: I32_SWAR: 0.3 ns/px I32 : 0.55 ns/px F32 : 0.8 ns/px RP : 0.8 ns/px Opts : 0.2 ns/px Change-Id: I4d10db51ea80a3258c36e97b6b334ad253804613 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/220708 Reviewed-by: Herb Derby <herb@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-06-13 18:21:44 +00:00
Mike Klein	3f593799da	expand unit tests, fix extract The mask-only special case for extract is wrong... it never looked it its input! This not only makes things correct-er, but oddly it also makes them faster by breaking inter-loop data dependencies. Disable tests for _I32... they're actually still broken because of a much more systemic flaw in how I've evaluated programs. The _F32 and _I32_SWAR JIT code and all interpreted code is just getting lucky. o_O While here, update the I32_SWAR code to use the same math as I32, (x*y+x)/256 for unorm8 mul. This just helps keep me sane. Change-Id: I1acc09adb84c426fca4b2be5ca8c2d46d9678dd8 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/220577 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Herb Derby <herb@google.com>	2019-06-12 18:58:56 +00:00
Florin Malita	920f6a9bdb	[skottie] Animated text tracking support Add logic to adjust glyph positions based on animated tracking properties. This adjustment is applied post-shaping (it doesn't observe the text box), and requires line re-alignment - thus it is being processed per-line. Change-Id: Id44a295032a48c7216f126cb02dd2d2d5cc18ae3 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/220076 Reviewed-by: Mike Reed <reed@google.com> Commit-Queue: Florin Malita <fmalita@chromium.org>	2019-06-11 13:41:28 +00:00
Florin Malita	e695e53f1c	[skottie] Add support for range selector domains Range selector's "Based On" property controls how range indices map to glyphs: characters, characters-excluding-spaces, words, lines. To support this feature: - update SkottieShaper to track domain-relevant info per fragment (fLineIndex, fIsWhitespace) - update TextAdapter to build domain maps (domain index -> fragment span) - update RangeSelector to run its range indices through a domain map, if present. Change-Id: I80e713f6beaa2578aa0eae1d1ddae8e1e47d8d10 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/219859 Commit-Queue: Florin Malita <fmalita@chromium.org> Reviewed-by: Ben Wagner <bungeman@google.com>	2019-06-10 17:49:08 +00:00
Mike Klein	22ea7e994b	add Builder::dump() I used to have a dump of the value program before it was translated to registers, but it went away a while ago. This restores it. Change-Id: I9b8bfcb124843cad4b0dc44bdf0a03e95a0c83d8 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/219757 Commit-Queue: Mike Klein <mtklein@google.com> Commit-Queue: Brian Osman <brianosman@google.com> Auto-Submit: Mike Klein <mtklein@google.com> Reviewed-by: Brian Osman <brianosman@google.com>	2019-06-10 17:43:58 +00:00
Mike Klein	a630732c36	revise extract instruction Convert extract(x,bits,z) to be (x >> bits) & z, now a more explicit parallel to pack(). This lets us eliminate the funky bit counting required from the old instruction, but more saliently it makes it more likely that the masks we AND with will be the same value. Ultimately down at the x86 or ARM ISA level, the AND instructions don't really benefit from having an immediate argument (while the shifts do). We might as well treat the mask as a normal value, letting it get commoned with identical values, loop hoisted, etc. Change-Id: I48a38468b46f2c730574c025f412262296472447 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/219597 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Brian Osman <brianosman@google.com>	2019-06-10 15:08:19 +00:00
Florin Malita	54c8411c27	[skottie] Fix zero-overlap selector handling The current implementation applies constant coverage (outside selector range) based on computed integral edges. But the integral range is clamped to the valid index domain and its extremes are always assumed to have partial coverage - so we never get to constant-blit the full buffer when the interval is outside, which can yield incorrect coverage for the first/last fragments. Update the constant coverage logic to operate in full domain coordinates. Change-Id: I23902674fe5e822081fb8262167511df1cc3463e Reviewed-on: https://skia-review.googlesource.com/c/skia/+/219206 Reviewed-by: Ben Wagner <bungeman@google.com> Commit-Queue: Florin Malita <fmalita@chromium.org>	2019-06-06 16:09:39 +00:00
Florin Malita	711f39fb45	[skottie] Add support for RangeSelector shapes Introduce square/ramp/triangle/round/smooth shape generators, and use them to seed the range selector coverage pipeline. Change-Id: Ib7b94ceecd2ccf66820f4dd2443fdd62e2ac6a1b Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218828 Commit-Queue: Florin Malita <fmalita@chromium.org> Reviewed-by: Ben Wagner <bungeman@google.com>	2019-06-06 13:51:11 +00:00
Mike Klein	6b43c30cd7	add mad_unorm8 instruction At some point adding more and more complex instructions reduces to the absurdity of SolveTheWholeProblem-The-Instruction, but I think this one will come up often enough to still make sense. mad() makes sense for unorm8 just about everywhere mad() makes sense for f32. This instruction won't matter to a JIT, but helps the interpreter. Change-Id: Iace92296cffbb6fbc3acd1f853cb01c51792f796 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218716 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Brian Osman <brianosman@google.com>	2019-06-05 20:46:02 +00:00
Mike Klein	754bad3f38	hoist loop-invariant code out of the loop I'm of two minds about this... it adds register pressure and really only tends to hoist few instructions that are fairly cheap anway. On the other hand, it's neat, it's easy to turn off (just set the initial hoist value to false in Builder::push()) and it does deliver a noticeable though slight performance improvement in the interpreter. I think the final decision will probably come down to what we think about maintainability? Change-Id: Idd6346f70f03188917918406731154246a7c6fcb Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218584 Reviewed-by: Brian Osman <brianosman@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-06-05 16:31:41 +00:00
Mike Klein	c2d9a3145b	Revert "print 1/K floats as fractions" This reverts commit `346f82c1c3`. Reason for revert: *SAN bots Original change's description: > print 1/K floats as fractions > > Change-Id: Id00cbd0950e77debb5ab5d45541dc0f8d13a3c42 > Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218338 > Reviewed-by: Brian Osman <brianosman@google.com> > Commit-Queue: Mike Klein <mtklein@google.com> TBR=mtklein@google.com,brianosman@google.com Change-Id: Ic35cec97d2dc2c1e19dbdf8ea7b505ad75072da1 No-Presubmit: true No-Tree-Checks: true No-Try: true Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218529 Reviewed-by: Mike Klein <mtklein@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-06-05 14:12:26 +00:00
Mike Klein	346f82c1c3	print 1/K floats as fractions Change-Id: Id00cbd0950e77debb5ab5d45541dc0f8d13a3c42 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218338 Reviewed-by: Brian Osman <brianosman@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-06-04 21:11:22 +00:00
Florin Malita	1c44db15d1	[skottie] Initial text range selector support Each animator can have multiple range selectors, whose combined "coverage" modulates how the animator props compose with other/initial props. Since there can be multiple animators with different/arbitrary selectors, we compute independent property values for each fragment. Supported features: - start, end, offset, amount - units: percentage, index - based-on: characters-only for now - mode: add-only for now - shape: square-only for now Change-Id: If7fee46ffb29e1f92542822481ed699fd0b0b521 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218076 Commit-Queue: Florin Malita <fmalita@chromium.org> Reviewed-by: Ben Wagner <bungeman@google.com>	2019-06-04 20:50:32 +00:00
Mike Klein	3c00d3d379	dump register and instruction count Change-Id: Ib0d4f354787e413749fdda8b59ccc2f94472b0ce Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218243 Reviewed-by: Mike Klein <mtklein@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-06-04 17:27:29 +00:00
Mike Klein	9656dcedc2	add extract instruction Kind of the flip side of pack. Made slightly awkward by instructions having only one immediate... calling _BitScanForward / __builtin_ctz() at runtime seems to work fine, but it really could have been done at compile time. Change-Id: Ic83fe8e0a1603fb9189598dcc26c842cc797bf45 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218241 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Brian Osman <brianosman@google.com>	2019-06-04 17:23:49 +00:00
Mike Klein	1665aaa5fd	add pack instruction This instruction can lower to some useful SSE/NEON instructions, and even if not, is a handy way to express the frequent paring of << and \|. I32_SWAR: 2.3 -> 1.9 I32: 2.6 -> 2.4 F32: 5.1 -> 4.7 Change-Id: Ia169ad40f0aaef32417e05d9bf91c2d2542e7b5f Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218238 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Brian Osman <brianosman@google.com>	2019-06-04 17:14:49 +00:00
Mike Klein	d7d04dc5eb	add mul_unorm8 instruction Another way for an interpreter to go faster is to provide better instructions. mul_unorm8 is one we use all the time. Drops _I32 bench from ~3.6ns/px to ~2.6ns/px. Change-Id: I9d08914c114048b79075796af9ec802236b35706 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218236 Commit-Queue: Mike Klein <mtklein@google.com> Reviewed-by: Brian Osman <brianosman@google.com>	2019-06-04 16:59:49 +00:00
Mike Klein	7b7077cc36	centralize test/bench SkVM builders Eliminate the duplicate functionality, and better testing for the bench builders. Change-Id: If20e52107738903f854aec431416e573d7a7d640 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218041 Reviewed-by: Mike Klein <mtklein@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-06-04 16:55:59 +00:00
Mike Klein	12818515b5	add a xy+0 -> xy peephole This is mostly to test how easy rebaselining SkVMTest is. Change-Id: I27ab6f6bb8b7e126327374009783afd86d416f55 Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218039 Reviewed-by: Brian Osman <brianosman@google.com>	2019-06-03 22:25:27 +00:00
Mike Klein	267f50773c	streamline SkVM test rebaselining - keep expectations in resources/ - overwrite automatically if needed so we can see the diff in Git Change-Id: I2486b127ebcc7f40332fd0462e38b1af04d3e32b Reviewed-on: https://skia-review.googlesource.com/c/skia/+/218038 Reviewed-by: Brian Osman <brianosman@google.com> Commit-Queue: Mike Klein <mtklein@google.com>	2019-06-03 22:24:50 +00:00
Florin Malita	1d89b0d7e1	[skottie] Add support for scale and rotation text animators The interesting bit here is a change in glyph positioning: AE text animator transforms are to be applied relative to the glyph position. To support this behavior, update Shaper to externalize glyph positioning when in fragmented mode. I.e. instead of baking glyph positions in blobs, apply them at the scene graph transform level (such that they compose with animated transforms correctly). Change-Id: I9aeb5e6f8c1ec1a2c8b5351e8fc2a73d4bdf5cad Reviewed-on: https://skia-review.googlesource.com/c/skia/+/217556 Reviewed-by: Ben Wagner <bungeman@google.com> Commit-Queue: Florin Malita <fmalita@chromium.org>	2019-05-31 20:41:19 +00:00
Florin Malita	e57546bb78	[skottie] Add support for opacity text animators Change-Id: I76da8745ccd714cb8457000d9cd208e62f1058ba Reviewed-on: https://skia-review.googlesource.com/c/skia/+/217536 Auto-Submit: Florin Malita <fmalita@chromium.org> Commit-Queue: Ben Wagner <bungeman@google.com> Reviewed-by: Ben Wagner <bungeman@google.com>	2019-05-31 19:50:59 +00:00

1 2 3 4 5 ...

257 Commits