skia2/src/sksl
John Stiles 5c7bb326a7 Reland "Create a basic IRNode pooling system."
This is a reland of e16eca95f5

This fixes the no-op (iOS) implementation of CreatePoolOnThread.

Original change's description:
> Create a basic IRNode pooling system.
>
> Allocations are redirected by overriding `operator new` and `operator
> delete` on the IRNode class. This allows us to use our existing
> `unique_ptr` and `make_unique` calls as-is. The Pool class is simple;
> it holds a fixed number of nodes and recycles them as they are returned.
>
> A fixed pool size of 2000 nodes was chosen. That is large enough to hold
> the contents of `sksl_large` during compilation, but it can be
> overflowed by very large shaders, or if multiple programs are converted
> at the same time. Exhausting the pool is not a problem; if this happens,
> additional nodes will be allocated via the system allocator as usual.
> More elaborate schemes are possible but might not add a lot of value.
>
> Thread safety is accomplished by placing the pool in a `thread_local`
> static during a Program's creation and destruction; the pool is freed
> when the program is destroyed. One important consequence of this
> strategy is that a program must free every node that it allocated during
> its creation, or else the node will be leaked. In debug, leaking a node
> will be detected and causes a DEBUGFAIL. In release, the pool will be
> freed despite having a live node in it, and if that node is later freed,
> that pointer will be passed to the system `free` (which is likely to
> cause a crash).
>
> In this CL, iOS does not support pooling, since support for
> `thread_local` was only added on iOS 9. This is fixed in the followup
> CL, http://review.skia.org/328837, which uses pthread keys on iOS.
>
> Nanobench shows ~15% improvement:
>   (last week) http://screen/5CNBhTaZApcDA8h
>       (today) http://screen/8ti5Rymvf6LUs8i
>
> Change-Id: I559de73606ee1be54e5eae7f82129dc928a63e3c
> Reviewed-on: https://skia-review.googlesource.com/c/skia/+/326876
> Commit-Queue: John Stiles <johnstiles@google.com>
> Reviewed-by: Ethan Nicholas <ethannicholas@google.com>
> Auto-Submit: John Stiles <johnstiles@google.com>

Change-Id: I8623a574a7e92332ff00b83982497863c8953929
Reviewed-on: https://skia-review.googlesource.com/c/skia/+/329171
Commit-Queue: John Stiles <johnstiles@google.com>
Commit-Queue: Brian Osman <brianosman@google.com>
Auto-Submit: John Stiles <johnstiles@google.com>
Reviewed-by: Brian Osman <brianosman@google.com>
2020-10-22 15:56:00 +00:00
..
generated Declare sk_InvocationID programmatically 2020-10-13 16:41:41 +00:00
ir Reland "Create a basic IRNode pooling system." 2020-10-22 15:56:00 +00:00
lex Reland "Added support for the 'inline' hint on SkSL functions" 2020-08-20 13:42:02 +00:00
GLSL.std.450.h Remove trailing whitespace. 2017-10-09 21:20:34 +00:00
README Reland "Added support for the 'inline' hint on SkSL functions" 2020-08-20 13:42:02 +00:00
sksl_fp_raw.sksl Remove sk_ClipDistance. 2020-09-25 20:29:10 +00:00
sksl_frag.sksl Remove sk_ClipDistance. 2020-09-25 20:29:10 +00:00
sksl_geom.sksl Declare sk_InvocationID programmatically 2020-10-13 16:41:41 +00:00
sksl_gpu.sksl Reland "Add sk_Caps.builtinDeterminantSupport and use it in cross()." 2020-10-09 14:45:23 +00:00
sksl_interp.sksl SkSL include files are now stored in a binary format 2020-07-31 13:48:25 +00:00
sksl_pipeline.sksl SkSL include files are now stored in a binary format 2020-07-31 13:48:25 +00:00
sksl_vert.sksl Remove sk_ClipDistance. 2020-09-25 20:29:10 +00:00
SkSLAnalysis.cpp Directly compute call counts, rather than mutating state 2020-10-19 16:08:39 +00:00
SkSLAnalysis.h Directly compute call counts, rather than mutating state 2020-10-19 16:08:39 +00:00
SkSLASTFile.h Enable ClangTidy check llvm-namespace-comment. 2020-08-06 19:07:52 +00:00
SkSLASTNode.cpp Disambiguate '.' from '::' in the SkSL parser 2020-09-11 13:48:54 +00:00
SkSLASTNode.h Add missing include to fix generated files bot 2020-09-24 20:03:57 +00:00
SkSLByteCode.cpp move scalar functions out of SkVx.h 2020-10-14 14:24:46 +00:00
SkSLByteCode.h moved SkSL FunctionDeclaration data into IRNode 2020-10-08 17:10:12 +00:00
SkSLByteCodeGenerator.cpp allow early returns 2020-10-20 14:33:31 +00:00
SkSLByteCodeGenerator.h Only include one variable per declaration statement 2020-10-07 17:27:01 +00:00
SkSLCFGGenerator.cpp moved SkSL FunctionDefinition data into IRNode 2020-10-14 18:53:41 +00:00
SkSLCFGGenerator.h Use SkSTArray to track CFG exits instead of std<set>. 2020-10-08 20:08:12 +00:00
SkSLCodeGenerator.h SkSL optimization now happens in convertProgram rather than being a 2020-09-14 16:34:47 +00:00
SkSLCompiler.cpp Reland "Create a basic IRNode pooling system." 2020-10-22 15:56:00 +00:00
SkSLCompiler.h Reorganization of IR generator's API and interaction with compiler 2020-10-09 20:29:03 +00:00
SkSLContext.h Moved SkSL type into IRNode, now accessed via a method 2020-09-11 17:08:04 +00:00
SkSLCPP.h header self-sufficiency and idempotency 2019-06-26 19:12:43 +00:00
SkSLCPPCodeGenerator.cpp moved SkSL FunctionDefinition data into IRNode 2020-10-14 18:53:41 +00:00
SkSLCPPCodeGenerator.h Fix checks of GR_TEST_UTILS. 2020-09-25 15:58:53 +00:00
SkSLCPPUniformCTypes.cpp Fix checks of GR_TEST_UTILS. 2020-09-25 15:58:53 +00:00
SkSLCPPUniformCTypes.h moved SkSL Variable data into IRNode 2020-10-07 21:29:12 +00:00
SkSLDefines.h Let skslc use more of Skia's core 2020-10-02 20:58:54 +00:00
SkSLDehydrator.cpp Add builtin flag to SymbolTable. 2020-10-19 14:35:59 +00:00
SkSLDehydrator.h Add line breaks to make dehydrated files more merge-friendly. 2020-10-08 18:55:02 +00:00
SkSLErrorReporter.h Enable ClangTidy check llvm-namespace-comment. 2020-08-06 19:07:52 +00:00
SkSLExternalValue.h Moved SkSL VariableReference data into IRNode 2020-10-08 15:59:08 +00:00
SkSLFileOutputStream.h Replace 'typedef xxxxx INHERITED' with 'using INHERITED = xxxx;'. 2020-09-03 03:41:26 +00:00
SkSLGLSLCodeGenerator.cpp Always declare sk_FragColor in GLSL, even if unused 2020-10-20 21:13:31 +00:00
SkSLGLSLCodeGenerator.h Only include one variable per declaration statement 2020-10-07 17:27:01 +00:00
SkSLHCodeGenerator.cpp Clear the builtin status of SkSL nodes when they are cloned. 2020-10-20 15:53:41 +00:00
SkSLHCodeGenerator.h Fix checks of GR_TEST_UTILS. 2020-09-25 15:58:53 +00:00
SkSLInliner.cpp Wrap built-in symbol tables during inlining. 2020-10-20 19:34:01 +00:00
SkSLInliner.h Revert "Clean up SkSL inliner and allow it to be disabled." 2020-10-13 15:13:26 +00:00
SkSLIRGenerator.cpp Pop the symbol table if compilation fails. 2020-10-21 20:44:53 +00:00
SkSLIRGenerator.h Convert statement arrays from std::vector to SkSTArray. 2020-10-13 17:06:01 +00:00
SkSLLexer.cpp Reland "Added support for the 'inline' hint on SkSL functions" 2020-08-20 13:42:02 +00:00
SkSLLexer.h Reland "Added support for the 'inline' hint on SkSL functions" 2020-08-20 13:42:02 +00:00
SkSLMain.cpp Add test for disabling the inliner. 2020-10-12 17:14:07 +00:00
SkSLMemoryLayout.h Revert "Revert "moved SkSL Field data into IRNode"" 2020-10-05 17:29:18 +00:00
SkSLMetalCodeGenerator.cpp Fix frexp support in Metal. 2020-10-20 15:31:31 +00:00
SkSLMetalCodeGenerator.h Convert statement arrays from std::vector to SkSTArray. 2020-10-13 17:06:01 +00:00
SkSLModifiersPool.h In SPIRVCodeGenerator, use the Program's ModifiersPool 2020-10-09 18:45:43 +00:00
SkSLOutputStream.cpp Add line breaks to make dehydrated files more merge-friendly. 2020-10-08 18:55:02 +00:00
SkSLOutputStream.h Add line breaks to make dehydrated files more merge-friendly. 2020-10-08 18:55:02 +00:00
SkSLParser.cpp Reject struct vardecls with modifiers. 2020-10-09 22:34:23 +00:00
SkSLParser.h Enable ClangTidy check llvm-namespace-comment. 2020-08-06 19:07:52 +00:00
SkSLPipelineStageCodeGenerator.cpp moved SkSL FunctionDefinition data into IRNode 2020-10-14 18:53:41 +00:00
SkSLPipelineStageCodeGenerator.h Replace 'typedef xxxxx INHERITED' with 'using INHERITED = xxxx;'. 2020-09-03 03:41:26 +00:00
SkSLPool.cpp Reland "Create a basic IRNode pooling system." 2020-10-22 15:56:00 +00:00
SkSLPool.h Reland "Create a basic IRNode pooling system." 2020-10-22 15:56:00 +00:00
SkSLPosition.h Enable ClangTidy check llvm-namespace-comment. 2020-08-06 19:07:52 +00:00
SkSLRehydrator.cpp Clear the builtin status of SkSL nodes when they are cloned. 2020-10-20 15:53:41 +00:00
SkSLRehydrator.h Add builtin flag to SymbolTable. 2020-10-19 14:35:59 +00:00
SkSLSampleUsage.cpp Only include one variable per declaration statement 2020-10-07 17:27:01 +00:00
SkSLSectionAndParameterHelper.cpp moved SkSL VarDeclarations data into IRNode 2020-10-14 17:37:11 +00:00
SkSLSectionAndParameterHelper.h moved SkSL Variable data into IRNode 2020-10-07 21:29:12 +00:00
SkSLSPIRVCodeGenerator.cpp Fix storage class issues for push constant variables. 2020-10-22 13:17:45 +00:00
SkSLSPIRVCodeGenerator.h Fix storage class issues for push constant variables. 2020-10-22 13:17:45 +00:00
SkSLSPIRVtoHLSL.cpp Add GrD3DPipelineStateBuilder class. 2020-04-23 15:01:58 +00:00
SkSLSPIRVtoHLSL.h Enable ClangTidy check llvm-namespace-comment. 2020-08-06 19:07:52 +00:00
SkSLString.cpp Revert "Revert "moved SkSL Field data into IRNode"" 2020-10-05 17:29:18 +00:00
SkSLString.h Improve performance of SymbolTable lookups. 2020-10-09 17:39:34 +00:00
SkSLStringStream.h Add line breaks to make dehydrated files more merge-friendly. 2020-10-08 18:55:02 +00:00
SkSLUtil.cpp Add support for ShaderCapsFactory to skslc. 2020-09-16 15:11:07 +00:00
SkSLUtil.h Reland "Add sk_Caps.builtinDeterminantSupport and use it in cross()." 2020-10-09 14:45:23 +00:00
spirv.h Remove trailing whitespace. 2017-10-09 21:20:34 +00:00

Overview
========

SkSL ("Skia Shading Language") is a variant of GLSL which is used as Skia's
internal shading language. SkSL is, at its heart, a single standardized version
of GLSL which avoids all of the various version and dialect differences found
in GLSL "in the wild", but it does bring a few of its own changes to the table.

Skia uses the SkSL compiler to convert SkSL code to GLSL, GLSL ES, or SPIR-V
before handing it over to the graphics driver.


Differences from GLSL
=====================

* Precision modifiers are not used. 'float', 'int', and 'uint' are always high
  precision. New types 'half', 'short', and 'ushort' are medium precision (we
  do not use low precision).
* Vector types are named <base type><columns>, so float2 instead of vec2 and
  bool4 instead of bvec4
* Matrix types are named <base type><columns>x<rows>, so float2x3 instead of
  mat2x3 and double4x4 instead of dmat4
* "@if" and "@switch" are static versions of if and switch. They behave exactly
  the same as if and switch in all respects other than it being a compile-time
  error to use a non-constant expression as a test.
* GLSL caps can be referenced via the syntax 'sk_Caps.<name>', e.g.
  sk_Caps.canUseAnyFunctionInShader. The value will be a constant boolean or int,
  as appropriate. As SkSL supports constant folding and branch elimination, this
  means that an 'if' statement which statically queries a cap will collapse down
  to the chosen branch, meaning that:

    if (sk_Caps.externalTextureSupport)
        do_something();
    else
        do_something_else();

  will compile as if you had written either 'do_something();' or
  'do_something_else();', depending on whether that cap is enabled or not.
* no #version statement is required, and it will be ignored if present
* the output color is sk_FragColor (do not declare it)
* use sk_Position instead of gl_Position. sk_Position is in device coordinates
  rather than normalized coordinates.
* use sk_PointSize instead of gl_PointSize
* use sk_VertexID instead of gl_VertexID
* use sk_InstanceID instead of gl_InstanceID
* the fragment coordinate is sk_FragCoord, and is always relative to the upper
  left.
* use sk_Clockwise instead of gl_FrontFacing. This is always relative to an
  upper left origin.
* you do not need to include ".0" to make a number a float (meaning that
  "float2(x, y) * 4" is perfectly legal in SkSL, unlike GLSL where it would
  often have to be expressed "float2(x, y) * 4.0". There is no performance
  penalty for this, as the number is converted to a float at compile time)
* type suffixes on numbers (1.0f, 0xFFu) are both unnecessary and unsupported
* creating a smaller vector from a larger vector (e.g. float2(float3(1))) is
  intentionally disallowed, as it is just a wordier way of performing a swizzle.
  Use swizzles instead.
* Swizzle components, in addition to the normal rgba / xyzw components, can also
  be LTRB (meaning "left/top/right/bottom", for when we store rectangles in
  vectors), and may also be the constants '0' or '1' to produce a constant 0 or
  1 in that channel instead of selecting anything from the source vector.
  foo.rgb1 is equivalent to float4(foo.rgb, 1).
* All texture functions are named "sample", e.g. sample(sampler2D, float3) is
  equivalent to GLSL's textureProj(sampler2D, float3).
* Render target width and height are available via sk_Width and sk_Height
* Functions support the 'inline' modifier, which causes the compiler to ignore
  its normal inlining heuristics and inline the function if at all possible
* some built-in functions and one or two rarely-used language features are not
  yet supported (sorry!)

SkSL is still under development, and is expected to diverge further from GLSL
over time.


SkSL Fragment Processors
========================

********************************************************************************
*** IMPORTANT: You must set gn arg "skia_compile_processors = true" to cause ***
*** .fp files to be recompiled! In order for compilation to succeed, you     ***
*** must run bin/fetch-clang-format (once) to install our blessed version.   ***
********************************************************************************

An extension of SkSL allows for the creation of fragment processors in pure
SkSL. The program defines its inputs similarly to a normal SkSL program (with
'in' and 'uniform' variables), but the 'main()' function represents only this
fragment processor's portion of the overall fragment shader. The 'main' function
can optionally have a 'float2' parameter that holds the sample coordinates the
processor is evaluated at.

Processors do not need to declare this parameter if they do not rely on local
coordinates or only invoke child processors using the no-arg and matrix-based
sample() functions.

Within an '.fp' fragment processor file:

* C++ code can be embedded in sections of the form:

  @section_name { <arbitrary C++ code> }

  Supported section are:
    @header            (in the .h file, outside the class declaration)
    @headerEnd         (at the end of the .h file)
    @class             (in the .h file, inside the class declaration)
    @cpp               (in the .cpp file)
    @cppEnd            (at the end of the .cpp file)
    @constructorParams (extra parameters to the constructor, comma-separated)
    @constructor       (replaces the default constructor)
    @initializers      (constructor initializer list, comma-separated)
    @emitCode          (extra code for the emitCode function)
    @fields            (extra private fields, each terminated with a semicolon)
    @make              (replaces the default Make function)
    @clone             (replaces the default clone() function)
    @dumpInfo          (replaces the default dumpInfo() function)
    @setData(<pdman>)  (extra code for the setData function, where <pdman> is
                        the name of the GrGLSLProgramDataManager)
    @test(<testData>)  (the body of the TestCreate function, where <testData> is
                        the name of the GrProcessorTestData* parameter)
    @samplerParams(<sampler>)
                       (the sampler params to attach to the named sampler2D)
* global 'in' variables represent data passed to the fragment processor at
  construction time. These variables become constructor parameters and are
  stored in fragment processor fields. By default float2/half2 maps to SkPoints,
  and float4/half4 maps to SkRects (in x, y, width, height) order. Similarly,
  int2/short2 maps to SkIPoint and int4/half4 maps to SkIRect. Use ctype
  (below) to override this default mapping.
* global variables support an additional 'ctype' layout key, providing the type
  they should be represented as from within the C++ code. For instance, you can
  use 'layout(ctype=SkPMColor4f) in half4 color;' to create a variable that looks
  like a half4 on the SkSL side of things, and a SkPMColor4f on the C++ side of
  things.
* 'uniform' variables become, as one would expect, top-level uniforms. By
  default they do not have any data provided to them; you will need to provide
  them with data via the @setData section.
* 'in uniform' variables are uniforms that are automatically wired up to
  fragment processor constructor parameters. The fragment processor will accept
  a parameter representing the uniform's value, and automatically plumb it
  through to the uniform's value in its generated setData() function.
* 'in uniform' variables support a 'tracked' flag in the layout that will
  have the generated code automatically implement state tracking on the uniform
  value to minimize GPU calls.
* Uniform variables support an additional 'when' layout key.
  'layout(when=foo) uniform int x;' means that this uniform will only be
  emitted when the 'foo' expression is true.
* 'in' variables support an additional 'key' layout key.
  'layout(key) in uniform int x;' means that this uniform should be included in
  the program's key. Matrix variables additionally support 'key=identity',
  which causes the key to consider only whether or not the matrix is an
  identity matrix.
* child processors can be declared with 'in fragmentProcessor <name>;', and can
  be invoked by calling 'sample(<name>)' or 'sample(<name>, <inputColor>)'.
  The first variant emits the child with a solid white input color. The second
  variant emits the child with the result of the 2nd argument's expression,
  which must evaluate to a half4. The process function returns a half4.
* The 'fragmentProcessor' type cannot hold a null. Nullable fragment processors
  should use the 'fragmentProcessor?' type: 'in fragmentProcessor? <name>'. You
  can check for null fragment processors by comparing them against 'null', as 
  in: 'if (inputFP != null) { ... }'. Invoking 'sample()' on a null fragment
  processor will return the inputColor unchanged (this defaults to solid white
  if not explicitly specified).


Creating a new .fp file
=======================

1. Ensure that you have set gn arg "skia_compile_processors = true"
2. Create your new .fp file, generally under src/gpu/effects.
3. Add the .fp file to sksl.gni.
4. Build Skia. This will cause the .fp file to be compiled, resulting in a new
   .cpp and .h file for the fragment processor.
5. Add the .cpp and .h files to gpu.gni.
6. Add the new processor's ClassID (k<ProcessorName>_ClassID) to
   GrProcessor::ClassID.
7. At this point you can reference the new fragment processor from within Skia.

Once you have done this initial setup, simply re-build Skia to pick up any
changes to the .fp file.