AuroraMiddleware/gtk - gtk - Gitea: Git with a cup of tea

mirror of https://gitlab.gnome.org/GNOME/gtk.git synced 2024-12-26 13:41:07 +00:00

Author	SHA1	Message	Date
Benjamin Otte	d7308f2d73	gsk: Rename GSK_DEBUG=glyphcache to GSK_DEBUG=cache 1. I mistype it all the time 2. It's shorter 3. We use it for all caching these days, not just glyphs.	2024-07-07 05:24:45 +02:00
Benjamin Otte	012c4b9425	gpu: Remove the ubershader It didn't bring any noticable benefits and it isn't compatible with the way we intend to do colorstate support. And nobody seems to want to spend time on it, so let's get rid of it. We can bring it back later if someone wants to work on it.	2024-07-07 05:19:32 +02:00
Benjamin Otte	9c249fefc3	gpu: Improve periodic cache debug message I was watching the log in my terminal and nothing happened. And I wasn't sure if that was because nothing was printed or because the same thing was printed every few seconds. Fix that by printing a timestamp, so that in a few seconds something else will be printed.	2024-07-03 20:36:56 +02:00
Benjamin Otte	eae7ee6c25	gpu: Track atlas differently Previously we tracked the dead pixels, but that meant we didn't know the alive pixels (because there's also unused pixels never accounted for). And we would free the current atlas randomly due to that. Now we track if any pixels are alive, and if so, we never gc the current atlas.	2024-07-03 20:36:56 +02:00
Benjamin Otte	bf7f302ff5	gpu: gc atlas, too After 60s, we gc the atlas, too. This ensures that after that time, we free all cache resources, so if an application gets moved to the background, it will no longer use GPU resources. (Well, at least the cache won't.)	2024-07-03 19:55:18 +02:00
Benjamin Otte	0d6981bd54	gpu: Make the cache track if it's empty Only if a non-stale item is in the cache do we consider the cache not empty. Once the cache is empty, the device frees it and stops running the periodic GC.	2024-07-03 19:55:18 +02:00
Benjamin Otte	148d7bcc25	gpu: Don't remove gc timeout unless cache is empty If the cache isn't empty, we want to rerun the GC.	2024-07-03 19:55:18 +02:00
Benjamin Otte	71161b6352	gpu: Split cache and device This is for 3 reasons: 1. Separation of concerns The device is meant to manage the Vulkan/GL device and check stuff like image sizes. Caching is not part of that. 2. Refcounting Images etc want to reference the device, but the cache wants to reference images. If the cache is the device, that's a refcycle. 3. Flexibility It's now easier to implement >1 cache, say one per depth or one per color state.	2024-07-03 19:55:15 +02:00
Benjamin Otte	72a4fae8dc	vulkan: More slight refactoring This applies the same refactoring as commit `5fbdec2a29` to another function.	2024-06-29 07:13:27 -04:00
Matthias Clasen	36c7d05445	gpu: Keep actual clear values in clear op Keeping the GdkRGBA requires doing later conversions, which isn't necessary if we just keep the already converted float[4]. It also prepares for future color states, where the color will need to be converted using the colorstate.	2024-06-29 07:12:28 -04:00
Matthias Clasen	51012c1802	ngl: Export dmabuf textures from render_texture We want dmabufs because we can import them into Vulkan, amongst other things.	2024-06-22 08:02:31 -04:00
Benjamin Otte	5fbdec2a29	vulkan: Slight refactoring for future changes No functional changes.	2024-06-21 19:53:46 +02:00
Matthias Clasen	36993ac707	gpu: Print some more details Print the variations of mask and blendmode operations. Just because we can.	2024-06-15 14:00:46 -04:00
Matthias Clasen	34fb08af6e	Fix a copy-paste error This was obviously meant to compare two different colors.	2024-06-14 12:30:06 -04:00
Matthias Clasen	0ec29c4176	gsk: Pass the memory format for back buffer We can now get this information from the Vulkan context, so use it to accurately represent the back buffer. Related: #6767	2024-06-09 15:59:56 -04:00
Matthias Clasen	18b3b4feed	gpu: Print more info for images Show the memory format. This helps debugging our depth selection.	2024-06-09 15:59:32 -04:00
Maximiliano Sandoval	3d1f914271	gskglrenderer: Document GL renderers	2024-05-31 11:47:30 +02:00
Chun-wei Fan	9dbdbaca43	gskvulkandevice.c: Put Vk[Pipeline\|RenderPass] in structures This way, we can simply duplicate the keys as separate pointers to store the corresponding Vulkan handles so that we can safely hash them, as Vulkan handles may or may not be pointers depending on the target platform. This will fix builds on 32-bit Windows at least.	2024-05-29 18:16:22 +08:00
Chun-wei Fan	4c677e4dcd	gskvulkanmemory.c: Use VK_NULL_HANDLE for VkDeviceMemory ...rather than NULL, so that things will build fine on non-LLP, non-64-bit systems.	2024-05-29 12:57:07 +08:00
Chun-wei Fan	be2ff60787	gsk: Call glDeleteSync() directly This function does not use the standard __cdecl calling convention on Windows, meaning using g_clear_pointer() on it directly will cause crashes on 32-bit Windows. Just call it directly if the GLsync it uses exists.	2024-05-25 11:07:37 +08:00
Alejandro Piñeiro	130a6fe0cf	gsk: use the correct memory type index https://gitlab.gnome.org/GNOME/gtk/-/issues/6726	2024-05-22 19:43:03 +02:00
Georges Basile Stavracas Neto	c45a6ad52d	gsk/gpu: Use G_GSIZE_FORMAT for printing gsizes On Windows, gsize is a long long unsigned. The compiler complains about that. Use G_GSIZE_FORMAT which translates to %llu on Windows, %lu on most platforms, and sometimes just %u on rare cases.	2024-05-03 12:30:39 -03:00
Matthias Clasen	ef1ff8313f	gsk: Improve logging Log the shader compilation with sufficient detail.	2024-04-30 07:36:42 -04:00
Benjamin Otte	e6700405c9	dmabuf: Use narrow range instead of full range It's way more common, and Mutter uses it, too. Avoid visual glitches when going in/out of offload. Fixes #6672	2024-04-29 14:30:56 +02:00
Matthias Clasen	a3bd0a3e17	gsk: Cosmetics Tweak a profiler counter name.	2024-04-28 23:54:55 -04:00
Benjamin Otte	719021e1f4	gpu: Handle tiny offscreens Due to rounding errors, it is possible after intersecting a lot of rectangles to end up with a tiny size for an offscreen. And because we allow an epsilon before ceil()ing to an integer (see commit `afc7b46264` for details) it is now possible that we end up with a size of 0. Avoid that by always enforcing a minimum size of 1px. Test included The test uses a different codepath to arrive at the same problem - it specifies the small size instead of triggering it via rounding errors and clipping like the original bug (and most likely the more common case to encounter this problem. Fixes #6656	2024-04-28 13:51:42 +02:00
Matthias Clasen	c45199e388	gsk: Fix a profiler mark I messed this up in `f26efd9adf`.	2024-04-27 10:23:45 -04:00
Matthias Clasen	a1fdf06d80	gsk: Add a warning for inefficient texture import With GSK_DEBUG=fallback, warn if a non-memory texture has to be downloaded for importing it into Vulkan or GL.	2024-04-26 11:04:47 -04:00
Matthias Clasen	1c9a55d185	Merge branch 'vulkan-msvc' into 'main' gskvulkandescriptors.c: Don't return value from void-rettype function See merge request GNOME/gtk!7175	2024-04-25 01:37:49 +00:00
Georges Basile Stavracas Neto	3aa6c27c26	vulkan/image: Use GENERAL for initial layout of DMA-BUF textures The VK_IMAGE_LAYOUT_UNDEFINED layout means that the data hold by the texture can be discarded, and we don't want to discard it. Because the Vulkan spec is unclear (see [1] for a discussion), err on the side of caution and use VK_IMAGE_LAYOUT_GENERAL. Fixes import failures with WebKit. [1] https://github.com/ValveSoftware/gamescope/issues/356	2024-04-24 17:21:51 -03:00
Chun-wei Fan	016354b6dd	gskvulkandescriptors.c: Don't return value from void-rettype function Fixes builds on Visual Studio with Vulkan enabled, as later GLib releases consider this as an error on Visual Studio builds.	2024-04-24 16:19:43 +08:00
Matthias Clasen	f26efd9adf	gsk: Add a profiler mark for pipeline creation This is the Vulkan equivalent of shader compilation, it could be expensive, so lets add a mark around it.	2024-04-22 20:47:25 -04:00
Matthias Clasen	cc8db1805d	gsk: Be safer against bad font options Some combinations of hint-style and hint-metrics lead to bad glyph placement in the glyph cache, so avoid them.	2024-04-09 19:12:49 -04:00
Benjamin Otte	3080e2974d	gpu: ceil() offscreen size before generating offscreen The goal is to generate an offscreen at 1x scale. When not ceil()ing the numbers the offscreen code would do it and adjust the scale accordingly, so we'd end up with something like a 1.01x scale. And that would cause the code to reenter this codepath with the goal to generate an offscreen at 1x scale. And indeed, this would lead to infinite recursion. Tests included. Fixes #6553	2024-04-09 17:39:32 +02:00
Benjamin Otte	9fe9ea34fd	vulkan: Handle generating mipmaps for 1x1 images Testcase included.	2024-04-08 21:06:54 +02:00
Matthias Clasen	d50b780551	gsk: Keep metrics hinting on when rendering It turns out that we mispositioned glyphs with some cff fonts when metrics hinting is off, and hinting is on. Since we don't fully understand the interactions of these settings at this point, lets preserve metrics hinting as it was on the font we got. This at least gives folks a workaround for when they experience clipped rendering with cff fonts: Turn on hint-metrics. We forced hint metrics off here because it made Pango do some creative wfh for hex boxes at small sizes, but I've dropped that on the Pango side.	2024-04-02 09:10:46 +02:00
Benjamin Otte	195ebf6848	Merge branch 'wip/otte/gl-map-buffer' into 'main' Add GLBuffer implementation w/ persistent mapping See merge request GNOME/gtk!7042	2024-03-17 00:27:51 +00:00
Benjamin Otte	aff34e8d1b	gpu: Sort passes correctly In a very particular situation, it could happen that our renderpass reordering did not work out. Consider this nesting of renderpasses (indentation indicates subpasses): pass A subpass of A pass B subpass of B Out reordering code would reorder this as: subpass of B subpass of A pass A pass B Which doesn't sound too bad, the subpasses happen before the passes after all. However, a subpass might be a pass that converts the image for a texture stored in the texture cache and then updates the cached image. If "subpass of A" is such a pass and if "subpass of B" then renders with exactly this texture, then "subpass of B" will use the result of "subpass of A" as a source. The fix is to ensure that subpasses stay ordered, too. The new order moves subpasses right before their parent pass, so the order of the example now looks like: subpass of A pass A subpass of B pass B The place where this would happen most common was when drawing thumbnail images in Nautilus, the GTK filechooser or Fractal. Those images are usually PNG files, which are straight alpha. They are then drawn with a drop shadow, which requires an offscreen for drawing as well as those images as premultipled sources, so lots of subpasses happen. If there is then a redraw with a somewhat tricky subregion, then the slicing of the region code could end up generating 2 passes that each draw half of the thumbnail image - the first pass drawing the top half and the second pass drawing the bottom half. And due to the bug the bottom half would then be drawn from the offscreen before the actual contents of the offscreen would be drawn, leading to a corrupt bottom part of the image. Test included. Fixes: #6318	2024-03-16 23:44:59 +01:00
Benjamin Otte	47307dc7c1	vulkan: Prefer cached buffer memory We write the buffers in small chunks, and we even sometimes read it. So prefer it when it's cached. Speeds up the text benchmarks by a factor of 3x on my dedicated GPU.	2024-03-16 22:32:49 +01:00
Benjamin Otte	96b800fa0c	gl: Add buffer implementation using persistent mapping If glBufferStorage() is available, we can replace our usage of glBufferSubData() with persistently mapped storage via glMappedBufferRange(). This has 1 disadvantage: 1. It's not supported everywhere, it requires GL 4.4 or GL_EXT_buffer_storage. But every GPU of the last 10 years should implement it. So we check for it and keep the old code. The old code can also be forced via GDK_GL_DISABLE=buffer-storage. But it has 2 advantages: 1. It is what Vulkan does, so it unifies the two renderers' buffer handling. 2. It is a significant performance boost in use cases with large vertex buffers. Those are pretty rare, but do happen with lots of text at a small font size. An example would be a small font in a maximized VTE terminal or the overview in gnome-text-editor. A custom benchmark tailored for this problem can be created with: tests/rendernode-create-tests 1000000 text.node This creates a node file called "text.node" that draws 1 million text nodes. (Creating that test takes a minute or so. A smaller number may be useful on less powerful hardware than my Intel Tigerlake laptop.) The difference can then be compared via: tools/gtk4-rendernode-tool benchmark --runs=20 text.node and GDK_GL_DISABLE=buffer-storage tools/gtk4-rendernode-tool benchmark --runs=20 text.node For my laptop, the difference is: before: 1.1s after: 0.8s Related: !7021	2024-03-16 20:55:26 +01:00
Benjamin Otte	e7a2baf78c	gpu: Remove unused arguments It's not just unused, it's also wrong. We are reading from the buffer when reallocating the vertex buffer and memcpy()ing the old into the new buffer - at that point we read from it.	2024-03-16 19:46:37 +01:00
Matthias Clasen	438d86fcf5	gsk: Move the buffer upload counter Move the sysprof counter for buffer uploads to the generic code, so it works for both ngl and Vulkan. This partially reverts commit `ecf1b7c18a`.	2024-03-16 19:39:16 +01:00
Benjamin Otte	43373e6350	gpu: Rename env var GSK_GPU_SKIP to GSK_GPU_DISABLE See previous commits.	2024-03-16 14:11:08 +01:00
Benjamin Otte	f725bdad25	gl: Move GL_ARB_base_instance check It's a GLContext feature check, not a GpuRenderer thing. So put it there.	2024-03-16 13:52:28 +01:00
Benjamin Otte	cfbe3709bf	gpu: Respect the GDK_GL_DISABLE flag It's now possible to disable sync support.	2024-03-16 13:52:21 +01:00
Benjamin Otte	141769fb46	gl: Turn has_foo flags into GdkGLFeatures The goal is to have it mirror GdkVulkanFeatures, and in particular having an environment variable to turn individual flags off.	2024-03-16 13:44:02 +01:00
Benjamin Otte	93cdcc5e88	gpu: Merge multiple ops into one ShaderOp When ops get allocated that use the same stats as the last op, put them into the same ShaderOp. This reduces the number of ShaderOps we need to record, which has 3 benefits: 1. It's less work when iterating over all the ops. This isn't a big win, but it makes submit() and print() run a bit faster. 2. We don't need to manage data per-op. This is a large win because we don't need to ref/unref descriptors as much anymore, and refcounting is visible on profiles. 3. We save memory. This is a pretty big win because we iterate over ops a lot, and when the array is large enough (I've managed to write testcases that makes it grow to over 4GB) it kills all the caches and that's bad. The main benefit of all this are glyphs, which used to emit 1 ShaderOp per glyph and can now end up with 1 ShaderOp for multiple text nodes, even if those text nodes use different fonts or colors - because they can all share the same ColorizeOp.	2024-03-15 20:25:02 +01:00
Matthias Clasen	d51912c0b4	gsk: Add gsk_gpu_frame_get_last_op This function will be used in the future to find the previous op during node processing, so we can make optimization decisions based on that.	2024-03-15 20:25:02 +01:00
Benjamin Otte	bad6e1e102	gpu: Change the way we merge draw calls With potentially multiple ops per ShaderOp, we may encounter situations where 1 ShaderOp contains more ops than we want to merge. (With GSK_GPU_SKIP=merge, we don't want to merge at all.) So we still merge the ShaderOps (now unconditionally), but we then run a loop that potentially splits the merged ops again - exactly at the point we want to. This way we can merge ops inside of ShaderOps and merge ShaderOps, but still have the draw calls contain the exact number of ops we want.	2024-03-15 20:25:02 +01:00
Benjamin Otte	28a8dc5a14	gpu: Add GskGpuShaderOp.n_ops This just introduces the variable and sets it to 1 everywhere. The ultimate goal is to allow one ShaderOp to collect multiple ops into one, thereby saving memory in the ops array and leading to faster performance.	2024-03-15 19:49:17 +01:00

1 2 3 4 5 ...

346 Commits