internal/driver/glfw: clean the cache periodically. #5112

knusbaum · 2024-09-04T16:10:24Z

Description:

The cache is supposed to be cleaned periodically, but the glfw driver does not do this as the mobile driver does.

This commit adds similar logic to the glfw driver as exists in the mobile driver, to periodically clean the cache on paint events.

Fixes #4903

Notes to Reviewers:

I'm calling cache.Clean in a similar way to how it's done in the mobile driver:

fyne/internal/driver/mobile/driver.go

Lines 280 to 293 in 5fb3d75

    
           canvasNeedRefresh := c.FreeDirtyTextures() > 0 || c.CheckDirtyAndClear() 
        
           if canvasNeedRefresh { 
        
           	newSize := fyne.NewSize(float32(d.currentSize.WidthPx)/c.scale, float32(d.currentSize.HeightPx)/c.scale) 
        
           	if c.EnsureMinSize() { 
        
           		c.sizeContent(newSize) // force resize of content 
        
           	} else { // if screen changed 
        
           		w.Resize(newSize) 
        
           	} 
        
           	d.paintWindow(w, newSize) 
        
           	d.app.Publish() 
        
           } 
        
           cache.Clean(canvasNeedRefresh)

I don't fully understand what the boolean value passed to clean represents, or if what I'm doing makes sense. I'm just mimicking the code from mobile.

I have not written regression tests for this, as I'm not familiar enough with the code to confidently do so.

Checklist:

Tests included.
Lint and formatter run with no errors.
Tests all pass.

Not all the tests pass on the current develop branch, but this PR does not cause new test failures.

The cache is supposed to be cleaned periodically, but the glfw driver does not do this as the mobile driver does. This commit adds similar logic to the glfw driver as exists in the mobile driver, to periodically clean the cache on paint events. Fixes fyne-io#4903

coveralls · 2024-09-04T16:32:15Z

coverage: 66.059% (+0.006%) from 66.053%
when pulling 7b2a4c1 on knusbaum:fix-cache-clean
into 5fb3d75 on fyne-io:develop.

andydotxyz

This looks like a good fix. Are you happy we land this before looking at the refactor we discussed @dweymouth ?

dweymouth · 2024-09-07T20:26:51Z

I suppose, though I suspect we are now doing double cache cleaning work on each clean, since it is calling both cache.Clean and cache.CleanCanvases, which duplicate most of the cleanup tasks. I had thought perhaps cache.CleanCanvases was introduced because cache.Clean couldn't be used in the desktop driver for whatever reason, but if we are calling cache.Clean now, it is very likely CleanCanvases is totally redundant and we should evaluate and remove it

dweymouth · 2024-09-07T20:41:38Z

From a quick glance it seems the only clean task missing from CleanCanvases is the call to destroyExpiredFontMetrics, so another fix could be just adding that call to CleanCanvases. I would really love if someone who was involved in the initial caching design could review these two funcs more deeply and figure out if we still need the separation between the two

knusbaum · 2024-09-09T15:38:34Z

From a quick glance it seems the only clean task missing from CleanCanvases is the call to destroyExpiredFontMetrics, so another fix could be just adding that call to CleanCanvases. I would really love if someone who was involved in the initial caching design could review these two funcs more deeply and figure out if we still need the separation between the two

CleanCanvases is already called in the frame draw code:

fyne/internal/driver/glfw/loop.go

Lines 78 to 98 in 7d81356

    
           func (d *gLDriver) drawSingleFrame() { 
        
           	for _, win := range d.windowList() { 
        
           		w := win.(*window) 
        
           		w.viewLock.RLock() 
        
           		canvas := w.canvas 
        
           		closing := w.closing 
        
           		visible := w.visible 
        
           		w.viewLock.RUnlock() 
        
           		// CheckDirtyAndClear must be checked after visibility, 
        
           		// because when a window becomes visible, it could be 
        
           		// showing old content without a dirty flag set to true. 
        
           		// Do the clear if and only if the window is visible. 
        
           		if closing || !visible || !canvas.CheckDirtyAndClear() { 
        
           			continue 
        
           		} 
        
           		d.repaintWindow(w) 
        
           		refreshingCanvases = append(refreshingCanvases, canvas) 
        
           	} 
        
           	cache.CleanCanvases(refreshingCanvases)

The issue is that CleanCanvases does not run the cache clean for renderers (and maybe other things I haven't checked), but just deletes renderers for canvases that have expired or are refreshed (gathered into deletingObjs):

fyne/internal/cache/base.go

Lines 132 to 145 in 7d81356

    
           renderersLock.Lock() 
        
           for _, dobj := range deletingObjs { 
        
           	wid, ok := dobj.(fyne.Widget) 
        
           	if !ok { 
        
           		continue 
        
           	} 
        
           	rinfo, ok := renderers[wid] 
        
           	if !ok || !rinfo.isExpired(now) { 
        
           		continue 
        
           	} 
        
           	rinfo.renderer.Destroy() 
        
           	overrides.Delete(wid) 
        
           	delete(renderers, wid) 
        
           }

If we want CleanCanvases to clean stale renderers from the cache, it needs to call destroyExpiredRenderers as cache.Clean does:

fyne/internal/cache/base.go

Line 51 in 7d81356

destroyExpiredRenderers(now)

dweymouth · 2024-09-09T15:43:29Z

I really hope @andydotxyz or someone else with deeper knowledge into the initial design of the cache and clean tasks can take another look at this, since I don't understand the design well enough to recommend between merging CleanCanvases and Clean into one func, or adding the missed destroyExpiredRenderers and destroyExpiredFontMetrics to CleanCanvases and continuing to not invoke Clean in the glfw driver.

knusbaum · 2024-09-10T15:02:58Z

In my fork I've removed the renderer cache entirely since it was causing issues, and I can't see, from a technical perspective, what benefit it brings that outweighs the downsides.

Issues included:

Some canvas objects create sub-objects and then discard them regularly, and their renderers get stuck in the cache until they expire. This means we hang onto garbage for quite a while. This isn't usually a big issue, but can inflate the heap for windows with a large number of objects.
In applications with multiple windows, interacting with one window will cause the cache to periodically be cleaned (at least, once this PR goes in), but since the second window is never interacted with, it will not be redrawn, and its renderers will expire. It looks like this should not be an issue and that new renderers will be generated on demand. However, this is not happening and the window just freezes. I didn't bother to debug this.

There are two potential benefits to the render cache that I see, listed here along with counter-points:

Renderers for objects that are not being drawn can be discarded eventually, reducing memory cost
- However: this is a small amount of memory, as renderers are usually not large objects. IMO, it is not worth it, especially since this doesn't appear to actually work today, and when renderers are discarded while objects are still active, things seem to break.
We know exactly when renderers are discarded (i.e. when they are evicted from the cache) so we can run their Destroy methods.
- However: Destroy is really only used in a couple of places, mostly to stop and unregister animations. This can be accomplished with an object finalizer. The current implementation of the cache looks like it has some issues around this, specifically it calls Destroy on the renderers before it removes them from the cache, meaning there is a race, and code can be handed a renderer to use that has already been destroyed. These can be fixed of course, but these kinds of race conditions are easy to miss, and it is probably better to just let the runtime handle it.

In the process of these changes, I also discovered a number of bugs, mostly related to widgets failing to call ExtendBaseWidget, since removing the render cache requires widgets to call this, or else the code fails. I see this as a plus, since it can be easily tested for. Any widget that does not call ExtendBaseWidget when using the base widget will cause a crash, making it difficult to commit code with widgets broken in this way.

If the project is interested in my modifications, I will clean it up and put up another PR for review. Based on my profiling, this improves performance in a number of cases, and probably improves correctness.

dweymouth · 2024-09-10T15:23:27Z

By getting rid of the render cache, do you mean in your fork the widget gets a new renderer created every time it is rendered? I imagine that would bring its own set of memory and performance issues, I think we'd be more interested in fixing the incorrectness with the current render cache than removing it entirely

knusbaum · 2024-09-10T15:34:36Z

No, each widget keeps a reference to its renderer. In practice, this is handled by the base widgets. I added a Renderer() method on the Widget interface, and implemented it in the base widget. The implementation is basically:

func (w *BaseWidget) Renderer() WidgetRenderer {
    if w.renderer == nil {
        w.renderer = w.super().CreateRenderer()
    }
    return w.renderer
}

And then all calls to r := cache.Renderer(widget) turn into r := widget.Renderer().

This, of course, does change the API, and so would probably require a major version bump. But there are ways around that if you want to keep the existing API.

dweymouth · 2024-09-10T15:40:47Z

Ah yes, that would be a breaking public API change, as existing widgets would now have to add this new method to continue to function, but could be an option for Fyne 3.x if we ever have the need to introduce other breaking changes (it is not planned anytime soon). In the meantime, fixing the render cache mechanism is something we will need to pursue. I haven't encountered any issues other than the memory leak you've identified with the current system though.

knusbaum · 2024-09-10T15:42:48Z

When I have more free time to work on this, I'll try to produce a sample application that demonstrates the other issues. The race condition will probably be hard to trigger, but I should be able to trigger the freeze since that was happening very regularly.

dweymouth · 2024-09-10T15:43:51Z

Yes, when you have the time please file an issue report for the freezing you're seeing along with a toy app to demonstrate it - as I don't believe any contributors have encountered anything like that and it's an unknown issue right now

dweymouth

Just commenting to put a block on merging so we don't accidentally merge this and introduce the window-freezing bug @knusbaum described

dweymouth · 2024-09-10T16:59:50Z

@knusbaum Is your change in your fork to remove the renderer cache on a public branch somewhere? I'd be curious to take a look at it (and maybe considering using it in my fork for my app as well if it's a simple diff)

knusbaum · 2024-09-10T19:25:43Z

@dweymouth It's not public at the moment. It's mixed in with a bunch of other changes I've made testing out optimizations. Some have worked and others have not been valuable.

I will pull that change out and put it on a public branch. When it's done I'll let you know here.

knusbaum · 2024-09-10T23:20:31Z

@dweymouth here's my branch with the cache removed, based on the develop branch:
https://github.com/fyne-io/fyne/compare/develop...knusbaum:fyne:remove-renderer-cache?expand=1

I'm not sure everything's fixed, and I haven't updated the tests at all.

knusbaum mentioned this pull request Sep 4, 2024

Memory leak in widget.Table due to constantly creating cells to cache.Renderer(impl) while table.Refresh() #4903

Open

2 tasks

dweymouth marked this pull request as ready for review September 4, 2024 16:24

andydotxyz approved these changes Sep 7, 2024

View reviewed changes

dweymouth mentioned this pull request Sep 10, 2024

[3.0] Consider removing render cache and make widgets maintain a reference to their renderer #5130

Open

2 tasks

dweymouth requested changes Sep 10, 2024

View reviewed changes

knusbaum mentioned this pull request Sep 10, 2024

Render cache cleaning causes freeze in inactive window. #5131

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

internal/driver/glfw: clean the cache periodically. #5112

internal/driver/glfw: clean the cache periodically. #5112

knusbaum commented Sep 4, 2024

coveralls commented Sep 4, 2024

andydotxyz left a comment

dweymouth commented Sep 7, 2024

dweymouth commented Sep 7, 2024

knusbaum commented Sep 9, 2024 •

edited

Loading

dweymouth commented Sep 9, 2024

knusbaum commented Sep 10, 2024

dweymouth commented Sep 10, 2024

knusbaum commented Sep 10, 2024 •

edited

Loading

dweymouth commented Sep 10, 2024 •

edited

Loading

knusbaum commented Sep 10, 2024

dweymouth commented Sep 10, 2024

dweymouth left a comment

dweymouth commented Sep 10, 2024

knusbaum commented Sep 10, 2024

knusbaum commented Sep 10, 2024

	canvasNeedRefresh := c.FreeDirtyTextures() > 0 \|\| c.CheckDirtyAndClear()
	if canvasNeedRefresh {
	newSize := fyne.NewSize(float32(d.currentSize.WidthPx)/c.scale, float32(d.currentSize.HeightPx)/c.scale)

	if c.EnsureMinSize() {
	c.sizeContent(newSize) // force resize of content
	} else { // if screen changed
	w.Resize(newSize)
	}

	d.paintWindow(w, newSize)
	d.app.Publish()
	}
	cache.Clean(canvasNeedRefresh)

internal/driver/glfw: clean the cache periodically. #5112

Are you sure you want to change the base?

internal/driver/glfw: clean the cache periodically. #5112

Conversation

knusbaum commented Sep 4, 2024

Description:

Notes to Reviewers:

Checklist:

coveralls commented Sep 4, 2024

andydotxyz left a comment

Choose a reason for hiding this comment

dweymouth commented Sep 7, 2024

dweymouth commented Sep 7, 2024

knusbaum commented Sep 9, 2024 • edited Loading

dweymouth commented Sep 9, 2024

knusbaum commented Sep 10, 2024

dweymouth commented Sep 10, 2024

knusbaum commented Sep 10, 2024 • edited Loading

dweymouth commented Sep 10, 2024 • edited Loading

knusbaum commented Sep 10, 2024

dweymouth commented Sep 10, 2024

dweymouth left a comment

Choose a reason for hiding this comment

dweymouth commented Sep 10, 2024

knusbaum commented Sep 10, 2024

knusbaum commented Sep 10, 2024

knusbaum commented Sep 9, 2024 •

edited

Loading

knusbaum commented Sep 10, 2024 •

edited

Loading

dweymouth commented Sep 10, 2024 •

edited

Loading