vulkan: Replace fence with semaphore when acquiring surfaces #4967

udoprog · 2024-01-03T09:06:07Z

Connections
Different way of solving #4946 which uses internal wait semaphores.

Description
This removes the use of a fence in favor of internally using and keeping track of one wait semaphore per swapchain image.

It roughly uses the approach initially attempted at https://github.com/cwfitzgerald/wgpu/tree/vulkan-timing-fixes,

I've defined a generic set in A::SubmitSurfaceTextureSet which can be used to collect whatever information it needs from a surface texture, this is defined as a dummy implementation for backends which does not use it.

Multiple calls to get_current_texture

There is a tricky situation which arises if a user were to perform multiple calls to get_current_texture. The current implementation naively just rotates surface_semaphores. So if we call this function images.len() + 1 times in a row, we will exhaust the semaphores and they will be assigned to a random set of swapchain images by the presentation engine.

This wasn't a problem in #4946 since the user is responsible for ensuring that the number of pending get_current_texture matches the number of semaphores they have (e.g. exactly one per image), but here we have to deal with this scenario somehow.

I think the "correct" way to deal with this is to somehow free the semaphore after a successful call to acquire_next_image has returned a previously used index again. But I'm looking for input.

Testing
Using the vulkan backend for a project I'm working on and dealing with any issues that show up.

Checklist

Run cargo fmt.
Run cargo clippy.
Run cargo xtask test to run tests (still doesn't work).
Add change to CHANGELOG.md. See simple instructions inside file.

ids1024 · 2024-01-03T18:44:25Z

I think this should also potentially perform better in some circumstances, by not unnecessarily blocking the CPU on a fence and instead just marking this as a dependency of the queue submission. Right? So this seems better in general.

udoprog · 2024-01-04T08:34:38Z

Looks like a legit validation failure, I'll try to suss it out before unmarking this as a draft:

[2024-01-04T08:20:08Z ERROR wgpu_test::expectations] Unexpected failure due to: ValidationError(Some("Validation Error: [ VUID-VkSubmitInfo-pSignalSemaphores-03242 ] Object 0: handle = 0xf56c9b0000000004, type = VK_OBJECT_TYPE_SEMAPHORE; Object 1: handle = 0x7f09e27c8fd0, type = VK_OBJECT_TYPE_QUEUE; | MessageID = 0xdb30ee87 | vkQueueSubmit(): pSubmits[0].pSignalSemaphores[0] signal value (0xffffffffffffffff) in VkQueue 0x7f09e27c8fd0[] must be greater than pending signal timeline semaphore VkSemaphore 0xf56c9b0000000004[] value (0xffffffffffffffff). The Vulkan spec states: For each element of pSignalSemaphores created with a VkSemaphoreType of VK_SEMAPHORE_TYPE_TIMELINE the corresponding element of VkTimelineSemaphoreSubmitInfo::pSignalSemaphoreValues must have a value greater than the current value of the semaphore when the semaphore signal operation is executed (https://vulkan.lunarg.com/doc/view/1.3.268.0/linux/1.3-extensions/vkspec.html#VUID-VkSubmitInfo-pSignalSemaphores-03242)"))

Seems like I wasn't careful when refactoring the timeline semaphores.

valaphee · 2024-01-04T20:17:14Z

Doesn't fix #4919 but the water example runs 600 fps faster (~1500 fps and now with ~2100 fps)

udoprog · 2024-01-04T21:38:47Z

That's both unfortunate that it doesn't address your problem but neat about the fps boost. Thanks for trying it out! Personally I can't compare since the unpatched wgpu with the fence doesn't work at all for me right now.

cwfitzgerald · 2024-01-09T22:25:03Z

@udoprog Haven't had a chance to fully look over the code, but going to send it out to a few people for testing in various environments - could you just resolve the merge conflicts with trunk?

udoprog · 2024-01-09T23:35:50Z

@cwfitzgerald All right, that should be it. Tell me if there's anything else.

wgpu-hal/src/vulkan/mod.rs

wgpu-core/src/device/queue.rs

cwfitzgerald · 2024-01-16T16:42:04Z

Hey! Sorry forgot to follow up on this - sent this to a bunch of people and things look good! Everything seems to work as expected on all platforms. It either has equivalent performance, or significantly higher performance.

Going to do another proper review in the next day or so and we'll get this in!

wgpu-core/src/device/queue.rs

cwfitzgerald

Looks a lot better and performs well based on testing, lets get this out there!

…4967)

emilk · 2024-09-09T12:25:00Z

@EmbersArc reports this PR causes 100% CPU usage:

100% CPU usage since the update to wgpu 0.20 emilk/egui#5092
OS: Arch Linux 6.10.8-arch1-1 (64-bit)
X11 with NVIDIA 560.35.03

cwfitzgerald · 2024-09-10T19:29:28Z

This got entirely rewritten in the 0.20.1 update (wgpu-hal 0.21), so it's unlikely to be this PR.

Replying upstream

udoprog requested a review from a team as a code owner January 3, 2024 09:06

udoprog marked this pull request as draft January 3, 2024 09:06

udoprog force-pushed the vulkan-wait-semaphores branch 2 times, most recently from 9bea671 to f9c1ff9 Compare January 3, 2024 09:15

udoprog mentioned this pull request Jan 3, 2024

Applications using wgpu hang forever on bleeding edge Linux with Nvidia drivers 545.29.06 on GNOME / Wayland #4775

Closed

udoprog marked this pull request as ready for review January 4, 2024 15:07

udoprog force-pushed the vulkan-wait-semaphores branch from 57c4b58 to 372b2ac Compare January 9, 2024 23:23

cwfitzgerald requested changes Jan 11, 2024

View reviewed changes

wgpu-hal/src/vulkan/mod.rs Outdated Show resolved Hide resolved

wgpu-core/src/device/queue.rs Outdated Show resolved Hide resolved

ids1024 mentioned this pull request Jan 15, 2024

Cosmic Apps using wgpu fail with Nvidia graphics. pop-os/cosmic-comp#264

Closed

udoprog force-pushed the vulkan-wait-semaphores branch from 89d7bd2 to 451d728 Compare January 16, 2024 23:15

udoprog commented Jan 16, 2024

View reviewed changes

wgpu-core/src/device/queue.rs Outdated Show resolved Hide resolved

udoprog force-pushed the vulkan-wait-semaphores branch 2 times, most recently from fd24c7b to 57a95d0 Compare January 17, 2024 00:46

udoprog requested a review from cwfitzgerald January 17, 2024 23:02

vulkan: Replace fence with semaphore when acquiring surfaces

940986f

udoprog force-pushed the vulkan-wait-semaphores branch from 57a95d0 to 3355cfd Compare January 17, 2024 23:32

Rework to pass in A::SurfaceTexture references

796553d

udoprog force-pushed the vulkan-wait-semaphores branch from 3355cfd to 796553d Compare January 17, 2024 23:37

cwfitzgerald added the PR: needs back-porting PR with a fix that needs to land on crates label Jan 21, 2024

cwfitzgerald approved these changes Jan 21, 2024

View reviewed changes

cwfitzgerald merged commit e5c62fb into gfx-rs:trunk Jan 21, 2024
27 checks passed

cwfitzgerald removed the PR: needs back-porting PR with a fix that needs to land on crates label Jan 21, 2024

Friz64 mentioned this pull request Jan 22, 2024

Release 0.19.1 #5114

Merged

Friz64 mentioned this pull request Jan 22, 2024

Enable Wayland by default bevyengine/bevy#10792

Open

ids1024 mentioned this pull request Jan 22, 2024

Insane gpu usage when hovering over the UI. iced-rs/iced#2119

Open

2 tasks

udoprog mentioned this pull request Jan 24, 2024

Calling Surface::get_current_texture is not idempotent #5140

Open

salimp2009 mentioned this pull request Jan 27, 2024

Upgrade smithay-client-toolkit 0.17 wez/wezterm#4777

Closed

9 tasks

xStrom mentioned this pull request Mar 2, 2024

Bump vello and wgpu. linebender/xilem#179

Merged

jleibs mentioned this pull request Mar 18, 2024

Wayland + Nvidia: rerun --skip-welcome-screen causes Rerun to hang on launch rerun-io/rerun#5283

Closed

dragostis mentioned this pull request Mar 26, 2024

Changes needed to get rive-bevy working with bevy 0.13.0 and vello 0.1.0 rive-app/rive-rs#6

Draft

Friz64 pushed a commit to Friz64/wgpu that referenced this pull request Apr 24, 2024

vulkan: Replace fence with semaphore when acquiring surfaces (gfx-rs#…

ab98828

…4967)

Friz64 mentioned this pull request Apr 24, 2024

Bevy freezes on Wayland bevyengine/bevy#10546

Open

PWhiddy mentioned this pull request May 12, 2024

Submitting multiple command encoders per frame causes hang on Vulkan #5693

Closed

This was referenced Jul 31, 2024

Mouse click frame drop on Bevy 0.14.0 bevyengine/bevy#14303

Open

Performance regression from "Replace fence with semaphore" #6067

Open

EmbersArc mentioned this pull request Sep 8, 2024

100% CPU usage since the update to wgpu 0.20 emilk/egui#5092

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vulkan: Replace fence with semaphore when acquiring surfaces #4967

vulkan: Replace fence with semaphore when acquiring surfaces #4967

udoprog commented Jan 3, 2024 •

edited

Loading

ids1024 commented Jan 3, 2024

udoprog commented Jan 4, 2024 •

edited

Loading

valaphee commented Jan 4, 2024

udoprog commented Jan 4, 2024

cwfitzgerald commented Jan 9, 2024

udoprog commented Jan 9, 2024

cwfitzgerald commented Jan 16, 2024

cwfitzgerald left a comment

emilk commented Sep 9, 2024

cwfitzgerald commented Sep 10, 2024

vulkan: Replace fence with semaphore when acquiring surfaces #4967

vulkan: Replace fence with semaphore when acquiring surfaces #4967

Conversation

udoprog commented Jan 3, 2024 • edited Loading

ids1024 commented Jan 3, 2024

udoprog commented Jan 4, 2024 • edited Loading

valaphee commented Jan 4, 2024

udoprog commented Jan 4, 2024

cwfitzgerald commented Jan 9, 2024

udoprog commented Jan 9, 2024

cwfitzgerald commented Jan 16, 2024

cwfitzgerald left a comment

Choose a reason for hiding this comment

emilk commented Sep 9, 2024

cwfitzgerald commented Sep 10, 2024

udoprog commented Jan 3, 2024 •

edited

Loading

udoprog commented Jan 4, 2024 •

edited

Loading