DoF optimisation #8057

kaychang-unity · 2024-04-08T00:55:06Z

Purpose of this PR

The main optimisation is to dynamically select the number of samples to generate the bokeh, depending on the max amount of bokeh around neighbouring pixels.
This makes the performance of the effect scales depending on the number of blur on screen. The performance is improved by 40% for usual DoF setup when using KERNEL_VERY_LARGE option (4 rings, 71 samples).
This requires an extra step to downsample the CoC texture a few mip levels. The cost of the extra step is compensated by the performance gain when generating the bokeh.

The shader code was also further optimised:

moved some calculations outside the loop generating the bokeh samples as uniform, moved some temp variables to half, replaced some divisions by multiplications: 5% improvement.
Manual unrolling of the loop generating the bokeh samples: 10% improvement

Testing status

DoF output is mostly changed.
A slight difference is the blending between foreground out-of-focus pixels into in-focus pixels. The original code uses a solution which generate different results depending on the number of kernel samples. As the number of kernel samples is now dynamic, I implemented a slightly different blending strategy but which gives very close results.

Note that the original solution to decide the blending is flawed, so there was no possible clean fix for it.

TODO: test on many platforms

Comments to reviewers

…ach.

BenGraterUnity · 2024-04-11T06:27:51Z

Thanks for this. Could you please provide some more detail on what testing has been done and on what plaforms?

kaychang-unity · 2024-04-12T02:14:43Z

I'm still experimenting with various optimisations before testing more platforms. Once ready, I will likely test on Android (GLES - Vulkan), iOS (Metal, iPhone 8), macOS, Windows (D3D11, D3D12, Vulkan, GL), PS5, Switch. Unfortunately I don't have access to XR devices, so will do on emulator instead (virtual HMD).

BenGraterUnity · 2024-04-12T07:31:00Z

That sounds great, thank you! 👍

…er).

The original blending formula between DoF texture and source image is flawed but this fix tries to keep the new results as similar as possible.

kaychang-unity added 2 commits April 5, 2024 17:01

First pass DoF optimisation.

5e4faa5

Update to get same result with unified and original brute-force appro…

8a0758f

…ach.

kaychang-unity requested a review from BenGraterUnity as a code owner April 8, 2024 00:55

Added UNITY_NEAR_CLIP_VALUE.

418ad53

Added static tile version.

69f1e7c

kaychang-unity marked this pull request as draft April 12, 2024 02:08

kaychang-unity self-assigned this Apr 12, 2024

kaychang-unity added 4 commits April 15, 2024 23:02

Shader code optimisation (7% faster), manual loop unrolling (15% fast…

d9da953

…er).

Removed tiling shaders, cleaned up code.

8707263

Removed unecessary code.

bafc28d

Renaming.

e44d18b

kaychang-unity changed the title ~~First pass DoF optimisation.~~ DoF optimisation. Apr 19, 2024

kaychang-unity changed the title ~~DoF optimisation.~~ DoF optimisation Apr 19, 2024

kaychang-unity added 5 commits April 18, 2024 21:50

More clean up.

1588873

Fix for artifact.

b008456

The original blending formula between DoF texture and source image is flawed but this fix tries to keep the new results as similar as possible.

Small improvement.

b3c0689

Clamp max kernel size.

6e51207

Added GatherRed support.

79e0a2f

kaychang-unity requested a review from sebastienlagarde May 27, 2024 23:40

sebastienlagarde approved these changes May 28, 2024

View reviewed changes

memphis88 force-pushed the master branch from 7a67a14 to 74326b1 Compare August 13, 2024 18:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DoF optimisation #8057

DoF optimisation #8057

kaychang-unity commented Apr 8, 2024 •

edited

Loading

BenGraterUnity commented Apr 11, 2024

kaychang-unity commented Apr 12, 2024

BenGraterUnity commented Apr 12, 2024

DoF optimisation #8057

Are you sure you want to change the base?

DoF optimisation #8057

Conversation

kaychang-unity commented Apr 8, 2024 • edited Loading

Purpose of this PR

Testing status

Comments to reviewers

BenGraterUnity commented Apr 11, 2024

kaychang-unity commented Apr 12, 2024

BenGraterUnity commented Apr 12, 2024

kaychang-unity commented Apr 8, 2024 •

edited

Loading