v3d backports from 6.13 #6620

6by9 · 2025-01-21T16:40:11Z

@roliver-rpi for testing.
I have compile tested it, but not booted it.

popcornmix · 2025-01-22T11:11:24Z

Need to confirm this does not introduce the NULL job issue in 6.13 reported here: #6624

pelwell · 2025-01-22T11:20:37Z

Either way, the result would be a useful datapoint.

roliver-rpi · 2025-01-27T11:30:13Z

This PR works to gather v3d gpu counters using perfetto. Also, it doesn't appear to introduce the issue noted in #6624 (no freezing up playing video in both firefox / chromium).

pelwell

Approval in principal (after rebasing...)

Commit be431df upstream Create a function `drm_gem_shmem_create_with_mnt()`, similar to `drm_gem_shmem_create()`, that has a mountpoint as a argument. This function will create a shmem GEM object in a given tmpfs mountpoint. This function will be useful for drivers that have a special mountpoint with flags enabled. Signed-off-by: Maíra Canal <[email protected]> Reviewed-by: Tvrtko Ursulin <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit 0992b25 upstream For some applications, such as applications that uses huge pages, we might want to have a different mountpoint, for which we pass mount flags that better match our usecase. Therefore, create a new function `drm_gem_object_init_with_mnt()` that allow us to define the tmpfs mountpoint where the GEM object will be created. If this parameter is NULL, then we fallback to `shmem_file_setup()`. Signed-off-by: Maíra Canal <[email protected]> Reviewed-by: Tvrtko Ursulin <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit f2a4bcb usptream Replace the open-coded v3d_perfmon_find() with the real thing. Signed-off-by: Christian Gmeiner <[email protected]> Reviewed-by: Maíra Canal <[email protected]> Signed-off-by: Maíra Canal <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit 56cf76e upstream If the scheduler initialization fails, GEM initialization must fail as well. Therefore, if `v3d_sched_init()` fails, free the DMA memory allocated and return the error value in `v3d_gem_init()`. Signed-off-by: Maíra Canal <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit eb8d395 upstream Create a separate "tmpfs" kernel mount for V3D. This will allow us to move away from the shmemfs `shm_mnt` and gives the flexibility to do things like set our own mount options. Here, the interest is to use "huge=", which should allow us to enable the use of THP for our shmem-backed objects. Signed-off-by: Maíra Canal <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit 8dd6074 upstream Currently, we are using an alignment of 128 kB to insert a node, which ends up wasting memory as we perform plenty of small BOs allocations (<= 4 kB). We require that allocations are aligned to 128Kb so for any allocation smaller than that, we are wasting the difference. This implies that we cannot effectively use the whole 4 GB address space available for the GPU in the RPi 4. Currently, we can allocate up to 32000 BOs of 4 kB (~140 MB) and 3000 BOs of 400 kB (~1,3 GB). This can be quite limiting for applications that have a high memory requirement, such as vkoverhead [1]. By reducing the page alignment to 4 kB, we can allocate up to 1000000 BOs of 4 kB (~4 GB) and 10000 BOs of 400 kB (~4 GB). Moreover, by performing benchmarks, we were able to attest that reducing the page alignment to 4 kB can provide a general performance improvement in OpenGL applications (e.g. glmark2). Therefore, this patch reduces the alignment of the node allocation to 4 kB, which will allow RPi users to explore the whole 4GB virtual address space provided by the hardware. Also, this patch allow users to fully run vkoverhead in the RPi 4/5, solving the issue reported in [1]. [1] zmike/vkoverhead#14 Signed-off-by: Maíra Canal <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit e4c1772 upstream The V3D MMU also supports 64KB and 1MB pages, called big and super pages, respectively. In order to set a 64KB page or 1MB page in the MMU, we need to make sure that page table entries for all 4KB pages within a big/super page must be correctly configured. In order to create a big/super page, we need a contiguous memory region. That's why we use a separate mountpoint with THP enabled. In order to place the page table entries in the MMU, we iterate over the 16 4KB pages (for big pages) or 256 4KB pages (for super pages) and insert the PTE. Signed-off-by: Maíra Canal <[email protected]> Reviewed-by: Tvrtko Ursulin <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit 20d69e8 upstream Although Big/Super Pages could appear naturally, it would be quite hard to have 1MB or 64KB allocated contiguously naturally. Therefore, we can force the creation of large pages allocated contiguously by using a mountpoint with "huge=within_size" enabled. Therefore, as V3D has a mountpoint with "huge=within_size" (if user has THP enabled), use this mountpoint for BO creation if available. This will allow us to create large pages allocated contiguously and make use of Big/Super Pages. Signed-off-by: Maíra Canal <[email protected]> Reviewed-by: Tvrtko Ursulin <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit 0df4a13 upstream Add a modparam for turning off Big/Super Pages to make sure that if an user doesn't want Big/Super Pages enabled, it can disabled it by setting the modparam to false. Signed-off-by: Maíra Canal <[email protected]> Reviewed-by: Tvrtko Ursulin <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit 9f8e1c9 upstream Add a new V3D parameter to expose the support of Super Pages to userspace. The userspace might want to know this information to apply optimizations that are specific to kernels with Super Pages enabled. Signed-off-by: Maíra Canal <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit d28292a upstream Function drm_gem_shmem_create_with_mnt() creates an object without using the mountpoint if gemfs is NULL. Drop the else branch calling drm_gem_shmem_create(). Signed-off-by: Matthias Brugger <[email protected]> Signed-off-by: Maíra Canal <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit e987e22 upstream When the new register addresses were introduced for V3D 7.x, we added new masks for performance counter sources on V3D 7.x. Nevertheless, we never apply these new masks when setting the sources. Fix the performance counter source settings on V3D 7.x by introducing a new macro, `V3D_SET_FIELD_VER`, which allows fields setting to vary by version. Using this macro, we can provide different values for source mask based on the V3D version, ensuring that sources are correctly configure on V3D 7.x. Fixes: 0ad5bc1 ("drm/v3d: fix up register addresses for V3D 7.x") Signed-off-by: Maíra Canal <[email protected]> Reviewed-by: Iago Toral Quiroga <[email protected]> Reviewed-by: Christian Gmeiner <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit 21f1435 upstream If the active performance monitor (`v3d->active_perfmon`) is being destroyed, stop it first. Currently, the active perfmon is not stopped during destruction, leaving the `v3d->active_perfmon` pointer stale. This can lead to undefined behavior and instability. This patch ensures that the active perfmon is stopped before being destroyed, aligning with the behavior introduced in commit 7d1fd36 ("drm/v3d: Stop the active perfmon before being destroyed"). Cc: [email protected] # v5.15+ Fixes: 26a4dc2 ("drm/v3d: Expose performance counters to userspace") Signed-off-by: Christian Gmeiner <[email protected]> Signed-off-by: Maíra Canal <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit c6eabba upstream Add a new ioctl, DRM_IOCTL_V3D_PERFMON_SET_GLOBAL, to allow configuration of a global performance monitor (perfmon). Use the global perfmon for all jobs to ensure consistent performance tracking across submissions. This feature is needed to implement a Perfetto datasources in user-space. Signed-off-by: Christian Gmeiner <[email protected]> Reviewed-by: Maíra Canal <[email protected]> Signed-off-by: Maíra Canal <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit 4ee06e3 upstream. This commit fixes several miscellaneous documentation errors. Mostly, delete/update comments that are outdated or are leftovers from past code changes. Apart from that, remove double-spaces in several comments. Signed-off-by: Maíra Canal <[email protected]> Acked-by: Iago Toral Quiroga <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

Commit dc4afc0 upstream. CPU jobs, like Cache Clean jobs, execute synchronously once the DRM scheduler starts running them. Consequently, a global `v3d->cpu_job` variable is unnecessary, as everything is managed within the `v3d_cpu_job_run()` function. This commit removes the `v3d->cpu_job` pointer, as it is not needed. Signed-off-by: Maíra Canal <[email protected]> Reviewed-by: Jose Maria Casanova Crespo <[email protected]> Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]

6by9 · 2025-01-27T13:13:19Z

Branch rebased.
I haven't compile tested it as the rebase was clean, so please let CI complete.

This PR doesn't include e4b5ccd drm/v3d: Ensure job pointer is set to NULL after job completion, so doesn't need the fix discussed in #6624.
e4b5ccd was merged to drm-misc-fixes as a fix of 14d1d19 (from Jun 2018) . It hasn't made it to drm-misc-next which is where I cherry-picked patches from. We can pick up both here, but they should come back to stable releases automatically anyway.

6by9 · 2025-01-27T17:39:33Z

Oops, forgot this had the draft tag. Now removed.

kernel: v3d backports from 6.13 See: raspberrypi/linux#6620

popcornmix force-pushed the rpi-6.12.y branch from f42ab21 to 8007665 Compare January 24, 2025 15:39

pelwell approved these changes Jan 27, 2025

View reviewed changes

mairacanal and others added 16 commits January 27, 2025 12:56

6by9 force-pushed the rpi-6.12.y-v3d branch from 57227b5 to a740b53 Compare January 27, 2025 13:11

6by9 marked this pull request as ready for review January 27, 2025 17:39

popcornmix merged commit 3852486 into raspberrypi:rpi-6.12.y Jan 27, 2025
11 of 12 checks passed

popcornmix added a commit to raspberrypi/firmware that referenced this pull request Jan 27, 2025

kernel: Bump to 6.12.11

5eaa424

kernel: v3d backports from 6.13 See: raspberrypi/linux#6620

popcornmix added a commit to raspberrypi/rpi-firmware that referenced this pull request Jan 27, 2025

kernel: Bump to 6.12.11

9ac1505

kernel: v3d backports from 6.13 See: raspberrypi/linux#6620

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v3d backports from 6.13 #6620

v3d backports from 6.13 #6620

Uh oh!

6by9 commented Jan 21, 2025

Uh oh!

popcornmix commented Jan 22, 2025

Uh oh!

pelwell commented Jan 22, 2025

Uh oh!

roliver-rpi commented Jan 27, 2025

Uh oh!

pelwell left a comment

Uh oh!

6by9 commented Jan 27, 2025

Uh oh!

6by9 commented Jan 27, 2025

Uh oh!

Uh oh!

Uh oh!

v3d backports from 6.13 #6620

v3d backports from 6.13 #6620

Uh oh!

Conversation

6by9 commented Jan 21, 2025

Uh oh!

popcornmix commented Jan 22, 2025

Uh oh!

pelwell commented Jan 22, 2025

Uh oh!

roliver-rpi commented Jan 27, 2025

Uh oh!

pelwell left a comment

Choose a reason for hiding this comment

Uh oh!

6by9 commented Jan 27, 2025

Uh oh!

6by9 commented Jan 27, 2025

Uh oh!

Uh oh!

Uh oh!