storage controller: don't hold detached tenants in memory #10264

jcsp · 2025-01-03T11:26:01Z

Problem

Typical deployments of neon have some tenants that stay in use continuously, and a background churning population of tenants that are created and then fall idle, and are configured to Detached state. Currently, this churn of short lived tenants results in an ever-increasing memory footprint.

Closes: #9712

Summary of changes

At startup, filter to only load shards that don't have Detached policy
In process_result, check if a tenant's shards are all Detached and observed=={}, and if so drop them from memory
In tenant_location_conf and other tenant mutators, load the tenants' shards on-demand if they are not present

github-actions · 2025-01-03T12:23:41Z

7238 tests run: 6867 passed, 11 failed, 360 skipped (full report)

Failures on Postgres 17

test_tenant_s3_restore: release-x86-64, release-x86-64, release-arm64, release-arm64, debug-x86-64

Failures on Postgres 16

test_tenant_s3_restore: release-x86-64, release-arm64

Failures on Postgres 15

test_tenant_s3_restore: release-x86-64, release-arm64

Failures on Postgres 14

test_tenant_s3_restore: release-x86-64, release-arm64

# Run all failed tests locally:
scripts/pytest -vv -n $(nproc) -k "test_tenant_s3_restore[release-pg14] or test_tenant_s3_restore[release-pg14] or test_tenant_s3_restore[release-pg15] or test_tenant_s3_restore[release-pg15] or test_tenant_s3_restore[release-pg16] or test_tenant_s3_restore[release-pg16] or test_tenant_s3_restore[release-pg17] or test_tenant_s3_restore[release-pg17] or test_tenant_s3_restore[release-pg17] or test_tenant_s3_restore[release-pg17] or test_tenant_s3_restore[debug-pg17]"

Flaky tests (3)

Postgres 17

test_tenant_reattach[reset]: release-arm64
test_tenants_normal_work: release-arm64

Postgres 14

test_physical_replication_config_mismatch_too_many_known_xids: release-arm64

Test coverage report is not available

_{The comment gets automatically updated with the latest test results
dd47681 at 2025-01-06T19:00:20.682Z :recycle:}

VladLazar · 2025-01-03T15:24:07Z

storage_controller/src/persistence.rs

@@ -330,11 +330,20 @@ impl Persistence {

    /// At startup, load the high level state for shards, such as their config + policy.  This will
    /// be enriched at runtime with state discovered on pageservers.
+    ///
+    /// We exclude shards configured to be detached.  During startup, if we see any attached locations
+    /// for such shards, they will automatically be detached as 'orphans'.
    pub(crate) async fn list_tenant_shards(&self) -> DatabaseResult<Vec<TenantShardPersistence>> {


nit: rename to load_active_tenant_shards?

storage_controller/src/service.rs

VladLazar · 2025-01-03T16:04:46Z

storage_controller/src/service.rs

+        if tenant.policy == PlacementPolicy::Detached {
+            self.maybe_drop_tenant(tenant.tenant_shard_id.tenant_id, &mut locked);
+        }


This is a bit scary. Since service state sits behind a sync lock we follow this pattern:

acquire lock and get some in-mem state

do something async

acquire lock again and update something

If step (3) doesn't expect the removal, then we run into trouble. I couldn't find any place with problematic expect or unwrap calls. Generally, this should be pretty safe since we wait on the reconcile spawned by the detach in tenant_location_config and that holds the tenant exclusive lock, but might run into issues if detaches end up taking long.

This comment isn't really actionable, but I'm curious about your thoughts on this.

In request handlers, the tenant_op_locks for the tenant should prevent any shenanigans like this.

However, you make an excellent point in this particular context: process_result does not hold that lock. If a request handler took the lock, read the existence of the tenant, and then raced with the processing of a reconciler completion, this could violate the assumption that while holding the lock, a tenant that is in memory should remain in memory.

I think the neatest solution to this is to try and get an exclusive lock around this, and to make our maybe_load and maybe_drop functions take refs to lock handles to prove the callers aren't using them outside the lock, let's try that...

Tightened up the use of op locks in 61bf182

In the process I had the interesting observation that tenant_location_conf holds the lock much longer than it needs to, really it should drop it before waiting for reconcilers, but I don't want to make that change inline here in case it has any spooky side effects.

jcsp · 2025-01-06T18:23:24Z

Let's get this merged and give it a week in staging -- there's a lot of detach/attach churn there, so we should have an excellent chance of spotting any unforseen issues

jcsp added 5 commits January 3, 2025 10:58

storcon: only load non-detached tenants at startup

83d22d6

storcon: load detached tenants on demand

de9adde

storcon: drop detached tenants

5637b99

storcon: handle detached shards in consistency_check

d421940

storcon: on-demand load for tenant mutators

b1f1032

jcsp added t/feature Issue type: feature, for new features or requests c/storage/controller Component: Storage Controller labels Jan 3, 2025

jcsp marked this pull request as ready for review January 3, 2025 13:06

jcsp requested a review from a team as a code owner January 3, 2025 13:06

jcsp requested review from arpad-m and VladLazar January 3, 2025 13:06

tests: add test_storage_controller_detach_lifecycle

4289efa

VladLazar reviewed Jan 3, 2025

View reviewed changes

jcsp added 4 commits January 6, 2025 15:37

nit: refine name of shard listing fn

93a7688

fixup: remove redundant filter (it's filtered in DB query)

ec2c0e0

fixup: properly require tenant op lock around dropping tenant

61bf182

storcon: reliably drop tenants

dd47681

jcsp requested a review from VladLazar January 6, 2025 18:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage controller: don't hold detached tenants in memory #10264

storage controller: don't hold detached tenants in memory #10264

jcsp commented Jan 3, 2025

github-actions bot commented Jan 3, 2025 •

edited

Loading

Postgres 17

Postgres 14

VladLazar Jan 3, 2025

jcsp Jan 6, 2025

VladLazar Jan 3, 2025

VladLazar Jan 3, 2025

jcsp Jan 6, 2025 •

edited

Loading

jcsp Jan 6, 2025

jcsp commented Jan 6, 2025

storage controller: don't hold detached tenants in memory #10264

Are you sure you want to change the base?

storage controller: don't hold detached tenants in memory #10264

Conversation

jcsp commented Jan 3, 2025

Problem

Summary of changes

github-actions bot commented Jan 3, 2025 • edited Loading

7238 tests run: 6867 passed, 11 failed, 360 skipped (full report)

Failures on Postgres 17

Failures on Postgres 16

Failures on Postgres 15

Failures on Postgres 14

Postgres 17

Postgres 14

Test coverage report is not available

VladLazar Jan 3, 2025

Choose a reason for hiding this comment

jcsp Jan 6, 2025

Choose a reason for hiding this comment

VladLazar Jan 3, 2025

Choose a reason for hiding this comment

VladLazar Jan 3, 2025

Choose a reason for hiding this comment

jcsp Jan 6, 2025 • edited Loading

Choose a reason for hiding this comment

jcsp Jan 6, 2025

Choose a reason for hiding this comment

jcsp commented Jan 6, 2025

github-actions bot commented Jan 3, 2025 •

edited

Loading

jcsp Jan 6, 2025 •

edited

Loading