Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add document for info schema cache #19018

Merged
merged 6 commits into from
Oct 16, 2024
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions TOC.md
Original file line number Diff line number Diff line change
Expand Up @@ -1020,6 +1020,7 @@
- [`schema_unused_indexes`](/sys-schema/sys-schema-unused-indexes.md)
- [Metadata Lock](/metadata-lock.md)
- [TiDB Accelerated Table Creation](/accelerated-table-creation.md)
- [schema cache](/schema-cache.md)
hfxsd marked this conversation as resolved.
Show resolved Hide resolved
- UI
- TiDB Dashboard
- [Overview](/dashboard/dashboard-intro.md)
Expand Down
38 changes: 38 additions & 0 deletions schema-cache.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,38 @@
---
title: schema cache
hfxsd marked this conversation as resolved.
Show resolved Hide resolved
aliases: ['/docs-cn/dev/information-schema-cache']
lilin90 marked this conversation as resolved.
Show resolved Hide resolved
summary: TiDB adopts an LRU-based caching mechanism for schema information, which significantly reduces memory usage and improves performance in scenarios with a large number of databases and tables.
hfxsd marked this conversation as resolved.
Show resolved Hide resolved
---

# schema cache
hfxsd marked this conversation as resolved.
Show resolved Hide resolved

In some multi-tenant scenarios, there may be tens of thousands or even millions of databases and tables. Loading all the schema information of these databases and tables into memory can consume a large amount of memory and degrade access performance. To address this issue, TiDB introduces a schema caching mechanism similar to LRU. Only the schema information of the most recently accessed databases and tables is cached in memory.
hfxsd marked this conversation as resolved.
Show resolved Hide resolved

> **Warning:**
>
> This feature is currently an experimental feature and it is not recommended to use in a production environment. This feature might change or be removed without prior notice. If you find a bug, please give feedback by raising an [issue](https://github.com/pingcap/tidb/issues) on GitHub.
lilin90 marked this conversation as resolved.
Show resolved Hide resolved

## Configuration
hfxsd marked this conversation as resolved.
Show resolved Hide resolved

You can enable the schema caching feature by configuring the system variable [`tidb_schema_cache_size`](/system-variables.md#tidb_schema_cache_size-new-in-v800)
hfxsd marked this conversation as resolved.
Show resolved Hide resolved

## Best Practices
hfxsd marked this conversation as resolved.
Show resolved Hide resolved

- In scenarios with a large number of databases and tables (e.g., over 100,000 databases and tables) or when the number of databases and tables is large enough to impact system performance, it is recommended to enable the schema caching feature.
- You can monitor the hit rate of the schema cache by observing the subpanel "Infoschema v2 Cache Operation" under the "Schema load" section in TiDB monitoring. If the hit rate is low, you can increase the value of [`tidb_schema_cache_size`](/system-variables.md#tidb_schema_cache_size-new-in-v800.
- You can monitor the current size of the schema cache being used by observing the subpanel "Infoschema v2 Cache Size" under the "Schema load" section in TiDB monitoring.
- It is recommended to disable [`performance.force-init-stats`](/tidb-configuration-file.md#force-init-stats-new-in-v657-and-v710) to reduce TiDB startup time.
- If you need to create a large number of tables (e.g., over 100,000 tables), it is recommended to set this parameter to false [`split-table`](/tidb-configuration-file.md#split-table) to reduce the number of regions and thus decrease TiKV's memory usage.
hfxsd marked this conversation as resolved.
Show resolved Hide resolved

## Known Limitations

In scenarios with a large number of databases and tables, the following known issues exist:
- When the tables that need to be accessed are irregularly accessed, such as one set of tables accessed by t1 and another set accessed by t2, and the tidb_schema_cache_size setting is small, the schema information may be frequently evicted and cached, leading to performance fluctuations. This feature is more suitable for scenarios where frequently accessed databases and tables are relatively fixed.
- Statistics information may not be collected in a timely manner.
- Access to some metadata information may become slower.
- Switching the schema cache on or off requires a waiting period.
- Operations that involve enumerating all metadata information may become slower, such as:
hfxsd marked this conversation as resolved.
Show resolved Hide resolved

- `SHOW FULL TABLES`
- `FLASHBACK`
- `ALTER TABLE ... SET TIFLASH MODE ...`
Loading