indexer: convert IndexerRead to using async connections #19213

bmwill · 2024-09-04T20:11:02Z

Description

Describe the changes or additions included in this PR.

Test plan

How did you test the new or updated feature?

Release notes

Check each box that your changes affect. If none of the boxes relate to your changes, release notes aren't required.

For each box you select, include information after the relevant heading that describes the impact of your changes that a user might notice and any actions they must take to implement updates.

vercel · 2024-09-04T20:11:13Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
sui-docs	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Sep 5, 2024 3:20pm

3 Skipped Deployments

Name	Status	Preview	Updated (UTC)
multisig-toolkit	⬜️ Ignored (Inspect)	Visit Preview	Sep 5, 2024 3:20pm
sui-kiosk	⬜️ Ignored (Inspect)	Visit Preview	Sep 5, 2024 3:20pm
sui-typescript-docs	⬜️ Ignored (Inspect)	Visit Preview	Sep 5, 2024 3:20pm

…e_numbers

wlmyng

iiuc, by using AsyncConnection, use diesel_async::RunQueryDsl, we can get rid of a lot of the existing code around handling blocking connections? pretty rad

to play well with the ObjectStore trait though we need to make the connection blocking

wlmyng · 2024-09-05T16:02:22Z

crates/sui-indexer/src/indexer_reader.rs

        &self,
        filter: EventFilter,
        cursor: Option<EventID>,
        limit: usize,
        descending_order: bool,
    ) -> IndexerResult<Vec<SuiEvent>> {
-        let pool = self.get_pool();
+        use diesel_async::RunQueryDsl;


how come we cant just import this once at the top of the file?

Because the two traits conflict with each other so i did this to make it easier to convert things

wlmyng · 2024-09-05T17:00:06Z

crates/sui-indexer/src/indexer_reader.rs

+        let object_store = ConnectionAsObjectStore::from_pool(&self.pool)
+            .await
+            .map_err(|e| IndexerError::PgPoolConnectionError(e.to_string()))?;
+
+        let system_state = tokio::task::spawn_blocking(move || {
+            sui_types::sui_system_state::get_sui_system_state(&object_store)


can we use self.get_pool (the blocking pool) here?

no, we're trying to eliminate the blocking pool entirely

wlmyng · 2024-09-05T17:01:52Z

crates/sui-indexer/src/indexer_reader.rs

+
+        let stored_epoch = epochs::table
+            .into_boxed()
+            .pipe(|query| {


TIL about pipe for keeping the method chaining style, pretty dope

wlmyng · 2024-09-05T17:07:06Z

crates/sui-indexer/src/indexer_reader.rs

+    pub fn get_pool(&self) -> BlockingConnectionPool {
+        self.blocking_pool.clone()
    }
 }


is this actually still used anywhere? seems like we just do self.pool.get().await now?

its used in graphql to get connections, in a follow up PR i'll work to convert graphql codebase which will then let us remove the blocking pool entirely

bmwill · 2024-09-05T17:38:47Z

to play well with the ObjectStore trait though we need to make the connection blocking

Yeah so to handle that case we take the async connection and wrap it in a sync layer (that uses the async connection under the hood) which lets us still remove a dependency on the sync connection pool

wlmyng

🚢

gegaowp

great move overall, with a comment regarding singleton object read perf.

gegaowp · 2024-09-05T18:01:16Z

crates/sui-indexer/src/indexer_reader.rs

+        let mut connection = self.pool.get().await?;
+
+        let object = match objects::table
+            .filter(objects::object_type.eq(type_.to_canonical_string(/* with_prefix */ true)))


object_type does not have its own index, this query will be a full-scan and very slow, shall we keep the logic of get_single_obj_id_from_package_publish & package_obj_type_cache?

Graphql does have the ability to do a singleton lookup based on type. I was under the impression we did have sufficient indexes for this because this is essentially how graphql does this

Note that GraphQL makes this query on objects_history which does have an index on just type:

sui/crates/sui-indexer/migrations/pg/2023-08-19-044023_objects/up.sql

Line 72 in 1ebf5f8

CREATE INDEX objects_history_type ON objects_history (checkpoint_sequence_number, object_type);

I think @gegaowp is right that today, this query will result in a full table scan. objects does have an index that you can leverage:

sui/crates/sui-indexer/migrations/pg/2023-08-19-044023_objects/up.sql

Line 40 in 1ebf5f8

CREATE INDEX objects_package_module_name_full_type ON objects (object_type_package, object_type_module, object_type_name, object_type);

But it needs additional filters -- I'll put up a PR for this. It was created this way so that the same index could be used to support filtering by just the type's package, module, name and then the full type.

Fixed in #19247

vercel bot deployed to Preview – sui-docs September 4, 2024 20:15 View deployment

bmwill added 5 commits September 4, 2024 15:51

indexer: perform database reset via async connection

5d13eef

indexer-reader: instantiate async connection pool

ca4df48

indexer-writer: instantiate async connection pool

a5f1f7b

indexer: use async connection for package resolver

be5bdad

indexer: use async connection for get_checkpoint

9b6fcb2

bmwill force-pushed the indexer-async branch from d90587e to 9b6fcb2 Compare September 4, 2024 20:51

vercel bot deployed to Preview – sui-docs September 4, 2024 20:55 View deployment

bmwill added 8 commits September 4, 2024 16:03

indexer: use async connection for get_epoch_info

9b6c745

indexer: remove unused get_consistent_read_range method

4b9cf7b

indexer: use async connection for get_epochs

2d1b39b

indexer: use async connection for get_latest_checkpoint

ff5e42e

indexer: use async connection for get_checkpoints

abe34e6

indexer: use async connection for get_coin_metadata and get_total_supply

57816bd

indexer: use async connection for get_display_object_by_type

20f9b7a

indexer: use async connection for get_owned_coins

562456c

vercel bot deployed to Preview – sui-docs September 5, 2024 00:36 View deployment

bmwill added 12 commits September 4, 2024 19:37

indexer: use async connection for get_coin_balances

c94d6da

indexer: use async connection for get_object_refs

322e622

indexer: use async connection for get_dynamic_fields

5fcd86c

indexer: use async connection for multi_get_transactions

5df37ff

indexer: use async connection for multi_get_transactions_with_sequenc…

47cf19b

…e_numbers

indexer: use async connection for get_transaction_events

1b55d06

indexer: use async connection for query_events

9bdfb17

indexer: use async connection for query_transaction_blocks

fbd5db3

indexer: use async connection for multi_get_objects

788647d

indexer: use async connection for get_owned_objects

3480186

indexer: use async connection for get_object_read

7b9f912

indexer: remove impl ObjectStore for IndexerReader

5be834d

bmwill added 2 commits September 5, 2024 10:04

indexer: use async connection for get_object

1965437

indexer: use async connection for SystemPackageTask

c5db231

bmwill requested review from emmazzz and gegaowp September 5, 2024 15:19

bmwill marked this pull request as ready for review September 5, 2024 15:19

bmwill requested review from amnn, wlmyng, stefan-mysten and suiwombat as code owners September 5, 2024 15:19

vercel bot deployed to Preview – sui-docs September 5, 2024 15:20 View deployment

bmwill changed the title ~~indexer: convert to using async connections~~ indexer: convert IndexerRead to using async connections Sep 5, 2024

bmwill enabled auto-merge (rebase) September 5, 2024 15:42

wlmyng reviewed Sep 5, 2024

View reviewed changes

bmwill requested a review from wlmyng September 5, 2024 17:37

wlmyng approved these changes Sep 5, 2024

View reviewed changes

bmwill merged commit 28feff4 into MystenLabs:main Sep 5, 2024
68 of 78 checks passed

bmwill deleted the indexer-async branch September 5, 2024 17:46

gegaowp reviewed Sep 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

indexer: convert IndexerRead to using async connections #19213

indexer: convert IndexerRead to using async connections #19213

bmwill commented Sep 4, 2024

vercel bot commented Sep 4, 2024 •

edited

Loading

wlmyng left a comment

wlmyng Sep 5, 2024

bmwill Sep 5, 2024

wlmyng Sep 5, 2024

bmwill Sep 5, 2024

wlmyng Sep 5, 2024

wlmyng Sep 5, 2024

bmwill Sep 5, 2024

bmwill commented Sep 5, 2024

wlmyng left a comment

gegaowp left a comment

gegaowp Sep 5, 2024

bmwill Sep 6, 2024

amnn Sep 6, 2024

amnn Sep 6, 2024

indexer: convert IndexerRead to using async connections #19213

indexer: convert IndexerRead to using async connections #19213

Conversation

bmwill commented Sep 4, 2024

Description

Test plan

Release notes

vercel bot commented Sep 4, 2024 • edited Loading

wlmyng left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bmwill commented Sep 5, 2024

wlmyng left a comment

Choose a reason for hiding this comment

gegaowp left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vercel bot commented Sep 4, 2024 •

edited

Loading