Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: state sync optimization #346

Merged
merged 10 commits into from
Jan 6, 2025
Merged

Conversation

ogabrielides
Copy link
Collaborator

@ogabrielides ogabrielides commented Dec 26, 2024

Issue being fixed or feature implemented

This PR includes optimizations for state sync.
It includes also all fixes from #300.

What was done?

The major improvments made are:

  • Ability to open a merk by subtree prefix, optional root_key and is_sum_tree bool (in both tx and non-tx mode)
  • Changed the chunk identification to: SUBTREE_PREFIX:SIZE_ROOT_KEY:ROOT_KEY:IS_SUM_TREE:CHUNK_ID allowing fetch_chunk function (on source side) to open directly the corresponding merk
  • Discovery of new subtrees is made on the fly and any recursion was completly removed.

How Has This Been Tested?

Tested with tutorials replication and in Platform strategy test.

Breaking Changes

No

Checklist:

  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • I have added or updated relevant unit/integration/functional/e2e tests
  • I have made corresponding changes to the documentation

For repository code-owners and collaborators only

  • I have assigned this pull request to a milestone

Summary by CodeRabbit

  • New Features

    • Introduced methods for accessing storage contexts based on subtree prefixes.
    • Added session-based state synchronization functionality.
    • Enhanced database synchronization process with increased data handling capacity.
  • Bug Fixes

    • Improved error handling for session commits during synchronization.
  • Documentation

    • Updated the Cargo.toml to include the blake3 crate dependency.
  • Refactor

    • Streamlined internal logic for state synchronization and chunk processing.

Copy link
Contributor

coderabbitai bot commented Dec 26, 2024

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Walkthrough

The changes introduce enhancements to GroveDB's state synchronization and storage management system. The modifications focus on improving subtree handling, introducing new methods for opening Merks by prefix, and refactoring the replication process. Key improvements include session-based state synchronization, more flexible storage context retrieval, and expanded methods for managing database operations across different contexts.

Changes

File Change Summary
grovedb/src/lib.rs Added methods open_transactional_merk_by_prefix and open_non_transactional_merk_by_prefix. Updated open_merk_for_replication to return additional context about the Merk.
grovedb/src/replication.rs Introduced new methods for syncing sessions, modified chunk fetching, and updated state synchronization approach.
grovedb/src/replication/state_sync_session.rs Created new structs and methods for managing state synchronization sessions, including SubtreeStateSyncInfo and MultiStateSyncSession.
storage/src/rocksdb_storage/storage.rs Added methods for retrieving storage contexts by subtree prefix. Made SubtreePrefix publicly visible.
storage/src/storage.rs Added new methods to Storage trait for context retrieval by subtree prefix.
tutorials/Cargo.toml Added blake3 dependency (version 1.5.1).
tutorials/src/bin/replication.rs Updated synchronization demo to use new session-based approach, increased data insertion range.

Sequence Diagram

sequenceDiagram
    participant Client
    participant GroveDB
    participant StateSyncSession
    participant StorageContext

    Client->>GroveDB: start_syncing_session()
    GroveDB->>StateSyncSession: Create new session
    Client->>GroveDB: fetch_chunk()
    GroveDB->>StorageContext: Get context by prefix
    StorageContext-->>GroveDB: Return storage context
    GroveDB->>StateSyncSession: Apply chunk
    StateSyncSession-->>GroveDB: Process chunk
    Client->>GroveDB: commit_session()
    GroveDB->>StateSyncSession: Finalize synchronization
Loading

Poem

🐰 A Rabbit's Ode to Database Delight

In subtrees of data, we dance and we sync,
Prefixes and contexts, no longer we shrink!
Sessions now sparkle with magical might,
Merks open swiftly, our storage burns bright!

🌟 Hop, hop, database magic takes flight! 🚀


Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Generate unit testing code for this file.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR. (Beta)
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

@ogabrielides ogabrielides mentioned this pull request Dec 26, 2024
5 tasks
@ogabrielides
Copy link
Collaborator Author

@coderabbitai review

Copy link
Contributor

coderabbitai bot commented Dec 27, 2024

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (12)
tutorials/src/bin/replication.rs (3)

21-22: Unify SubtreePrefix type definition
You define SubtreePrefix here as [u8; blake3::OUT_LEN], but the same alias appears elsewhere. Consider unifying it into one central definition to avoid duplication across modules.


104-104: Avoid panics by handling errors
Using .unwrap() here will panic on error. In production scenarios, consider replacing .unwrap() with the ? operator or custom error handling to provide more graceful error reporting.


246-270: Enhance logging and session commit flow

  1. start_time usage is good for measuring sync duration. Consider adopting a structured logging approach in place of println! for better production adaptability.
  2. Tracking num_chunks is helpful. You might also record chunk sizes to diagnose throughput or memory usage.
  3. commit_session silently discards potential costs or errors. If future versions return a Result, ensure to handle or log it.
grovedb/src/replication.rs (5)

25-27: Lack of error handling in commit_session
commit_session swallows the cost result. If in future the commit operation can fail, consider returning a Result to propagate potential errors.


49-49: Revised transaction parameter
Renaming tx to transaction improves consistency with the rest of the code.


68-70: Consider smaller helper method
This block might be refactored into a helper, e.g. open_merk_for_chunk(...), for improved readability if it grows further.


225-225: Reduce type complexity
Clippy warns about the return signature in util_split_global_chunk_id. Consider using a dedicated struct or alias to clarify meaning and reduce complexity:

🧰 Tools
🪛 GitHub Check: clippy

[warning] 225-225: very complex type used. Consider factoring parts into type definitions
warning: very complex type used. Consider factoring parts into type definitions
--> grovedb/src/replication.rs:225:6
|
225 | ) -> Result<(crate::SubtreePrefix, Option<Vec>, bool, Vec), Error> {
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#type_complexity
= note: #[warn(clippy::type_complexity)] on by default


214-214: Optimize string conversion
Clippy notes that borrowing &subtree is immediately dereferenced. Also, you can replace unwrap_or_else with unwrap_or for a constant fallback string.

- let string = std::str::from_utf8(&subtree).unwrap_or_else(|_| "<NON_UTF8_PATH>");
+ let string = std::str::from_utf8(subtree).unwrap_or("<NON_UTF8_PATH>");
🧰 Tools
🪛 GitHub Check: clippy

[warning] 214-214: this expression creates a reference which is immediately dereferenced by the compiler
warning: this expression creates a reference which is immediately dereferenced by the compiler
--> grovedb/src/replication.rs:214:42
|
214 | let string = std::str::from_utf8(&subtree).unwrap_or_else(|_| "<NON_UTF8_PATH>");
| ^^^^^^^^ help: change this to: subtree
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#needless_borrow


[warning] 214-214: unnecessary closure used to substitute value for Result::Err
warning: unnecessary closure used to substitute value for Result::Err
--> grovedb/src/replication.rs:214:22
|
214 | let string = std::str::from_utf8(&subtree).unwrap_or_else(|_| "<NON_UTF8_PATH>");
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#unnecessary_lazy_evaluations
= note: #[warn(clippy::unnecessary_lazy_evaluations)] on by default
help: use unwrap_or instead
|
214 | let string = std::str::from_utf8(&subtree).unwrap_or("<NON_UTF8_PATH>");
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~

grovedb/src/replication/state_sync_session.rs (3)

43-43: Elide explicit lifetimes
The 'db lifetime in impl<'db> SubtreeStateSyncInfo<'db> could potentially be elided for clarity if no advanced lifetime constraints are required.

🧰 Tools
🪛 GitHub Check: clippy

[warning] 43-43: the following explicit lifetimes could be elided: 'db
warning: the following explicit lifetimes could be elided: 'db
--> grovedb/src/replication/state_sync_session.rs:43:6
|
43 | impl<'db> SubtreeStateSyncInfo<'db> {
| ^^^ ^^^
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#needless_lifetimes
help: elide the lifetimes
|
43 - impl<'db> SubtreeStateSyncInfo<'db> {
43 + impl SubtreeStateSyncInfo<'_> {
|


263-263: Avoid cloning bools
You’re calling .clone() on is_sum_tree which implements Copy. Removing .clone() is more concise and avoids minor overhead.

- subtree_state_sync.is_sum_tree.clone(),
+ subtree_state_sync.is_sum_tree,
🧰 Tools
🪛 GitHub Check: clippy

[warning] 263-263: using clone on type bool which implements the Copy trait
warning: using clone on type bool which implements the Copy trait
--> grovedb/src/replication/state_sync_session.rs:263:21
|
263 | subtree_state_sync.is_sum_tree.clone(),
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ help: try removing the clone call: subtree_state_sync.is_sum_tree
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#clone_on_copy
= note: #[warn(clippy::clone_on_copy)] on by default


406-408: Avoid cloning fixed-size arrays
Clippy warns about cloning [u8; 32]. As these are Copy, simply dereference instead of cloning:

- elem_value_hash.clone(),
- actual_value_hash.clone(),
- prefix.clone(),
+ *elem_value_hash,
+ *actual_value_hash,
+ *prefix,
🧰 Tools
🪛 GitHub Check: clippy

[warning] 408-408: using clone on type [u8; 32] which implements the Copy trait
warning: using clone on type [u8; 32] which implements the Copy trait
--> grovedb/src/replication/state_sync_session.rs:408:21
|
408 | prefix.clone(),
| ^^^^^^^^^^^^^^ help: try dereferencing it: *prefix
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#clone_on_copy


[warning] 407-407: using clone on type [u8; 32] which implements the Copy trait
warning: using clone on type [u8; 32] which implements the Copy trait
--> grovedb/src/replication/state_sync_session.rs:407:26
|
407 | Some(actual_value_hash.clone()),
| ^^^^^^^^^^^^^^^^^^^^^^^^^ help: try dereferencing it: *actual_value_hash
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#clone_on_copy


[warning] 406-406: using clone on type [u8; 32] which implements the Copy trait
warning: using clone on type [u8; 32] which implements the Copy trait
--> grovedb/src/replication/state_sync_session.rs:406:21
|
406 | elem_value_hash.clone(),
| ^^^^^^^^^^^^^^^^^^^^^^^ help: try dereferencing it: *elem_value_hash
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#clone_on_copy

grovedb/src/lib.rs (1)

384-391: High complexity return type
Clippy’s warning suggests factoring out (Merk<PrefixedRocksDbImmediateStorageContext<'tx>>, Option<Vec<u8>>, bool) into a named type or struct for clarity:

🧰 Tools
🪛 GitHub Check: clippy

[warning] 384-391: very complex type used. Consider factoring parts into type definitions
warning: very complex type used. Consider factoring parts into type definitions
--> grovedb/src/lib.rs:384:10
|
384 | ) -> Result<
| __________^
385 | | (
386 | | Merk<PrefixedRocksDbImmediateStorageContext<'tx>>,
387 | | Option<Vec>,
... |
390 | | Error,
391 | | >
| |_____^
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#type_complexity

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 7f900ea and 0667774.

📒 Files selected for processing (7)
  • grovedb/src/lib.rs (5 hunks)
  • grovedb/src/replication.rs (3 hunks)
  • grovedb/src/replication/state_sync_session.rs (1 hunks)
  • storage/src/rocksdb_storage/storage.rs (4 hunks)
  • storage/src/storage.rs (4 hunks)
  • tutorials/Cargo.toml (1 hunks)
  • tutorials/src/bin/replication.rs (5 hunks)
🧰 Additional context used
🪛 GitHub Check: clippy
grovedb/src/lib.rs

[warning] 384-391: very complex type used. Consider factoring parts into type definitions
warning: very complex type used. Consider factoring parts into type definitions
--> grovedb/src/lib.rs:384:10
|
384 | ) -> Result<
| __________^
385 | | (
386 | | Merk<PrefixedRocksDbImmediateStorageContext<'tx>>,
387 | | Option<Vec>,
... |
390 | | Error,
391 | | >
| |_____^
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#type_complexity

grovedb/src/replication/state_sync_session.rs

[warning] 408-408: using clone on type [u8; 32] which implements the Copy trait
warning: using clone on type [u8; 32] which implements the Copy trait
--> grovedb/src/replication/state_sync_session.rs:408:21
|
408 | prefix.clone(),
| ^^^^^^^^^^^^^^ help: try dereferencing it: *prefix
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#clone_on_copy


[warning] 407-407: using clone on type [u8; 32] which implements the Copy trait
warning: using clone on type [u8; 32] which implements the Copy trait
--> grovedb/src/replication/state_sync_session.rs:407:26
|
407 | Some(actual_value_hash.clone()),
| ^^^^^^^^^^^^^^^^^^^^^^^^^ help: try dereferencing it: *actual_value_hash
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#clone_on_copy


[warning] 406-406: using clone on type [u8; 32] which implements the Copy trait
warning: using clone on type [u8; 32] which implements the Copy trait
--> grovedb/src/replication/state_sync_session.rs:406:21
|
406 | elem_value_hash.clone(),
| ^^^^^^^^^^^^^^^^^^^^^^^ help: try dereferencing it: *elem_value_hash
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#clone_on_copy


[warning] 263-263: using clone on type bool which implements the Copy trait
warning: using clone on type bool which implements the Copy trait
--> grovedb/src/replication/state_sync_session.rs:263:21
|
263 | subtree_state_sync.is_sum_tree.clone(),
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ help: try removing the clone call: subtree_state_sync.is_sum_tree
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#clone_on_copy
= note: #[warn(clippy::clone_on_copy)] on by default


[warning] 43-43: the following explicit lifetimes could be elided: 'db
warning: the following explicit lifetimes could be elided: 'db
--> grovedb/src/replication/state_sync_session.rs:43:6
|
43 | impl<'db> SubtreeStateSyncInfo<'db> {
| ^^^ ^^^
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#needless_lifetimes
help: elide the lifetimes
|
43 - impl<'db> SubtreeStateSyncInfo<'db> {
43 + impl SubtreeStateSyncInfo<'_> {
|

grovedb/src/replication.rs

[warning] 225-225: very complex type used. Consider factoring parts into type definitions
warning: very complex type used. Consider factoring parts into type definitions
--> grovedb/src/replication.rs:225:6
|
225 | ) -> Result<(crate::SubtreePrefix, Option<Vec>, bool, Vec), Error> {
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#type_complexity
= note: #[warn(clippy::type_complexity)] on by default


[warning] 214-214: this expression creates a reference which is immediately dereferenced by the compiler
warning: this expression creates a reference which is immediately dereferenced by the compiler
--> grovedb/src/replication.rs:214:42
|
214 | let string = std::str::from_utf8(&subtree).unwrap_or_else(|_| "<NON_UTF8_PATH>");
| ^^^^^^^^ help: change this to: subtree
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#needless_borrow


[warning] 214-214: unnecessary closure used to substitute value for Result::Err
warning: unnecessary closure used to substitute value for Result::Err
--> grovedb/src/replication.rs:214:22
|
214 | let string = std::str::from_utf8(&subtree).unwrap_or_else(|_| "<NON_UTF8_PATH>");
| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
= help: for further information visit https://rust-lang.github.io/rust-clippy/master/index.html#unnecessary_lazy_evaluations
= note: #[warn(clippy::unnecessary_lazy_evaluations)] on by default
help: use unwrap_or instead
|
214 | let string = std::str::from_utf8(&subtree).unwrap_or("<NON_UTF8_PATH>");
| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~

🔇 Additional comments (20)
tutorials/src/bin/replication.rs (3)

3-3: Use Duration import or remove it if unused
If you plan on using Duration for performance measurements beyond Instant, that's great. Otherwise, consider removing the unused import to keep the code clean.


42-42: Verify performance impact of increased range
Expanding the loop from 5 to 100 can significantly increase inserted records (up to 5050 items). This is helpful for testing larger datasets but may slow down tutorial runs. Ensure that these higher ranges won’t overly impact performance or memory usage.


49-49: Confirm consistency with prior loop changes
This loop mirrors the previous one, inserting the same expanded range of values. If this duplication of data is intentional, no changes are needed.

grovedb/src/replication.rs (4)

1-2: Module import clarity
The new mod state_sync_session; integrates well. Ensure any future additions to state_sync_session remain consistent with overarching replication design.


21-23: Session-based sync approach
Introducing start_syncing_session is a clear improvement toward session-based replication. Be mindful of transaction boundaries in concurrent scenarios.


33-44: Document newly introduced chunk ID format
This doc comment thoroughly describes [SUBTREE_PREFIX:SIZE_ROOT_KEY:ROOT_KEY:IS_SUM_TREE:CHUNK_ID]. Ensure all call sites and maintainers reference this properly for clarity.


64-66: Validate root hash usage
root_app_hash is used to detect if global_chunk_id matches the root. Confirm that upstream logic always provides a valid application hash.

grovedb/src/lib.rs (4)

184-185: Confirm ChunkProducer usage
Ensure that the newly imported ChunkProducer is invoked where intended, given that replicated chunk retrieval logic now resides in replication.rs.


222-223: Check usage of util_encode_vec_ops
Since util_encode_vec_ops is central to chunk serialization, confirm it’s used consistently throughout replication code to avoid diverging chunk formats.


337-376: Open transactional Merk by prefix
The approach is logical: you retrieve a specialized storage context from get_transactional_storage_context_by_subtree_prefix. Verify that concurrency handling is consistent with open Merk usage in other methods.


520-557: Open non-transactional Merk by prefix
Similar to the transactional version, ensure the logic for root key checks and error handling remains parallel to avoid divergence between the two flows.

storage/src/storage.rs (4)

46-47: Type alias alignment
pub type SubtreePrefix = [u8; blake3::OUT_LEN]; is consistent with your usage; however, confirm you aren’t duplicating this alias in multiple places. If you are, consider referencing it from one source.


94-101: Context creation by subtree prefix
Providing a batch-ready storage context by prefix is a welcome addition. If concurrency or partial writes become more complex, consider separate modules or methods for clarity.


113-121: Transactional context by subtree prefix
This extension mirrors the batch context style. Keep an eye on error-handling consistency around partial commits or rollbacks for the prefix-based approach.


132-139: Immediate storage context by prefix
Ensure that replicators or direct-writers supply consistent prefix values so that separate subtrees don’t accidentally overlap.

storage/src/rocksdb_storage/storage.rs (4)

61-61: Elevating SubtreePrefix to pub scope looks good
It aligns with the need to share prefix creation across the codebase, but make sure that no inadvertent external usage is introduced without careful access control or documentation.


475-482: Consider adding cost-based validation for imported prefixes
In get_storage_context_by_subtree_prefix, the prefix is assumed to be correct and the method wraps it with a default cost. If the prefix is invalid or empty, there’s no direct error handling.


498-506: Transactional context creation is consistent
The logic for building a prefixed transactional context by subtree prefix is coherent. Ensure that error pathways are considered if prefix usage eventually fails.


521-528: Immediate context creation is consistent, but watch for concurrency
When retrieving an immediate storage context by prefix, ensure that concurrent modifications to the same subtree prefix or transaction references do not cause issues.

tutorials/Cargo.toml (1)

17-17: Dependency on blake3 is properly introduced
This aligns with the usage in SubtreePrefix definitions. Double-check for any potential version conflicts in other crates referencing blake3.

grovedb/src/replication.rs Outdated Show resolved Hide resolved
grovedb/src/replication.rs Outdated Show resolved Hide resolved
grovedb/src/replication.rs Outdated Show resolved Hide resolved
grovedb/src/replication.rs Outdated Show resolved Hide resolved
grovedb/src/replication.rs Outdated Show resolved Hide resolved
grovedb/src/replication/state_sync_session.rs Outdated Show resolved Hide resolved
grovedb/src/replication/state_sync_session.rs Outdated Show resolved Hide resolved
@ogabrielides ogabrielides requested a review from fominok December 30, 2024 11:19
@ogabrielides ogabrielides force-pushed the feat/state_sync_optimization branch from 47374aa to 995aec7 Compare December 30, 2024 12:35
grovedb/src/replication.rs Outdated Show resolved Hide resolved
grovedb/src/replication/state_sync_session.rs Outdated Show resolved Hide resolved
return Ok(vec![]);
}

let completed_path = subtree_state_sync.get_current_path();
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

seems like this should return a reference instead of owned vec

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this one seems to be not possible. As it will violate the borrowing rule: we already have a mutable reference to subtree_state_sync a few lines above

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

okay

grovedb/src/replication/state_sync_session.rs Outdated Show resolved Hide resolved
grovedb/src/replication/state_sync_session.rs Outdated Show resolved Hide resolved
grovedb/src/replication/state_sync_session.rs Outdated Show resolved Hide resolved
Copy link
Member

@QuantumExplorer QuantumExplorer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not a big fan of Pin, but I tried to do it without pinning myself and realized it would take a lot of time. So this is good enough if it works.

@ogabrielides ogabrielides merged commit 0cce59e into develop Jan 6, 2025
8 checks passed
@ogabrielides ogabrielides deleted the feat/state_sync_optimization branch January 6, 2025 10:22
@coderabbitai coderabbitai bot mentioned this pull request Jan 8, 2025
5 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants