Merge bulk lookup proofs #163

eozturk1 · 2022-03-07T08:31:14Z

This PR adds support for bulk lookup proofs (See Issue #103).

First, for each lookup proof requested, necessary labels are calculated. Then, the nodes corresponding to their self- and sibling-prefixes are preloaded before the lookup operations (See the comments above the function build_lookup_prefixes_set for my understanding why this is the case).

To test it, I added a new feature to the MySQL implementation where each storage call is tracked with its self-(read/write) and data-type (e.g., HistoryTreeNode). Here is its output with the following parameters:

num_users: 50
num_epochs: 5
num_lookups: 100

running 1 test
Metrics after publish(es).
MySQL writes: 3279, MySQL reads: 6, Time read: 0.071272952 s, Time write: 1.13520928 s
        Tree size: 899
        Node state count: 1386
        Value state count: 250
Read call stats: [("get_direct:~Azks", 1), ("get_user_state_versions~", 5)]
Write call stats: [("internal_batch_set~", 20), ("internal_set~", 3)]

- Individual 100 lookups took 3103ms.
Metrics after individual lookups:
MySQL writes: 0, MySQL reads: 969, Time read: 2.041950956 s, Time write: 0 s
        Tree size: 899
        Node state count: 1386
        Value state count: 250
- Read call stats: [("get_direct:~Azks", 1), ("get_direct:~HistoryNodeState", 459), ("get_direct:~HistoryTreeNode", 459), ("get_user_state~", 50)]
Write call stats: []

+ Bulk 100 lookups took 1732ms.
Metrics after lookup proofs: 
MySQL writes: 0, MySQL reads: 75, Time read: 0.59272097 s, Time write: 0 s
        Tree size: 899
        Node state count: 1386
        Value state count: 250
+ Read call stats: [("batch_get~HistoryNodeState", 12), ("batch_get~HistoryTreeNode", 12), ("get_direct:~Azks", 1), ("get_user_state~", 50)]
Write call stats: []

Note the reduction in completion time (about 40% but may vary depending on the parameters) and get_direct calls replaced with batch_gets in bulk lookups.

This code was a bit tricky to test since MySQL metrics are reset when printed and nodes in cache may change the resulting numbers. In addition, the users looked up might result in different time measurements. I tried and accounted for all that but if you notice any issues, please let me know!

slawlor

It's close but we need to fix the mysql metrics gathering and I think there's more optimizations that are possible for batch lookups. But the second part could be pushed to a future PR.

akd/src/directory.rs

slawlor · 2022-03-08T18:38:30Z

akd/src/directory.rs

+        let mut lookup_proofs = Vec::new();
+        for i in 0..unames.len() {
+            lookup_proofs.push(
+                self.lookup_with_info::<H>(


Why do we need the info? I think if we've done the proper BFS loading, we should only need to batch-get the value states in order to load them into the cache, then we can do the regular proof generation operations since we'll only be accessing from local memory

This is to eliminate duplicate get_user_state queries. One in identifying which labels are needed and one in (previously) lookup for the same purpose. This way pre-loaded info is passed to the lookup function, so no re-loads.

Looking at the code for MySQL's get_user_state, are we missing looking up the queried state in cache?

Ah I see, yes we've excluded get_user_state from the caching logic because of the filter parameters. How would you know a cached value matches "<= epoch" or some weird filter without going to the backing store? It sounds like we might need a new query at the storage layer to retrieve the max versions for a collection of users which is <= a given epoch. (If I'm remembering right). We could move forward with this however and simply open an issue for it, but we'll want to do it properly for batch proof generation. Here's we will hit order N queries just to gather the user states, while not awful, still not ideal when we can do a better query.

Ah I see! Tracked in #169. I can take a stab at #166, #167 and #169 in a new PR. Does that sound good?

Sounds great, let's merge this in!

akd/src/node_state.rs

akd_mysql/src/mysql.rs

eozturk1 · 2022-03-09T19:16:13Z

I'll work on #166, #167 and #169 in a different PR. Is there anything I should update in this PR to proceed with merging?

eozturk1 added 4 commits March 6, 2022 23:42

Keep track of MySQL read/write call stats

e6cec6c

Add support for bulk lookup proofs

d6bf1a5

Use warn for failing Docker container search

8feb721

Allow Trace to be printed out

7e00fef

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 7, 2022

eozturk1 added 2 commits March 7, 2022 00:35

Fix off by one in call stats

29ccd00

Remove unnecessary call

8239c71

slawlor requested changes Mar 8, 2022

View reviewed changes

eozturk1 added 2 commits March 8, 2022 21:57

Comment out logging metrics in directory

335f91a

Clarify get_sibling_prefix function

04cad68

slawlor approved these changes Mar 9, 2022

View reviewed changes

eozturk1 merged commit 01e3e00 into facebook:main Mar 9, 2022

slawlor mentioned this pull request Mar 9, 2022

Call multiple lookup operations async #166

Open

eozturk1 deleted the merge-bulk-lookup-proofs branch March 10, 2022 23:50

eozturk1 mentioned this pull request Aug 1, 2022

Bulk lookup proofs #103

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge bulk lookup proofs #163

Merge bulk lookup proofs #163

eozturk1 commented Mar 7, 2022 •

edited

Loading

slawlor left a comment

slawlor Mar 8, 2022

eozturk1 Mar 9, 2022 •

edited

Loading

slawlor Mar 9, 2022

eozturk1 Mar 9, 2022

slawlor Mar 9, 2022

eozturk1 commented Mar 9, 2022

Merge bulk lookup proofs #163

Merge bulk lookup proofs #163

Conversation

eozturk1 commented Mar 7, 2022 • edited Loading

slawlor left a comment

Choose a reason for hiding this comment

slawlor Mar 8, 2022

Choose a reason for hiding this comment

eozturk1 Mar 9, 2022 • edited Loading

Choose a reason for hiding this comment

slawlor Mar 9, 2022

Choose a reason for hiding this comment

eozturk1 Mar 9, 2022

Choose a reason for hiding this comment

slawlor Mar 9, 2022

Choose a reason for hiding this comment

eozturk1 commented Mar 9, 2022

eozturk1 commented Mar 7, 2022 •

edited

Loading

eozturk1 Mar 9, 2022 •

edited

Loading