Initial Snap sync for arbitrum #275

tsahee · 2023-11-28T17:39:27Z

No description provided.

checkpoint is verified via skeleton

Tristan-Wilson

Sometimes when I run the test, it fails:
snap_sync.txt

Tristan-Wilson · 2023-12-05T23:37:24Z

arbitrum/handler_p2p.go

+		log.Warn("hPeer not found on HandleLastConfirmed")
+		return
+	}
+	peer.RequestCheckpoint(nil)


Should we check the response here?

This function only sends the checkpoint request message.
The peer is expected to respond with a checkpoint messge, and that will be handled by HandleCheckpoint function.
We might/should eventually add things like retry this request / remove peer if no response.. but that should be part of a later PR.

…release

magicxyyz · 2024-05-30T15:38:25Z

arbitrum/handler_p2p.go

+	defer hPeer.mutex.Unlock()
+	hPeer.arb = nil
+	if hPeer.eth != nil {
+		hPeer.eth.Disconnect(p2p.DiscSelf)


Is there a reason why we pass p2p.DiscSelf here instead of eg. p2p.DiscUselessPeer?

like here:

go-ethereum/eth/handler.go

Lines 475 to 481 in 021ff6f

// removePeer requests disconnection of a peer.

func (h *handler) removePeer(id string) {

peer := h.peers.peer(id)

if peer != nil {

peer.Peer.Disconnect(p2p.DiscUselessPeer)

}

}

magicxyyz · 2024-05-30T15:45:16Z

arbitrum/handler_p2p.go

+	if canonical == (common.Hash{}) {
+		skeleton := rawdb.ReadSkeletonHeader(h.db, number)
+		if skeleton == nil {
+			log.Error("arbitrum handler_p2p: canonical not found", "number", number, "peer", peer.ID())


we probably should panic here with some descriptive message, otherwise the panic will happen in line 333 canonical = skeleton.Hash() - if we hit this error then either there is a bug in waitBlockSync or the database is broken.

or we can put line 333 into else block and fail somewhere later?

nice catch!
we don't need to panic, just return (effectively: ignore the received checkpoint message)

should we also drop the peer then?

I am still not sure about how we should handle this corner case, because if we can't read skeleton header from the database here, then either the database is corrupted or waitBlockSync didn't work.
In waitBlockSync (called few lines before) we are waiting until filler.Resume sets h.syncedBlockNum to number greater or equal then the checkpoint block number. filler.Resume reads the number from header fromh.downloader.SkeletonHead() and it internally reads the header from the database. So it seems impossible to first get the header from db and for the second time not, unless something is broken.

Does it make sense?
And if so, should we log a critical error here (or panic)?

I'm not sure how we could fail this test. I don't have a good scenario that doesn't require some bug somewhere.
Say we do have a race bug in waitForBlockSync - in that case, a return is the right approach. Maybe if we get another checkpoint from this or another peer it will work.

Generally if you get an unexpected error you should try to contain it. Just make sure you don't cause illegal values elsewhere. Panic would be the opposite of containing the error.

No reason to assume anything wrong with this peer so a drop is unnecessary either. (Later we should have logic trying to handle peers DOSing etc)

magicxyyz · 2024-05-31T15:47:52Z

arbitrum/handler_p2p.go

+	}
+	nodeIter, err := storageTrie.NodeIterator(origin[:])
+	if err != nil {
+		log.Error("Failed node iterator to open storage trie", "root", acc.Root, "err", err)


seems that the error message needs fixing

tsahee added 26 commits October 6, 2023 12:50

arbitrum/handler_p2p : initial

4b50a07

add pivotsync

b4e53cb

snap handler: dont use blockchain directly

e88a582

update arb syncer

aac8749

update sync_test

414ac55

arbitrum handler_p2p: fix iteration

d9423b1

sync_test: add storage to blocks

67f54b0

arb protocol: initial

e0b5022

downloader: handle nil pivot

16518aa

arb protocol updates

9f516de

sync_test: add bad node

018e470

eth/snap: support nit enode.Iterator

c8f538c

arb p2p: support drop peer

72e435d

sync_test: remove empty line

f0d7991

Merge remote-tracking branch 'origin/master' into snap_sync

080c0b4

snap backend for tests

41d5ed3

lint fixes

41dc364

eth downloader: export backfiller and get as param

4999bac

make functions in backfillerinterface public

dcea7bc

downloader/backfiller: move SetMode to public interface

5dd7b4a

arbitrum p2p: verify checkpoint block

a4af6d4

checkpoint is verified via skeleton

sync_test: bad node publishes correct validated

c49e6ae

eth/skeleton_test: fix

e2d1583

Merge remote-tracking branch 'origin/master' into snap_sync

571b42d

arb handler_p2p: update for geth 1.12.2

15e55cf

arb sync_test: update for geth 1.12.2

bbc5130

cla-bot bot added the s CLA signed label Nov 28, 2023

Tristan-Wilson reviewed Dec 6, 2023

View reviewed changes

magicxyyz added 2 commits May 16, 2024 00:55

Merge branch 'master' into snap_sync

9dbf259

fix skeleton merge

5e224f9

magicxyyz and others added 5 commits May 16, 2024 14:47

use triedb config when creating trieIteratorWrapper, close triedb on …

c7e362f

…release

use new log api in sync_test

44bfcdf

Merge branch 'master' into snap_sync

48a3fc2

Merge branch 'master' into snap_sync

12b0e72

Merge branch 'master' into snap_sync

4467103

magicxyyz reviewed May 31, 2024

View reviewed changes

magicxyyz and others added 2 commits May 31, 2024 18:02

Merge branch 'master' into snap_sync

55f78a9

handler_p2p: address comments

b78a746

tsahee marked this pull request as ready for review June 6, 2024 23:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial Snap sync for arbitrum #275

Initial Snap sync for arbitrum #275

tsahee commented Nov 28, 2023

Tristan-Wilson left a comment

Tristan-Wilson Dec 5, 2023

tsahee Dec 8, 2023

magicxyyz May 30, 2024 •

edited

Loading

magicxyyz May 30, 2024

tsahee Jun 6, 2024

magicxyyz Jun 6, 2024

tsahee Jun 7, 2024 •

edited

Loading

magicxyyz May 31, 2024

	// removePeer requests disconnection of a peer.
	func (h *handler) removePeer(id string) {
	peer := h.peers.peer(id)
	if peer != nil {
	peer.Peer.Disconnect(p2p.DiscUselessPeer)
	}
	}

Initial Snap sync for arbitrum #275

Are you sure you want to change the base?

Initial Snap sync for arbitrum #275

Conversation

tsahee commented Nov 28, 2023

Tristan-Wilson left a comment

Choose a reason for hiding this comment

Tristan-Wilson Dec 5, 2023

Choose a reason for hiding this comment

tsahee Dec 8, 2023

Choose a reason for hiding this comment

magicxyyz May 30, 2024 • edited Loading

Choose a reason for hiding this comment

magicxyyz May 30, 2024

Choose a reason for hiding this comment

tsahee Jun 6, 2024

Choose a reason for hiding this comment

magicxyyz Jun 6, 2024

Choose a reason for hiding this comment

tsahee Jun 7, 2024 • edited Loading

Choose a reason for hiding this comment

magicxyyz May 31, 2024

Choose a reason for hiding this comment

magicxyyz May 30, 2024 •

edited

Loading

tsahee Jun 7, 2024 •

edited

Loading