Replies: 4 comments 9 replies
-
Do you see any invalid merkle root errors in the logs? can you show us the sync progress graph? It would also be good to know whether or not you're using lighthouse checkpoint sync |
Beta Was this translation helpful? Give feedback.
-
The node simply wasn't catching up, entering this constant loop of finishing the 12 stages and going back to the start, and processing only ~30 blocks at a time, and taking 2-3m just with merkle stage. After reading some issues here, I decided to stop reth, drop the whole stage and restart reth, now with v0.2.0-beta.5. :; sudo systemctl stop reth
:; reth stage drop merkle
(...update from beta.4 to beta.5...)
:; sudo systemctl start reth The logs now show more info, such as the ETA and % done. I have never seen it before.
Will keep you posted on how it progresses. |
Beta Was this translation helpful? Give feedback.
-
Seriously, I'm lost here, why would it take 10 minutes for merkle to execute on a ~90 block interval? I changed log verbosity to debug and still I can't find anything relevant, aside from what I considered are acceptable net errors, like peers disconnecting and such. |
Beta Was this translation helpful? Give feedback.
-
@pires did you ever solve this? I am curious what AWS Instance type and EBS type ended up working for you? |
Beta Was this translation helpful? Give feedback.
-
Hello there,
What I'm doing: I'm running
v0.2.0-beta.4
with Lighthousev5.1.3
on the same hardware, an AWSm7a.4xlarge
(AMD EPYC 4th gen 16 vCPU, 64GB RAM) with an EBS volume w/ 16k IOPS and 1000MBps throughput, and I have synced from one of the mainnet snapshots gently provided by merkle.io folks.What I'm curious about: the
MerkleStage
takes a some time processing a small interval of blocks (as can be seen below), even after settingclean_threshold = 5000
and restarting. This keeps happening because then the node finally reaches the finalize stage, it decides to repeat the twelve stages all over again.Other than RAM cache being high I don't see anything else that could point me to resource exhaustion:
A different machine with the exact same specs has caught up entirely from the network.
Logs:
What am I missing or what else should I be looking at?
Thank you!
RAM
CPU
Disk IOPS
Beta Was this translation helpful? Give feedback.
All reactions