Skip to content

Commit

Permalink
gh-148 devp2p: ENR subsection added
Browse files Browse the repository at this point in the history
  • Loading branch information
raa-dev committed May 27, 2024
1 parent d3be53e commit 988b4ed
Show file tree
Hide file tree
Showing 2 changed files with 50 additions and 23 deletions.
66 changes: 43 additions & 23 deletions docs/wiki/EL/devp2p.md
Original file line number Diff line number Diff line change
Expand Up @@ -46,10 +46,10 @@ The reason behind this is that the information exchange requires a reliable conn
so they can be able to both confirm the connection before sending the data and have a way to ensure that the data is delivered in the correct order and without errors (or at least to have a way to detect and correct them),
while the discovery process does not require the reliable connection, since it is enough to let other knows that the node is available to communicate.

### Discovery
### Discv protocol (Discovery)
The process of how the nodes find each other in the network starts with [the hard-coded bootnodes listed in the specification](https://github.com/ethereum/go-ethereum/blob/master/params/bootnodes.go).
The bootnodes are nodes that are known by all the other nodes in the networks (both Mainnet and testnets), and they are used to bootstrap the discovery peers process.
Using the Kademlia DHT (Distributed Hash Table) algorithm, the nodes are able to find each other in the network by referring to a routing table where the bootnodes are listed.
Using the Kademlia-like DHT (Distributed Hash Table) algorithm, the nodes are able to find each other in the network by referring to a routing table where the bootnodes are listed.
The TLDR of the Kademlia is that it is a peer-to-peer protocol that enables nodes to find each other in the network by using a distributed hash table, as Leffew mentioned in his article (2019).

That is to say, the connection process starts with a PING-PONG game where the new node send a PING message to the bootnode, and the bootnode responds with a PONG hashed message.
Expand All @@ -58,6 +58,7 @@ so it can repeat the PING-PONG game with them and bond with them as well.

![img.png](../../images/el-architecture/peer-discovery.png)

#### Wire protocol
The PING/PONG game is better known as the wire subprotocol, and it includes the next specifications:

**PING packet structure**
Expand Down Expand Up @@ -107,37 +108,56 @@ Where ENR is the Ethereum Node Record, a standard format for connectivity for no

---
Currently, the execution clients are using the [Discv4 protocol](https://github.com/ethereum/devp2p/blob/master/discv4.md) for the discovery process, although it is planned to be migrated to [Discv5](https://github.com/ethereum/devp2p/blob/master/discv5/discv5.md) in the future.
This Kademlia-like discovery protocol includes the routing table, which keeps information about other nodes in the neighbourhood consisting of *k-buckets* (where *k* is the number of nodes in the bucket, currently defined as 16).
This Kademlia-like protocol includes the routing table, which keeps information about other nodes in the neighbourhood consisting of *k-buckets* (where *k* is the number of nodes in the bucket, currently defined as 16).
Worth mentioning that all the table entries are sorted by *last seen/least-recently seen* at the head, and most-recently seen at the tail.
If one of the entities has not been responded to in 12 hours, it is removed from the table, and the next encounter node is added to the tail of the list.

### ENR: Ethereum Node Records
The ENR os a standard format for p2p connectivity, which was originally proposed in the [EIP-778](https://eips.ethereum.org/EIPS/eip-778).
A node record contains the node's network endpoints, such as the IP address and port, as well as the node's public key and the sequence number of the record.

The record content structure is as follows:

### ENR: Ethereum Node Records
* Standard format for connectivity for nodes
* Node's identity
* Object
* signature (hash)
* sequence number
* arbitrary key-value pairs

## DevP2P specs
* RLPx: encrypted and authenticated transport protocol (TCP based transport system for nodes communication)
* Handshake messaging
* Encryption
* Authentication
* RLP: Recursive Length Prefix encoding
* Multiplexing: multiple subprotocols
* Subprotocols
* LES: Light Ethereum Subprotocol
* Witness Subprotocol
* Wire Subprotocol
* SHH: Whisper Subprotocol
| Key | Value |
| --- |---------------------------------------------|
| id | id scheme, e.g "v4" |
| secp256k1 | compressed public key, 33 bytes |
| ip | IPv4 address, 4 bytes |
| tcp | TCP port, big endinan integer |
| udp | UDP port, big endinan integer |
| ip6 | IPv6 address, 16 bytes |
| tcp6 | IPv6-specific TCP port, big endinan integer |
| udp6 | IPv6-specific UDP port, big endinan integer |

All the fields are optional, except for the `id` field, which is required. If no `tcp6`/`udp6` port are provided, the `tcp`/`udp` ports are used for both IPv4 and IPv6.

The node record is composed of a `signature`, which is the cryptographic signature of record contents, and a `seq` field, which is the sequence number of the record (a 64-bit unsigned integer).
#### Encoding

The record is encoded as an RLP list of `[signature, seq, k, v,...]` with a maximum size of 300 bytes.
Signed records are encoded as follows:
```
content = [seq, k, v, ...]
signature = sign(content)
record = [signature, seq, k, v, ...]
```
In addition to the RLP encoding, there is a textual representation of the record, which is a base64 encoding of the RLP encoding. It is prefixed with `enr:`.
i.e. `enr:-IS4QHCYrYZbAKWCBRlAy5zzaDZXJBGkcnh4MHcBFZntXNFrdvJjX04jRzjzCBOonrkTfj499SZuOh8R33Ls8RRcy5wBgmlkgnY0gmlwhH8AAAGJc2VjcDI1NmsxoQPKY0yuDUmstAHYpMa2_oxVtw0RW_QAdpzBQA8yWM0xOIN1ZHCCdl8` which contains the loopback address `127.0.0.1` and the UDP port 30303. The node ID is `a448f24c6d18e575453db13171562b71999873db5b286df957af199ec94617f7`.

Despite of the fact that the ENR is a standard format for p2p connectivity, it is not mandatory to use it in the Ethereum network. The nodes can use any other format to exchange the information about their connectivity.
There are two additional formats able to be understand by an Ethereum node: multiaddr and enode.

* The multiaddr was the original one. For example, the multiaddr for a node with a loopback IP listening on TCP port 30303 and node ID `a448f24c6d18e575453db13171562b71999873db5b286df957af199ec94617f7` is `/ip4/127.0.0.1/tcp/30303/a448f24c6d18e575453db13171562b71999873db5b286df957af199ec94617f7`.
* The enode is a more human-readable format. For example, the enode for the same node is `enode://a448f24c6d18e575453db13171562b71999873db5b286df957af199ec94617f7@127.0.0.1:30303?discport=30301`. It is a URL-like format describing the node ID encoded before de @ sign, the IP address, the TCP port and the UDP port specified as "discport".

### RLPx protocol (Transport)


### Further Reading
* [Geth devp2p docs](https://geth.ethereum.org/docs/tools/devp2p)
* [Ethereum devp2p GitHub](https://github.com/ethereum/devp2p)
* [Ethereum networking layer](https://ethereum.org/en/developers/docs/networking-layer/)
* [Ethereum Addresses](https://ethereum.org/en/developers/docs/networking-layer/network-addresses/)
* Andrew S. Tanenbaum, Nick Feamster, David J. Wetherall (2021). *Computer Networks*. 6th edition. Pearson. London.
* Clause E. Shannon (1948). "A Mathematical Theory of Communication". *Bell System Technical Journal*. Vol. 27.
* Jim Kurose and Keith Ross (2020). *Computer Networking: A Top-Down Approach*. 8th edition. Pearson.
Expand Down
7 changes: 7 additions & 0 deletions wordlist.txt
Original file line number Diff line number Diff line change
Expand Up @@ -186,6 +186,7 @@ Dimitar
disambiguated
Discordo
discv
discport
distro
docsify
dogecoin
Expand Down Expand Up @@ -223,8 +224,10 @@ Elmore
ELs
EL's
Encodings
endian
Endian
env
enode
ENR
EOA
EOAs
Expand Down Expand Up @@ -358,6 +361,7 @@ interop
invariants
IOP
IPC
IPv
IRTF
ISA
Jitsi
Expand Down Expand Up @@ -403,6 +407,7 @@ locktime
Longrightarrow
lookahead
lookups
loopback
LSB
LSM
LSP
Expand Down Expand Up @@ -447,6 +452,7 @@ MSB
MSIZE
mstore
MUL
multiaddr
multiproofs
Murr
MVE
Expand All @@ -458,6 +464,7 @@ namespaces
Nand
natively
neighbours
neighbourhood
neq
Nethermind
neuder
Expand Down

0 comments on commit 988b4ed

Please sign in to comment.