streamline spec and add more sections

celestiaorg · Mar 1, 2024 · 7d02fb7 · 7d02fb7
1 parent 5a818be
commit 7d02fb7
Showing 1 changed file with 184 additions and 99 deletions.
diff --git a/specs/src/shwap/spec.md b/specs/src/shwap/spec.md
@@ -2,25 +2,53 @@
 
 ## Abstract
 
-This document specifies the Shwap p2p protocol. Shwap provides scalable and extensible framework for exchanging and 
-swapping of shared data for Celestia's Data Availability network and beyond. 
+This document specifies Shwap - the simple and expressive, yet extensible and future-proof messaging framework aiming to
+solve critical inefficiencies and standardise messaging of Celestia's Data Availability p2p network. 
+
+Shwap defines messaging framework to be exchanged around the DA p2p network in trust-minimized way and without enforcing
+transport(QUIC/TCP or IP) or application layer protocol semantics(e.g HTTP/x). Using this framework, Shwap 
+declares the most common messages and provides options on how to stack them with lower-level protocols. 
+Shwap can be stacked together with application protocol like HTTP/x, [KadDHT][kaddht], [Bitswap][bitswap] or any custom 
+protocol.
 
 ## Motivation
 
-The current Data Availability Sampling (DAS) network protocol is inefficient. A _single_ sample operation takes log2(k) network
-round-trips(where k is the square size). This is not practical and does not scale for the theoretically unlimited data
-square that the Celestia network enables. The main motive here is a protocol with O(1) round-trip for _multiple_ samples, preserving
-the assumption of having 1/n honest peers connected.
+The current Data Availability Sampling (DAS) network protocol is inefficient. A _single_ sample operation takes log2(k) 
+network round-trips(where k is the square size). This is not practical and does not scale for the theoretically unlimited 
+data square that the Celestia network enables. The main motive here is a protocol with O(1) round-trip for _multiple_ 
+samples, preserving the assumption of having 1/n honest peers connected.
 
 Initially, Bitswap and IPLD were adopted as the basis for the DA network protocols, including DAS,
 block synchronization (BS), and blob/namespace data retrieval (ND). They gave battle-tested protocols and tooling with
 pluggability to rapidly scaffold Celestia's DA network. However, it came with the price of scalability limits and
 round-trips resulting in BS slower than block production. Before the network launch, the transition
 to the optimized [ShrEx protocol][shrex] for BS and integrating [CAR and DAGStore-based storage][storage] happened
-optimizing BS and ND. However, DAS was left untouched, preserving its weak scalability and roundtrip inefficiency. Shwap
-addresses these and provides an extensible and flexible framework for BS, ND, and beyond.
+optimizing BS and ND. However, DAS was left untouched, preserving its weak scalability and roundtrip inefficiency. 
+
+Shwap messaging stacked together with Bitswap protocol directly addresses described inefficiency and provides foundation
+for efficient communication for BS, ND, and beyond.
+
+## Rationale
+
+The atomic primitive of Celestia's DA network is a share. Shwap standardize messaging and serialization for shares.
+Shares are grouped together forming more complex data types(Rows, Blobs, etc). These data types are encapsulated in
+containers, e.g. Row container groups shares of a particular row. Containers can be identified with share identifiers
+in order to request, advertise or index the containers. The combination of containers and identifiers provides extensible
+and expressive messaging framework for groups of shares and enable efficient single round-trip request-response 
+communication.
+
+There are many share groups or containers known in Celestia network and systemizing this is the main reason behind setting
+up this simple messaging framework. There needs to be a single place with all the possible Celestia DA messages defined
+which node software and protocol researchers can rely and coordinate on. Besides, this framework is designed to be 
+future-proof and sustain changes in the core protocol's data structures and proving system, as long shares stays the 
+de facto atomic data type.
+
+Besides, there needs to be systematization and common knowledge-base with all the edge cases for possible protocol 
+compositions of Shwap with lower-level protocols Bitswap, KadDHT or Shrex, which Shwap aims to describe.
+
+## Specification
 
-## Terms and Definitions
+### Terms and Definitions
 
 The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
 "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and
@@ -48,46 +76,44 @@ _**Client**_: The Peer that requests content by content identifies over Shwap.
 
 _**Server**_: The Peer that responds with content over Shwap.
 
-_**Node**_: The peer that namespacesis both the client and the server.
+_**Node**_: The peer that is both the client and the server.
 
-_**Proof**_: Merkle inclusion proof of the data in the DataSquare.
+_**Proof**_: A Merkle inclusion proof of the data in the DataSquare.
 
-## Rationale
+### Message Framework
 
-### Multihashes and CID
+This sections defines messaging framework of Shwap. Every group of shares that needs to be exchanged over the network 
+MUST define its [share identifier](#share-identifiers) and [share container](#share-containers), as well as, follow 
+their described rules.
 
-Shwap takes inspiration from content addressability, but breaks-free from hash-based only model to optimize message sizes
-and data request patterns. In some way, it hacks into multihash abstraction to make it contain data that isn't in fact a
-hash. Furthermore, the protocol does not include hash digests in the multihashes. The authentication of the messages
-happens using externally provided data commitment.
+#### Share Identifiers
 
-## Protocol Dependencies
+Share identifiers defined by Shwap can be used to uniquely identify any [share container](#share-containers) over a chain
+with arbitrary number of [DataSquares][square], like a range of [shares][shares], a row or a [blob][blob]. Every share 
+identifier relates to a respective share container and vise-versa.
 
-### Bitswap
+Identifiers MUST have a fixed size for their fields. Subsequently, protobuf SHOULD NOT be used for CID serialization due
+to varints and lack of fixed size arrays. Instead, identifiers use simple binary big endian serialization.
 
-Shwap depends on Bitswap for swapping bits in fully distributed p2p-manner.
+Identifiers MAY embed each other to narrow down the scope of needed shares. For example, [SampleID](#sampleid) embeds
+[RowID](#rowid) as every sample lay on a particular row.
 
-## Share Identifiers
+#### Share Containers
 
-This section defines list of supported share identifiers. Share identifiers defined by Shwap can be used to uniquely
-identify any [share container](#share-containers) over a chain with arbitrary number of [DataSquares][square], like a range of 
-[shares][shares], a row or a [blob][blob]. Every share identifier relates to a respective share container and wise-versa.
+Share containers encapsulate a set of data shares with [DAH][dah] inclusion proof. Share containers are identified by 
+[share identifiers](#share-identifiers).
 
-Identifiers are embeddable to narrow down to the needed content. (TODO: Describe better)
+#### Versioning
 
-Identifiers MUST have a fixed size for their fields. Subsequently, protobuf can't be used for CID serialization due to
-varint usage. Instead, identifiers use simple binary big endian serialization.
+In case defined share container or identifier requires an incompatible change the new message type MAY be introduced 
+suffixed with new major version starting from v1. E.g. if Row message needs a revision, RowV1 is created.
 
-Table of supported identifiers with their respective multihash and codec codes. This table is supposed to be extended
-whenever any new identifier is added.
+### Messages
 
-| Name     | Multihash | Codec  |
-|----------|-----------|--------|
-| RowID    | 0x7811    | 0x7810 |
-| SampleID | 0x7801    | 0x7800 |
-| DataID   | 0x7821    | 0x7820 |
+This section defines all the supported Shwap messages which includes share identifiers and share containers. All the new
+future messages should be described in here.
 
-### RowID
+#### RowID
 
 RowID identifies the [Row shares container](#row-container) in a [DataSquare][square].
 
@@ -109,55 +135,7 @@ The fields with validity rules that form RowID are:
 
 Serialized RowID MUST have length of 10 bytes.
 
-### SampleID
-
-SampleID identifies a Sample container of a single share in a [DataSquare][square].
-
-SampleID identifiers are formatted as shown below:
-```
-SampleID {
-    RowID;
-    ShareIndex: u16; 
-}
-```
-
-The fields with validity rules that form SampleID are:
-
-[**RowID**](#rowid): A RowID of the sample. It MUST follow [RowID](#rowid) formatting and field validity rules.
-
-**ShareIndex**: A uint16 representing the index of the sampled share in the row. It MUST not exceed the number of Column
-roots in [DAH][dah].
-
-Serialized SampleID MUST have length of 12 bytes.
-
-### DataID
-
-DataID identifies [namespace][ns] Data container of shares within a _single_ Row. That is, namespace shares spanning 
-over multiple Rows are identified with multiple identifiers.
-
-DataID identifiers are formatted as shown below:
-```
-DataID {
-    RowID;
-    Namespace;
-}
-```
-
-The fields with validity rules that form DataID are:
-
-[**RowID**](#rowid): A RowID of the namespace data. It MUST follow [RowID](#rowid) formatting and field validity rules.
-
-[**Namespace**][ns]: A fixed-size bytes array representing the Namespace of interest. It MUST follow [Namespace][ns] 
-formatting and its validity rules.
-
-Serialized DataID MUST have length of 39 bytes.
-
-## Share Containers
-
-This section defines list of supported share containers. Share containers encapsulate a set of data shares with [DAH][dah]
-inclusion proof. Share containers are identified by [share identifiers](#share-identifiers).
-
-### Row Container
+#### Row Container
 
 Row containers encapsulate Row of the [DataSquare][square].
 
@@ -175,12 +153,33 @@ The fields with validity rules that form Row containers are:
 
 [**RowID**](#rowid): A RowID of the Row Container. It MUST follow [RowID](#rowid) formatting and field validity rules.
 
-**RowHalf**: A two-dimensional variable size byte arrays representing left half of shares in the row. It MUST be equal 
-to the number of Columns roots in [DAH][dah] divided by two. These shares MUST only be from the left half of the row. 
-The right half is computed using Leopard GF16 Reed-Solomon erasure-coding. Afterward, the [NMT][nmt] is built over both 
+**RowHalf**: A two-dimensional variable size byte arrays representing left half of shares in the row. It MUST be equal
+to the number of Columns roots in [DAH][dah] divided by two. These shares MUST only be from the left half of the row.
+The right half is computed using Leopard GF16 Reed-Solomon erasure-coding. Afterward, the [NMT][nmt] is built over both
 halves and the computed NMT root MUST be equal to the respective Row root in [DAH][dah].
 
-### Sample Container
+#### SampleID
+
+SampleID identifies a Sample container of a single share in a [DataSquare][square].
+
+SampleID identifiers are formatted as shown below:
+```
+SampleID {
+    RowID;
+    ColumnIndex: u16; 
+}
+```
+
+The fields with validity rules that form SampleID are:
+
+[**RowID**](#rowid): A RowID of the sample. It MUST follow [RowID](#rowid) formatting and field validity rules.
+
+**ColumnIndex**: A uint16 representing the column index of the sampled share; in other words share index in the row. It 
+MUST not exceed the number of Column roots in [DAH][dah].
+
+Serialized SampleID MUST have length of 12 bytes.
+
+#### Sample Container
 
 Sample containers encapsulate single shares of the [DataSquare][square].
 
@@ -206,15 +205,37 @@ The fields with validity rules that form Sample containers are:
 [**SampleID**](#sampleid): A SampleID of the Sample container. It MUST follow [SampleID](#sampleid) formatting and field
 validity rules.
 
-**SampleShare**: A variable size array representing the share contained in the sample. Each share MUST follow [share 
+**SampleShare**: A variable size array representing the share contained in the sample. Each share MUST follow [share
 formatting and validity][shares-format] rules.
 
-**Proof**: A [protobuf formated][nmt-pb] [NMT][nmt] proof of share inclusion. It MUST follow [NMT proof verification][nmt-verify] 
+**Proof**: A [protobuf formated][nmt-pb] [NMT][nmt] proof of share inclusion. It MUST follow [NMT proof verification][nmt-verify]
 and be verified against the respective root from Row or Column axis in [DAH][dah]. The axis is defined by ProofType field.
 
-**ProofType**: An enum defining which root the Proof is coming from. It MUST be either RowProofType or ColumnProofType. 
+**ProofType**: An enum defining which root the Proof is coming from. It MUST be either RowProofType or ColumnProofType.
+
+#### DataID
+
+DataID identifies [namespace][ns] Data container of shares within a _single_ Row. That is, namespace shares spanning 
+over multiple Rows are identified with multiple identifiers.
+
+DataID identifiers are formatted as shown below:
+```
+DataID {
+    RowID;
+    Namespace;
+}
+```
+
+The fields with validity rules that form DataID are:
+
+[**RowID**](#rowid): A RowID of the namespace data. It MUST follow [RowID](#rowid) formatting and field validity rules.
+
+[**Namespace**][ns]: A fixed-size bytes array representing the Namespace of interest. It MUST follow [Namespace][ns] 
+formatting and its validity rules.
+
+Serialized DataID MUST have length of 39 bytes.
 
-### Data Container
+#### Data Container
 
 Data containers encapsulate user submitted data under [namespaces][ns].
 
@@ -243,19 +264,83 @@ and be verified against the respective root from Row or Column axis in [DAH][dah
 Namespace data may span over multiple rows in which case all the data is encapsulated in multiple containers. This is 
 done
 
-## Protocol Extensions
+## Protocol Compositions
 
-This section is a placeholder for future protocol extensions like new new identifiers and containers.
+This sections specifies compositions of Shwap with other protocols. While Shwap is transport agnostic there are rough 
+edges on the protocol integration which every composition specifications has to describe.
 
-## Considerations
+### Bitswap
+
+[Bitswap][bitswap] is an application-level protocol designed for sharing verifiable data across peer-to-peer networks. 
+Bitswap operates as a dynamic want-list exchange among peers in a network. Peers continuously update and share their 
+want-lists of desired data in real-time. If at least one connected peer has the needed data, it is promptly fetched. 
+This ongoing exchange ensures that as soon as any peer acquires the sought-after data, it can instantly share it with 
+those in need.
+
+Shwap is designed to be synergetic with Bitswap, as that's the primary composition to be deployed in Celestia's DA 
+network. Bitswap provides 1/N peers guarantee and can parallelize fetching across multiple peers. Both of these properties
+greatly contribute to efficient DAS protocol of Celestia.
+
+Bitswap runs over libp2p stack which provides QUIC transport integration. Subsequently, Shwap will benefit from features
+libp2p provides together with transport protocol advancements introduced in QUIC.
 
-### Bitswap CID integration
+#### Multihashes and CID
+
+Bitswap is tightly coupled with Multihash and CID notions establishing the content addressability property. Shwap takes 
+inspiration from content addressability, but breaks-free from hash-based only model to optimize message sizes
+and data request patterns. In some way, it hacks into multihash abstraction to make it contain data that isn't in fact a
+hash. Furthermore, the protocol does not include hash digests in the multihashes. The authentication of the messages
+happens using externally provided data commitment.
+
+However, Bitswap still requires multihashes and CID codecs to be registered. Therefore, we provide a table for the 
+supported [share identifiers](#share-identifiers) with their respective multihash and CID codec codes. This table 
+is supposed to be extended whenever any new share identifier is added.
+
+| Name     | Multihash | Codec  |
+|----------|-----------|--------|
+| RowID    | 0x7811    | 0x7810 |
+| SampleID | 0x7801    | 0x7800 |
+| DataID   | 0x7821    | 0x7820 |
 
 The naive question would be: "Why not to make content verification after Bitswap provided it back over its API?"
-Intuitively, this would simplify a lot and wouldn't require "hacking" CID. However, this has an important downside - 
-the Bitswap in such case would consider the content as fetched and valid, sending DONT_WANT message to its peers, while
-the message might be invalid according to the verification rules.
+Intuitively, this would simplify a lot and wouldn't require "hacking" CID. However, this has an important downside -
+the Bitswap in such case would consider the request finalized and the content as fetched and valid, sending DONT_WANT 
+message to its peers, while the message might stillbe invalid according to the verification rules. 
+
+## Backwards Compatibility
+
+Swap is incompatible with the old sampling protocol.
+
+After rigorous investigation, celestia-node team decided against _implementing_ backward compatibility with
+the old protocol into the node client due to the immense complications it brings. Instead, the simple and time-efficient
+strategy is transiently deploying infrastructure for old and new versions, allowing network participants to migrate
+gradually to the latest version. We will first deprecate the old version, and once the majority has migrated, we will
+terminate the old infrastructure.
+
+## Considerations
+
+### Security
+
+Shwap does not change the security model of Celestia's Data Availability network and changes the underlying
+protocol for data retrieval.
+
+Essentially, the network and its codebase get simplified and require less code and infrastructure to operate. This in turn
+decreases the amount of implementation vulnerabilities, DOS vectors, message amplification, and resource exhaustion attacks.
+Although, new bug may be introduced as with any new protocol.
+
+### Protobuf Serialization
+
+Protobuf is widely adopted serialization format and is used within Celestia's protocols. This was quite an obvious choice
+for consistency reason, even though we could choose other more efficient and advanced formats like Cap'n Proto.
+
+## Reference Implementation
+
+- [Go reference implementation with Bitswap composition][gimpl]
+- [Rust implementation with Bitswap composition][rimpl]
 
+[shrex]: https://github.com/celestiaorg/celestia-node/blob/0abd16bbb05bf3016595498844a588ef55c63d2d/docs/adr/adr-013-blocksync-overhaul-part-2.md
+[storage]: https://github.com/celestiaorg/celestia-node/blob/a33c80e20da684d656c7213580be7878bcd27cf4/docs/adr/adr-011-blocksync-overhaul-part-1.md
+[bitswap]: https://docs.ipfs.tech/concepts/bitswap/
 [square]: https://celestiaorg.github.io/celestia-app/specs/data_structures.html#2d-reed-solomon-encoding-scheme
 [shares]: https://celestiaorg.github.io/celestia-app/specs/shares.html#abstract
 [shares-format]: https://celestiaorg.github.io/celestia-app/specs/shares.html#share-format