feat: parallel EVM claim

rakita · Jul 9, 2023 · 36ecc4c · 36ecc4c
1 parent 7d113ba
commit 36ecc4c
Show file tree

Hide file tree

Showing 67 changed files with 154 additions and 3,340 deletions.
diff --git a/.DS_Store b/.DS_Store
diff --git a/content/.DS_Store b/content/.DS_Store
diff --git a/content/_index.md b/content/_index.md
@@ -5,7 +5,7 @@ title = "Welcome to my blog!"
 # The homepage contents
 [extra]
 lead = 'I write about core blockchain ideas (mostly Ethereum) and rust.'
-url = "/docs/getting-started/introduction/"
+url = "/blog"
 url_button = "Get started"
 repo_version = "GitHub v0.1.0"
 repo_license = "Open-source MIT License."

diff --git a/content/authors/draganrakita.md b/content/authors/draganrakita.md
@@ -1,6 +1,6 @@
 +++
 title = "draganrakita"
-description = "Author of revm, core dev at reth, worked on openethereum."
+description = "Author of revm, reth core dev, worked on openethereum."
 date = 2021-04-01T08:50:45+00:00
 updated = 2021-04-01T08:50:45+00:00
 draft = false

diff --git a/content/blog/.DS_Store b/content/blog/.DS_Store
diff --git a/content/blog/2d_transformations.md b/content/blog/2d_transformations.md
@@ -7,7 +7,7 @@ draft = false
 template = "blog/page.html"
 
 [taxonomies]
-authors = ["Public"]
+authors = ["draganrakita"]
 
 [extra]
 lead = "Post about 2d transformations that i did in my previous work life."
@@ -44,13 +44,13 @@ Rotation is little bit more complex (it has little bit more to do) but in same r
 
 And we come to scaling, it is best part of this post (it has pictures) :D. We will gradually introducing few things that needs to be done in scaling, we will see how we handle rotation, and shift controls (shift is usually used for aspect ration lock).
 
-![Naive Scale](./naive_scale.png)
+![Naive Scale](/2d_transformations/naive_scale.png)
 
 Nice, lets start with basic example where our element is not rotated or translated and we just want to scale it. We will use `ref_point` (corner or side usually), and its `anchor_point` and of course we will need `current_point` to tell us where we want to scale to. We calculate `diff=current_point-anchor_point`, get scale as `s=scale(diff/element_size)` and we are done, we have scale matrix that can add to our transformation.
 
 Okay, lets now look on example where we want to take top left corner `ref_point` ( you can follow picture below), in that case our `anchor_point` is positioned at bottom and if we want to scale it properly, to top and left. First difference from previous example is that we will need to move our object so that `anchor_point` is in `(0.0)` coordinate! We still need `diff` and we are calculating it same as before, but because now our axis are flipped, this is second difference, we need to reverse sign of `diff_new=Vector(-diff.x,-diff.y)`. Note, reversing `y` is needed for top side `ref_point` and reversing `x` for left side `ref_point`. We get scale as `s=scale(diff_new/element_size)` . And final third difference from previous example is that after all this we need to take translation of anchor `T=translate(anchor_point)`, calculate inverse `Tinv=inverse(T)` and bind it all together (from left to right) `S=T*s*Tin`.
 
-![Scale](./scale.png)
+![Scale](/2d_transformations/scale.png)
 
 As you can see diff vector is oriented to negative in reference to our axis, this is reason why we need to flip it, if we didn't do flipping you would get small scale when moving away from top left corner.
 
@@ -62,7 +62,7 @@ That's great, but how to append scale in current matrix, when scale is something
 
 Shift scale is scaling where aspect ration is not changed. This means that scale on both axis is equal and we need to choose which axis orientation we will take as primary. We could make it simple and depending on which corner_id is selected that take modulo of two and chose x or y scale, this will work but will be unintuitive. For better solution where depending on position of mouse relative to diagonal of element we will get smother transition between x and y orientation. See picture below:
 
-![Naive Scale](./shyft_scale.png)
+![Naive Scale](/2d_transformations/shyft_scale.png)
 
 With transparent colors we can see zones where we want to take only `x` ( blue color) or take only `y` (marked with red). As noticeable our object is in original position that means our `original_points` is calculated same as in example with rotated object. Slope of diagonals that make these zones are calculated from `original_size` with equation `line_slope = original_size.y/original_size.x` . for second diagonal it is enough to just flip sign and we will get second slope. what we want to check is if point is in blue or red space and we can do that following if statement: (for abbreviate: `op` is `original_point` , `ls` is `line_slope` ): `(op.y < ls**op.x && op.y > -ls**op.x) || (op.y > op.x**ls && op.y < -ls**op.x)`, and if this if statement is true do `scale.y=scale.x` if it is false do opposite. And lastly don't forget that when you are overriding one scale to not override its sign, in example from picture we are taking `y` scale and overriding `x` scale but we need to preserve `x` sign to properly scale our element `x=sign(x)*abs(y)`.
 
@@ -71,12 +71,12 @@ With transparent colors we can see zones where we want to take only `x` ( blue c
 Summary of functions that were called throughout the text:
 
 Translation:
-```text
+```rust
 M = M*translation(current_point-ref_point)
 ```
 
 Rotation:
-```text
+```rust
 crp  =ref_point - center_point
 ccp = current_point - center_point
 angle = atan2(norm(cross(crp,ccp)), dot(crp,ccp))
@@ -88,7 +88,7 @@ M = M*Ra
 ```
 
 Scale:
-```dwda
+```rust
 Minv = inverse(M)
 relative_position = Minv * current_position
 original_anchor_point = original_corners[handler_id]
@@ -102,7 +102,7 @@ M=M*Sa
 ```
 
 Shift scale:
-```dwda
+```rust
 scale = (x,y)
 line_slope = original_size.y/original_size.x
 if (op.y < ls**op.x && op.y > -ls**op.x) || (op.y > op.x**ls && op.y < -ls**op.x) {

diff --git a/content/blog/parallel_evm_claim.md b/content/blog/parallel_evm_claim.md
@@ -0,0 +1,135 @@
++++
+title = "Parallel EVM claim"
+description = "How to verify claim of parallel execution"
+date = 2023-07-09T22:00:00+00:00
+updated = 2023-07-09T22:00:00+00:00
+draft = false
+template = "blog/page.html"
+
+[taxonomies]
+authors = ["draganrakita"]
++++
+
+
+This paper is not what you would expect, it is not about how to find the order and dependencies of transaction execution, as there are already a few approaches to this, first can be done with access lists (UTXO, Solana) and the main paper for the second approach is to brute force it with probabilistic execution aka [Block-STM](https://arxiv.org/abs/2203.06871) pioneered by Nova/Aptos, and some EVM type blockchain emulated this and gained good performance boost ([Polygon PoS](https://polygon.technology/blog/innovating-the-main-chain-a-polygon-pos-study-in-parallelization), [Binance Chain](https://www.bnbchain.org/tr/blog/new-milestone-the-implementation-of-parallel-evm-2-0/) both got similar performance).
+
+The idea is for the builder to (somehow) find the transactions that can be done in parallel (the great thing about this is that this can be considered as a black box and can evolve on its own) and share that claim in a form of the transaction [DAG](https://en.wikipedia.org/wiki/Directed_acyclic_graph) to other peers/validators, the builder will be rewarded for doing that correctly. And verifier needs to execute those transactions in parallel following that DAG and **verify** the integrity of that claim. We will talk about how to verify this claim (Split of builder and verifier in imho is a very powerful idea that is a little bit undervalued allows us cleaner system modelling).
+
+Until now I didn't find anything related to this and the topic seems a lot more interesting to explore. You don't need to increase your transaction size with an access list and you don't need to do expensive probabilistic execution (at least not for a lot of nodes), so verifiers have smaller work that they need to do but still fully consistently verify execution. And there is an additional benefit for archive sync that I will talk about later.
+
+Parallel claim verification creates a potential path to introduce parallel execution inside Ethereum as the focus would be not on finding parallel tx but just on making sure that there are no inconsistencies when given tx are run in parallel. This path is long and requires more research to fully comprehend the change. As this topic is complex I will introduce a few simple examples and slowly build it up to encompass a working solution. But even with that, there are still a lot of pending topics that need to be addressed for this to become integrated inside protocol (parallel gas aka multidimensional gas accounting for example).
+
+# Algorith explained 
+
+All examples start from the point that we received a DAG of transaction and the builder claims that transaction can be done in parallel. We want to execute those transactions in parallel and be sure that the claim is correct and that there are no inconsistencies (data races) that can happen.
+
+
+### Example 1: simple two parallel transactions
+
+We have two transactions that read/write to the **same** state (there is only one state that all of them share) and the update to that state is atomic. The example here is very simple but it allows us to set up some groundwork and initial ideas of what is checked.
+
+[Mermaid graph](https://mermaid.live/edit#pako:eNpdTrsOwjAQ-5XII2oGOmZgYmViJAyn5gqRmgSlFwSq-u8cMCDhyfJD9oKhBIbDLCS8j3SplOy999koTpuzsXZn5LH9F3p0SFwTxaDt5W17yJUTezilgUdqk3j4vGqUmpTjMw9wUht3aLfw24MbaZpV5RCl1MP30efY-gKkKDNp)
+
+![](/parallel_evm_claim/example_2tx.png)
+
+
+For the sake of explaining we are simplifying state and seeing it as a list of accounts, these "accounts" can be a balance/nonce/code hash(code)/storage slot, it is just easier to reason and think about in simpler form.
+
+Additionally, we should consider both reads and writes of accounts as the same thing. This can be explored as a follow-up but for the first iteration, it is easier to omit this distinction. So this means that the transaction touched state consists of both reads and writes that this transaction did. And with this, having an account read from two different parallel transactions is considered invalid.
+
+Now, the idea here is that on every access of an account (read or write) to mark that account in the state as accessed by that transaction. This means that if account `0x01` is accessed by `tx1` it will be marked as such and if `tx2` tries to access account `0x1` we will notice that account is already marked and see that there is inconsistency and data race in place.
+
+So every "account" had additional information that represent the transaction that last touched it.
+
+Running transactions in parallel is more implementation detail and will depend on the programming language.
+
+
+### Example 2: Chains, transactions dependencies.
+
+The second example is having a third transaction that depends on the first one.
+
+![](/parallel_evm_claim/example_chain.png)
+
+
+[Graph](https://mermaid.live/edit#pako:eNpdjjEOwjAMRa8SeUTNQMuUgYmViZEwWI0LkZoEpU4Fqnp3DC1CwtPX-7b1JmiTIzAwMDIdPF4zBj3WNiqZ8-aitN4rfmwXIGEFzRc0K9j9n9RQQaAc0Dv5P71rC3yjQBaMREcdlp4t2DjLKhZOp2dswXAuVEG5u58RmA77QSg5zykfF-eP-vwC_v88KA)
+
+This is the first example of dependent transactions and`tx3` can access only accounts that are in the original state or touched by `tx1`, if both `tx3` and `tx2` access the same account this would make the parallelism claim invalid.
+
+This example show's us that marking of state can be done by chain ids that this tx belongs to and we would get the same outcome. Without this `tx4` would need to check if the account state is original or marked by `tx1` or marked by `tx3` and that wouldn't be efficient. I will use the terms chain and transaction interchangeably.
+
+## Example 3: Chain forks and joins
+
+Modelling dependency can be tricky but in parallel execution, there are only two synchronizations that can happen. And those are forks and joins and both of them can be seen in the picture.
+
+![](/parallel_evm_claim/example_fork_join.png)
+
+
+[Graph](https://mermaid.live/edit#pako:eNpdj7EOwjAMRH-l8oiagRSWDEysTIwNg9W4EKlJUOogUNV_J9BWFXg6vTtZdwM0wRAo6BmZjhavEZ14SO2LfPXmUghxKPi5nUAWM6gWUM1g95_YL0D-gvWphBIcRYfW5AbDx9bAN3KkQWVpqMXUsQbtxxzFxOH88g0ojolKSHezdgbVYtdnSsZyiKdp1Xfc-AaEXkTp)
+
+There is one fork here, and can be seen in the example of `tx1` that forks its state to chains of `tx5` and `tx3`. This means that there is a dependency between `tx5` and `tx1`, `tx3` and `tx1` but there are no dependencies on `tx3` and `tx5` and they can be run in parallel.
+
+The mechanism of marking the state works the same as in the first example. `tx5` can now access the account of the original or `tx1` or `tx2` accounts if it accessed the state of `tx3` or `tx3` this would make parallel claim invalid.
+
+
+## Example 4: Diamond pattern
+
+This is a good example that tests our initial mechanism of marking of accessed state.
+
+[Graph](https://mermaid.live/edit#pako:eNpd0D0PgjAQBuC_Qm40MMiHJB2cXJ0crcOFHkpCKSlXoyH8d6uUmPSmy3PvcHczNEYRCJgYmU4d3i3q7JnLIfF13d2SLDsm_Nqv4JsAxQZFgDJOVBvkMZQBDhtUAeo4Ucd75JCCJquxU37p-TuWwA_SJEH4VlGLrmcJclh8FB2by3toQLB1lIIb1f9MEC32k1dSHRt7Xh_x-8fyAQIhUhg)
+
+![](/parallel_evm_claim/example_diamont.png)
+
+
+All previous statements should be valid here.
+
+For example `tx7` can only touch original state or `tx1`,`tx2`,`tx3`,`tx4`,`tx5` but not `tx6`, and same with `tx6` it can't touch state of `tx7`
+
+## How to check marks
+
+Every transaction could have a list of previous dependent transactions, and when checking the mark inside the database we compare it if it is found inside that list.
+
+This list can be sorted so finding particular values can be done by binary search. The list size depends on the number of dependent transactions.
+
+# miner fee
+
+The problem with the current setup is that transaction pays the fee of execution to the miner after a transaction is finished, this would mean that every transaction depends on its predecessor miner balance to update it with this transaction fee. A simple solution for this is to just move the fee balance increment of the miner at the end of the block after all transactions are executed. This is a small consensus change without a lot of general impact, but as said it is a "consensus change" for us to parallelize transactions with the DAG hints we need a different solution.
+
+This solution requires a lot of small things we need to make it consistent:
+* we should mark those transactions in DAG that need miner information and make them dependent on all previous transactions.
+* We need to have an additional atomic vector that is the size of the number of transactions and it will contain an increment of the miner balance, and it is updated async when transaction execution finishes.
+* We need a hint when miner balance information is needed, this is only possible in a few situations. This hint or a flag should be checked against the flag set inside DAG to see that this information is done correctly:
+    * Opcode BALANCE is called for a miner
+    * When the miner account is the contract and it transfers funds or calls SELFBALANCE opcode. We can be a little loose and say when the miner as a contract is called.
+    * If the miner account is an ordinary account and there is a transaction with the miner as a sender.
+* After all transactions are executed apply the rest of the transaction fee to the miner.
+
+The solution requires a little bit of hoop jumping but it is possible to make and have verifiable parallel execution.
+
+
+# Usage 
+
+When live syncing, builders could potentially get more rewards if they find transactions that can be done in parallel. This would mean more throughput without state increase.
+
+On history sync, we can obtain DAGs from centralized sources, and if we have verifiable parallel execution we don't need to trust that DAG and can do verification of parallel execution on our own, if received DAG is not correct we can just fall back to serial execution. This can potentially speed up initial archive sync by X factor.
+
+
+# Further works
+
+### Split of reads and writes:
+
+This could make transactions ever more parallel.
+
+The main idea behind this is that chain that writes an account should be the only one that can read that account. And few more checks need to be done
+* If the account is read by two chains but written in one it is considered a potential data race.
+* For every account read we should append the transaction number, and on every write, we should check if those reads connect to the same chain and mark that account as written and clear read list. And if there is a transaction from a different chain/predecessor that has written an account before us this is making the claim invalid. This means that every account now has the last transactions that wrote it and the list of transactions that have read it.
+* If we want to read the account we should first check if it is written by the predecessor, if it is not, this makes the claim invalid.
+
+
+This is optimization for this architecture.
+
+### Gas calculation
+One of the pending things that need to be defined when parallel transactions are considered for inclusion. This probably can be done by some calculation on the DAG and its weight (gas).
+
+Separation of current gas accounting on CPU gas and disk io gas firstly specified in [multidimentional EIP1559](https://ethresear.ch/t/multidimensional-eip-1559/11651) is probably desirable, but not required. Gas calculation is always a sensitive topic as it can be abused if not done correctly.
+
+And as CPU cores are limited, we can have limitations on transaction DAG format.