Refactor `extract_key` logic in Taproot compiler #732

apoelstra · 2024-08-30T14:55:51Z

Right now our Taproot compiler does some complicated analysis to identify a key within a policy that may be extracted to use as an internal key. Specifically, if you look at the "disjunction tree" at the root of a policy, every leaf of this tree can be a tapbranch and any leaf which is just a pubkey-check and nothing else is a candidate to be the internal key.

However, our logic to do this is convoluted, has many unnecessary error paths, and lots of gratuitous allocations (which ironically make the code harder, not easier, to follow).

This PR smooths all that out by introducing a new iterator over the potential tapbranches of a policy (along with their probabilities). It also cleans up much of the error handling in the policy module, eliminating 3 calls to the stringly-typed errstr function.

This test takes ~28 seconds on my system, but it is doing two independent compilations in serial. If I split it into two, then they can run in parallel and the total time is about 16 seconds.

The Taproot compiler has a couple algorithms used to find an optimal internal key to extract from a policy. One is `to_tapleaf_prob_vec` which is a fundamental part of the compiler, which iterates over the disjunction-only root of the tree and returns every branch along with its probabilities. This is currently implemented using both allocations and recursion. By pulling the logic out into an iterator we can get a clearer, faster algorithm that cannot stack-overflow. This algorithm is then fed into Concrete::extract_key. The goal of this algorithm is to find the highest-probability tapbranch consisting of just a single key. This algorithm inexplicably works by: * Lifting the policy to a semantic policy, maybe returning an error * Iterating over the entire policy, extracting a list of every key. * Calling to_tapleaf_prob_vec to get a vector of tapleaves, which it then iterates over to find the key-only branches, which it collects into a BTreeMap mapping the key to probabilities. * Iterating over the extracted lists of keys, and for each key, reducing the semantic policy by that key and comparing to Trivial (logically, this is asking "is this key in a tree of disjunctions from the root"). * For each such key that it finds, looking it up in the map, potentially returning an error (actually this error path is impossible to hit). * and manually minimizing the looked up probability. With to_tapleaf_prob_vec replaced by an iterator there is a simpler and more direct algorithm: * Iterate through all the tapbranches/probability pairs, filtering for key-only branches, and maximizing by probability. This can only fail if there are no key-only branches, and this is reflected by only having one error branch.

We have a few error returns that are impossible to hit: * A sanity check on a tapleaf that just came out of the compiler (if this is hit it is a compiler bug and we want to know about it). * An error return from with_huffman_tree which can only happen if it's given an empty input (impossible) * An error if the final compilation (all tapleaves assembled into a tree) can't fit into a Descriptor::tr; but again, this is a compiler bug if we hit it. (Actually, I think that by manually constructing a policy that exceeds the maximum recursion depth you can trigger this error path, but the compiler output is not the place to flag this manual violation of invariants). The next commit will clean up the error types. These changes are in their own commit because they are potentially controversial.

This adds a couple variants to policy::compiler::Error and eliminates a couple calls to errstr. It also changes a few internal functions to return compiler errors instead of the top-level error.

sanket1729

ACK a195c81

Awesome. Thanks for the code cleanup.

apoelstra

Successfully ran local tests on a195c81.

shesek · 2024-09-17T19:17:32Z

We have a few error returns that are impossible to hit:
...
* An error return from with_huffman_tree which can only happen if it's given an empty input (impossible)

This one was not actually impossible. 😅 See #677 for a test (that now panics) and a fix.

apoelstra added 7 commits August 30, 2024 14:43

policy: split up error enum for concrete and semantic

7de96d3

compiler: split segwit_limits test into 2

9f53c01

This test takes ~28 seconds on my system, but it is doing two independent compilations in serial. If I split it into two, then they can run in parallel and the total time is about 16 seconds.

policy: use new tapleaf_probability_iter for num_tap_leaves

a3338fb

policy: clean up error types in compile_tr

a6ca4ce

This adds a couple variants to policy::compiler::Error and eliminates a couple calls to errstr. It also changes a few internal functions to return compiler errors instead of the top-level error.

fuzz: add compile_taproot fuzztest

a195c81

sanket1729 approved these changes Aug 30, 2024

View reviewed changes

apoelstra commented Aug 30, 2024

View reviewed changes

apoelstra merged commit 4490289 into rust-bitcoin:master Aug 30, 2024
30 checks passed

apoelstra deleted the 2024-08--taproot-compiler branch August 30, 2024 16:38

shesek mentioned this pull request Sep 17, 2024

Fix compilation of pk()-only policies into tr() descriptors #677

Merged

apoelstra mentioned this pull request Oct 23, 2024

Segwitv0 compilation of x-only keys panics #761

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor `extract_key` logic in Taproot compiler #732

Refactor `extract_key` logic in Taproot compiler #732

Uh oh!

apoelstra commented Aug 30, 2024

Uh oh!

sanket1729 left a comment

Uh oh!

apoelstra left a comment

Uh oh!

Uh oh!

shesek commented Sep 17, 2024

Uh oh!

Uh oh!

Refactor extract_key logic in Taproot compiler #732

Refactor extract_key logic in Taproot compiler #732

Uh oh!

Conversation

apoelstra commented Aug 30, 2024

Uh oh!

sanket1729 left a comment

Choose a reason for hiding this comment

Uh oh!

apoelstra left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

shesek commented Sep 17, 2024

Uh oh!

Uh oh!

Refactor `extract_key` logic in Taproot compiler #732

Refactor `extract_key` logic in Taproot compiler #732