BIP: 68 Title: Consensus-enforced transaction replacement signaled via sequence numbers (relative lock-time) Author: Mark Friedenbach <[email protected]> Status: Draft Type: Standards Track Created: 2015-05-28
This BIP describes a modification to the consensus-enforced semantics of the sequence number field to enable a signed transaction input to remain invalid for a defined period of time after confirmation of its corresponding outpoint, for the purpose of supporting consensus-enforced transaction replacement features.
Bitcoin has sequence number fields for each input of a transaction. The original idea appears to have been that the highest sequence number should dominate and miners should prefer it over lower sequence numbers. This was never really implemented, and the half-implemented code seemed to be making an assumption that miners would honestly prefer the higher sequence numbers, even if the lower ones were much much more profitable. That turns out to be a dangerous assumption, and so most technical people have assumed that kind of sequence number mediated replacement was useless because there was no way to enforce "honest" behavior, as even a few rational (profit maximizing) miners would break that completely. The change described by this BIP provides the missing piece that makes sequence numbers do something significant with respect to enforcing transaction replacement without assuming anything other than profit-maximizing behavior on the part of miners.
For transactions with an nVersion of 2 or greater, if the most significant bit (1 << 31) of a sequence number is clear, the remaining 31 bits are interpreted as an encoded relative lock-time. A sequence number with the most significant bit set is given no consensus meaning and can be included in any block, like normal, under all circumstances.
If the second most significant bit (1 << 30) is clear, the next 16 bits are interpreted as a minimum block-height constraint over the input's age. The remaining 14 bits have no consensus-enforced meaning. A sequence number of zero indicates a relative lock-time of zero blocks (bits 31 and 30 clear) and can be included in any block. A sequence number of 1 << 14 can be included in the next block after the input it is spending, or any block thereafter, but cannot be included in the same block as its parent. A sequence number of 2 << 14 can't be included until at least two blocks later, and so on.
Alternatively, if the second most significant bit (1 << 30) is set, the next 25 bits are interpreted as a minimum block-time constraint over the input's age. The remaining 5 bits have no consensus-enforced meaning. A sequence number with just that second most significant bit set (0x40000000) is interpreted as a relative lock-time of 0, measured in seconds, and can be included in the same block as the output being spent. Advancing that sequence number by 2^5 (0x40000020) constrains the transaction to be included in blocks with an nTime timestamp at least one second greater than the median time stamp of the 11 blocks prior to the block containing the coin being spent. Advancing the sequence number by an additional 2^5 (0x40000040) constrains the spend to be two seconds later, and so on.
This is proposed to be accomplished by replacing IsFinalTx() and CheckFinalTx(), existing consensus and non-consensus code functions that return true if a transaction's lock-time constraints are satisfied and false otherwise, with LockTime() and CheckLockTime(), new functions that return a non-zero value if a transaction's lock-time or sequence number constraints are not satisfied and zero otherwise:
enum { /* Interpret sequence numbers as relative lock-time constraints. */ LOCKTIME_VERIFY_SEQUENCE = (1 << 0), }; /* Setting nSequence to this value for every input in a transaction * disables nLockTime. */ const uint32_t CTxIn::SEQUENCE_FINAL = 0xffffffff; /* Threshold for CTxIn::nSequence: below this value it is interpreted * as arelative lock-time, otherwise ignored. */ const uint32_t CTxIn::SEQUENCE_LOCKTIME_THRESHOLD = (1 << 31); /* Threshold for CTxIn::nSequence when interpreted as a relative * lock-time: below this value it has units of blocks, otherwise * seconds. */ const uint32_t CTxIn::SEQUENCE_UNITS_THRESHOLD = (1 << 30); /* Number of reserved low-order bits (depending on units used) which * are shifted out prior to relative lock-time constraint checking. */ const int CTxIn::SEQUENCE_BLOCKS_OFFSET = 14; const int CTxIn::SEQUENCE_SECONDS_OFFSET = 5; int64_t LockTime(const CTransaction &tx, int flags, const CCoinsView* pCoinsView, int nBlockHeight, int64_t nBlockTime) { CCoins coins; bool fEnforceBIP68 = static_cast<uint32_t>(tx.nVersion) >= 2 && flags & LOCKTIME_VERIFY_SEQUENCE; // Will be set to the equivalent height- and time-based nLockTime // values that would be necessary to satisfy all relative lock- // time constraints given our view of block chain history. int nMinHeight = 0; int64_t nMinTime = 0; // Will remain equal to true if all inputs are finalized // (CTxIn::SEQUENCE_FINAL). bool fFinalized = true; BOOST_FOREACH(const CTxIn& txin, tx.vin) { // Set a flag if we witness an input that isn't finalized. if (CTxIn::SEQUENCE_FINAL == txin.nSequence) continue; else fFinalized = false; // Do not enforce sequence numbers as a relative lock time // unless we have been instructed to, and a view has been // provided. if (!(fEnforceBIP68 && pCoinsView)) continue; // Sequence numbers equal to or above the locktime threshold // are not treated as relative lock-times, nor are they given // any consensus-enforced meaning at this point. if (txin.nSequence >= CTxIn::SEQUENCE_LOCKTIME_THRESHOLD) continue; // Fetch the UTXO corresponding to this input. if (!pCoinsView->GetCoins(txin.prevout.hash, coins)) { // It is fully expected that coinbases inputs are not // found in the UTXO set. Proceed to the next intput... if (txin.prevout.IsNull()) continue; // If a non-coinbase input cannot be found, we cannot // be certain about whether lock-time constraints have // been satisfied. Note that it should only ever be // possible for this to happen with wallet transactions // that have unknown inputs. else return std::numeric_limits<int64_t>::max(); } // coins.nHeight is MEMPOOL_HEIGHT (an absurdly high value) // if the parent transaction was from the mempool. We can't // know what height it will have once confirmed, but we // assume it makes it in the same block. int nCoinHeight = std::min(coins.nHeight, nBlockHeight); if (txin.nSequence < CTxIn::SEQUENCE_UNITS_THRESHOLD) { // We subtract 1 from relative lock-times because a lock- // time of 0 has the semantics of "same block," so a lock- // time of 1 should mean "next block," but nLockTime has // the semantics of "last invalid block height." nMinHeight = std::max(nMinHeight, nCoinHeight + (int)( txin.nSequence >> CTxIn::SEQUENCE_BLOCKS_OFFSET) - 1); } else { // In two locations that follow we make reference to // chainActive.Tip(). To prevent a race condition, we // store a reference to the current tip. // // Note that it is not guaranteed that indexBestBlock will // be consistent with the passed in view. The proper thing // to do is to have the view return time information about // UTXOs. const CBlockIndex& indexBestBlock = *chainActive.Tip(); // The only time the negative branch of this conditional // is executed is when the prior output was taken from the // mempool, in which case we assume it makes it into the // same block (see above). int64_t nCoinTime = (nCoinHeight <= (indexBestBlock.nHeight+1)) ? indexBestBlock.GetAncestor(nCoinHeight-1)->GetMedianTimePast() : nBlockTime; // Time-based relative lock-times are measured from the // smallest allowed timestamp of the block containing the // txout being spent, which is the median time past of the // block prior. We subtract one for the same reason as // above. nMinTime = std::max(nMinTime, nCoinTime + (int64_t)(( txin.nSequence - CTxIn::SEQUENCE_UNITS_THRESHOLD) >> CTxIn::SEQUENCE_SECONDS_OFFSET) - 1); } } // If all sequence numbers are CTxIn::SEQUENCE_FINAL, the // transaction is considered final and nLockTime constraints // are not enforced. if (fFinalized) return 0; if ((int64_t)tx.nLockTime < LOCKTIME_THRESHOLD) nMinHeight = std::max(nMinHeight, (int)tx.nLockTime); else nMinTime = std::max(nMinTime, (int64_t)tx.nLockTime); if (nMinHeight >= nBlockHeight) return nMinHeight; if (nMinTime >= nBlockTime) return nMinTime; return 0; } int64_t CheckLockTime(const CTransaction &tx, int flags) { AssertLockHeld(cs_main); // By convention a negative value for flags indicates that the // current network-enforced consensus rules should be used. flags = std::max(flags, 0); // pcoinsTip contains the UTXO set for chainActive.Tip() CCoinsViewMemPool viewMemPool(pcoinsTip, mempool); const CCoinsView *pCoinsView = &viewMemPool; // CheckLockTime() uses chainActive.Height()+1 to evaluate // nLockTime because when LockTime() is called within // CBlock::AcceptBlock(), the height of the block *being* // evaluated is what is used. Thus if we want to know if a // transaction can be part of the *next* block, we need to call // LockTime() with one more than chainActive.Height(). const int nBlockHeight = chainActive.Height() + 1; // Timestamps on the other hand don't get any special treatment, // because we can't know what timestamp the next block will have, // and there aren't timestamp applications where it matters. const int64_t nBlockTime = GetAdjustedTime(); return LockTime(tx, flags, pCoinsView, nBlockHeight, nBlockTime); }
Code conditional on the return value of IsFinalTx() / CheckLockTime() has to be updated as well, since the semantics of the return value has been inverted.
Using sequence numbers for locking inputs makes sense, since no transaction can be in a block before its parent transactions. This means that a lower sequence number can always be included earlier than a higher one (even if the time the original coins being spent was unknown when the transaction was authored). Because of this, even rational miners should go along with the scheme: Take the lowest sequence number and collect the fees, or skip over it in the hopes that no one else takes a lower number before the next available higher number becomes spendable. And, of course, users are not constrained to make their sequence numbers go up only one at a time. So it's "take the most updated version, now, or gamble that no one in the next dozen blocks takes the most updated and that you manage to include the next to most when it finally becomes mineable." This is similar to how lock-time works. In fact, this scheme is precisely a per-input relative lock-time.
A bidirectional payment channel can be established by two parties funding a single output in the following way: Alice funds a 1btc output which is the 2-of-2 multisig of Alice AND Bob, or Alice's key only after a sufficiently long timeout, e.g. 30 days or 4320 blocks. The channel-generating transaction is signed by Alice and broadcast to the network.
Alice desires to send Bob a payment of 0.1btc. She does so by constructing a transaction spending the 1btc output and sending 0.1btc to Bob and 0.9btc back to herself. She provides her signature for the 2-of-2 multisig constraint, and sets a relative lock-time using the sequence number field such that the transaction will become valid 24-hours or 144 blocks before the refund timeout. Two more times Alice sends Bob a payment of 0.1btc, each time generating and signing her half of a transaction spending the 1btc output and sending 0.2btc, then 0.3btc to Bob with a relative lock-time of 29 days from creation of the channel.
Bob now desires to send Alice a refund of 0.25btc. He does so by constructing a transaction spending the 1btc output and sending 0.95btc (= 0.7btc + 0.25btc) to Alice and 0.05btc to himself. Since Bob will still have in his logs the transaction giving him 0.7btc 29 days after the creation of the channel, Alice demands that this new transaction have a relative lock-time of 28 days so she has a full day to broadcast it before the next transaction matures.
Alice and Bob continue to make payments to each other, decrementing the relative lock-time by one day each time the channel switches direction, until the present time is reached or either party desires to close out the channel. A close-out is performed by finalizing the input (nSequence = MAX_INT) and both parties signing.
A reference implementation is provided by the following pull request
Credit goes to Gregory Maxwell for providing a succinct and clear description of the behavior of this change, which became the basis of this BIP text.
We reuse the double-threshold switch over mechanism from BIPs 34 and 66, with the same thresholds, but for nVersion = 4. The new rules are in effect for every block (at height H) with nVersion = 4 and at least 750 out of 1000 blocks preceding it (with heights H-1000..H-1) also have nVersion = 4. Furthermore, when 950 out of the 1000 blocks preceding a block do have nVersion = 4, nVersion = 3 blocks become invalid, and all further blocks enforce the new rules.
It is recommended that this soft-fork deployment trigger include other related proposals for improving Bitcoin's lock-time capabilities, including BIP 65: OP_CHECKLOCKTIMEVERIFY, BIP 112: CHECKSEQUENCEVERIFY, and BIP 113 Median time-past as endpoint for lock-time calculations.
The only use of sequence numbers by the Bitcoin Core reference client software is to disable checking the nLockTime constraints in a transaction. The semantics of that application are preserved by this BIP.
There may be other uses for the sequence number field that are not commonplace or yet to be discovered. To allow for other uses of the sequence number field, it is only interpreted as a relative lock-time as described in this BIP if the most significant bit is clear. This allows up to 31 bits of the sequence number field to be used for other purposes in applications which don't simultaneously need a per-input relative lock-time. In addition, the unused low-order bits of the relative lock-time encoding are available for use by future soft-fork extensions.
The most efficient way to calculate sequence number from relative lock-time is with bit masks and shifts:
// 0 <= nHeight < 65,535 blocks (1.25 years) nSequence = nHeight << 14; nHeight = nSequence >> 14; // 0 <= nTime < 33,554,431 seconds (1.06 years) nSequence = 1<<30 | (nTime << 5); nTime = (nSequence ^ 1<<30) >> 5;
Bitcoin mailing list discussion: https://www.mail-archive.com/[email protected]/msg07864.html
BIP 34: Block v2, Height in Coinbase
BIP 65: OP_CHECKLOCKTIMEVERIFY
BIP 66: Strict DER signatures
BIP 112: CHECKSEQUENCEVERIFY
BIP 113: Median time-past as endpoint for lock-time calculations