Channel Splicing (feature 62/63) #1160

t-bast · 2024-05-02T09:33:54Z

Splicing allows spending the current funding transaction to replace it with a new one that changes the capacity of the channel, allowing both peers to add or remove funds to/from their channel balance.

Splicing takes place while a channel is quiescent, to ensure that both peers have the same view of the current commitments.

We don't want channels to be unusable while waiting for transactions to confirm, so channel operation returns to normal once the splice transaction has been signed and we're waiting for it to confirm. The channel can then be used for payments, as long as those payments are valid for every pending splice transactions. Splice transactions can be RBF-ed to speed up confirmation.

Once one of the pending splice transactions confirms and reaches acceptable depth, peers exchange splice_locked to discard the other pending splice transactions and the previous funding transaction. The confirmed splice transaction becomes the channel funding transaction.

Nodes then advertise this spliced channel to the network, so that nodes keep routing payments through it without any downtime.

This PR replaces #863 which contains a lot of legacy mechanisms for early versions of splicing, which didn't work in some edge cases (detailed in the test vectors provided in this PR). It can be very helpful to read the protocol flows described in the test vector: they give a better intuition of how splicing works, and how it deals with message concurrency and disconnections.

This PR requires the quiescence feature (#869) to start negotiating a splice.

Credits to @rustyrussell and @ddustin will be added in the commit messages once we're ready to merge this PR.

ProofOfKeags · 2024-05-02T19:37:37Z

Can I suggest we do this as an extension BOLT rather than layering it in with the existing BOLT2 text? It makes it easier to implement when all of the requirements deltas are in a single document than when it is inlined into the original spec. Otherwise, the PR/branch-diff itself is the only way to see the diff and that can get very messy during the review process as people's commentary comes in. While there are other ways to get at this diff without the commentary, it would make the UX of getting at this diff rather straightforward.

Given that the change is gated behind a feature bit anyway it also makes it easier for a new implementation to bootstrap itself without the splice feature by just reading the main BOLTs as is.

At some point in the future when splicing support becomes standard across the network we can consolidate the extension BOLT into the main BOLTs if people still prefer.

t-bast · 2024-05-03T08:43:18Z

Why not, if others also feel that it would be better as an extension bolt. I prefer it directly in Bolt 2, because of the following reasons:

Most of it is self contained in its own section(s) anyway.
It's an important part of the channel lifecycle: channels are opened, then during normal operation payments are relayed and splices happen, then the channel eventually closes. It is nicely reflected in the architecture of the Bolt 2 sections right now.
The few additions to existing message TLVs (commit_sig, tx_add_input, tx_signatures) should not be in a separate document when merging, because otherwise different features may use the same TLV tags without realizing it, with a risk of inadvertently shipping incompatible code. I think it's important that all TLVs for a given message are listed in that message's section, this way you know you don't have to randomly search the BOLTs for another place where TLVs may be defined.

But if I'm the only one thinking this is better, I'll move it to a separate document!

One thing to note is that we already have two implementations (eclair and cln), and maybe a 3rd one (LDK) who are very close to code-complete and have had months of experience on mainnet, which means the spec is almost final and we should be able to to merge it to the BOLTs in the not-so-distant future (:crossed_fingers:).

02-peer-protocol.md

ddustin · 2024-06-04T17:37:29Z

One thing I've been thinking about is with large splices across many nodes, if some node fails to send signatures (likely because two nodes in the cluster demand to sign last) than splice will hang one tx_signatures.

I believe we need two things to address this:

Timeout logic where splices are aborted
Being lax about having sent our tx_signatures but getting nothing back

Currently CLN fails the channel in this case as taking signatures and not responding is rather rude but this is bad because it could lead to clusters of splice channels being closed.

The unfortunate side effect of this is we have to be comfortable sending out signatures with no recourse for not getting any back.

I believe long term the solution is to maintain a signature-sending reputation for each peer and eventually blacklist peers from doing splices and / or fail your channels with that peer.

A reputation system may be beyond the needs of the spec but what to do with hanging tx_signatures (timeout etc) should be in the spec with a note about this problem.

t-bast · 2024-06-06T13:28:59Z

Timeout logic where splices are aborted

This is already covered at the quiescence level: quiescence will timeout if the splice doesn't complete (e.g. because we haven't received tx_signatures).

Being lax about having sent our tx_signatures but getting nothing back

I don't think this is necessary, and I think we should really require people to send tx_signatures when it is owed, to ensure that we get to a clean state on both peers.

if some node fails to send signatures (likely because two nodes in the cluster demand to sign last)

It seems like we've discussed this many times already: this simply cannot happen because ordering based on contributed amount fixes this? Can you detail a concrete scenario where tx_signatures ordering leads to a deadlock?

02-peer-protocol.md

ProofOfKeags · 2024-09-05T16:00:14Z

02-peer-protocol.md

+    - Either side has added an output other than the channel funding output
+      and the balance for that side is less than the channel reserve that
+      matches the new channel capacity.


What does it mean to have a channel reserve to "match the new channel capacity". AFAICT the channel_reserve is specified in satoshis and reading the negotiation process of this proposal doesn't seem to indicate that there is any change happening to that parameter during negotiation.

AFAICT the channel_reserve is specified in satoshis

Not with dual-funding, where the channel reserve is 1% of the channel capacity. That's why this is potentially changing "automatically" when splicing on top of a dual-funded channel if we want to keep using 1%.

But you're right to highlight this: the channel reserve behavior is very loosely specified for now, and there were a lot of previous discussions with @morehouse regarding what we should do when splicing. Another edge case that we must better specify is what happens when splicing on top of a non-dual-funded channel, where the channel reserve was indeed a static value instead of a proportional one!

The channel reserve behavior is IMO the only missing piece of this specification, that we should discuss, thanks for bringing it up!

Could be a good thing to discuss in Tokyo!

Also worth stepping back and double checking the reserve requirement makes sense in its current form generally 👀.

What do you think of the following behavior for handling channel reserves:

Whenever a splice happens, the channel is automatically enrolled into the 1% reserve policy, even if it wasn't initially a dual-funded channel (unless 0-reserve is used of course, see Add option_zero_reserve (FEATURE 64/65) #1140)

Splice-out is not allowed if you end up below your pre-splice reserve (your peer will reject that splice with tx_abort)

Otherwise, it's ok if one side ends up below the channel reserve after a splice: this is the same behavior as when a new channel is created. If we get into that state, the peer that is below the channel reserve:

is not allowed to send outgoing HTLCs

is allowed to receive incoming HTLCs

if it is paying the commit fees, it is allowed to dip further into its channel reserve to receive HTLCs (because of the added weight of the HTLC output), because we must be able to move liquidity to their side to get them above their reserve

When there are multiple unconfirmed splices, we use the highest channel reserve of all pending splices (ie requirements must be satisfied for all pending splice transactions)

As discussed during yesterday's meeting, there are subtle edge cases due to concurrent updates: this is inherent to the current commitment protocol, but will eventually become much simpler with #867

@ddustin @ProofOfKeags @rustyrussell @ziggie1984 @morehouse

related: ACINQ/eclair#2899 (comment), tries to specify the concurrent edge cases and also the requirement when we would already (without splicing) allow the peer paying the fees being dipped below its reserve.

@t-bast

That all seems reasonable to me. The one part where we could get into trouble is:

if it is paying the commit fees, it is allowed to dip further into its channel reserve to receive HTLCs (because of the added weight of the HTLC output), because we must be able to move liquidity to their side to get them above their reserve

This allows the reserve to be violated, potentially all the way down to 0. In that situation, there is ~zero incentive to broadcast the latest commitment on force close.

That said, I know the implementation details are hairy to do things completely safely. And we can also look forward to zero-fee commitments with TRUC and ephemeral anchors, which would obsolete the "dip-into-reserve to pay fees" exception entirely.

This allows the reserve to be violated, potentially all the way down to 0. In that situation, there is ~zero incentive to broadcast the latest commitment on force close.

Since we only allow this to happen when the node paying the fee receives HTLCs, the other node sending that HTLC can limit the exposure by controlling how many HTLCs they send in a batch (or keep pending the commit tx) when we're in this state.

There are unfortunately cases where even a single HTLC would make the node paying the fee have no output (small channels with high feerate), but when that happens you really don't have any other option, the channel is otherwise unusable, so your only other option is to force-close anyway which isn't great...

And we can also look forward to zero-fee commitments with TRUC and ephemeral anchors, which would obsolete the "dip-into-reserve to pay fees" exception entirely.

Exactly, this is coming together (look at this beautiful 0-fee commitment transaction: https://mempool.space/testnet4/tx/85f2256c8d6d61498c074d53912d1f0ef907ee508bb06f5701f3826432ba53b8) which will finally get rid of this kind of mess: I'm fine with using an imperfect but simple work-around in the meantime!

I wonder if this requirement would solely be used for the splicing case, allowing HTLC which dip the opener into its reserve or should we make this an overall requirement. If so there is the problem with backwards compatibility, because older nodes (speaking for LND nodes) will force close if the opener dips below its reserve. So maybe it makes sense to only activate it for splicing use cases so that we don't run into the backwards compatibility issues ?

02-peer-protocol.md

ddustin · 2024-11-07T18:15:44Z

I added a PR to fix the spec for short_channel_id post-splice: t-bast#3

ddustin · 2024-11-07T18:33:47Z

@t-bast asked me to put together a summary for Richard on how to implement the short_channel_id changes for splice_locked. The process is fairly straight forward but I'll write out some contextual information around it that might be helpful:

When receiving splice_locked, mark that it was received in a variable, and call check_splice_locked
When splice seen on chain at depth, send splice_locked, mark that it was sent in a variable, save the txid of the locked transaction into splice_locked_txid, and call check_splice_locked
When reconnecting without a pending splice but peer expects one (rules as in spec), resend splice_locked
When check_splice_locked is called; If sent and receive variables are set, then:
a) Clear send & receive variables
b) Find the splice inflight that matches splice_locked_txid
c) Set the channel's short_channel_id according to the locked tx
d) Update channel funding amounts according to the confirmed splice
e) Save it all to disk and clear all splice inflights and reset channel allsplice state variables to neutral

02-peer-protocol.md

bolt02/splicing-test.md

t-bast · 2025-01-02T17:13:32Z

@ddustin I've added more details for the announcement/gossip part in adf968c and finished implementing it in eclair. You can grab the last version of ACINQ/eclair#2887 for your cross-compatibility tests which should contain everything!

t-bast · 2025-01-14T10:00:16Z

Rebased to fix conflicts and squashed commits. Please carefully read the reconnection requirements, especially around handling of next_commitment_number: I have applied the same logic as #1214 to use next_commitment_number to indicate when we'd like commitment_signed to be retransmitted.

02-peer-protocol.md

t-bast · 2025-02-10T13:51:52Z

@ddustin the last commits add more information to channel_reestablish to help synchronize the splice_locked state and simplify retransmission: please review!

ddustin · 2025-02-10T18:28:32Z

@ddustin the last commits add more information to channel_reestablish to help synchronize the splice_locked state and simplify retransmission: please review!

Will do! Excited to get this all "locked" in

remyers · 2025-02-12T09:38:51Z

02-peer-protocol.md

+      `splice_locked` it has sent:
+      - MUST retransmit `splice_locked`.
+    - otherwise:
+      - MUST NOT retransmit `splice_locked`.


In the routing-gossip spec it says on reconnection we should wait for splice_locked before retransmitting announcement_signatures.

If nodes are disconnected after splice_locked messages are exchanged, but before announcement_signatures are sent, should announcement_signatures be sent after the channel_reestablish message, or should this situation also trigger the splice_locked message to be resent ?

Both nodes will know when they receive channel_reestablish that splice_locked has been exchanged but not announcement_signatures so could proceed directly to exchanging signatures rather then wait for splice_locked.

As currently written, both nodes will not send splice_locked because your_last_funding_locked_txid will match the most recent splice_locked each node has sent and so the announcement_signatures for a splice will not be exchanged.

Very good point! In order to be compatible with taproot, we need to re-exchange splice_locked in that case, because that's where we'll have the opportunity to provide the partial nonces necessary to sign the announcement.

We should do the same thing as what we're doing for the initial announcement_signatures : on reconnection, if one side has not received the remote announcement_signatures, it should re-send splice_locked. When receiving that splice_locked, the remote node should re-send splice_locked once (if not already sent) to provide its partial nonces. Then both nodes can exchange announcement_signatures again.

Thanks for pointing this, I'll fix it. I'll rebase the PR to fix the conflicts and will squash the reestablish commits.

I've done a clean rebase to fix conflicts with option_simple_close and took this opportunity to clean-up the commits:

43b5785 contains the bulk of the splicing protocol

5912705 contains the new TLV in channel_reestablish to clean-up splice_locked retransmission

c79dca9 describes the announcement logic and adds the retransmission requirements that make splicing future-proof with taproot

@ddustin I believe cln will currently be missing implementation of the last two commits, you can focus your review on those two commits and ignore the first one which is (I think) already fully implemented in cln.

Splicing allows spending the current funding transaction to replace it with a new one that changes the capacity of the channel, allowing both peers to add or remove funds to/from their channel balance. Splicing takes place while a channel is quiescent, to ensure that both peers have the same view of the current commitments. We don't want channels to be unusable while waiting for transactions to confirm, so channel operation returns to normal once the splice tx has been signed and we're waiting for it to confirm. The channel can then be used for payments, as long as those payments are valid for every pending splice transactions. Splice transactions can be RBF-ed to speed up confirmation. Once one of the pending splice transactions confirms and reaches acceptable depth, peers exchange `splice_locked` to discard the other pending splice transactions and the previous funding transaction. The confirmed splice transaction becomes the channel funding transaction. Nodes then advertize this spliced channel to the network, so that nodes keep routing payments through it without any downtime.

If one side sent `splice_locked` and the other side is ready to send its own `splice_locked` while they are disconnected, this creates a race condition on reestablish because `splice_locked` is retransmitted after `channel_reestablish`, and other channel updates can be inserted by the other node before receiving `splice_locked`. This will be an issue for taproot channels, because nonces will be missing. This race condition is described in more details in lightning#1223. We fix this race condition by adding TLVs to `channel_reestablish` that provide information about the latest locked transaction. This additional information also makes it easier to detect when we need to retransmit our previous `splice_locked`.

We make the requirements for `announcement_signatures` more clear. It is important that both nodes are able to generate the corresponding `channel_announcement` to allow them to create a new `channel_update` using the `short_channel_id` of the confirmed splice. We insist on exchanging `splice_locked` before generating signatures to ensure compatibility with taproot channels, where nonces will be exchanged in `splice_locked` messages. This means that we need to retransmit `splice_locked` on reconnection if `announcement_signatures` hasn't been fully exchanged. Importantly, after announcing a splice, nodes must still allow payments that use the previous `short_channel_id`, because remote nodes may not have processed the `channel_announcement` and `channel_update`s yet.

t-bast mentioned this pull request May 2, 2024

Lightning Specification Meeting 2024/05/06 #1161

Closed

25 tasks

t-bast mentioned this pull request May 14, 2024

Lightning Specification Meeting 2024/05/20 #1164

Closed

23 tasks

t-bast mentioned this pull request Jun 3, 2024

Lightning Specification Meeting 2024/06/03 #1167

Closed

23 tasks

ddustin reviewed Jun 4, 2024

View reviewed changes

02-peer-protocol.md Outdated Show resolved Hide resolved

ddustin reviewed Jun 4, 2024

View reviewed changes

02-peer-protocol.md Show resolved Hide resolved

ddustin reviewed Jun 4, 2024

View reviewed changes

02-peer-protocol.md Show resolved Hide resolved

optout21 reviewed Jun 7, 2024

View reviewed changes

02-peer-protocol.md Show resolved Hide resolved

t-bast mentioned this pull request Jun 17, 2024

Lightning Specification Meeting 2024/06/17 #1172

Closed

22 tasks

optout21 mentioned this pull request Jun 17, 2024

Update splice messages according to new spec draft lightningdevkit/rust-lightning#3129

Merged

t-bast mentioned this pull request Jun 26, 2024

Lightning Specification Meeting 2024/07/01 #1175

Closed

23 tasks

t-bast mentioned this pull request Jul 12, 2024

Lightning Specification Meeting 2024/07/15 #1183

Closed

22 tasks

This was referenced Jul 23, 2024

Use final spec values for splicing ACINQ/eclair#2887

Draft

Lightning Specification Meeting 2024/07/29 #1185

Closed

t-bast mentioned this pull request Aug 9, 2024

Lightning Specification Meeting 2024/08/12 #1187

Closed

21 tasks

t-bast mentioned this pull request Aug 23, 2024

Lightning Specification Meeting 2024/08/26 #1191

Closed

20 tasks

dunxen mentioned this pull request Aug 30, 2024

Dual-funded channels and Splicing Project Tracking lightningdevkit/rust-lightning#1621

Open

6 tasks

optout21 mentioned this pull request Sep 5, 2024

[Splicing] Update Splicing msgs lightningdevkit/rust-lightning#3293

Closed

ProofOfKeags reviewed Sep 5, 2024

View reviewed changes

t-bast mentioned this pull request Sep 6, 2024

Lightning Specification Meeting 2024/09/09 #1195

Closed

20 tasks

t-bast mentioned this pull request Oct 16, 2024

Lightning Specification Meeting 2024/11/04 #1206

Closed

20 tasks

ddustin reviewed Nov 7, 2024

View reviewed changes

02-peer-protocol.md Show resolved Hide resolved

remyers mentioned this pull request Nov 21, 2024

Update scid when splice funding tx confirms ACINQ/eclair#2941

Closed

ZmnSCPxj-jr reviewed Nov 21, 2024

View reviewed changes

02-peer-protocol.md Outdated Show resolved Hide resolved

t-bast mentioned this pull request Nov 22, 2024

Lightning Specification Meeting 2024/12/02 #1210

Closed

19 tasks

t-bast mentioned this pull request Dec 9, 2024

Lightning Specification Meeting 2024/12/16 #1213

Closed

19 tasks

t-bast commented Jan 2, 2025

View reviewed changes

bolt02/splicing-test.md Show resolved Hide resolved

t-bast mentioned this pull request Jan 6, 2025

Lightning Specification Meeting 2025/01/13 #1216

Closed

19 tasks

t-bast mentioned this pull request Jan 14, 2025

Do not unnecessarily retransmit commitment_signed in dual funding #1214

Open

t-bast force-pushed the splicing branch from 8eb508a to adf968c Compare January 14, 2025 09:58

optout21 reviewed Jan 15, 2025

View reviewed changes

02-peer-protocol.md Show resolved Hide resolved

t-bast mentioned this pull request Jan 22, 2025

Lightning Specification Meeting 2025/01/27 #1221

Closed

23 tasks

ddustin reviewed Jan 23, 2025

View reviewed changes

02-peer-protocol.md Outdated Show resolved Hide resolved

t-bast mentioned this pull request Jan 27, 2025

Improve splice_locked retransmission logic #1223

Open

t-bast mentioned this pull request Feb 4, 2025

Lightning Specification Meeting 2025/02/10 #1224

Closed

21 tasks

remyers reviewed Feb 12, 2025

View reviewed changes

t-bast force-pushed the splicing branch from fec0d44 to 99e3f03 Compare February 12, 2025 11:22

St333p mentioned this pull request Feb 17, 2025

RCP-240406A: Fallback single-use-seal RGB-WG/RFC#7

Open

t-bast added 2 commits February 18, 2025 16:10

t-bast force-pushed the splicing branch from 99e3f03 to c79dca9 Compare February 18, 2025 15:18

t-bast mentioned this pull request Feb 19, 2025

Lightning Specification Meeting 2025/02/24 #1229

Open

18 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Channel Splicing (feature 62/63) #1160

Channel Splicing (feature 62/63) #1160

t-bast commented May 2, 2024 •

edited

Loading

ProofOfKeags commented May 2, 2024

t-bast commented May 3, 2024 •

edited

Loading

ddustin commented Jun 4, 2024

t-bast commented Jun 6, 2024

ProofOfKeags Sep 5, 2024

t-bast Sep 5, 2024

ddustin Sep 5, 2024

t-bast Sep 10, 2024

ziggie1984 Sep 11, 2024 •

edited

Loading

morehouse Sep 11, 2024 •

edited

Loading

t-bast Sep 12, 2024

ziggie1984 Sep 13, 2024 •

edited

Loading

t-bast Sep 13, 2024

ddustin commented Nov 7, 2024

ddustin commented Nov 7, 2024

t-bast commented Jan 2, 2025 •

edited

Loading

t-bast commented Jan 14, 2025

t-bast commented Feb 10, 2025

ddustin commented Feb 10, 2025

remyers Feb 12, 2025

t-bast Feb 12, 2025

t-bast Feb 12, 2025 •

edited

Loading

Channel Splicing (feature 62/63) #1160

Are you sure you want to change the base?

Channel Splicing (feature 62/63) #1160

Conversation

t-bast commented May 2, 2024 • edited Loading

ProofOfKeags commented May 2, 2024

t-bast commented May 3, 2024 • edited Loading

ddustin commented Jun 4, 2024

t-bast commented Jun 6, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ziggie1984 Sep 11, 2024 • edited Loading

Choose a reason for hiding this comment

morehouse Sep 11, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ziggie1984 Sep 13, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ddustin commented Nov 7, 2024

ddustin commented Nov 7, 2024

t-bast commented Jan 2, 2025 • edited Loading

t-bast commented Jan 14, 2025

t-bast commented Feb 10, 2025

ddustin commented Feb 10, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

t-bast Feb 12, 2025 • edited Loading

Choose a reason for hiding this comment

t-bast commented May 2, 2024 •

edited

Loading

t-bast commented May 3, 2024 •

edited

Loading

ziggie1984 Sep 11, 2024 •

edited

Loading

morehouse Sep 11, 2024 •

edited

Loading

ziggie1984 Sep 13, 2024 •

edited

Loading

t-bast commented Jan 2, 2025 •

edited

Loading

t-bast Feb 12, 2025 •

edited

Loading