Feature/iter branch and bound #30

yancyribbens · 2024-01-26T22:07:20Z

Implement Branch and Bound search.

yancyribbens · 2024-03-06T12:19:24Z

No new code changes with the last push, just re-arranging commits and touchup docs. Considering this a candidate to merge soonish.

yancyribbens · 2024-03-07T10:29:47Z

Fixed a few more typos in the comments, no code changes.

murchandamus · 2024-03-07T19:52:04Z

Starting my review

murchandamus

Much better!

@murchandamus I looked over your checklist of additional tests to add as well as the questions you posed. I think your question of what will cause BnB to fail was the most interesting.. Please see bellow.

What happens if you have multiple UTXOs that have only slightly different values?

select_coins_bnb_set_size_five() and select_coins_bnb_set_size_seven() both have slightly different values

What I meant to point out here was that if you have a UTXO pool that has many UTXOs with slightly different effective values, e.g.: the 200 UTXOs {0.00100000, 0.00100001, 0.00100002, 0.00100003, …, 0.00100198, 0.00100199}, you can quickly exhaust your iteration limit, if a combination of multiple of these is close to the target (here for example 0.005). But you got that covered now with the iteration limit tests!

What happens if you have many UTXOs?

select_coins_bnb_set_size_seven() has many UTXOs.

Regular end-user wallets easily venture into the territory of dozens of UTXOs, especially since DCA for small amounts has become so popular. Enterprise wallets can easily accumulate thousands of UTXOs. The biggest wallet I’ve seen had over 550'000 UTXOs. When I recommend testing a wallet with many UTXOs I was thinking of something like ~1000 UTXOs, not 7. In a follow-up PR, you could perhaps think about how you may explore a search tree more quickly that has many duplicates.

What happens if there are many alternative solutions that are not equivalent, when the first is not the best?

This is exemplified by test more_inputs_when_cheap. The test sets the current fee_rate to be 10 sat per kwu and the long term fee rate to be 20 sat per kwu. Therefore, the more inputs that are selected the better. A target of 6 is created and a utxo pool [1, 2, 3, 4]. The search routine first finds solution [2, 4] which is not the best solution. Since it is cheaper now (long_term_fee is higher) each utxo that is added reduces the waste. Therefor, the next solution that is found [1, 2, 3] has a lower waste metric and is found second.

Yup, good! This is also covered with select_coins_bnb_set_size_seven() now.

Instead of testing with a feerate of 0, make a helper function that allows you to create UTXOs per their effective value, then run your tests at high and low feerates. Adjust the expected results accordingly.

select_coins_bnb_consume_more_inputs_when_cheap() and select_coins_bnb_consume_less_inputs_when_expensive() both test the behavior with different combinations of long_term and short_term feerate. Not to ignore your suggestion about a testing helper function that allows the creating of UTXOs at different effective values, although I was thinking of restructuring BnB so that it takes a list of effective_values instead of a list of UTXOs. That way, in lib, we would compute the effective_value once and pass the results to either BnB, SRD or whatever other search function is needed. However, I wasn't planning to do that as part of this PR. I don't know how that might change your suggestion to add a helper that calculates effective_value.

That’s okay, you could do it in a follow-up. My general idea would be that your UTXOs should have a weight, and when you pass a feerate to your function, it automatically calculates the corresponding fee. That way it would be easy to create a UTXO that has a given effective value, or a UTXO that has a given amount but still a fee.

Add tests in which BnB fails to find a solution or fails to find the best solution. When does this happen and why?

select_coins_bnb_exhaust fails to find a solution because the iteration limit occurs before a solution is found. //…snip…//

Looks good!

Test what happens when you run into the iteration limit (i.e. you should have cases in which you find no solution, not the best solution)

see select_coins_bnb_exhaust and select_coins_bnb_exhaust_v2

Yeah, they both do not find a solution, but what happens if you find a solution before running into the iteration limit, but you are not done searching?

Optionally make a test that generates a random UTXO pool and target=
where you compare your outcome with the result of a:

I still have this as a TODO. I'm not sure if I'll include it as part of this PR or a followup.

Sure that’s not the easiest. You could perhaps do something like this Knapsack test I wrote to generate a diverse UTXO set and then randomly pick seven UTXOs, sum them up, and use that as target for a BnB search. The test would then assert that you should find a solution, and that this solution has a waste that is equal or lower than if you had used the input set you randomly picked.

I would also suggest that you write a test in which a solution with excess and fewer inputs is preferred over a solution with more inputs at high feerates.

src/branch_and_bound.rs

yancyribbens · 2024-03-08T14:21:44Z

@murchandamus

I would also suggest that you write a test in which a solution with excess and fewer inputs is preferred over a solution with more inputs at high feerates.

Good idea. I added a test here: 92c4e6b

Yeah, they both do not find a solution, but what happens if you find a solution before running into the iteration limit, but you are not done searching?

Good question. I added a test to show what happens here: 0114f2a

I've made note for the next PR along with some other refactors to

use reasonable fee_rate and effective value
consolidate common tests with a helper function
keep at least on test with a zero feerate.
optionally make a test that generates a random UTXO pool and target

I think the most critical comments have been addressed. If the fixups all look good I'll squash and merge. Thanks for all the feedback!

yancyribbens · 2024-03-08T14:24:09Z

As a side note, it would be nice maybe to return the iterations in some way which would make all of the iteration testing more meaningful.

murchandamus

The algorithm looks good to me now and the test coverage seems to be fairly complete.

What I haven’t looked into, and remains an open question to me is how the algorithm integrates with the greater code base. The tests set magic numbers for feerate, long term feerate estimate, fees, and cost_of_change. How is select_coins_bnb called from the transaction building? How do the UTXOs get preprocessed? How is the algorithm integrated into the transaction sending, and how do you ensure that the UTXOs are passed with correct and complete values for the call? When transaction building gets back an input set that does not create change, will it build the transaction correctly, skip the change output and drop the excess to the fees?

I’m not sure whether that’s in scope for this PR, but you may want to scan the PR one more time with the interface in mind, and perhaps consider adding some form of integration test that includes having a wallet set up with UTXOs and a transaction being built from there at a specific feerate that results in a changeless transaction that used BnB under the hood.

src/branch_and_bound.rs

murchandamus · 2024-03-08T16:27:48Z

As a side note, it would be nice maybe to return the iterations in some way which would make all of the iteration testing more meaningful.

In CoinGrinder I return the iteration count in the selection result object for exactly that reason.

yancyribbens · 2024-03-08T17:04:37Z

Enterprise wallets can easily accumulate thousands of UTXOs. The biggest wallet I’ve seen had over 550'000 UTXOs. When I recommend testing a wallet with many UTXOs I was thinking of something like ~1000

A few more TODOs I forgot to mention earlier for the next iteration:

Add testing for much larger sets.
In a follow-up PR, you could perhaps think about how you may explore a search tree more quickly that has many duplicates.

yancyribbens · 2024-03-08T17:32:38Z

The algorithm looks good to me now and the test coverage seems to be fairly complete.

Thanks!

What I haven’t looked into, and remains an open question to me is how the algorithm integrates with the greater code base. The tests set magic numbers for feerate, long term feerate estimate, fees, and cost_of_change. How is select_coins_bnb called from the transaction building? How do the UTXOs get preprocessed? How is the algorithm integrated into the transaction sending, and how do you ensure that the UTXOs are passed with correct and complete values for the call? When transaction building gets back an input set that does not create change, will it build the transaction correctly, skip the change output and drop the excess to the fees?

I've been looking into this, although I'm still exploring these questions. @Tibo-lg can speak more to this than I can, but Rust-DLC has some simple wallet interface that combines coin-selection with electrs to create wallet functionality of some fashion. My intention with the project is to include this as a rust-bitcoin crate to make it available to anyway as a basic building block for wallet construction. I've been meaning to try out electrs and interface that with some simple wallet software as well. Anyway, I'm not sure all the mechanics of RPC to facilitate transaction creation/sending etc yet, although I'm looking forward to exploring that more next :)

I’m not sure whether that’s in scope for this PR, but you may want to scan the PR one more time with the interface in mind, and perhaps consider adding some form of integration test that includes having a wallet set up with UTXOs and a transaction being built from there at a specific feerate that results in a changeless transaction that used BnB under the hood.

I'm not satisfied with the API yet enough to mark this as complete and ready for use, although I'm satisfied enough to merge this PR shortly. There's still some ongoing discussion about the interface and getting a review to bring this into rust-bitcoin. Also, there's still work to be done to add fuzz testing which I think is important to make sure it's a high quality implementation.

murchandamus · 2024-03-08T18:57:26Z

I'm not satisfied with the API yet enough to mark this as complete and ready for use, although I'm satisfied enough to merge this PR shortly. There's still some ongoing discussion about the interface and getting a review to bring this into rust-bitcoin. Also, there's still work to be done to add fuzz testing which I think is important to make sure it's a high quality implementation.

If you are going to make a crate that is called by completely unrelated software, you would probably want to put more verification of your assumptions into your code. E.g. you may want to ensure that every UTXO has a reasonable weight and/or fee beside the amount, if fee and weight are both present that they match with the feerate, etc.

yancyribbens · 2024-03-09T14:35:18Z

If you are going to make a crate that is called by completely unrelated software, you would probably want to put more verification of your assumptions into your code. E.g. you may want to ensure that every UTXO has a reasonable weight and/or fee beside the amount, if fee and weight are both present that they match with the feerate, etc.

I'm not sure if you've seen InputWeightPredict in Rust Bitcoin. I was thinking of next changing the API to accept a list of WeightPredictions as well as values. That way, every UTXO can have an associated weight prediction instead of requiring the consumer to calculate the satisfaction weight..

Move branch and bound to a seperate module

dbc77f3

yancyribbens mentioned this pull request Jan 26, 2024

Feature/iter branch and bound #28

Closed

yancyribbens force-pushed the feature/iter-branch-and-bound branch 5 times, most recently from 4bcb1f2 to fa9341f Compare February 1, 2024 17:26

yancyribbens mentioned this pull request Feb 2, 2024

Amount arithmetic operations are slow rust-bitcoin/rust-bitcoin#2434

Closed

yancyribbens force-pushed the feature/iter-branch-and-bound branch 9 times, most recently from 1004254 to bab87cf Compare February 7, 2024 11:39

This was referenced Feb 7, 2024

BnB computation not bounded #19

Closed

BnB optimizes for smallest selected amount, not smallest waste #18

Closed

BnB implementation does not account for fees #17

Closed

Implementation of BnB is not a Branch and Bound algorithm #16

Closed

yancyribbens requested a review from Tibo-lg February 7, 2024 12:13

yancyribbens force-pushed the feature/iter-branch-and-bound branch 3 times, most recently from b73cb54 to dc07501 Compare February 8, 2024 10:01

yancyribbens mentioned this pull request Feb 8, 2024

Examine all possible combinations #21

Closed

yancyribbens force-pushed the feature/iter-branch-and-bound branch 2 times, most recently from a5ff21e to 34a2424 Compare February 9, 2024 15:01

yancyribbens mentioned this pull request Feb 15, 2024

Return Iterator instead of Vector for SRD #33

Closed

yancyribbens force-pushed the feature/iter-branch-and-bound branch from efa5278 to 64f7def Compare February 17, 2024 20:09

yancyribbens force-pushed the feature/iter-branch-and-bound branch 4 times, most recently from 45ad1b1 to e153843 Compare March 6, 2024 10:56

yancyribbens force-pushed the feature/iter-branch-and-bound branch from e153843 to bd8dfec Compare March 7, 2024 10:01

murchandamus reviewed Mar 7, 2024

View reviewed changes

src/branch_and_bound.rs Outdated Show resolved Hide resolved

src/branch_and_bound.rs Outdated Show resolved Hide resolved

src/branch_and_bound.rs Show resolved Hide resolved

src/branch_and_bound.rs Outdated Show resolved Hide resolved

src/branch_and_bound.rs Show resolved Hide resolved

yancyribbens force-pushed the feature/iter-branch-and-bound branch 3 times, most recently from efa4668 to 0114f2a Compare March 8, 2024 13:56

murchandamus reviewed Mar 8, 2024

View reviewed changes

src/branch_and_bound.rs Show resolved Hide resolved

yancyribbens force-pushed the feature/iter-branch-and-bound branch from 0114f2a to f808f14 Compare March 11, 2024 10:03

yancyribbens added 5 commits March 11, 2024 11:04

Implement BnB search algorithm

ad57928

Replace cargo bench with criterion

2f927e9

Return Iterator instead of Vector for SRD

8b5c1b2

Bump MSRV

1204ec5

Bump version

f808f14

yancyribbens merged commit e4ce100 into p2pderivatives:master Mar 11, 2024
3 checks passed

This was referenced Mar 16, 2024

Missed Solution #20

Closed

BNB/SRD add tests with a much larger set #39

Open

BNB/SRD consolidate common tests with a helper function #41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/iter branch and bound #30

Feature/iter branch and bound #30

yancyribbens commented Jan 26, 2024

yancyribbens commented Mar 6, 2024

yancyribbens commented Mar 7, 2024

murchandamus commented Mar 7, 2024

murchandamus left a comment •

edited

Loading

yancyribbens commented Mar 8, 2024

yancyribbens commented Mar 8, 2024

murchandamus left a comment

murchandamus commented Mar 8, 2024

yancyribbens commented Mar 8, 2024

yancyribbens commented Mar 8, 2024

murchandamus commented Mar 8, 2024

yancyribbens commented Mar 9, 2024 •

edited

Loading

Feature/iter branch and bound #30

Feature/iter branch and bound #30

Conversation

yancyribbens commented Jan 26, 2024

yancyribbens commented Mar 6, 2024

yancyribbens commented Mar 7, 2024

murchandamus commented Mar 7, 2024

murchandamus left a comment • edited Loading

Choose a reason for hiding this comment

yancyribbens commented Mar 8, 2024

yancyribbens commented Mar 8, 2024

murchandamus left a comment

Choose a reason for hiding this comment

murchandamus commented Mar 8, 2024

yancyribbens commented Mar 8, 2024

yancyribbens commented Mar 8, 2024

murchandamus commented Mar 8, 2024

yancyribbens commented Mar 9, 2024 • edited Loading

murchandamus left a comment •

edited

Loading

yancyribbens commented Mar 9, 2024 •

edited

Loading