Refactor syncing and introduce `ChainSource` #365

tnull · 2024-10-03T13:48:49Z

Based on #358
Closes #362

We refactor our chain syncing and introduce a new ChainSource enum, which is a prerequisite for adding new chain data sources in a follow-up.

To this end, we first move the syncing logic to ChainSource and then introduce incremental syncing for Esplora which is always used except for initial syncs (now permanently persisted via a new NodeMetrics struct).

TODO:

Move Esplora-specific config options to fields in a tba. EsploraConfig object, whose defaults can be overridden in the Builder::set_chain_source_esplora method.

src/chain/mod.rs

tnull · 2024-10-08T11:55:23Z

Now addressed the open TODO mentioned above: moved Esplora-specific sync options to a new EsploraSyncConfig.

tnull · 2024-10-09T07:34:10Z

Rebased after #366 landed.

tnull · 2024-10-09T17:25:55Z

Rebased after #358 landed.

src/chain/mod.rs

src/lib.rs

jkczyz · 2024-10-11T03:38:48Z

src/fee_estimator.rs

+pub(crate) fn get_num_block_defaults_for_target(target: ConfirmationTarget) -> usize {
+	match target {
+		ConfirmationTarget::OnchainPayment => 6,
+		ConfirmationTarget::ChannelFunding => 12,
+		ConfirmationTarget::Lightning(ldk_target) => match ldk_target {
+			LdkConfirmationTarget::MaximumFeeEstimate => 1,
+			LdkConfirmationTarget::UrgentOnChainSweep => 6,
+			LdkConfirmationTarget::MinAllowedAnchorChannelRemoteFee => 1008,
+			LdkConfirmationTarget::MinAllowedNonAnchorChannelRemoteFee => 144,
+			LdkConfirmationTarget::AnchorChannelFee => 1008,
+			LdkConfirmationTarget::NonAnchorChannelFee => 12,
+			LdkConfirmationTarget::ChannelCloseMinimum => 144,
+			LdkConfirmationTarget::OutputSpendingFee => 12,
+		},
+	}
+}
+
+pub(crate) fn get_fallback_rate_for_target(target: ConfirmationTarget) -> u32 {
+	match target {
+		ConfirmationTarget::OnchainPayment => 5000,
+		ConfirmationTarget::ChannelFunding => 1000,
+		ConfirmationTarget::Lightning(ldk_target) => match ldk_target {
+			LdkConfirmationTarget::MaximumFeeEstimate => 8000,
+			LdkConfirmationTarget::UrgentOnChainSweep => 5000,
+			LdkConfirmationTarget::MinAllowedAnchorChannelRemoteFee => FEERATE_FLOOR_SATS_PER_KW,
+			LdkConfirmationTarget::MinAllowedNonAnchorChannelRemoteFee => FEERATE_FLOOR_SATS_PER_KW,
+			LdkConfirmationTarget::AnchorChannelFee => 500,
+			LdkConfirmationTarget::NonAnchorChannelFee => 1000,
+			LdkConfirmationTarget::ChannelCloseMinimum => 500,
+			LdkConfirmationTarget::OutputSpendingFee => 1000,
+		},
+	}
+}
+
+pub(crate) fn get_all_conf_targets() -> [ConfirmationTarget; 10] {
+	[
+		ConfirmationTarget::OnchainPayment,
+		ConfirmationTarget::ChannelFunding,
+		LdkConfirmationTarget::MaximumFeeEstimate.into(),
+		LdkConfirmationTarget::UrgentOnChainSweep.into(),
+		LdkConfirmationTarget::MinAllowedAnchorChannelRemoteFee.into(),
+		LdkConfirmationTarget::MinAllowedNonAnchorChannelRemoteFee.into(),
+		LdkConfirmationTarget::AnchorChannelFee.into(),
+		LdkConfirmationTarget::NonAnchorChannelFee.into(),
+		LdkConfirmationTarget::ChannelCloseMinimum.into(),
+		LdkConfirmationTarget::OutputSpendingFee.into(),
+	]
+}
+
+pub(crate) fn apply_post_estimation_adjustments(
+	target: ConfirmationTarget, estimated_rate: FeeRate,
+) -> FeeRate {
+	match target {
+		ConfirmationTarget::Lightning(
+			LdkConfirmationTarget::MinAllowedNonAnchorChannelRemoteFee,
+		) => {
+			let slightly_less_than_background = estimated_rate.to_sat_per_kwu() - 250;
+			FeeRate::from_sat_per_kwu(slightly_less_than_background)
+		},
+		_ => estimated_rate,
+	}


Will these helpers be called more than once? Just wondering what the rationale for splitting the logic across files. If the only difference later is how convert_fee_rate is called, seems that update_fee_estimates could be parameterized by a trait instead.

Otherwise, all the fee estimation logic is in the chain module instead of here, which seems wrong. A similar argument can be made for other code moved to chain, IMO (e.g., broadcasting).

Will these helpers be called more than once? Just wondering what the rationale for splitting the logic across files. If the only difference later is how convert_fee_rate is called, seems that update_fee_estimates could be parameterized by a trait instead.

Yes, they will be used by the chain-source specific fee estimation implementations. Unfortunately convert_fee_rate isn't the only difference, each of the implementations will work out quite differently (e.g., bitcoind will deliver the estimates individually instead of all together, and in a different unit. Additionally we'll want to use the mempoolminfee value from the getmempoolinfo call to determine the mempool minimum, rather than a far out block-based target, etc. (cf. 47beb0f)).

Otherwise, all the fee estimation logic is in the chain module instead of here, which seems wrong.

I agree that it's not super intuitive, but unfortunately the logic for fee estimation and even broadcasting will work out pretty differently for Esplora, bitcoind RPC, and Electrum. So if we left them in fee_estimator and tx_broadcaster we will in the contrary need to pull in a lot of ChainSource-specifics in there, not sure whether that would be much preferable? When I wrote it there were also some concerns around introducing circular object dependencies, although I have to admit that I'm failing to pin point / remember them right now.

So for now it may be fine to implement the ChainSource-specific parts (which is most of the logic) in ChainSource? We should however do another round of refactoring/DRYing up where we can once it's all said an done.

Hmm... right, but it seems doable. For instance, the trait's method could take a list of confirmation targets and return an iterator over fee rates. This would keep the code fairly DRY, avoiding repeating much of the logic and keeping these functions internal to the fee_estimation module, IIUC.

the trait's method

Do I understand correctly that you referring to a new internal trait here, not LDK's FeeEstimator? Then I would agree that this would likely be a reasonable refactor if we end up with separating out individual ChainSource objects (rather than the single enum), which would lend themselves to implement this trait. We'll see, as a first step I want to get everything working as expected and then we can clean up the internal APIs afterwards. This prefactor PR is really mainly meant to lay the groundwork to get started with other chain sources in the first place.

Yeah, an internal trait but I believe it could be implemented on EsploraAsyncClient. You wouldn't need individual ChainSource objects.

Hmm... why couldn't the trait be internal to LDK Node?

Ah, nevermind, indeed it could live in LDK Node. We might then need to take a dependency on async_trait for it though, so not sure if going the trait route would be worth it, but that's tbd.

Hmm... or possibly returning some sort of async iterator? Alternatively, OnchainFeeEstimator::update_fee_estimates can be made into a macro and instantiated for each.

To be clear, all I'm suggesting is parameterizing OnchainFeeEstimator::update_fee_estimates by a trait that can be implemented for both EsploraAsyncClient and BitcoindRpcClient. Then we wouldn't need to move all the code just to move it back later.

Right, but that trait method would at the very least be async (meaning our <1.75 MSRV we'd need to take a dependency on async_trait) and given the difference in logic probably would need to be update_fee_estimates itself, minus the set_fee_rate_cache part, which still would need to be called afterwards based on the return value of said trait method? Then again, BDK currently doesn't offer an async version of their electrum client, so at least for the initial implementation we will need to fit in a blocking client in the otherwise async internal GossipSource APIs. This might be prohibitive for going the async trait route, or at least for implementing it directly on the clients themselves, as we'll need to hand in our runtime object and use spawn_blocking for every call we make.

Right, I was thinking something like this:

pub(crate) async fn update_fee_estimates<F: FeeSource>(&self, fee_source: f) -> Result<(), Error> { let mut new_fee_rate_cache = HashMap::with_capacity(10); let confirmation_targets = vec![/* .. */]; for (target, fee_rate) in fee_source.fee_rates(confirmation_targets) { let adjusted_fee_rate = /* .. */; new_fee_rate_cache.insert(target, adjusted_fee_rate); } if fee_estimator.set_fee_rate_cache(new_fee_rate_cache) { /* .. */ } }

Where FeeSource::fee_rates returns an iterator. I'm handwaving the async part -- so fair enough if our MSRV prevents this -- but I'd think the different logic could be encapsulated in FeeSource::fee_rates implementations since it is given all targets. For esplora, it can fetch all at once using get_fee_estimates -- so the trait method would need to be async as you said -- and return an iterator callinh convert_fee_rate on each item. For bitcoind, it can return an iterator that fetches them one-by-one.

Maybe there is a way to do it with a Fn trait parameter instead if using aync_trait is a problem? Something like: https://forum.dfinity.org/t/passing-async-function-or-closure-as-parameter/21785/9

Anyhow, happy to leave at is for now given the difficulties with async / blocking.

Yeah, as said above I don't disagree that we should explore something like this, but for now (and in this PR) I'd like to keep the focus on actually shipping the features. We'll then follow-up with more refactoring once we have a better understanding how to accommodate all the different requirements, i.e., after we're done with the bitcoind and Electrum integration.

I understand that we'd usually want to avoid touching the same code multiple times, but in this instance (and given the time constraints) I'd prefer to get something functional merged soon rather than spending more time on figuring out the best refactoring pattern (which would require to think through the details of the Electrum integration and accoommodate the anticipated changes).

TLDR: I agree we should (and will) do this, but let's do one step at a time to keep making progress.

jkczyz

LGTM. Please squash.

src/chain/mod.rs

.. which will allow us to switch between different chain sources.

.. which also gives us the opportunity to simplify and DRY up the logic between background and manual syncing.

Previously, we persisted some of the `latest_` fields exposed via `NodeStatus`. Here, we now refactor this via a persisted `NodeMetrics` struct which allows to persist more fields across restarts. In particular, we now persist the latest time we sync the on-chain wallet, resulting in only doing a full scan on first initialization, and doing incremental syncing afterwards. As both of these operations are really really lightweight, we don't bother to migrate the old persisted timestamps for RGS updates and node announcement broadcasts over to the new data format.

.. to further de-clutter the top-level docs.

.. to also expose it via the `config` module rather than at the top-level docs.

.. as other upcoming chain sources might not have the same config options such as syncing intervals, or at least not with the same semantics.

tnull · 2024-10-15T07:40:08Z

LGTM. Please squash.

Squashed the fixup and rebased on main to fix the build post-yanking of the v0.0.124 lightning-* crates.

tnull requested a review from jkczyz October 3, 2024 13:48

tnull force-pushed the 2024-10-chain-source-refactor branch 2 times, most recently from a704f7c to 1ddb475 Compare October 7, 2024 14:29

jkczyz reviewed Oct 8, 2024

View reviewed changes

src/chain/mod.rs Outdated Show resolved Hide resolved

src/chain/mod.rs Show resolved Hide resolved

src/chain/mod.rs Outdated Show resolved Hide resolved

tnull force-pushed the 2024-10-chain-source-refactor branch 3 times, most recently from b90363f to 996c8eb Compare October 8, 2024 11:53

tnull force-pushed the 2024-10-chain-source-refactor branch 3 times, most recently from 1df47fe to d9df8ac Compare October 9, 2024 07:33

tnull force-pushed the 2024-10-chain-source-refactor branch from d9df8ac to 10fffa9 Compare October 9, 2024 17:25

tnull mentioned this pull request Oct 10, 2024

Add bitcoind RPC support #370

Merged

3 tasks

jkczyz reviewed Oct 11, 2024

View reviewed changes

jkczyz reviewed Oct 14, 2024

View reviewed changes

src/chain/mod.rs Show resolved Hide resolved

src/chain/mod.rs Outdated Show resolved Hide resolved

tnull added 12 commits October 15, 2024 09:39

Add ChainSource enum

0265c2d

.. which will allow us to switch between different chain sources.

Move on-chain syncing to ChainSource

da65835

Move Lightning wallet syncing to ChainSource

4da77eb

Prefactor: Move fee estimator defaults to util methods

9096382

Move fee rate cache updating to ChainSource

3105a28

Move broadcast queue processing to ChainSource

fd0c26f

Move syncing tasks to ChainSource

d66edd4

.. which also gives us the opportunity to simplify and DRY up the logic between background and manual syncing.

Implement incremental Esplora syncing for the on-chain wallet

835baf4

Prefactor: Expose config objects via config module

66ca6be

.. to further de-clutter the top-level docs.

Prefactor: Move ChannelConfig to config

ad60e07

.. to also expose it via the `config` module rather than at the top-level docs.

Move Esplora-specific sync options to dedicated config

94ff68f

.. as other upcoming chain sources might not have the same config options such as syncing intervals, or at least not with the same semantics.

tnull force-pushed the 2024-10-chain-source-refactor branch from 10fffa9 to 94ff68f Compare October 15, 2024 07:39

tnull mentioned this pull request Oct 15, 2024

Replay events on event handling failures due to persistence failures. #374

Merged

tnull added this to the 0.4 milestone Oct 15, 2024

jkczyz approved these changes Oct 15, 2024

View reviewed changes

tnull merged commit 26e61e8 into lightningdevkit:main Oct 15, 2024
11 of 13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor syncing and introduce `ChainSource` #365

Refactor syncing and introduce `ChainSource` #365

tnull commented Oct 3, 2024 •

edited

Loading

tnull commented Oct 8, 2024

tnull commented Oct 9, 2024

tnull commented Oct 9, 2024

jkczyz Oct 11, 2024

tnull Oct 11, 2024 •

edited

Loading

jkczyz Oct 11, 2024

tnull Oct 11, 2024

jkczyz Oct 11, 2024 •

edited

Loading

tnull Oct 11, 2024

jkczyz Oct 11, 2024

tnull Oct 11, 2024 •

edited

Loading

jkczyz Oct 11, 2024

tnull Oct 12, 2024

jkczyz left a comment

tnull commented Oct 15, 2024

Refactor syncing and introduce ChainSource #365

Refactor syncing and introduce ChainSource #365

Conversation

tnull commented Oct 3, 2024 • edited Loading

tnull commented Oct 8, 2024

tnull commented Oct 9, 2024

tnull commented Oct 9, 2024

jkczyz Oct 11, 2024

Choose a reason for hiding this comment

tnull Oct 11, 2024 • edited Loading

Choose a reason for hiding this comment

jkczyz Oct 11, 2024

Choose a reason for hiding this comment

tnull Oct 11, 2024

Choose a reason for hiding this comment

jkczyz Oct 11, 2024 • edited Loading

Choose a reason for hiding this comment

tnull Oct 11, 2024

Choose a reason for hiding this comment

jkczyz Oct 11, 2024

Choose a reason for hiding this comment

tnull Oct 11, 2024 • edited Loading

Choose a reason for hiding this comment

jkczyz Oct 11, 2024

Choose a reason for hiding this comment

tnull Oct 12, 2024

Choose a reason for hiding this comment

jkczyz left a comment

Choose a reason for hiding this comment

tnull commented Oct 15, 2024

Refactor syncing and introduce `ChainSource` #365

Refactor syncing and introduce `ChainSource` #365

tnull commented Oct 3, 2024 •

edited

Loading

tnull Oct 11, 2024 •

edited

Loading

jkczyz Oct 11, 2024 •

edited

Loading

tnull Oct 11, 2024 •

edited

Loading