client: Start resending sooner during `send_and_confirm_transactions_in_parallel` #348

joncinque · 2024-03-21T01:43:18Z

Problem

send_and_confirm_transactions_in_parallel is really great, but during times when one particular validator is slow or offline, sending and confirming transactions can take forever. This is because we wait to send all of the transactions before moving on to resending.

With the congestion on the network nowadays, it makes deploying a program very difficult.

Summary of Changes

This really just moves things around to start retrying and confirming sooner.

Rather than send all transactions first, and then kick off the resends, the concept is to add the transaction to the unconfirmed queue before sending, and also start retrying two seconds after the start of the sending.

The impact of this change is tough to gauge, especially because of volatile priority fees on mainnet, but I tried sending 100 self-transfers with 2 lamports (2 million micro-lamports) per CU for priority fees, and a 450 CU limit. On 5 trials, here's what I got:

without patch: 40s, 45s, 34s, 37s, 36s
with patch: 27s, 68s, 38s, 29s, 20s

While testing without the patch, I had to abort a couple of runs because the sending was absolutely crawling, only one send every 3 seconds.

mergify · 2024-03-21T01:43:55Z

Backports to the beta branch are to be avoided unless absolutely necessary for fixing bugs, security issues, and perf regressions. Changes intended for backport should be structured such that a minimum effective diff can be committed separately from any refactoring, plumbing, cleanup, etc that are not strictly necessary to achieve the goal. Any of the latter should go only into master and ride the normal stabilization schedule. Exceptions include CI/metrics changes, CLI improvements and documentation updates on a case by case basis.

codecov-commenter · 2024-03-21T03:16:02Z

Codecov Report

Attention: Patch coverage is 95.12195% with 2 lines in your changes are missing coverage. Please review.

Project coverage is 81.8%. Comparing base (184ba6c) to head (d7aa979).
Report is 12 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff            @@
##           master     #348     +/-   ##
=========================================
- Coverage    81.9%    81.8%   -0.1%     
=========================================
  Files         837      837             
  Lines      226898   226900      +2     
=========================================
- Hits       185871   185817     -54     
- Misses      41027    41083     +56

KirillLykov · 2024-03-21T10:10:31Z

thanks for working on that! People in different Solana chats complaining daily about deploy speed. For example, this issue solana-labs#34444

KirillLykov · 2024-03-21T10:23:18Z

Depending on the importance of this task we can prioritize running deployment on testnet with measuring the time to deploy and other relevant metrics solana-labs#32873

joncinque · 2024-03-21T10:45:48Z

That's a good point, I've been using just a little test program that uses send_and_confirm_transactions_in_parallel and getting rough numbers. At least some test which writes a program to a buffer would be useful.

godmodegalactus · 2024-03-21T11:04:22Z

client/src/send_and_confirm_transactions_in_parallel.rs

+            }
+            .boxed_local(),
+        ];
+        join_all(futures).await.into_iter().collect::<Result<_>>()?;


Overall looks great, could you add some timeouts on join_all for 2 minutes = 150 blockhashes please. Sometimes CLI gets stuck.

I've noticed that sometimes... do you have any idea what causes it? In my experience, I don't see the block height updating, which makes me think that something in confirm_transactions_till_block_height_... is hanging.

I'd prefer that we only abort when we know that the block height has been passed, rather than hardcoding 2 minutes. We can detach the future with tokio::spawn, loop checking is_finished() on it, and abort it if the blockhash is expired. What do you think?

Yeah good idea a Notify channel and here we break if the blockheight exceeds the last transaction blockheight or something. This should be done for all join and/or awaits.

Sweet, do you mind if I include it in a separate PR? It isn't directly related to the changes here and can be done independently

Nah not at all

…in_parallel` (#348) client: Confirm sooner during send_and_confirm_in_parallel (cherry picked from commit b2f4fb3)

…ctions_in_parallel` (backport of #348) (#357) client: Start resending sooner during `send_and_confirm_transactions_in_parallel` (#348) client: Confirm sooner during send_and_confirm_in_parallel (cherry picked from commit b2f4fb3) Co-authored-by: Jon C <me@jonc.dev>

…ctions_in_parallel` (backport of anza-xyz#348) (anza-xyz#357) client: Start resending sooner during `send_and_confirm_transactions_in_parallel` (anza-xyz#348) client: Confirm sooner during send_and_confirm_in_parallel (cherry picked from commit b2f4fb3) Co-authored-by: Jon C <me@jonc.dev>

client: Confirm sooner during send_and_confirm_in_parallel

d7aa979

joncinque added the v1.18 label Mar 21, 2024

joncinque requested review from KirillLykov and godmodegalactus March 21, 2024 01:43

KirillLykov approved these changes Mar 21, 2024

View reviewed changes

godmodegalactus suggested changes Mar 21, 2024

View reviewed changes

godmodegalactus approved these changes Mar 21, 2024

View reviewed changes

joncinque merged commit b2f4fb3 into anza-xyz:master Mar 21, 2024
47 checks passed

joncinque deleted the confirmsooner branch March 21, 2024 13:35

mergify bot pushed a commit that referenced this pull request Mar 21, 2024

client: Start resending sooner during `send_and_confirm_transactions_…

f3c301b

…in_parallel` (#348) client: Confirm sooner during send_and_confirm_in_parallel (cherry picked from commit b2f4fb3)

mergify bot mentioned this pull request Mar 21, 2024

v1.18: client: Start resending sooner during send_and_confirm_transactions_in_parallel (backport of #348) #357

Merged

joncinque mentioned this pull request Mar 21, 2024

client: Timeout resends during send_and_confirm_in_parallel #358

Merged

mergify bot mentioned this pull request Mar 22, 2024

v1.18: client: Timeout resends during send_and_confirm_in_parallel (backport of #358) #384

Merged

willhickey mentioned this pull request Mar 28, 2024

v1.18 commits - please ignore #475

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

client: Start resending sooner during `send_and_confirm_transactions_in_parallel` #348

client: Start resending sooner during `send_and_confirm_transactions_in_parallel` #348

joncinque commented Mar 21, 2024

mergify bot commented Mar 21, 2024

codecov-commenter commented Mar 21, 2024

KirillLykov commented Mar 21, 2024 •

edited

Loading

KirillLykov commented Mar 21, 2024

joncinque commented Mar 21, 2024

godmodegalactus Mar 21, 2024 •

edited

Loading

joncinque Mar 21, 2024

godmodegalactus Mar 21, 2024

joncinque Mar 21, 2024

godmodegalactus Mar 21, 2024

client: Start resending sooner during send_and_confirm_transactions_in_parallel #348

client: Start resending sooner during send_and_confirm_transactions_in_parallel #348

Conversation

joncinque commented Mar 21, 2024

Problem

Summary of Changes

mergify bot commented Mar 21, 2024

codecov-commenter commented Mar 21, 2024

Codecov Report

KirillLykov commented Mar 21, 2024 • edited Loading

KirillLykov commented Mar 21, 2024

joncinque commented Mar 21, 2024

godmodegalactus Mar 21, 2024 • edited Loading

Choose a reason for hiding this comment

joncinque Mar 21, 2024

Choose a reason for hiding this comment

godmodegalactus Mar 21, 2024

Choose a reason for hiding this comment

joncinque Mar 21, 2024

Choose a reason for hiding this comment

godmodegalactus Mar 21, 2024

Choose a reason for hiding this comment

client: Start resending sooner during `send_and_confirm_transactions_in_parallel` #348

client: Start resending sooner during `send_and_confirm_transactions_in_parallel` #348

KirillLykov commented Mar 21, 2024 •

edited

Loading

godmodegalactus Mar 21, 2024 •

edited

Loading