Bound size of `TransactionPool` in memory #3284

birchmd · 2020-09-03T19:08:51Z

Presently transactions are simply stored in a BTreeMap in memory, however, if we receive too many transactions too quickly, then this could grow over time, especially if the transactions are large.

The text was updated successfully, but these errors were encountered:

stale · 2021-07-01T08:28:04Z

This issue has been automatically marked as stale because it has not had recent activity in the last 2 months.
It will be closed in 7 days if no further activity occurs.
Thank you for your contributions.

stale · 2021-10-24T16:25:26Z

This issue has been automatically marked as stale because it has not had recent activity in the last 2 months.
It will be closed in 7 days if no further activity occurs.
Thank you for your contributions.

nikurt · 2022-01-13T16:44:09Z

Not sure what to do if a pool reaches its maximum size but transactions keep coming in.
Is it possible that rejecting incoming transactions will also the one transaction that needs to be processed next, i.e. a transaction with the lowest nonce?
@mzhangmzz do you have suggestions?

nikurt · 2022-01-13T16:45:30Z

To get an idea of a reasonable tx pool size bound, will add a metric.

Issue #3284

bowenwang1996 · 2022-01-22T18:53:58Z

Not sure what to do if a pool reaches its maximum size but transactions keep coming in.

We might need to stop accepting transactions at that point. cc @mm-near

stale · 2022-04-27T18:23:55Z

This issue has been automatically marked as stale because it has not had recent activity in the last 2 months.
It will be closed in 7 days if no further activity occurs.
Thank you for your contributions.

This fixes an issue of the transaction pool growing indefinitely in non-RPC nodes. Issue #3284

…ling Commit a361b25: ‘Don’t add transactions to a pool in non-validator nodes’ broke a few nightly tests. This is a known issue at the moment. Disable the tests while fix is being worked on. Issue: near#3284

aborg-dev · 2023-04-25T09:30:54Z

I'll pick this issue in the context of congestion work (https://github.com/near/nearcore/milestone/26, #8878).

As a first step, I plan to investigate two naive approaches:

Introducing hard capacity limit for transaction pool (e.g. 1000 transactions)
Introducing TTL for transactions in the pool (e.g. 1 minute)

I highly suspect that both approaches will break some tests/assumptions that the clients make. My first goal will be to understand:

What will actually break
How easy would it be to fix it
How should the clients behave in the face of these limits: Provide guidelines for retrying transaction submission #8879

aborg-dev · 2023-04-26T11:14:21Z

We do indeed have tests that rely on putting many transactions into the pool. Example failures are:

https://buildkite.com/nearprotocol/nearcore/builds/27048 (with size limit 20)
https://buildkite.com/nearprotocol/nearcore/builds/27050 (with size limit 2)

Concrete test

nearcore/integration-tests/src/tests/client/benchmarks.rs

Line 25 in 055a9e2

fn benchmark_large_chunk_production_time() {

puts 20 transactions into the pool before starting to process them.

I'm adding explicit checks for the returned transaction status #8976 which would make it much easier to catch tests broken by introducing transaction pool size limits.

Retrying the failure in all those tests would likely be too burdensome, so we'll have to use some high-enough limit that will cover the majority of tests and only introduce in the retries in tests that actually exercise the congestion scenario.

aborg-dev · 2023-05-04T09:52:38Z

I will proceed with implementing a hard capacity limit measured in bytes. The limit will be separate for each per-shard transaction pool within the ShardedTransactionPool.
I confirmed that none of the tests break with a limit > 1000 transactions (yet to measure how many bytes that would be).

We will also start returning a new error type to the clients when we reach the transaction pool size limit, so that they can retry this error intelligently with a back-off.

The PR introduces a limit to the size of the transaction pool for each shard and a logic that rejects transactions that go over this limit. The reasoning for this work is described in #3284. To start, the limit will be disabled (effectively set to infinity) and this PR shouldn't have any effect. The node operators can override it with a config option. In the future, we will come up with a safe value to set by default (probably between 10 MiB - 1 GiB). We also start with a simple option where the RPC client will not know if their transaction runs into this limit. We will need to rework this part in the future, but it will have to touch the transaction forwarding code in a non-trivial way, and I would prefer to work on this when we have a congestion test ready #8920. Lastly, this PR adds some nuance to handling reintroducing transactions back to the pool after reorg or producing a chunk (`reintroduce_transactions` method), by acknowledging that not all transactions might have been included and logging the number of dropped transactions.

aborg-dev · 2023-05-11T17:32:10Z

I've also explored the option of garbage collecting transactions based on their age and while doable, the existing clients (e.g. jsonrpc) will not work well with this approach and will raise the error to a higher level, likely all the way to the user (e.g. Wallet interface). This also introduces an attack vector where users can spam the system with many transactions, knowing that only a few of them will be included within the time limit and effectively block transaction pool space at a lower cost.

The remaining steps to finish this work

Expose the size of transaction pool and number of dropped transactions as a Prometheus metric
Estimate reasonable default limit and set it to testnet nodes
Suggest the limit to mainnet nodes
Write documentation about this feature

This is one of the steps for near#3284

aborg-dev · 2023-05-15T15:33:16Z

I've filed a bug for some follow-up work to simplify transaction pool code: #9060

This is one of the steps for near#3284

This will allow us to understand how big the transaction pools get in practice and what is the realistic limit to set for them. The logic within pool iterator is a bit complex due to the need to return transactions back to the pool and I'm working on a way to simplify it in a separate PR, but for now this accounting should do the job. This is one of the steps for #3284

So that it will be read from `config.json` that each node provides This is a part of #3284

So that it will be read from `config.json` that each node provides This is a part of near#3284

Right now the metric has noticeable blips due to rapid change when transactions are drawn from the pool: https://nearinc.grafana.net/goto/YZwyDllVR?orgId=1 To avoid this, we only decrease the metric after the pool iterator is dropped. This is a part of #3284

@nikurt

This PR enables the limit discussed in #3284. I've [considered](https://near.zulipchat.com/#narrow/stream/297873-pagoda.2Fnode/topic/Adding.20a.20new.20field.20to.20config.2Ejson/near/358955785) another approach to rolling this out by changing the value in `config.json` distributed through S3, but that would require more work both on our side and on validators without adding much benefit. Specifically, I've checked that we have a good safety margin here, as over the last month on the testnet, the max size of the transaction pool on the validators was < 40 KB: https://nearinc.grafana.net/goto/AhZFN__4R?orgId=1. @nikurt What would be a good place to document this field for validators? I saw https://near-nodes.io/, but couldn't find the appropriate section there.

So that it will be read from `config.json` that each node provides This is a part of #3284

Right now the metric has noticeable blips due to rapid change when transactions are drawn from the pool: https://nearinc.grafana.net/goto/YZwyDllVR?orgId=1 To avoid this, we only decrease the metric after the pool iterator is dropped. This is a part of #3284

@nikurt

This PR enables the limit discussed in #3284. I've [considered](https://near.zulipchat.com/#narrow/stream/297873-pagoda.2Fnode/topic/Adding.20a.20new.20field.20to.20config.2Ejson/near/358955785) another approach to rolling this out by changing the value in `config.json` distributed through S3, but that would require more work both on our side and on validators without adding much benefit. Specifically, I've checked that we have a good safety margin here, as over the last month on the testnet, the max size of the transaction pool on the validators was < 40 KB: https://nearinc.grafana.net/goto/AhZFN__4R?orgId=1. @nikurt What would be a good place to document this field for validators? I saw https://near-nodes.io/, but couldn't find the appropriate section there.

aborg-dev · 2023-06-13T12:38:03Z

The limit of 100 MB on the per-shard transaction pool on each node will be active with the next release that @marcelo-gonzalez will be shepherding.

birchmd added the A-chain Area: Chain, client & related label Sep 3, 2020

birchmd self-assigned this Sep 3, 2020

stale bot added the S-stale label Jul 1, 2021

bowenwang1996 added T-core Team: issues relevant to the core team and removed S-stale labels Jul 1, 2021

bowenwang1996 assigned mzhangmzz and unassigned birchmd Jul 6, 2021

bowenwang1996 assigned Longarithm and unassigned mzhangmzz Jul 26, 2021

stale bot added the S-stale label Oct 24, 2021

bowenwang1996 removed the S-stale label Oct 25, 2021

bowenwang1996 assigned nikurt and unassigned Longarithm Oct 25, 2021

nikurt mentioned this issue Jan 13, 2022

Metric for the transaction pool size #6057

Merged

near-bulldozer bot pushed a commit that referenced this issue Jan 17, 2022

Metric for the transaction pool size (#6057)

d0ec767

Issue #3284

stale bot added the S-stale label Apr 27, 2022

bowenwang1996 removed the S-stale label Apr 27, 2022

This was referenced Apr 28, 2022

Don't add transactions to a pool in non-validator nodes #6713

Merged

Bound size of TransactionPool in memory #6718

Closed

near-bulldozer bot pushed a commit that referenced this issue May 2, 2022

Don't add transactions to a pool in non-validator nodes (#6713)

a361b25

This fixes an issue of the transaction pool growing indefinitely in non-RPC nodes. Issue #3284

nikurt mentioned this issue May 3, 2022

Fix two tests broken by the txpool behavior changes #6736

Closed

aborg-dev self-assigned this Apr 25, 2023

aborg-dev added this to the Local Congestion Control - Q2 2023 milestone Apr 25, 2023

aborg-dev mentioned this issue May 11, 2023

feature: Introduce a limit to transaction pool size #8970

Merged

aborg-dev added a commit to aborg-dev/nearcore that referenced this issue May 12, 2023

feature: Add metric measuring size of transaction pool

7a5f113

This is one of the steps for near#3284

This was referenced May 12, 2023

feature: Add metric measuring size of transaction pool #9052

Merged

fix: Adjust transaction pool size after draining iterator #9056

Merged

aborg-dev added a commit to aborg-dev/nearcore that referenced this issue May 15, 2023

feature: Add metric measuring size of transaction pool

41e5219

This is one of the steps for near#3284

aborg-dev added a commit to aborg-dev/nearcore that referenced this issue May 15, 2023

feature: Add metric measuring size of transaction pool

4683b2b

This is one of the steps for near#3284

aborg-dev added a commit to aborg-dev/nearcore that referenced this issue May 16, 2023

feature: Add metric measuring size of transaction pool

ebaf310

This is one of the steps for near#3284

aborg-dev mentioned this issue May 17, 2023

feature: Add transaction pool size to Config #9069

Merged

near-bulldozer bot pushed a commit that referenced this issue May 26, 2023

feature: Add transaction pool size to Config (#9069)

c8e8263

So that it will be read from `config.json` that each node provides This is a part of #3284

nikurt pushed a commit that referenced this issue May 31, 2023

feature: Add transaction pool size to Config (#9069)

ebb1ebd

So that it will be read from `config.json` that each node provides This is a part of #3284

nikurt pushed a commit to nikurt/nearcore that referenced this issue Jun 8, 2023

feature: Add transaction pool size to Config (near#9069)

4c8a722

So that it will be read from `config.json` that each node provides This is a part of near#3284

This was referenced Jun 12, 2023

Set default transaction pool size limit to 100MB. #9172

Merged

Fix blips in the transaction pool size metric #9174

Merged

nikurt pushed a commit that referenced this issue Jun 13, 2023

feature: Add transaction pool size to Config (#9069)

c745421

So that it will be read from `config.json` that each node provides This is a part of #3284

aborg-dev closed this as completed Jun 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bound size of `TransactionPool` in memory #3284

Bound size of `TransactionPool` in memory #3284

birchmd commented Sep 3, 2020

stale bot commented Jul 1, 2021

stale bot commented Oct 24, 2021

nikurt commented Jan 13, 2022

nikurt commented Jan 13, 2022 •

edited

Loading

bowenwang1996 commented Jan 22, 2022

stale bot commented Apr 27, 2022

aborg-dev commented Apr 25, 2023 •

edited

Loading

aborg-dev commented Apr 26, 2023

aborg-dev commented May 4, 2023

aborg-dev commented May 11, 2023 •

edited

Loading

aborg-dev commented May 15, 2023

aborg-dev commented Jun 13, 2023

Bound size of TransactionPool in memory #3284

Bound size of TransactionPool in memory #3284

Comments

birchmd commented Sep 3, 2020

stale bot commented Jul 1, 2021

stale bot commented Oct 24, 2021

nikurt commented Jan 13, 2022

nikurt commented Jan 13, 2022 • edited Loading

bowenwang1996 commented Jan 22, 2022

stale bot commented Apr 27, 2022

aborg-dev commented Apr 25, 2023 • edited Loading

aborg-dev commented Apr 26, 2023

aborg-dev commented May 4, 2023

aborg-dev commented May 11, 2023 • edited Loading

The remaining steps to finish this work

aborg-dev commented May 15, 2023

aborg-dev commented Jun 13, 2023

Bound size of `TransactionPool` in memory #3284

Bound size of `TransactionPool` in memory #3284

nikurt commented Jan 13, 2022 •

edited

Loading

aborg-dev commented Apr 25, 2023 •

edited

Loading

aborg-dev commented May 11, 2023 •

edited

Loading