Add cache for recent on chain transaction hashes #1886

Qiao-Jin · 2020-08-28T09:43:21Z

Partially Close #1734.

In this PR a new feature, UInt256 LastBlockHash is added to class Transaction, to work together with uint ValidUntilBlock to prove that transaction sender provides an honest ValidUntilBlock. So that other people can be sure that this Transaction must be created no earlier than Height = ValidUntilBlock - MaxValidUntilBlockIncrement.

We can use this feature to judge whether we need to search this transaction only in cache to make sure no duplicate transactions will be onchain. Thus lots of time wasted in hard disk reading whilst DB searching, upon incoming transactions, will be saved.

…hashes

erikzhang · 2020-08-28T09:48:17Z

I don’t think it’s worth adding 32 bytes to every transaction.

Qiao-Jin · 2020-08-28T09:52:11Z

I don’t think it’s worth adding 32 bytes to every transaction.

Or maybe we can add only part of the block hash for proof, say 8 bytes or less?

erikzhang · 2020-08-28T09:57:58Z

If you want to cache the recent transactions, why not just cache the transactions in the latest n blocks?

Qiao-Jin · 2020-08-28T10:02:31Z

If you want to cache the recent transactions, why not just cache the transactions in the latest n blocks?

I cached transaction hashes instead of transactions themseves for the sake of RAM saving.

Tommo-L · 2020-08-29T05:47:21Z

src/neo/Network/P2P/Payloads/Transaction.cs

@@ -30,6 +30,7 @@ public class Transaction : IEquatable<Transaction>, IInventory, IInteroperable
        private long sysfee;
        private long netfee;
        private uint validUntilBlock;
+        private UInt256 lastBlockHash = UInt256.Zero;


If so, why not use Height or SendHeight, HeightAt, ValidAfter?

Such value can be forged by sender to make duplicate transactions onchain.

shargon · 2020-08-30T06:16:35Z

src/neo/IO/Caching/HashCache.cs

+
+namespace Neo.IO.Caching
+{
+    public class HashCache<T> where T : IEquatable<T>


Why not https://github.com/Qiao-Jin/neo/blob/da3b82a9ae958689f49847cc2a89d4eabc22695e/src/neo/IO/Caching/HashSetCache.cs ?

The logic is somehow different, i.e. don't need a bucketCapacity, don't want to check Contains upon adding as has been checked before in Blockchain.OnTransaction, etc.

…hashes

Qiao-Jin · 2020-09-01T10:04:19Z

After merging #1507 Blockchain.OnTransaction has become the new bottleneck as each new coming transaction needs to be checked in hard disk, as explained in corresponding issue. We do need such a cache for recent onchain transctions so as to get ride of preventable disk readings, even though the price is adding some extra data into Transaction, as the benefit is very much compared with the price.

roman-khimov · 2020-09-01T13:27:25Z

The problem here to me is that this code makes too many assumptions about the way it'd be tested. It fine-tunes for a very specific transaction flow and it will surely improve things for it, but for a real congested network under big load it doesn't look that good. Transactions will arrive late and you'll have to make a full search anyway. Think about other benchmarks also, we're testing nodes with neo-bench and I can tell you right now that this change won't improve the number there in any way, just because we're creating all (signed) transactions in advance and they only differ in nonce values (and they're perfectly valid!).

Qiao-Jin · 2020-09-02T02:43:45Z

but for a real congested network under big load it doesn't look that good

Please provide test result for this assumption like in the issue.

Transactions will arrive late and you'll have to make a full search anyway

How late? 10 blocks or 20 blocks? Again you need a test result for this assumption. On the other hand, as long as there is an average delay got by testing we can set the cached height can't we? As long as most cases are covered this PR is effective isn't it?

I can tell you right now that this change won't improve the number there in any way, just because we're creating all (signed) transactions in advance and they only differ in nonce values (and they're perfectly valid!).

You should read more carefully about corresponding issue. There I wrote some methods to prevent such occasion. It involves other repos & functionality so is not shown in this PR.

"in any way"? Please, read before saying so OK?

And once more, you like to ask ppl for test results as proof in their issues/PRs. Then WHY do youself negate others' ideas without any tests and only relies upon assumption of yourself??????

Please, do some experiments. Then you can see how much time is exhausted in DB checking for incoming transactions.

erikzhang · 2020-09-02T08:49:54Z

neo/src/neo/Persistence/StoreView.cs

Lines 52 to 56 in 9322675

    
           public bool ContainsTransaction(UInt256 hash) 
        
           { 
        
               TransactionState state = Transactions.TryGet(hash); 
        
               return state != null; 
        
           }

Maybe if we can optimize StoreView.ContainsTransaction(), we can solve all the problems.

shargon · 2020-09-02T09:16:32Z

Maybe if we can optimize StoreView.ContainsTransaction(), we can solve all the problems.

Agree, we can optimize this method, and take new benchmarks with rocksDb and levelDb.

roman-khimov · 2020-09-02T09:30:22Z

Please, do some experiments.

That's what I've been doing for the past ~2 weeks going from

(it's that pesky HasTransaction rectangle we're talking about here) to

(which suddenly doesn't really have it)

I can certainly feel your pain in trying to squeeze more juice out of the node. And I'm absolutely sympathetic to any attempts at improving node's performance, it's not easy. Yet at the same time I think there is lot of room for improvement without any protocol changes. And any protocol change should be weighted very carefully and only applied if there is a real improvement that justifies this change.

Qiao-Jin · 2020-09-02T09:45:56Z

That's what I've been doing for the past ~2 weeks going from

What's the TPS of your test? Is this test result of Neo-Go? And what does the times in each block mean?

roman-khimov · 2020-09-02T10:06:47Z

What's the TPS of your test?

Let me reference our latest public verifiable results for now (you can grab neo-bench and get something similar and I'll reiterate here that part of C# "failures" are due to RPC settings that were not taken into account by neo-bench, it's fixed in subsequent revisions of it): https://medium.com/@neospcc/neo-3-0-0-preview3-nodes-benchmarking-e0f447fdf6af

We have improved since then.

Is this test result of Neo-Go?

Yes. But the protocol is the same. And the problems we're facing are very similar.

And what does the times in each block mean?

It's a standard Go pprof output (SVG originally, but Github doesn't allow to attach it), "42.87s" on incoming arrow means that this amount of time is spent in a call to this function, "0.02s of 42.87s" inside of rectangle means that only 0.02 seconds out of 42.87 are actually spent inside this function and the rest of it is spent in internal calls.

Qiao-Jin · 2020-09-02T10:18:14Z

I see. And I know why we have different understanding about optimization. In your pic HasTransaction is not the bottleneck but in my local env it is. This is because in my local env I have optimized many blocks in your pic, i.e. I use Akka remote to send transaction data in batch via remote actors which are located in different actor systems (as in #1874), so transaction transmission is not a bottleneck, at least not the most serious one; I use parelleled transaction verification which is shown in #1507 so that transaction verification is not the bottleneck; Also the logic of this PR is in my local env; And yes, vm exhausts time during Persist and I'm also thinking about solutions.

AnnaShaleva · 2020-09-03T18:53:34Z

src/neo/Network/P2P/Payloads/Transaction.cs

@@ -285,6 +296,8 @@ public virtual VerifyResult VerifyStateDependent(StoreView snapshot, Transaction
        {
            if (ValidUntilBlock <= snapshot.Height || ValidUntilBlock > snapshot.Height + MaxValidUntilBlockIncrement)
                return VerifyResult.Expired;
+            if (snapshot.GetBlock(LastBlockHash)?.Index != ValidUntilBlock - MaxValidUntilBlockIncrement)


@Qiao-Jin, shouldn't it be <= instead of !=? If not, could you, please, explain the meaning of this line?

Why? This LastBlockHash is used to judge whether it's a valid ValidUntilBlock got from Height + MaxValidUntilBlockIncrement.

AnnaShaleva · 2020-09-03T19:16:34Z

We also tested this branch against 765c43a with the help of neo-bench. Two configurations were tested under the load of 30 workers: single C# node and four-nodes network. As a result, there's no significant TPS improvements:

Branch	Single node TPS	Four nodes TPS
`765c43a`	3320.284	503.935
tx_hash_cache	3118.608	533.828

The full benchmark results you can find here.

Qiao-Jin · 2020-09-04T02:27:36Z

We also tested this branch against 765c43a with the help of neo-bench. Two configurations were tested under the load of 30 workers: single C# node and four-nodes network. As a result, there's no significant TPS improvements:

Branch Single node TPS Four nodes TPS
765c43a 3320.284 503.935
tx_hash_cache 3118.608 533.828
The full benchmark results you can find here.

Please refer this, and my test results in corressponding issue. This place is a future bottleneck when tx verification & message transmission are optimized.

…hashes

Qiao-Jin · 2020-09-04T03:33:53Z

@AnnaShaleva @roman-khimov

What it solves?

It avoids large numbers of preventable DB search for incoming transactions. Upon each incoming transaction we will search it in mempool and then DB to make sure that it shall not be onchain for duplicate times. But such strategy is problematic.

For each newly created incoming transaction we will eventually have to search it in LevelDb/RocksDB till the last level. What does this mean? So much time is exhausted in hard disk reading for each transaction. What's worse is that it can be a disaster when the size of levelDB goes up. Just imagine how it's like proceeding such strategy when the size of DB data grows to many many GB as time goes by. More seriously, new-coming stateroot logic will produce much more data in DB than current. Think about it.

Is it with "too many assumptions"?

I don't think so. Firstly tell me, what is TPS? Doesn't it mean transactions per second? So aren't transactions are the center of this definition? So what should we care about most? Shouldn't it include things like transaction transmission, transaction verification, transaction persisting, etc?? So tell me, why checking transactions a node have not seen before, is with "too many assumptions"?? EVERY IN COMING TRANSACTION HAS NOT BEEN SEEN BEFORE BY A NODE UNLESS IT'S CREATED BY ITSELF.

Is this PR the whole logic?

No. Some functionalities has not been included, because I think further discussion is needed, i.e. such as how to prevent ppl keep sending transactions with low ValidUntilBlock. Sadly till now I am the only one to come up with some ideas on this topic (I myself also don't like strategies such as more gas fee for DB checking & am thinking about better solutions). I am waiting for other ideas. And the reason I propose this PR is also attracting more ppl's attention to think about better solutions.

erikzhang · 2020-09-06T09:52:06Z

@Qiao-Jin Do you think #1910 solves this problem?

Jin Qiao and others added 3 commits August 28, 2020 17:35

Add cache for recent on chain transaction hashes

2810274

Format

a6408b5

Merge branch 'master' into add_cache_for_recent_on_chain_transaction_…

da3b82a

…hashes

Tommo-L reviewed Aug 29, 2020

View reviewed changes

shargon reviewed Aug 30, 2020

View reviewed changes

Qiao-Jin and others added 3 commits August 31, 2020 16:48

Merge branch 'master' into add_cache_for_recent_on_chain_transaction_…

2bc11bc

…hashes

Code format

4bd87af

Merge branch 'master' into add_cache_for_recent_on_chain_transaction_…

bd92363

…hashes

shargon mentioned this pull request Sep 2, 2020

Optimize contains #1898

Merged

AnnaShaleva reviewed Sep 3, 2020

View reviewed changes

Merge branch 'master' into add_cache_for_recent_on_chain_transaction_…

a037828

…hashes

erikzhang closed this Sep 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cache for recent on chain transaction hashes #1886

Add cache for recent on chain transaction hashes #1886

Qiao-Jin commented Aug 28, 2020 •

edited

Loading

erikzhang commented Aug 28, 2020

Qiao-Jin commented Aug 28, 2020

erikzhang commented Aug 28, 2020 •

edited

Loading

Qiao-Jin commented Aug 28, 2020

Tommo-L Aug 29, 2020 •

edited

Loading

Qiao-Jin Aug 30, 2020

shargon Aug 30, 2020

Qiao-Jin Aug 31, 2020

Qiao-Jin commented Sep 1, 2020

roman-khimov commented Sep 1, 2020

Qiao-Jin commented Sep 2, 2020 •

edited

Loading

erikzhang commented Sep 2, 2020

shargon commented Sep 2, 2020

roman-khimov commented Sep 2, 2020

Qiao-Jin commented Sep 2, 2020 •

edited

Loading

roman-khimov commented Sep 2, 2020

Qiao-Jin commented Sep 2, 2020 •

edited

Loading

AnnaShaleva Sep 3, 2020

Qiao-Jin Sep 4, 2020 •

edited

Loading

AnnaShaleva commented Sep 3, 2020 •

edited

Loading

Qiao-Jin commented Sep 4, 2020 •

edited

Loading

Qiao-Jin commented Sep 4, 2020 •

edited

Loading

erikzhang commented Sep 6, 2020

Add cache for recent on chain transaction hashes #1886

Add cache for recent on chain transaction hashes #1886

Conversation

Qiao-Jin commented Aug 28, 2020 • edited Loading

erikzhang commented Aug 28, 2020

Qiao-Jin commented Aug 28, 2020

erikzhang commented Aug 28, 2020 • edited Loading

Qiao-Jin commented Aug 28, 2020

Tommo-L Aug 29, 2020 • edited Loading

Choose a reason for hiding this comment

Qiao-Jin Aug 30, 2020

Choose a reason for hiding this comment

shargon Aug 30, 2020

Choose a reason for hiding this comment

Qiao-Jin Aug 31, 2020

Choose a reason for hiding this comment

Qiao-Jin commented Sep 1, 2020

roman-khimov commented Sep 1, 2020

Qiao-Jin commented Sep 2, 2020 • edited Loading

erikzhang commented Sep 2, 2020

shargon commented Sep 2, 2020

roman-khimov commented Sep 2, 2020

Qiao-Jin commented Sep 2, 2020 • edited Loading

roman-khimov commented Sep 2, 2020

Qiao-Jin commented Sep 2, 2020 • edited Loading

AnnaShaleva Sep 3, 2020

Choose a reason for hiding this comment

Qiao-Jin Sep 4, 2020 • edited Loading

Choose a reason for hiding this comment

AnnaShaleva commented Sep 3, 2020 • edited Loading

Qiao-Jin commented Sep 4, 2020 • edited Loading

Qiao-Jin commented Sep 4, 2020 • edited Loading

erikzhang commented Sep 6, 2020

Qiao-Jin commented Aug 28, 2020 •

edited

Loading

erikzhang commented Aug 28, 2020 •

edited

Loading

Tommo-L Aug 29, 2020 •

edited

Loading

Qiao-Jin commented Sep 2, 2020 •

edited

Loading

Qiao-Jin commented Sep 2, 2020 •

edited

Loading

Qiao-Jin commented Sep 2, 2020 •

edited

Loading

Qiao-Jin Sep 4, 2020 •

edited

Loading

AnnaShaleva commented Sep 3, 2020 •

edited

Loading

Qiao-Jin commented Sep 4, 2020 •

edited

Loading

Qiao-Jin commented Sep 4, 2020 •

edited

Loading