Batch status check queries for pending transactions in txsub #3552

bartekn · 2021-04-20T09:59:12Z

What version are you using?

2.1.0

What did you do?

@tomerweller was running into performance issues when submitting thousands of transactions to a standalone network. It looks like the problem is related to this code:

go/services/horizon/internal/txsub/system.go

Lines 307 to 325 in 56d94a2

    
           for _, hash := range sys.Pending.Pending(ctx) { 
        
           	tx, err := txResultByHash(db, hash) 
        
           	if err == nil { 
        
           		logger.WithField("hash", hash).Debug("finishing open submission") 
        
           		sys.Pending.Finish(ctx, hash, Result{Transaction: tx}) 
        
           		continue 
        
           	} 
        
           	if _, ok := err.(*FailedTransactionError); ok { 
        
           		logger.WithField("hash", hash).Debug("finishing open submission") 
        
           		sys.Pending.Finish(ctx, hash, Result{Transaction: tx, Err: err}) 
        
           		continue 
        
           	} 
        
           	if err != ErrNoResults { 
        
           		logger.WithStack(err).Error(err) 
        
           	} 
        
           }

With many pending transactions getting its' results, even when separate queries are fast, can cumulate to a very long updates time (few seconds time). The chart below shows the time distribution of tx status updates after an example ledger ingestion:

What did you expect to see?

When checking the status of pending transaction we should send a single batch query instead of thousands queries asking for a single tx result. This should make this magnitudes faster.

When working on this please remember about two things:

Postgres param limit of around 65k. If a batch query needs to send more params it should be called multiple times.
The query used right now is searching for both normal txs and fee bump txs. The batch query should do the same thing.

ire-and-curses · 2021-04-20T16:16:56Z

How do we decide when we've accumulated enough status checks to send a single batch query? There's a tradeoff here between not waiting and batching for performance.

bartekn · 2021-04-20T16:19:23Z

Currently we check it on every app tick which is every second, even when there are no pending txs.

…ctions in txsub (#3563) This commit modifies `txsub.Tick` to check the status of transactions in the queue by sending a batch query instead of sending separate queries to get status of each transaction in the queue. This was done due performance reasons. If the number of transactions in the queue is large this slows down entire `Tick` function. Data in #3552 suggest that previous method can take even 4s per `Tick` execution for around 1000 transactions in the queue.

bartekn added performance issues aimed at improving performance horizon labels Apr 20, 2021

bartekn self-assigned this Apr 22, 2021

bartekn linked a pull request Apr 22, 2021 that will close this issue

services/horizon/txsub: Batch status check queries for pending transactions in txsub #3563

Merged

7 tasks

bartekn mentioned this issue Jun 8, 2021

services/horizon/txsub: Batch status check queries for pending transactions in txsub #3563

Merged

7 tasks

bartekn modified the milestone: Horizon 2.5.0 Jun 14, 2021

bartekn closed this as completed in #3563 Jun 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch status check queries for pending transactions in txsub #3552

Batch status check queries for pending transactions in txsub #3552

bartekn commented Apr 20, 2021

ire-and-curses commented Apr 20, 2021

bartekn commented Apr 20, 2021

Batch status check queries for pending transactions in txsub #3552

Batch status check queries for pending transactions in txsub #3552

Comments

bartekn commented Apr 20, 2021

What version are you using?

What did you do?

What did you expect to see?

ire-and-curses commented Apr 20, 2021

bartekn commented Apr 20, 2021