Memory optimizations for large address queries #392

braydonf · 2016-01-04T15:38:08Z

Switches to use transform streams for output and input queries for better memory usage
Adds configurable limits to queries to avoid several minute long queries being enabled by default
Refactors address history and address summary methods
Limits the number of files that leveldb keeps open in cache
Cleaner shutdown by removing bitcoin event listeners
Closes: Address Service: Resolve memory issues with large queries #372

New Address Service Options:

maxInputsQueryLength (default 50,000) // The maximum number of inputs per query
maxOutputsQueryLength (default 50,000) // The maximum number of outputs per query
maxHistoryQueryLength (default 100) // The maximum number of transactions per query
maxAddressesQuery (default 10,000) // The maximum number of addresses per query

New DB Service Options:

maxOpenFiles (default 200) // The maximum number of files that leveldb keeps in cache

…rge queries

… block heights

… time

- Restored functionality to be able to query the history of multiple addresses in one query - Sorted mempool transactions by timestamp in txid lists

- Refactored getAddressSummary and added several tests - Fixed bugs revealed from the integration regtests - Updated many unit tests

Querying addresses that have millions of transactions is supported however takes hundreds of seconds to fully calculate the balance. Creating a cache of previous results wasn't currently working because the `isSpent` query is always based on the current bitcoind tip. Thus the balance of the outputs would be included however wouldn't be removed when spent as the output wouldn't be checked again when querying for blocks past the last checkpoint. Including the satoshis in the inputs address index would make it possible to subtract the spent amount, however this degrades optimizations elsewhere. The syncing times or querying for addresses with 10,000 transactions per address. It may preferrable to have an additional address service that handles high-volume addresses be on an opt-in basis so that a custom running client could select high volume addresses to create optimizations for querying balances and history. The strategies for creating indexes differs on these use cases.

…ion`

braydonf · 2016-01-18T21:00:20Z

Ready for wider testing

There was an issue where streams would still be held open if "pause" was called before "end", this would lead to http requests from the insight-api not being returned with an error status as soon as possible but would instead stay open.

kleetus · 2016-01-27T19:45:02Z

LGTM

Memory optimizations for large address queries

braydonf added the in progress label Jan 4, 2016

Braydon Fuller added 7 commits January 11, 2016 16:17

Address Service: Start to use streams for memory optimization with la…

cab25cf

…rge queries

Address Service: Start to cache getAddressSummary based on range of…

40eb4f5

… block heights

Address Service: Limit the length of outputs that can be queried at a…

cef2f76

… time

Address Service: Use streams to combine inputs and outputs

8298e38

Address Service: Use address summary cache for pagination

5c4f3c4

Address Service: Restored multi-address history queries

8d2f69c

- Restored functionality to be able to query the history of multiple addresses in one query - Sorted mempool transactions by timestamp in txid lists

Address Service: Fixed HASH_TYPES_MAP naming issue

188ff28

braydonf force-pushed the large-queries branch 9 times, most recently from 71e892d to 7b0464d Compare January 13, 2016 22:11

Address Service: Fixed many bugs from tests

4fcec87

- Refactored getAddressSummary and added several tests - Fixed bugs revealed from the integration regtests - Updated many unit tests

braydonf force-pushed the large-queries branch from 7b0464d to 4fcec87 Compare January 13, 2016 22:16

Braydon Fuller added 4 commits January 14, 2016 17:17

Address Service: Updated tests and fixed various bugs

e79c00d

Address Service: More tests for history

3d9b6d5

Address Service: Added test for history `getAddressDetailsForTransact…

687400e

…ion`

braydonf force-pushed the large-queries branch from 820080d to af72d40 Compare January 18, 2016 19:57

Address Service: Removed event listeners prior to stopping

62934b4

braydonf force-pushed the large-queries branch from af72d40 to 62934b4 Compare January 18, 2016 20:01

Braydon Fuller added 2 commits January 18, 2016 15:06

Address Service: Removed nolonger used constant for cache

a166b6a

Address Service: Sort mempool txids

d4f2df5

braydonf force-pushed the large-queries branch from 336c316 to d4f2df5 Compare January 18, 2016 20:56

braydonf changed the title ~~WIP: Memory optimizations for large address queries~~ Memory optimizations for large address queries Jan 18, 2016

Braydon Fuller added 5 commits January 18, 2016 16:03

Address Service: Include default callback earlier

e498e0f

Address Service: Sort after unconfirmed and confirmed

4502903

Address Service: Bump maximum number of addresses default

39f8355

Address Service: Fixed test for max address limit

a2acc0c

Address Service: End stream without pausing first

3d7fb6f

There was an issue where streams would still be held open if "pause" was called before "end", this would lead to http requests from the insight-api not being returned with an error status as soon as possible but would instead stay open.

kleetus added a commit that referenced this pull request Jan 27, 2016

Merge pull request #392 from braydonf/large-queries

b0a0f62

Memory optimizations for large address queries

kleetus merged commit b0a0f62 into bitpay:master Jan 27, 2016

kleetus removed the in progress label Jan 27, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory optimizations for large address queries #392

Memory optimizations for large address queries #392

braydonf commented Jan 4, 2016

braydonf commented Jan 18, 2016

kleetus commented Jan 27, 2016

Memory optimizations for large address queries #392

Memory optimizations for large address queries #392

Conversation

braydonf commented Jan 4, 2016

braydonf commented Jan 18, 2016

kleetus commented Jan 27, 2016