power outage and corrupted db #199

adamkrellenstein · 2014-12-24T13:27:48Z

My database is on an external drive which was (accidentally) suddenly unplugged while btcd was running. Now I get this error when making a particular getrawtransaction call. What can I do to repair the database?

08:24:51 2014-12-24 [WRN] BCDB: unable to decode tx block 335644 00000000000000001259f32bc7942a729547e66fad32525c3803bc83720d86e4 txoff 545 txlen 192
08:24:51 2014-12-24 [ERR] RPCS: Error fetching tx: unexpected EOF

The API returns {'code': -5, 'message': 'No information available about transaction'}.

The text was updated successfully, but these errors were encountered:

davecgh · 2014-12-24T19:07:54Z

Can you dig up the database LOG file and paste it please? This is almost positively going to be an issue in goleveldb that we'll have to work with the upstream author on.

adamkrellenstein · 2014-12-28T14:09:19Z

LevelDB log and btcd log are here: https://gist.github.com/adamkrellenstein/4ad68fa2c8d6ee7ed80e

This commit contains a complete redesign and rewrite of the database package that approaches things in a vastly different manner than the previous version. This is the first part of several stages that will be needed to ultimately make use of this new package. Some of the reason for this were discussed in btcsuite#255, however a quick summary of some of the reasons for the major redesign are as follows: - The previous database could only contain blocks on the main chain and reorgs required deleting the blocks from the database. This made it impossible to store orphans and could make external RPC calls for information about blocks during the middle of a reorg fail. - The previous database interface forced a high level of bitcoin-specific intellegience such as spend tracking into each backend driver. - The aforementioned point led to making it difficult to implement new backend drivers due to the need to repeat a lot of non-trivial logic which is better handled at a higher layer, such as the blockchain package. - The old database stored all blocks in leveldb. This made it extremely inefficient to do things such as lookup headers and individual transactions since the entire block had to be loaded from leveldb (which entails it doing data copies) to get access. - The vast majority of database activity after the initial block download is read activity, however leveldb, as its name implies, is optimized for levelled write performance as opposed to read performance. This is really the exact opposite of the desired characteristics for a bitcoin database. In order to address all of these concerns, and others not mentioned, the database interface has been redesigned as follows: - Two main categories of functionality are provided: block storage and metadata storage - All block storage and metadata storage are done via read-only and read-write MVCC transactions with both manual and managed modes - Support for multiple concurrent readers and a single writer - Readers use a snapshot and therefore are not blocked by the writer - Some key properties of the block storage and retrieval API: - It is generic and does NOT contain additional bitcoin logic such spend tracking and block linking - Provides access to the raw serialized bytes so deserialization is not forced for callers that don't need it - Support for fetching headers via independent functions which allows implementations to provide significant optimizations - Ability to efficiently retrieve arbitrary regions of blocks (transactions, scripts, etc) - A rich meatdata storage API is provided: - Key/value with arbitrary data - Support for buckets and nested buckets - Bucket iteration through a couple of differente mechanisms - Cursors for efficient and direct key seeking - Supports registration of backend database implementations - Comprehensive test coverage - Provides strong documentation with example usage This commit also contains an implementation of the previously discussed interface named ffboltdb (flat file plus boltdb metadata backend). Here is a quick overview: - Highly optimized for read performance - All blocks are stored in flat files on the file system - Bulk block region fetching is optimized to perform linear reads which improves performance on spindle disks - The metadata storage uses boltdb under the hood which provides fast memory-mapped access - NOTE: Due to the fact the database can get quite large and due to how the OS does memory-mapped files this can result in the process showing huge amount of memory used. This is typical for memory-mapped files, but the key thing to remember is the reported memory is not _really_ used because the OS will relinquish it as soon as other processes need it - Anti-corruption mechanisms: - Flat files contain full block checksums to quickly an easily detect database corruption without needing to do expensive merkle root calculations - Metadata checksums - Open reconciliation - Extensive test coverage: - Comprehensive blackbox interface testing - Whitebox testing which uses intimate knowledge to exercise uncommon failure paths such as deleting files out from under the database - Corruption tests (replacing random data in the files) In addition, this commit also contains a new tool under the new database directory named dbtool which provides a few basic commands for testing the database. It is designed around commands, so it could be useful to expand on in the future. Finally, this commit addresses the following issues: - Adds support for and therefore closes btcsuite#255 - Fixes btcsuite#199 - Fixes btcsuite#201 - Implements and closes btcsuite#256 - Obsoletes and closes btcsuite#257 - Closes btcsuite#247 once the required chain and btcd modifications are in place to make use of this new code

This commit contains a complete redesign and rewrite of the database package that approaches things in a vastly different manner than the previous version. This is the first part of several stages that will be needed to ultimately make use of this new package. Some of the reason for this were discussed in btcsuite#255, however a quick summary of some of the reasons for the major redesign are as follows: - The previous database could only contain blocks on the main chain and reorgs required deleting the blocks from the database. This made it impossible to store orphans and could make external RPC calls for information about blocks during the middle of a reorg fail. - The previous database interface forced a high level of bitcoin-specific intellegience such as spend tracking into each backend driver. - The aforementioned point led to making it difficult to implement new backend drivers due to the need to repeat a lot of non-trivial logic which is better handled at a higher layer, such as the blockchain package. - The old database stored all blocks in leveldb. This made it extremely inefficient to do things such as lookup headers and individual transactions since the entire block had to be loaded from leveldb (which entails it doing data copies) to get access. - The vast majority of database activity after the initial block download is read activity, however leveldb, as its name implies, is optimized for leveled write performance as opposed to read performance. In order to address all of these concerns, and others not mentioned, the database interface has been redesigned as follows: - Two main categories of functionality are provided: block storage and metadata storage - All block storage and metadata storage are done via read-only and read-write MVCC transactions with both manual and managed modes - Support for multiple concurrent readers and a single writer - Readers use a snapshot and therefore are not blocked by the writer - Some key properties of the block storage and retrieval API: - It is generic and does NOT contain additional bitcoin logic such spend tracking and block linking - Provides access to the raw serialized bytes so deserialization is not forced for callers that don't need it - Support for fetching headers via independent functions which allows implementations to provide significant optimizations - Ability to efficiently retrieve arbitrary regions of blocks (transactions, scripts, etc) - A rich metadata storage API is provided: - Key/value with arbitrary data - Support for buckets and nested buckets - Bucket iteration through a couple of different mechanisms - Cursors for efficient and direct key seeking - Supports registration of backend database implementations - Comprehensive test coverage - Provides strong documentation with example usage This commit also contains an implementation of the previously discussed interface named ffboltdb (flat file plus boltdb metadata backend). Here is a quick overview: - Highly optimized for read performance - All blocks are stored in flat files on the file system - Bulk block region fetching is optimized to perform linear reads which improves performance on spindle disks - The metadata storage uses boltdb under the hood which provides fast memory-mapped access - NOTE: Due to the fact the database can get quite large and due to how the OS does memory-mapped files this can result in the process showing huge amount of memory used. This is typical for memory-mapped files, but the key thing to remember is the reported memory is not _really_ used because the OS will relinquish it as soon as other processes need it - Anti-corruption mechanisms: - Flat files contain full block checksums to quickly an easily detect database corruption without needing to do expensive merkle root calculations - Metadata checksums - Open reconciliation - Extensive test coverage: - Comprehensive blackbox interface testing - Whitebox testing which uses intimate knowledge to exercise uncommon failure paths such as deleting files out from under the database - Corruption tests (replacing random data in the files) In addition, this commit also contains a new tool under the new database directory named dbtool which provides a few basic commands for testing the database. It is designed around commands, so it could be useful to expand on in the future. Finally, this commit addresses the following issues: - Adds support for and therefore closes btcsuite#255 - Fixes btcsuite#199 - Fixes btcsuite#201 - Implements and closes btcsuite#256 - Obsoletes and closes btcsuite#257 - Closes btcsuite#247 once the required chain and btcd modifications are in place to make use of this new code

This commit contains a complete redesign and rewrite of the database package that approaches things in a vastly different manner than the previous version. This is the first part of several stages that will be needed to ultimately make use of this new package. Some of the reason for this were discussed in btcsuite#255, however a quick summary is as follows: - The previous database could only contain blocks on the main chain and reorgs required deleting the blocks from the database. This made it impossible to store orphans and could make external RPC calls for information about blocks during the middle of a reorg fail. - The previous database interface forced a high level of bitcoin-specific intelligence such as spend tracking into each backend driver. - The aforementioned point led to making it difficult to implement new backend drivers due to the need to repeat a lot of non-trivial logic which is better handled at a higher layer, such as the blockchain package. - The old database stored all blocks in leveldb. This made it extremely inefficient to do things such as lookup headers and individual transactions since the entire block had to be loaded from leveldb (which entails it doing data copies) to get access. - The vast majority of database activity after the initial block download is read activity, however leveldb, as its name implies, is optimized for leveled write performance at the expense of read performance. In order to address all of these concerns, and others not mentioned, the database interface has been redesigned as follows: - Two main categories of functionality are provided: block storage and metadata storage - All block storage and metadata storage are done via read-only and read-write MVCC transactions with both manual and managed modes - Support for multiple concurrent readers and a single writer - Readers use a snapshot and therefore are not blocked by the writer - Some key properties of the block storage and retrieval API: - It is generic and does NOT contain additional bitcoin logic such spend tracking and block linking - Provides access to the raw serialized bytes so deserialization is not forced for callers that don't need it - Support for fetching headers via independent functions which allows implementations to provide significant optimizations - Ability to efficiently retrieve arbitrary regions of blocks (transactions, scripts, etc) - A rich metadata storage API is provided: - Key/value with arbitrary data - Support for buckets and nested buckets - Bucket iteration through a couple of different mechanisms - Cursors for efficient and direct key seeking - Supports registration of backend database implementations - Comprehensive test coverage - Provides strong documentation with example usage This commit also contains an implementation of the previously discussed interface named ffboltdb (flat file plus boltdb metadata backend). Here is a quick overview: - Highly optimized for read performance - All blocks are stored in flat files on the file system - Bulk block region fetching is optimized to perform linear reads which improves performance on spindle disks - The metadata storage uses boltdb under the hood which provides fast memory-mapped access - NOTE: Due to the fact the database can get quite large and due to how the OS does memory-mapped files this can result in the process showing a huge amount of memory used. This is typical for memory-mapped files, but the key thing to remember is the reported memory is not really used because the OS will relinquish it as soon as other processes need it - Anti-corruption mechanisms: - Flat files contain full block checksums to quickly an easily detect database corruption without needing to do expensive merkle root calculations - Metadata checksums - Open reconciliation - Extensive test coverage: - Comprehensive blackbox interface testing - Whitebox testing which uses intimate knowledge to exercise uncommon failure paths such as deleting files out from under the database - Corruption tests (replacing random data in the files) In addition, this commit also contains a new tool under the new database directory named dbtool which provides a few basic commands for testing the database. It is designed around commands, so it could be useful to expand on in the future. Finally, this commit addresses the following issues: - Adds support for and therefore closes btcsuite#255 - Fixes btcsuite#199 - Fixes btcsuite#201 - Implements and closes btcsuite#256 - Obsoletes and closes btcsuite#257 - Closes btcsuite#247 once the required chain and btcd modifications are in place to make use of this new code

This commit contains a complete redesign and rewrite of the database package that approaches things in a vastly different manner than the previous version. This is the first part of several stages that will be needed to ultimately make use of this new package. Some of the reason for this were discussed in btcsuite#255, however a quick summary is as follows: - The previous database could only contain blocks on the main chain and reorgs required deleting the blocks from the database. This made it impossible to store orphans and could make external RPC calls for information about blocks during the middle of a reorg fail. - The previous database interface forced a high level of bitcoin-specific intelligence such as spend tracking into each backend driver. - The aforementioned point led to making it difficult to implement new backend drivers due to the need to repeat a lot of non-trivial logic which is better handled at a higher layer, such as the blockchain package. - The old database stored all blocks in leveldb. This made it extremely inefficient to do things such as lookup headers and individual transactions since the entire block had to be loaded from leveldb (which entails it doing data copies) to get access. In order to address all of these concerns, and others not mentioned, the database interface has been redesigned as follows: - Two main categories of functionality are provided: block storage and metadata storage - All block storage and metadata storage are done via read-only and read-write MVCC transactions with both manual and managed modes - Support for multiple concurrent readers and a single writer - Readers use a snapshot and therefore are not blocked by the writer - Some key properties of the block storage and retrieval API: - It is generic and does NOT contain additional bitcoin logic such spend tracking and block linking - Provides access to the raw serialized bytes so deserialization is not forced for callers that don't need it - Support for fetching headers via independent functions which allows implementations to provide significant optimizations - Ability to efficiently retrieve arbitrary regions of blocks (transactions, scripts, etc) - A rich metadata storage API is provided: - Key/value with arbitrary data - Support for buckets and nested buckets - Bucket iteration through a couple of different mechanisms - Cursors for efficient and direct key seeking - Supports registration of backend database implementations - Comprehensive test coverage - Provides strong documentation with example usage This commit also contains an implementation of the previously discussed interface named ffldb (flat file plus leveldb metadata backend). Here is a quick overview: - Highly optimized for read performance with consistent write performance regardless of database size - All blocks are stored in flat files on the file system - Bulk block region fetching is optimized to perform linear reads which improves performance on spindle disks - Anti-corruption mechanisms: - Flat files contain full block checksums to quickly an easily detect database corruption without needing to do expensive merkle root calculations - Metadata checksums - Open reconciliation - Extensive test coverage: - Comprehensive blackbox interface testing - Whitebox testing which uses intimate knowledge to exercise uncommon failure paths such as deleting files out from under the database - Corruption tests (replacing random data in the files) In addition, this commit also contains a new tool under the new database directory named dbtool which provides a few basic commands for testing the database. It is designed around commands, so it could be useful to expand on in the future. Finally, this commit addresses the following issues: - Adds support for and therefore closes btcsuite#255 - Fixes btcsuite#199 - Fixes btcsuite#201 - Implements and closes btcsuite#256 - Obsoletes and closes btcsuite#257 - Closes btcsuite#247 once the required chain and btcd modifications are in place to make use of this new code

…ug-fix blockchain: fix bridge node bug when enabling -utreexoproofindexmaxmemory=-1

davecgh added the bug label Dec 30, 2014

davecgh added the database label Jan 27, 2015

Roasbeef mentioned this issue Feb 17, 2015

--addrindex doesn't work on preprocessed blocks #303

Closed

davecgh mentioned this issue Apr 16, 2015

database: Major redesign of database package. #380

Merged

4 tasks

conformal-deploy closed this as completed in #380 Feb 3, 2016

davecgh mentioned this issue Mar 16, 2016

database: Major redesign of database package decred/dcrd#91

Merged

3 tasks

davecgh pushed a commit to davecgh/btcd that referenced this issue May 20, 2016

Use the correct heap sorting function (btcsuite#199)

c3d7ede

kcalvinalvin added a commit to kcalvinalvin/btcd that referenced this issue Nov 29, 2024

Merge pull request btcsuite#199 from kcalvinalvin/2024-09-02-bridge-b…

87a8b19

…ug-fix blockchain: fix bridge node bug when enabling -utreexoproofindexmaxmemory=-1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

power outage and corrupted db #199

power outage and corrupted db #199

adamkrellenstein commented Dec 24, 2014

davecgh commented Dec 24, 2014

adamkrellenstein commented Dec 28, 2014

power outage and corrupted db #199

power outage and corrupted db #199

Comments

adamkrellenstein commented Dec 24, 2014

davecgh commented Dec 24, 2014

adamkrellenstein commented Dec 28, 2014