After a restart, a node doesn't respond to chunk requests that have non-empty list of requested receipts #2916

SkidanovAlex · 2020-06-27T22:04:12Z

No description provided.

…#3036) After this change stress.py node_restart passes relatively consistently, and is reintroduced to nightly. Nearcore fixes: - We had a bug in the syncing logic (with a low chance of being triggered in the wild): if a block is produced, and between 1/3 and 2/3 of block producers received it, and the rest have not, the system stalls, because no 2/3 of block producers have the same head, but also nobody is two blocks behind the highest peer to start syncing. Fixing it by forcing sync if we've been 1 block behind for too long. stress.py was reproducing this issue in every run - (#2916) we had an issue that if a node produced a chunk, and then crashed, on recovery it was not able to serve it because it didn't have all the parts and receipts stored in the storage from which we recover cache entries in the shards manager. Fixing it by always storing all the parts and receipts (redundantly) for chunks in the shards we care about. Test fixes [v] Fixing a scenario in which a failure to send a transaction to all validators resulted in recording an incorrect tx hash alongside the tx. Later when checking balances using the incorrect hash resulted in getting incorrect success value, and thus applying incorrect corrections to the expected balances; [v] Changing the order of magnitude of staking transactions, so that the validator set actually changes. Other issues discovered while fixing stress.py: - #2906

SkidanovAlex added the A-chain Area: Chain, client & related label Jun 27, 2020

SkidanovAlex self-assigned this Jun 27, 2020

ilblackdragon added the C-bug Category: This is a bug label Jun 29, 2020

weekly-digest bot mentioned this issue Jul 3, 2020

Weekly Digest (26 June, 2020 - 3 July, 2020) #2941

Closed

SkidanovAlex added a commit that referenced this issue Jul 24, 2020

Naive fix of #2916

a908038

SkidanovAlex added a commit that referenced this issue Jul 24, 2020

Fixing #2916 for real

bb0894c

SkidanovAlex mentioned this issue Jul 24, 2020

(fix): Making stress.py with node_restart mode pass, and fixing #2916 #3036

Merged

SkidanovAlex added a commit that referenced this issue Jul 30, 2020

Naive fix of #2916

401e930

SkidanovAlex added a commit that referenced this issue Jul 30, 2020

Fixing #2916 for real

6b657a6

bowenwang1996 closed this as completed Aug 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

After a restart, a node doesn't respond to chunk requests that have non-empty list of requested receipts #2916

After a restart, a node doesn't respond to chunk requests that have non-empty list of requested receipts #2916

SkidanovAlex commented Jun 27, 2020

After a restart, a node doesn't respond to chunk requests that have non-empty list of requested receipts #2916

After a restart, a node doesn't respond to chunk requests that have non-empty list of requested receipts #2916

Comments

SkidanovAlex commented Jun 27, 2020