validator: remove optional remote accounts hash consistency check #31279

t-nelson · 2023-04-20T05:22:45Z

Problem

old debug code lying around doing old debug code things

Summary of Changes

remove it

codecov · 2023-04-20T06:32:27Z

Codecov Report

Merging #31279 (02e6ee2) into master (7a393e4) will increase coverage by 0.0%.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master   #31279   +/-   ##
=======================================
  Coverage    81.5%    81.5%           
=======================================
  Files         733      733           
  Lines      207009   206941   -68     
=======================================
+ Hits       168731   168735    +4     
+ Misses      38278    38206   -72

brooksprumo

I'm OK with removing the --halt-on-trusted-validators-accounts-hash-mismatch CLI flag.

Without the flag, if a node calculates the accounts hash incorrectly, it'll find out later (basically whenever the account is next used, which has a maximum of rent collection duration). Maybe this is fine? I'm guessing most validators do not have this flag set anyway, so there's no change of behavior for them.

When the Epoch Accounts Hash feature is enabled, then the accounts hash will be part of consensus directly, since it'll be part of the bank hash once per epoch. That'll be the proper way to ensure safety for the whole cluster.

One interesting possibility is w.r.t. snapshot download in bootstrap. If a known validator calculates the accounts hash wrong due to a disk issue and an accounts storage file is bad, then it would be possible for a new validator to download this bad snapshot with the bad account. Again, it'll find out once that account is accessed next.

@HaoranYi, requesting your review here too, since you've recently been interacting with this code. Specifically around the accounts_hash_fault_injector. Do you rely on --halt-on-trusted-validators-accounts-hash-mismatch for any testing? If not, can we also remove ``accounts_hash_fault_injector`? (that would be for a different PR)

HaoranYi · 2023-04-20T13:44:54Z

No, we don't rely on this cli argument for the fault injection test.

steviez

Given Brook's insight about accounts hashes becoming part of consensus, that makes me feel better about ripping this check out altogether.

I thought about whether keeping a warning in place would be useful, but I don't think it would be.

Suppose a node N is running with this flag for a set of known validators {K1, K2, ..., Kn}
If one of the known validators Ki deviates, N would get a warning.
But, N's operator can't do anything to fix Ki directly, so seemingly not super helpful

steviez · 2023-04-20T15:34:30Z

validator/src/main.rs

    if matches.is_present("halt_on_known_validators_accounts_hash_mismatch") {
-        validator_config.halt_on_known_validators_accounts_hash_mismatch = true;
+        warn!("the `--halt-on-known-validators-accounts-hash-mismatch` argument is deprecated. please remove it from the command line");


Checkout this struct and following for deprecated args. It

Allows for consistent warning messages across deprecated args

Gets deprecated arg handling out of the way of actual logic

But unfortunately, not immediately obvious to move stuff there unless you're already aware of it.

solana/validator/src/cli.rs

Line 1636 in 04bbf3b

struct DeprecatedArg {

steviez · 2023-05-15T15:40:05Z

I think we still want this - I just had one minor request and looks like things need a merge resolution now

steviez

LGTM.

Looking at this again after a few weeks, I still think ripping this out is the right move. If this flag had wide adoption, a single node experiencing a bug or fault could cause a domino effect.

Additionally, we can't know if the node that we're checking against or we deviated on a slot, yet, this code makes only our node panic. Hypothetically, we could do some sampling of N nodes, but to make this robust we're basically trying to implement a stripped down consensus. Better to kill this altogether and let the feature Brooks previously mentioned (accounts hash becoming part of consensus) take effect.

t-nelson · 2023-05-16T19:20:47Z

no idea why we're wasting ci/dev resources to keep this list sorted...

…lana-labs#31279)

t-nelson requested review from brooksprumo, mvines and steviez April 20, 2023 05:22

brooksprumo reviewed Apr 20, 2023

View reviewed changes

brooksprumo requested a review from HaoranYi April 20, 2023 12:14

steviez reviewed Apr 20, 2023

View reviewed changes

github-actions bot added the stale [bot only] Added to stale content; results in auto-close after a week. label May 5, 2023

github-actions bot closed this May 15, 2023

steviez reopened this May 15, 2023

t-nelson force-pushed the avhp branch from 02e6ee2 to ed6d026 Compare May 15, 2023 23:31

t-nelson removed the stale [bot only] Added to stale content; results in auto-close after a week. label May 16, 2023

t-nelson force-pushed the avhp branch from ed6d026 to e53bc22 Compare May 16, 2023 05:26

steviez previously approved these changes May 16, 2023

View reviewed changes

validator: remove optional remote accounts hash consistency check

2027036

t-nelson dismissed steviez’s stale review via 2027036 May 16, 2023 19:21

t-nelson force-pushed the avhp branch from e53bc22 to 2027036 Compare May 16, 2023 19:21

steviez approved these changes May 16, 2023

View reviewed changes

t-nelson merged commit ad67fd5 into solana-labs:master May 16, 2023

t-nelson deleted the avhp branch May 16, 2023 20:23

CriesofCarrots mentioned this pull request May 17, 2023

Eradicate zombie RPC threads #31688

Merged

wen-coding pushed a commit to wen-coding/solana that referenced this pull request May 18, 2023

validator: remove optional remote accounts hash consistency check (so…

950f649

…lana-labs#31279)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

validator: remove optional remote accounts hash consistency check #31279

validator: remove optional remote accounts hash consistency check #31279

t-nelson commented Apr 20, 2023

codecov bot commented Apr 20, 2023

brooksprumo left a comment •

edited

Loading

HaoranYi commented Apr 20, 2023

steviez left a comment

steviez Apr 20, 2023 •

edited

Loading

steviez commented May 15, 2023

steviez left a comment

t-nelson commented May 16, 2023

validator: remove optional remote accounts hash consistency check #31279

validator: remove optional remote accounts hash consistency check #31279

Conversation

t-nelson commented Apr 20, 2023

Problem

Summary of Changes

codecov bot commented Apr 20, 2023

Codecov Report

brooksprumo left a comment • edited Loading

Choose a reason for hiding this comment

HaoranYi commented Apr 20, 2023

steviez left a comment

Choose a reason for hiding this comment

steviez Apr 20, 2023 • edited Loading

Choose a reason for hiding this comment

steviez commented May 15, 2023

steviez left a comment

Choose a reason for hiding this comment

t-nelson commented May 16, 2023

brooksprumo left a comment •

edited

Loading

steviez Apr 20, 2023 •

edited

Loading