You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
While importing blocks, EL errors with various kind of errors (bad merkle state, internal error etc) as well as possible timeout/connection refused error.
The treatment of all these currently is that that block verification errors and the peer is downscored, which leads to downscoring/banning of peer and soon making lodestar loosing all peers and going out of sync! The lodestar doesn't find peers till the peerstore directory is removed and then the same cycle happens again (based on EL again erroring)
Expected:
The EL executePayload call should be try catched for these errors, and a separate status responded from executionEngine, which should log the error, but lodesstar shouldn;t penalize these peers and also accept this blocks with syncing.
However there is debate in the devnet debug group is that CLs should try discriminate these error (like treat merkle state error as invalid). Waiting till that discussion resolves.
Observed with: lodestar <> nethermind
The text was updated successfully, but these errors were encountered:
Oh good points! With downgrade do you mean downscore?
yes 🙂
g11tech
changed the title
CL<>EL interop: Peers are getting downgraded when EL errors on payload execution
CL<>EL interop: Peers are getting downscored when EL errors on payload execution
Dec 17, 2021
@tuyennhv yes, this is the same issue, I have cleared peerstore and restarted nodes, they seem to now have good peer count. Once this PR #3545 gets merged, peers shouldn't get penalized for EL malfunctions.
While importing blocks, EL errors with various kind of errors (bad merkle state, internal error etc) as well as possible timeout/connection refused error.
The treatment of all these currently is that that block verification errors and the peer is downscored, which leads to downscoring/banning of peer and soon making lodestar loosing all peers and going out of sync! The lodestar doesn't find peers till the peerstore directory is removed and then the same cycle happens again (based on EL again erroring)
Expected:
The EL executePayload call should be try catched for these errors, and a separate status responded from executionEngine, which should log the error, but lodesstar shouldn;t penalize these peers and also accept this blocks with syncing.
However there is debate in the devnet debug group is that CLs should try discriminate these error (like treat merkle state error as invalid). Waiting till that discussion resolves.
Observed with: lodestar <> nethermind
The text was updated successfully, but these errors were encountered: