-
Notifications
You must be signed in to change notification settings - Fork 743
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Investigate flaky sampling tests #6319
Comments
I recall the flakiness of this test to increase with #6256 as a pointer to start investigating |
When I reproduced the test failure, A solution that came to mind is to add another supernode (making a total of two) to ensure that the node can retry sampling. What do you think? lighthouse/beacon_node/network/src/sync/sampling.rs Lines 570 to 586 in f75a2cf
|
Oh I think it may also be related to #6303 - the lighthouse/beacon_node/lighthouse_network/src/peer_manager/peerdb.rs Lines 250 to 264 in c0b4f01
and with #6308 we no longer use ENR for choosing peers, instead we use metadata to populate lighthouse/beacon_node/lighthouse_network/src/peer_manager/mod.rs Lines 726 to 733 in c0b4f01
In the test rig, only the supernode has lighthouse/beacon_node/lighthouse_network/src/peer_manager/peerdb.rs Lines 720 to 726 in c0b4f01
So this means the remaining 100 peers we added here are not useful, because custody_subnets are empty for them: lighthouse/beacon_node/network/src/sync/block_lookups/tests.rs Lines 401 to 408 in ed7cd3b
I think we can either
|
+1 on being as close to reality as possible |
Fixed in #6382, thanks @ackintosh! 🚀 |
Description
The test
network sync::block_lookups::tests::sampling_with_retries
is failing on CI intermittently, seems like it could be a race condition somewhere.https://github.com/sigp/lighthouse/actions/runs/10592162826/job/29350969832?pr=6312
The text was updated successfully, but these errors were encountered: