-
Notifications
You must be signed in to change notification settings - Fork 754
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
network: Some network error occurred when fetching erasure chunk #6587
Comments
Reproduces on libp2p as well, not isolated to litep2p:
|
Are all the dial failures when dialing the same peer? maybe there are some bad validators that are not responsive |
After looking a bit into warnings, some overlaps are having the same We were able to reproduce this for a breath moment on libp2p as well. It might be a side-effect of the kusama spamming that happened yesterday, we'll try to reproduce this issue in versi with extra stress testing 🙏 |
LogsWarnigns happened for a brief period of 7 minutes over the weekend in kusama-waw10 (litep2p) validator:
Considering this recovered, I would lean towards authority-records not being discovered / propagated for litep2p component. Pending investigation. Code LogicWarning originates from availability-distribution: polkadot-sdk/polkadot/node/network/availability-distribution/src/requester/fetch_task/mod.rs Lines 437 to 451 in 9fe8cff
The request is initiated with polkadot-sdk/polkadot/node/network/availability-distribution/src/requester/fetch_task/mod.rs Lines 359 to 368 in 9fe8cff
This is then handled by the bridge/tx component, which updates metrics and forwards it to the network service: polkadot-sdk/polkadot/node/network/bridge/src/tx/mod.rs Lines 323 to 331 in 9fe8cff
A dial failure response is provided immediately if the authority discovery mechanism cannot translate the authority ID into a Multiaddr: polkadot-sdk/polkadot/node/network/bridge/src/network.rs Lines 293 to 309 in 9fe8cff
Possible Causes
|
I've seen this occasionally either at startup when the network wasn't full discovered or because we aren't connected to 100% percent of the nodes, this log should tell you how well connected a given node is: https://github.com/paritytech/polkadot-sdk/blob/master/polkadot/node/network/gossip-support/src/lib.rs#L608. |
Indeed, I confirm this happened at startup after another crash related to:
Grafana link: https://grafana.teleport.parity.io/goto/_2BODBnHg?orgId=1 We also see the following messages at sartup, @alexggh did you see them before? Do you think they are related?
|
Yeah, I've seen those in the past at startup as far as I can tell they are completely harmless. |
Kusama litep2p node is spamming the following earnings after a while:
Offhand it looks like the node is not able to dial peers:
DialFailure
. Pending confirmation on libp2p node.Version deployed: version 1.16.1-ca8beaed148
Grafana link: https://grafana.teleport.parity.io/goto/K-F2Od7NR?orgId=1
cc @paritytech/sdk-node
The text was updated successfully, but these errors were encountered: