-
Notifications
You must be signed in to change notification settings - Fork 2.6k
Finalization issue #2304
Comments
Could you try to collect some logs with |
4 validators( 3 validators in 14046, 1 validators in 14071), may be 3 validators afg not > 2/3 weight. may be restart 4 validators resume grandpa |
@gguoss I'm assuming the authorities are stuck on different rounds, if you collect logs for |
Logs from validators and some other nodes https://gist.github.com/xlc/82e9c35d95f9e400134de047d6dfea67 |
So it seems that one of the validators progressed to the next round (maybe because the other authorities didn't see its vote), while the other authorities are stuck in round 550 and probably don't have threshold stake to finalize. What you should do to get finality started again is disable all validators and copy the database from validator 0 into the other validators' nodes, this way when you restart the nodes they'll all be at round 551. We are working on improvements to fix these situations where it can get stuck with a small amount of validators (9631622 was recently merged which should help as well). |
Thanks. I will upgrade the substrate version and do the fix next week and report the results here. |
Tried copy the db of validator 0 to other validators and reset all other nodes and it breaks the connection somehow. Maybe relates to #2335. I am going to pull latest substrate and reset the testnet and see if this happens again. |
Not happening anymore. |
It happens again. Please let me know if there are anything you need to diagnostic this issue. The testnet is public now. Our telemetry server is not public but we are going to migrate to use polkadot one soon. Our web UI: https://cennznet.js.org/cennznet-ui/ Let me know if you need anything, like logs from our validators, or a validator seat. |
Most likely fixed in new version. |
Will this workable? I am using |
Based on 7c64746
All nodes stopped finalization at 14046 / 14071.
New node failed to get finalization at all.
Restarting doesn't help.
Anything we can do to diagnostic the issue and resume the finalization process?
The text was updated successfully, but these errors were encountered: