-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
lotus sync stucked with error "reorg channel is heavily backlogged, waiting a bit before trying to take process new tipsets" #7803
Comments
Can you get a goroutine dump when that happens - |
|
We have come across this issue too on v1.14.1 |
I have same error |
Hey, I've gotten this in |
Coming back to report that it is still happening on v1.15.1, and actually much more frequently than before. |
Also reporting that this issue is still present on (extract)
Which all workers are shown in the |
I just ha this happen after a restart of my daemon |
we have noticed this much more often in the latest versions |
After restarting it again the daemon continued. I don't think I've seen it happen while running yet. |
+1 |
When unsubscribing all topics, pubsub will drain the subscription channel to avoid deadlocks. See whyrusleeping/pubsub#1. fixes #7803
When unsubscribing _all_ topics, pubsub will drain the subscription channel to avoid deadlocks. See whyrusleeping/pubsub#1. fixes #7803
When unsubscribing _all_ topics, pubsub will drain the subscription channel to avoid deadlocks. See whyrusleeping/pubsub#1. fixes #7803
While we have high confidence that this issue was fixed by #8635, I'm going to leave it open for any more reports. We can close it when a few people who were experiencing the issue confirm the fix worked. In the meanwhile, if anyone else experiences this, please share your daemon logs here. |
Closing this now as there has been no reports of If someone still experiences this on Lotus v1.15.3 or higher, do not hesitate to reopen this, or open a new issue! |
v1.16.0 |
+1 , v1.16.0 also has this error, But it disappeared after a few minutes |
+1 on v1.16.0 - I have not seen this issue on any other version until now |
+1 v1.16.0 |
2022-07-06T14:04:03.764+0800 ERROR chainstore store/store.go:409 reorg channel is heavily backlogged, waiting a bit before trying to take process new tipsets +1 v1.16.0 |
+1 v1.16.0 |
+1 v1.16.0 |
People seeing this since v1.16.0 - are you running PoSt workers? |
This is relevant because it would appear that post-worker alive check will just spam heartbeats in a loop with sleep: Which in some cases, with multiple PoSt workers, because worker communication is over simple http RPCs, may exhaust all default http client connections.. If this is in fact the bug I'll have a fix for it tomorrow |
Yes. I haven't seen it in a while, but I've mainly stopped the post worker because of other issues (#8362) |
V1.23.0
|
Happening again in 1.23.2 |
1.24 again
|
Checklist
Latest release
, or the most recent RC(release canadiate) for the upcoming release or the dev branch(master), or have an issue updating to any of these.Lotus component
Lotus Version
Describe the Bug
Client subscribe mpool with MpoolSub and client's network stucked or changed, lotus may cannot publish message to client and chain sync stucked.
Logging Information
Repo Steps
The text was updated successfully, but these errors were encountered: