-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Node crashes with "Failed to write response" #6147
Comments
Thanks for this report. One of the first things I notice is the the Tendermint version is missing 🤔 We'll look into that. Did you use Also, I see that Desmos v0.15.1 is running Cosmos SDK v0.40. There's a new SDK release series out which fixes some halting bugs. The latest release is v0.41.3. I recommend you try upgrading first and if this is still happening we can try to take a closer look. |
I used
Thank, we'll surely update on our next chain upgrade. |
@RiccardoM if you'd like to get the Tendermint version in here:
add:
to your makefile. |
Whoa, good tip, @marbar3778 - do we have that documented anywhere? |
I think we only put it in the upgrading doc, but doesn't seem like anyone read it 😃 |
added here: #6151 |
I don't see any stacktrace. Are you sure the node has crashed and not simply hanged? if crashedCould you paste the stacktrace of the panic that lead to a crash? Usually it's the last line of the log / stdout. if hangedDo you have a goroutine list? Note you can get it by killing the frozen node with |
@melekes You right, it hangs. I'm now running |
|
|
sorry, maybe I should've been more specific. I meant Linux kill command https://linux.die.net/man/1/kill |
Yeah, I've run the |
cool. so what was the stacktrace? |
I could not get any stacktrace. I just used that command to kill the service as the |
Since it hangs, we'll need to see the stacktrace/goroutine list. I fixed the debug kill command, but I'm not sure if that landed in a point release or not. |
Let's figure out if the debug kill command was fixed or not. @alexanderbez can you point me to the commit/PR where you fixed it, and I can see if it was released? Otherwise we can backport and include it in 0.34.9. |
Here is the PR. I didn't add a backport label, so I don't think it exists in any release yet. |
Closing as duplicate of #6184 |
Tendermint version (use
tendermint version
orgit rev-parse --verify HEAD
if installed from source):ABCI app (name for built-in, URL for self-written if it's publicly available):
Desmos
v0.15.1
Environment:
What happened:
Yesterday, one of our chain nodes has stopped with error
Failed to write response
for no apparent reason. This is not the first time this happens, and we still have to identify why.What you expected to happen:
The node should not crash
Have you tried the latest version:
No
How to reproduce it (as minimally and precisely as possible):
I have yet to know this
Logs (paste a small part showing an error (< 10 lines) or link a pastebin, gist, etc. containing more of the log file):
https://pastebin.com/rbaw6FVH
Config (you can paste only the changes you've made):
The text was updated successfully, but these errors were encountered: