-
Notifications
You must be signed in to change notification settings - Fork 1.7k
Secret Store Nodes eventually disconnects #10266
Comments
Hey! According to this log line:
, it is the node Could you also please specify - which git commit are you using for your tests? |
Also - if you leave nodes for some time after disconnect, are they reconnecting again? Does (at least) pinging works during disconnect period? |
Hi @svyatonik!, thanks for the answer.
Logs for node
Logs for node
Logs for node
Thanks! |
@jcortejoso Could you, please, share the whole (since restart) log of node |
Hi @svyatonik. I attach the logs file. Now the nodes that seems to disconnect is the
Thanks! |
Thanks, @jcortejoso - I think that's enough info for now. Definitely seems like issues are in SS code: (1) that node gets disconnected because of timeout (though seems like |
Seems like I found a reason - the SS' HTTP listener is currently written in a way that it won't work well with runtime shared with other components (i.e. it blocks the entire runtime thread until request is served). And since runtime is now (since #9888 - my bad :/) shared between ethereum sync + SS HTTP listener + SS network => some messages required for session completion could not be processed until session is completed (you could see it in logs - just after session is completed with timeout error, queued messages are processed). @jcortejoso Could. you please try if everything works with this commit 1fdfa1e6c6833ec18e171e67e5925e17a243a21b? |
Hi @svyatonik. We have running our SS cluster built from that commit since 2 hours ago. So far so good. I will update if there is any issue or if everything works as expected. Thanks! |
We have been testing our POA more intensively these days (running from commit 1fdfa1e) and I have the feeling that Secret Store reconnects correctly but Ethereum network does not. This is an example of the logs:
Maybe because it is an old commit there is some already solved bug that is causing the issue. We will test now from branch https://github.com/paritytech/parity-ethereum/tree/fix_ss_blocking_wait |
Hey, @jcortejoso ! There could have been fixes for PoA sync, for sure, since that commit. Could you please share results of |
Before filing a new issue, please provide the following information.
Your issue description goes here below. Try to include actual vs. expected behavior and steps to reproduce the issue.
Our setup consist in a PoA Network with 3 nodes that have the secret store API enabled, and also these nodes act as Authorities in the PoA. Also we have other clients connected to the Ethereum network. The problem we are observing is that everything work as expected, but after some time running (order of hours), we see some errors in the secret store logs and the encryption calls fail.
After restarting the parity process in the secret store nodes, the problem disappears for some hours. I have checked the status of the instance, and try to make the connections manually when the nodes are failing, and everything seems good and working.
These are the logs we can see in the secret store nodes:
And other node with the logger options
-l jsonrpc,secretstore_net,privatetx=trace
:This is the configuration file we are using for the secret store nodes:
If I can provide any further information please do not hesitate to ask.
Thanks!
The text was updated successfully, but these errors were encountered: