-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
consul 0.9.2 - [ERR] memberlist: Failed fallback ping: write tcp 172.17.0.5:45890->a.b.c.d:8301: i/o timeout #3411
Comments
The same happening. I'm on EC2 isntances. NO docker. Consul v0.9.0 It is a 3 servers cluster only. No clients. Ports from 8300-8500 are allowed both udp and tcp. Not for 8600.
|
For what is worth, I rechecked the ACLs in AWS and the UDP ports were missing. The log is not that helpful though. I remember that in previous versions it stated that UDP was not reaching, fallback to TCP.. now the ping message isnt very helpful. Perhaps in the new 0.9.2 it changed. |
Don't have any ACL's, and all ports are open, but still get random timeout messages. |
We changed our infrastructure so that consul container has a host network and CONSUL_ALLOW_PRIVILEGED_PORTS=1 this morning. And we are seeing a lot of the same log messages:
I found the explanation and can see the use of it and would not like to disable it but it is a little too excesive. Log lines are full for no obvious reason.https://github.com/hashicorp/consul/blob/v0.6.4/vendor/github.com/hashicorp/memberlist/state.go#L275-L299 @slackpad can You help? |
Hmm that error message did get more generic after a refactoring. We should look at making these messages more specific and actionable (and less spammy). |
That would be great @slackpad. And also, can you do someting about this type of messages? We get them every couple of minutes, even on the last version 1.0.2
|
Hi @slackpad , Is there a chance to resolve this in the next release? I'm asking because we have a lot of nodes and this kind of log messages are becoming too spammy. Thanks |
Seeing the same thing in 1.0.2. More specifically, the nodes having the issues on are different VPCs, but the VPCs are peered and I have verified they can reach each other bi-direcitonally on all of the required ports. The only thing I could think of is that since all of the nodes are in a private subnet with a NAT, that is somehow causing interference, but they have appropriate direct routes setup. Debug messages don't shed any additional light. |
Hi, Consul version 1.2.0, on the same LAN network every few minutes logs are filled:
|
I have been seeing the same for quite some time (and versions) between my on premise server and a cloud server. I have verified all ports back and forth using telnet, netcat, iperf3. consul version 1.2.2 |
+1 consul version 1.2.2 |
encountered same issue with 1.2.2
|
+1 consul version 1.4.0 |
The errors I was seeing have since gone away. Previously it was a software based VPN (StrongSwan). |
hi,i meet the same problem in consul 1.5.1 ,have you found a solution ? thanks |
Seeing the same issues with 1.8.3 on AWS peered VPCs |
consul:1.8.5 seeing it too, it causes to pod be not ready |
Haven't used in a while, but I would first confirm if all the necessary ports were reachable back and forth, TCP and UDP. |
When I deployed consul with the consul-k8s helm chart my problem was that the memory limit was set too low and the server pods were crashing. That is why the ping request returned with a timeout. |
Based on #3411 (comment) it sounds like the issues work more network related. Closing as this issue is quite old, please re-open a new issue if further investigation is needed. |
@justas147 we ran into that Problem too and your answer helped us a lot. Made the extra steps for a (maybe denied) PR so others won't have that problem in the future. This Problem had cost us several days and a precise lucky kubectl describe at the right time hashicorp/consul-k8s#1696 |
Running consul docker image 0.9.2
consul version
for both Client and ServerClient:
0.7.5 -> upd to 0.9.2
Server:
0.7.5 -> upd to 0.9.2
consul info
for both Client and ServerClient:
Server:
Operating system and Environment details
Ubuntu 16.04.3 LTS, docker 17.05.0-ce
Description of the Issue (and unexpected/desired result)
After upgrade Consul to v0.9.2. seeing lot of messages in log on every host, random
Reproduction steps
Consul docker image 0.7.5 upgrade to v.0.9.2, after that randomly get log messages about fallback ping.
Tried to use -log-level=TRACE but it is impossible to capture on what host is this going to happen. It is totally random.
All docker ports are open as I see it:
On my test enviroment, installed 2 docker hosts with consul 0.7.5, after that upgrade to v0.8.5 and then to v0.9.0 and fallback ping started. So I think this is something caused from version 0.9.x
No firewall, no iptables, nothing that could block connection and cause timeout.
Edit:
Also, seeing a lot of this on 2 servers on the same subnet/network:
The text was updated successfully, but these errors were encountered: