Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Endpoint Disassociated #3387

Closed
JTrotta opened this issue Apr 14, 2018 · 8 comments
Closed

Endpoint Disassociated #3387

JTrotta opened this issue Apr 14, 2018 · 8 comments

Comments

@JTrotta
Copy link

JTrotta commented Apr 14, 2018

When creating a new issue, please make sure the following information is part of your issue description. (if applicable). Thank You!

  • Which Akka.Net version you are using: Last
  • On which platform you are using Akka.Net: Windows & Linux
  • A list of steps to reproduce the issue. Or an gist or github repo which can be easily used to reproduce your case.

Hello I created a simple cluster among 2 server: 1 Linux 1 Windows. Same application.
Firewall are stopped.
As far as the cluster is not initialized, everything works fine. When one server start the cluster manually, the cluster is formed, the other server join the cluster regularly but in few seconds becomes unreachable, and a lot of errors come up in logging. One of the most significant errors are:
2018/04/14 11:41:18.760|Error|Akka.Logger.NLog.NLogLogger.LogEvent|Error caught channel 192.168.0.30:60066->192.168.0.31:32211.

The cluster becomes unstable.
Is there some configuration I missed? This is the config used by both server:
akka { loglevel = DEBUG loggers =["Akka.Logger.NLog.NLogLogger, Akka.Logger.NLog"] actor { provider = cluster } remote { dot-netty.tcp { port = 32211 hostname = 192.168.0.30 public-hostname = 192.168.0.30 } } cluster { seed-nodes = [] } }

@Aaronontheweb
Copy link
Member

@JTrotta what's the full error message? Are you getting a length-frame encoding error?

@JTrotta
Copy link
Author

JTrotta commented Apr 15, 2018

@Aaronontheweb That's the full log:
logakka.txt
As you can see there are a lot of "Associated" and "Disassociated" events.

May it be a serialize/deserialize issue like here: 3031 ?

@Aaronontheweb
Copy link
Member

Could be, although I don't see anything in the error logs here that would indicate that.

Looks to me like you've never had a functional cluster, poking through the logs here, so this smells like a borked configuration more than anything else. Can you guarantee that the addresses you're using are reachable?

@JTrotta
Copy link
Author

JTrotta commented Apr 27, 2018

I did many tests, but still having the same issue. Addresses are reachable with no doubts, and firewalls are switched off on both servers. What do you suggest?

@Aaronontheweb
Copy link
Member

I don't see any serialization issue in the logs (which are formatted rather weirdly, FYI) - do you see that error showing up elsewhere?

@JTrotta
Copy link
Author

JTrotta commented Apr 28, 2018

What you mean for :

do you see that error showing up elsewhere?

I have two application running one on a server Centos, and one on my laptop for debugging. The log is a copy and paste from my console app.
On the server I have a similar log in a NLog file. It is the same of my console app.
Find attached the NLOG server log
server.log

@Aaronontheweb
Copy link
Member

It looks like there is something off with this node:

Akka.Remote.Transport.InvalidAssociationException: connection timed out: 192.168.0.103:32211

All of the disassociation errors are occurring when attempting to connect to this node. Are the serailization settings different on this node vs the others?

@JTrotta
Copy link
Author

JTrotta commented Apr 29, 2018

You were right, some differences between the two serialization settings.
Problem solved.
Thank you @Aaronontheweb

@JTrotta JTrotta closed this as completed Apr 29, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants