-
Notifications
You must be signed in to change notification settings - Fork 246
It congests the web traffic on the node after running for a while #30
Comments
What I understand is you don't have to connect to brokers for what you did in the UI for now. You can get everything from zookeeper, correct? |
I have seen this as well - after running the application for several days, the output of netstat showed many hundreds of open connections to kafka brokers. Eventually the application ceased to function because it could no longer open any new sockets - the process max file limit was reached. |
The console regularly connects to brokers to retrieve partition log sizes so that you can view partition size over time. I've checked the code and the client is closed after retrieving the log size. This will require further investigation. |
@siyuanh & @unclebilly, as a temporary workaround, you can increase the Offset Fetch Interval in Settings to reduce the rate at which connections are created. |
Also have a stacktrace, if you are interested.
Oh, sorry for the edit after edit, but this is just lsof's royal stupidity, it uses an asterisk and 3 digits (*123) when it's above 9999.
|
I think I fixed the issue. I'm going to let it sit for a little while longer, but I'm not seeing any unbounded increase in the number of open files. If there aren't any additional problems, I'll push my changes to my fork tomorrow. Update: When I run Update 2: Ah, wait, the netstat return value just dropped by 1000... |
Alright, after >12 hours the netstat is still returning below 4000, the lsof below 200. I think this works. |
Well, this only fixed part of the issue, apparently. I didn't notice because I was only checking for connections on Port 9092, but it's still leaving open sockets on 9091 and 9090. The number of connections on port 9092 doesn't increase without bound anymore, however. The open file count is still less than 200, so that exception is gone, but instead of crashing it just continuously fails to open a socket. |
Alright, I replaced the finagle-kafka library with a different kafka connection system and the number of open connections is consistently remaining below 20. I can only assume that okapies' library wasn't properly closing connections. |
Thanks for the fix, works wonderfully for us. Let's hope it gets merged. |
Has this been merged? Or should we keep using @ibanner56 's fork? |
* ibanner56/master: Sped up LowLevelConsumer, since we don't need to worry about writes. Uncomment a logger statement Removed unnecessary line. future is deprecated, use Future Removed unused imports Added author attributions Actually fixes cjmamo#30. Typo. Closed on the wrong side of the braces. Fixes cjmamo#30 Forgot to remove a debug line Forgot a failure case Modified delete workflow to prevent incomplete deletes.
I'm running your fork and initially it looked way better (I could use the app for more than a minute which was the case before) but after a weekend being up and running it hangs again with lot of open files. Haven't had time to investigate it but let me know if you need some additional info. |
After running this web-console for a while, the connection table is full of connections to the brokers.
The text was updated successfully, but these errors were encountered: