-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Kafka output performance is worse than expected #2825
Comments
I try socket_writer UDP & TCP output plugin, the UDP metrics performance can be 100k/s and tcp metrics performance can be 50k/s. But kafka output plugin metric performance just only 10k/s.................. Telegraf UDP config:
Result:
Telegraf TCP Config:
Result:
|
You could try enabling the compression codec. We try to reserve GitHub Issues for actionable bug reports or feature requests. Please ask general questions at the InfluxData Community site. |
@danielnelson Thank you, I submit a ticket to InfluxData Community. I also try enable the compress, but the collection will exceed 1 mins
|
I would also try setting this values back to their defaults, I believe this is why the statsd input is reporting that error. What happens is it we buffer so many metrics that it takes too long to flush, during this time the internal metric channel fills and blocks the inputs. If you flush more frequently with less data it will block the input for a shorter period.
Defaults are 1000 and 10000. |
Dear @danielnelson , telegraf.conf( Enable compression )
telegraf.log( Enable Compression )
telegraf.conf ( DIsabled compression)
|
We are seeing the same issue with telegraf 1.3.1. Did testing with a simple PHP script and metrics are posted immediately. But with the telegraf plugin, the performance is abysmal.
|
Wow, 6m is really bad. Do you think you could capture a trace? If you run telegraf with |
Hey Daniel, here is the file. I did it for 5 minutes though, I hope that is OK. Also here is some debug output during the trace.. As you can see it finally wrote to kafka at the 2 minute mark: 2017/06/15 19:18:26 I! Using config file: /etc/telegraf/telegraf.conf |
Sorry should have tagged you @danielnelson |
Looking through the trace all the time is all being spent in the sarama library. We did update this library recently though, do you think you could try with the latest nightly? |
@danielnelson sure I can test it out sometime this weekend. Let me know if I should reach out to via Email. Mention my name to Colm and he can get you the deets. |
@dennisjbr Can you add your kafka output's configuration? |
@danielnelson FYI I replaced IP address with y.y.y.y, replaced topic with "topic", and removed commented lines out for brevity. [[outputs.kafka]] |
@danielnelson No workie. Still is hung up on kafka and it hangs up the whole agent and stops sending data to all outputs. That 3rd party library is looking questionable ;-) Thanks in advance. |
Performance should be much improved in 1.8 with the changes made in #4491. If possible can you help test the development builds with the nightly builds? |
@danielnelson I have the same problem running release 1.7.1. While the performance is indeed much better in 1.8 with changes in #4491! When shall we roll out 1.8 release? |
@HansonHH 1.8 should be released by the end of the month with an RC coming before that. |
@glinton Thanks for the info! Looking forward to 1.8 with the latest and greatest! |
Dear ALL,
I have metrics 100k/sec. But when I testing the statsd input and kafka output , I only get 600k/min,
Did anyone test telegraf performance or can give me some idea?
Machine type
1 telegraf: cpu 2core , memory 4GB
3 kafka: each server cpu 2core , memory 8GB
Software Version:
Telegraf 1.3.0
Kafka 0.10.2
telegraf config:
Test Script:
https://github.com/octo/statsd-tg
Test Result:
The text was updated successfully, but these errors were encountered: