-
Notifications
You must be signed in to change notification settings - Fork 444
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[aws.vpcflow] Default max_number_of_messages to 1 #4599
[aws.vpcflow] Default max_number_of_messages to 1 #4599
Conversation
For users that are getting started with ingesting VPC flow logs with the aws-s3 input, using max_number_of_messages: 1 will provide a better experience. This is because each VPC flow log file usually contains many thousands of messages. For example a single S3 object might contain 100k events and this is suffiecient to keep the Agent's internal queue full. Having multiple S3 objects in flight by default often leads to timeouts or connection resets because the overall processing time for each object increases. This setting usually needs to be tuned in conjunction with the queue.mem and output.elasaticsearch settings, and I think `max_number_of_messages: 1` is better aligned to the queue and output defaults than 5. This change will not affect users that currently have the integration added to policies. It will only affect new additions to agent policies.
[git-generate] cd packages/aws elastic-package changelog add --link elastic#4599 --next minor --type enhancement --description "Change default max_number_of_messages for vpcflow to 1 because VPC flow log files normally contain a high number of events."
5e94bc0
to
18d1a70
Compare
🚀 Benchmarks reportTo see the full report comment with |
🌐 Coverage report
|
Hi! We just realized that we haven't looked into this PR in a while. We're sorry! We're labeling this issue as |
I'm going to close this because there were some changes to the aws-s3 input in elastic/beats#33658 that may mitigate this issue. We can monitor the situation and revive this if needed. |
What does this PR do?
For users that are getting started with ingesting VPC flow logs with the aws-s3 input, using max_number_of_messages: 1 will provide a better experience. This is because each VPC flow log file usually contains many thousands of messages. For example a single S3 object might contain 100k events and this is suffiecient to keep the Agent's internal queue full. Having multiple S3 objects in flight by default often leads to timeouts or connection resets because the overall processing time for each object increases.
This setting usually needs to be tuned in conjunction with the queue.mem and output.elasaticsearch settings, and I think
max_number_of_messages: 1
is better aligned to the queue and output defaults than 5.This change will not affect users that currently have the integration added to policies. It will only affect new additions to agent policies.
Checklist
changelog.yml
file.Related issues