-
Notifications
You must be signed in to change notification settings - Fork 4.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[filebeat][gcs] - Removed bucket_timeout config option and replaced bucket context with parent program context #41970
Conversation
…r bucket operations
This pull request does not have a backport label.
To fixup this pull request, you need to add the backport labels for the needed
|
|
Pinging @elastic/security-service-integrations (Team:Security-Service Integrations) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Suggested commit message body:
Removed bucket_timeout config option and replaced bucket context with parent
program context
The bucket_timeout option was used to specify the timeout for the individual
bucket operations. This option was difficult to understand in its effects
resulting in malconfiguration leading to processing timeouts and context
cancellation, ultimately causing gaps in the ingested data. To avoid this,
the bucket_timeout option has been removed. The input now uses the parent
program context to handle the contexts for bucket operations.
@@ -23,6 +23,8 @@ The input can be configured to work with and without polling, though if polling | |||
3. If any major error occurs which stops the main thread, the logs will be appropriately generated, | |||
describing said error. | |||
|
|||
**Config Option Removal Notice** : The `bucket_timeout` config option has been removed from the google cloud storage input. The intention behind this removal is to simplify the configuration and to make it more user friendly. The `bucket_timeout` option was used to specify the timeout for the individual bucket operations. But this proved counter-intuitive because bucket operations could slow down due to a lot of external factors like network latency, file size, back pressure etc. This could lead to the input not being able to process the files in time, which would then lead to a context cancellation error. This error would then cause gaps in the ingested data. To avoid this, the `bucket_timeout` option has been removed. The input now uses the parent program context to handle the contexts for bucket operations. This ensures that the input is able to process the files in a timely manner and avoid most context cancellation errors. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think we need to say so much. All we need to say is that it was confusing and had the potential to let users malconfigure the input, and so it's been removed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated the documentation.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks
…ucket context with parent program context (elastic#41970)
Type of change
Proposed commit message
difficult to understand in its effects resulting in malconfiguration leading to processing timeouts and context cancellation, ultimately causing gaps in the ingested data. To avoid this, the bucket_timeout option has been removed. The input now uses the parent program context to handle the contexts for bucket operations.
Checklist
CHANGELOG.next.asciidoc
orCHANGELOG-developer.next.asciidoc
.Disruptive User Impact
Since it is a config option that has not impact on ingested data, this removal should not have any user impact.
Author's Checklist
How to test this PR locally
Related issues
Use cases
Screenshots
Logs