Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Modify default Task Manager configuration for better throughput? #78851

Closed
mikecote opened this issue Sep 29, 2020 · 3 comments
Closed

Modify default Task Manager configuration for better throughput? #78851

mikecote opened this issue Sep 29, 2020 · 3 comments
Assignees
Labels
Feature:Task Manager Team:ResponseOps Label for the ResponseOps team (formerly the Cases and Alerting teams)

Comments

@mikecote
Copy link
Contributor

After the performance improvements below are completed. Should we change the default maxWorkers and pollInterval that task manager currently has to provide a better throughput out of the box?

Note from @kobelb:
We should also assess the impact that increasing these default values have on the rest of Kibana. Since task manager is sharing a process with the rest of Kibana, we don't want to inhibit other operations from occurring. Ex: route handlers to service end-user generated HTTP requests.

@mikecote mikecote added Feature:Task Manager Team:ResponseOps Label for the ResponseOps team (formerly the Cases and Alerting teams) labels Sep 29, 2020
@elasticmachine
Copy link
Contributor

Pinging @elastic/kibana-alerting-services (Team:Alerting Services)

@mikecote mikecote changed the title Modify default Task Manager configuration for better throughput Modify default Task Manager configuration for better throughput? Sep 29, 2020
@mikecote
Copy link
Contributor Author

mikecote commented Oct 2, 2020

This question can be asked in context of alerting GA to determine what performance benchmarks we want to deliver for GA. What is the optimal default throughput do we want out of the box?

@mikecote mikecote mentioned this issue Oct 13, 2020
36 tasks
@mikecote mikecote self-assigned this Oct 21, 2020
@mikecote
Copy link
Contributor Author

It was mentioned during today's sync that it can be complex trying to figure out what the recommended change is for this. Changing the configuration will have an impact on the remainder of Kibana (http requests, ingestion, etc) and makes it hard to come up with a recommendation at the cost of background CPU usage that may not be acceptable on small deployments.

The current theoretical throughput is 200 tasks per minute [10 tasks (max_workers) running every 3 seconds (poll_interval) = 200 tasks per minute if they complete before the next poll interval]. Since there is no complaint at this time on the current throughput, it was agreed that it's better to document how to increase throughput with mentions of what the values depend on and what impact changing them will have.

I have added a note to the documentation meta issue (#81532) to document this and will work with @bmcconaghy to come up with content once we've completed our performance benchmark issue (#40264).

I will now close this issue and we can re-open if ever we think otherwise.

@kobelb kobelb added the needs-team Issues missing a team label label Jan 31, 2022
@botelastic botelastic bot removed the needs-team Issues missing a team label label Jan 31, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Feature:Task Manager Team:ResponseOps Label for the ResponseOps team (formerly the Cases and Alerting teams)
Projects
None yet
Development

No branches or pull requests

3 participants