-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Task Manager plugin API to bulk update task schedules #124850
Comments
Pinging @elastic/response-ops (Team:ResponseOps) |
Moving to the backlog as updating the interval is not a requirement for the first iteration of bulk update rules API. |
Pinging @elastic/security-detections-response (Team:Detections and Resp) |
Pinging @elastic/security-solution (Team: SecuritySolution) |
I am working on a proposal and meeting with @vitaliidm in the coming weeks. Thank you for helping with the PR! 🙏 |
I've updated the issue description to have a Requirements and Proposal section reflecting what was discussed and proposed to the team. |
After discussion with @mikecote and @XavierM we agreed on following implementation:
bulkUpdateSchedules(taskIds: string[], schedule: string)
newRunAt = oldRunAt - oldInterval + newInterval
|
^^ correct, I remove the handling of 409 errors from the issue description. 👍 (because 409 errors would usually mean the idle task changed to claiming or running and will update the schedule after it's run). |
Addresses: #124850 ## Summary - Adds new method Task Manager API `bulkUpdateSchedules` - Adds calling `taskManager.bulkUpdateSchedules` in rulesClient.bulkEdit to update tasks if updated rules have `scheduleTaskId` property - Enables the rest of operations for rulesClient.bulkEdit (set schedule, notifyWhen, throttle) - #### bulkUpdateSchedules Using `bulkUpdatesSchedules` you can instruct TaskManager to update interval of tasks that are in `idle` status. When interval updated, new `runAt` will be computed and task will be updated with that value ```js export class Plugin { constructor() { } public setup(core: CoreSetup, plugins: { taskManager }) { } public start(core: CoreStart, plugins: { taskManager }) { try { const bulkUpdateResults = await taskManager.bulkUpdateSchedule( ['97c2c4e7-d850-11ec-bf95-895ffd19f959', 'a5ee24d1-dce2-11ec-ab8d-cf74da82133d'], { interval: '10m' }, ); // If no error is thrown, the bulkUpdateSchedule has completed successfully. // But some updates of some tasks can be failed, due to OCC 409 conflict for example } catch(err: Error) { // if error is caught, means the whole method requested has failed and tasks weren't updated } } } ``` ### in follow-up PRs - use `taskManager.bulkUpdateSchedules` in rulesClient.update (#134027) - functional test for bulkEdit (#133635) ### Checklist - [x] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios ### Release note Adds new method to Task Manager - bulkUpdatesSchedules, that allow bulk updates of scheduled tasks. Adds 3 new operations to rulesClient.bulkUpdate: update of schedule, notifyWhen, throttle.
@vitaliidm This issue is closed by #132637 right? |
Hey @ymao1
Let's close it, after the first one gets merged (RulesClient.update refactoring), when all scope of this task is addressed |
…kUpdateSchedules (#134027) ## Summary - follow-up to #132637, #124850 - replaces in `RulesClient.update` method TaskManager API `runNow` to `bulkUpdateSchedules` When using runNow in scale, there can be situations, when TaskManager capacity is full, thus leading failure of `runNow`. Instead, new API `bulkUpdateSchedules` will be used, which in case if rule schedule is getting updated: will update underlying task schedule and will calculate new `runAt` time. More details on new TaskManager API: #132637, #124850 ### Checklist - [x] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios
To support #124715 and the existing functionality when updating a rule, we need to reflect the rule's schedule in the task's schedule. We previously ran the rule right away and let it update the task schedule at the end. However, when thinking of doing so in bulk, we encounter thundering herd and worker capacity problems.
We think of a way to allow bulk updates of tasks with a new schedule that doesn't bring thundering herd or Kibana capacity problems. Maybe something like rescheduling each task between now and the interval, at random, for each. This functionality could also be re-used in the current rule update API to keep behaviour the same (bulk vs single update).
Requirements
Proposal
For Alerting and Task Manager to support updating task schedules in bulk, we should avoid clogging the Task Manager queue and instead calculate the next
runAt
based on the new interval and the last run using the formula below. By only updating the task document, it allows any Kibana instance pick up these re-scheduled tasks.newRunAt = oldRunAt - oldInterval + newInterval
Note: If newRunAt is < now, we will set it to now
Task Manager will only update task schedules of non-running tasks. Tasks that are claiming or running will have their schedule updated automatically at the end of their run.
The
update(...)
function of therulesClient
will move to use the newly proposedupdateSchedule
API (see below) to keep the behaviour the same asbulkUpdate(...)
.Task Manager API proposal
updateSchedule(taskIds: string[], schedule: string)
Since the bulk edit API in alerting takes a single schedule and applies it to all alerting rules. We can do the same here by taking a single
schedule
parameter and applying it to alltasksIds
passed in.The function should do the following:
runAt
for each taskNote: The alerting API shouldn’t return an error if we can’t update all tasks successfully (they will eventually be updated, this is a best effort mechanism)
The text was updated successfully, but these errors were encountered: