[RAM] Update rule status #140882

XavierM · 2022-09-16T17:16:56Z

Summary

Resolves the parent issue: #136039

Also resolves the subtasks:

This is the backend portion of the consolidated rule status feature. It mainly contains changes to the rules_client.ts and task_runner.ts to support the new consolidated rule statuses.

This PR added a new property: lastRun to the rules saved object to hold the new rule outcome statuses (succeeded, warning, and failed) as the new simplified rule status over the existing executionStatus property. However, we are keeping the old executionStatus so we can slowly migrate the rest of the application to use the new lastRun outcomes.

In addition, we have enriched the monitoring property to be the source of truth for metrics related to the last run (as well as new fields that other plugins will find useful). We also added a monitoring service that allows other plugins to easily add data to the monitoring field.

To test this PR, please use #144466 since it has both the frontend and backend changes.

Checklist

Documentation was added for features that require explanation or tutorials
Unit or functional tests were updated or added to match the most common scenarios

…-ref HEAD~1..HEAD --fix'

…-fix'

…atus

…136039-rules-status

…-fix'

…-ref HEAD~1..HEAD --fix'

elasticmachine · 2022-11-02T17:08:18Z

Pinging @elastic/response-ops (Team:ResponseOps)

XavierM · 2022-11-02T19:04:56Z

x-pack/plugins/alerting/server/saved_objects/mappings.ts

@@ -255,5 +251,80 @@ export const alertMappings: SavedObjectsTypeMappingDefinition = {
        },
      },
    },
+    running: {


let's delete running, we are not using it and too scare to make another call to SO. Let's wait and see

This was going to be my question actually...if I create a rule, what is the status prior to the first run? Or if I disable it before its run?

we were thinking that we can tell an approximation time when the rule is going to run. if you disable it before its run, we won't show anything, do you have a better idea?

After chatting, we opted to simply show -- if the rule has never run. My initial concern was if I create a rule and see nothing in this column, I would wonder if the rule is starting or not. So for this scenario, we'll also show the -- but do a similar treatment as we do for Stat loading indicator. We felt this was the simplest solution at this time and still indicated to the user that something is happening with the rule after I create it.

ymao1

Did initial code review only of the task runner. Still need to look at the rules client and migration and pull it down to run it. Looking good tho!

x-pack/plugins/alerting/server/lib/monitoring.ts

x-pack/plugins/alerting/server/monitoring/rule_monitoring_client.ts

x-pack/plugins/alerting/server/task_runner/task_runner.ts

x-pack/plugins/alerting/server/types.ts

ymao1 · 2022-11-03T13:23:46Z

x-pack/plugins/alerting/server/rules_client/rules_client.ts

-      monitoring: getDefaultRuleMonitoring(),
+      running: false,
+      executionStatus: getRuleExecutionStatusPending(lastRunTimestamp.toISOString()),
+      monitoring: getDefaultMonitoring(lastRunTimestamp.toISOString()),


When a rule is created, before running, it looks like monitoring is set to this:

"monitoring": { "run": { "history": [], "calculated_metrics": { "success_ratio": 0 }, "last_run": { "timestamp": "2022-11-03T13:06:31.486Z", "metrics": { "duration": 0, "total_search_duration_ms": null, "total_indexing_duration_ms": null, "total_alerts_detected": null, "total_alerts_created": null, "gap_duration_s": null } } } }

Should those initial duration be null since we don't know it? Also is setting last_run.timestamp to the current date confusing since it hasn't run yet?

I'm thinking of a case where a rule is created in an overloaded Kibana cluster, so it might take longer than expected for the task to get claimed and run. In the UI, this last_run.timestamp would show up and give the impression that the rule has already run? Maybe I'm overthinking :)

That is a good point, having 0 for the duration is definitely misleading

As for the timestamp, we're essentially mirroring existing executionStatus -> lastExecutionDate field with the lastRun -> timestamp field. So I guess setting the timestamp during create was existing behavior but we could fix it for both of the existing behavior if it doesn't make sense.

That make sense!

ymao1 · 2022-11-03T13:27:39Z

x-pack/plugins/alerting/server/task_runner/task_runner.ts

@@ -783,7 +799,7 @@ export class TaskRunner<

        return { interval: retryInterval };
      }),
-      monitoring,
+      monitoring: this.ruleMonitoring.getMonitoring(),


After 1 execution, my rule saved object looks like:

{ ... other fields "executionStatus": { "status": "active", "lastExecutionDate": "2022-11-03T13:09:15.575Z", "error": null, "warning": null, "lastDuration": 7270 }, "monitoring": { "run": { "history": [{ "duration": 7270, "success": true, "timestamp": 1667480955574 }], "calculated_metrics": { "success_ratio": 1, "p99": 7270, "p50": 7270, "p95": 7270 }, "last_run": { "timestamp": "2022-11-03T13:09:15.574Z", "metrics": { "duration": 0, "total_search_duration_ms": null, "total_indexing_duration_ms": null, "total_alerts_detected": null, "total_alerts_created": null, "gap_duration_s": null } } } }, "lastRun": { "alertsCount": { "new": 5, "ignored": 0, "recovered": 0, "active": 5 }, "outcomeMsg": null, "warning": null, "outcome": "succeeded" }, }

The monitoring.run.last_run.duration field doesn't look like it's been updated to the last duration.

I'm sure this was covered in the RFC and I'm behind, but why a separate warning and outcomeMsg field? The outcome can be succeeded | failed | warning so I would expect any warnings to show up as outcome: 'warning', outcomeMsg: <warning message>?

Ah yes good catch WRT the duration, will fix that.

From the RFC, the warning field is described to act sort of like a decorator to the outcome. My understanding is the outcome could be successful but there might have been a special flag raised in the rule run that should be noted. So in this case the warning field gets mapped to reasons. @XavierM could correct me if I'm wrong here.

x-pack/plugins/alerting/server/saved_objects/migrations/8.6/index.ts

…atus

JiaweiWu · 2022-11-14T06:36:50Z

@elasticmachine merge upstream

…136039-rules-status

ymao1

LGTM!

JiaweiWu · 2022-11-14T16:59:18Z

@elasticmachine merge upstream

…136039-rules-status

JiaweiWu · 2022-11-14T17:53:33Z

@elasticmachine merge upstream

JiaweiWu · 2022-11-14T18:45:44Z

@elasticmachine merge upstream

kibana-ci · 2022-11-14T20:21:58Z

💚 Build Succeeded

Buildkite Build
Commit: 367b3d8

Metrics [docs]

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`alerting`	379	406	+27

Public APIs missing exports

Total count of every type that is part of your API that should be exported but is not. This will cause broken links in the API documentation system. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats exports for more detailed information.

id	before	after	diff
`alerting`	26	27	+1

Page load bundle

Size of the bundles that are downloaded on every page load. Target size is below 100kb

id	before	after	diff
`alerting`	38.7KB	39.2KB	+443.0B

Saved Objects .kibana field count

Every field in each saved object type adds overhead to Elasticsearch. Kibana needs to keep the total field count below Elasticsearch's default limit of 1000 fields. Only specify field mappings for the fields you wish to search on or query. See https://www.elastic.co/guide/en/kibana/master/saved-objects-service.html#_mappings

id	before	after	diff
`alert`	74	95	+21

Unknown metric groups

API count

id	before	after	diff
`alerting`	388	415	+27

ESLint disabled in files

id	before	after	diff
`osquery`	1	2	+1

ESLint disabled line counts

id	before	after	diff
`alerting`	68	70	+2
`enterpriseSearch`	19	21	+2
`fleet`	59	65	+6
`osquery`	108	113	+5
`securitySolution`	441	447	+6
total			+21

References to deprecated APIs

id	before	after	diff
`alerting`	84	96	+12

Total ESLint disabled count

id	before	after	diff
`alerting`	69	71	+2
`enterpriseSearch`	20	22	+2
`fleet`	67	73	+6
`osquery`	109	115	+6
`securitySolution`	518	524	+6
total			+22

History

💔 Build #87656 failed 49d07a1
💔 Build #87623 failed c8e1da5
💔 Build #87562 failed 1869e17
💛 Build #87357 was flaky bbef4eb
💔 Build #86893 failed 897ec3f
💔 Build #86772 failed f0b1c66

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

* main: (65 commits) Migrate server-side `Root` and `Server` to packages (elastic#144990) [Discover] Handle no data views state for `esQuery` alert (elastic#145052) [ML] Allow updates for number of allocations and priority for trained model deployments (elastic#144704) [api-docs] 2022-11-15 Daily api_docs build (elastic#145203) [Security solution] remove guided onboarding feature flag (elastic#144247) [DOCS] Automate final case APIs (elastic#145007) [Enterprise Search] Name and description flyout for connectors (elastic#143827) [Guided onboarding] Update header button logic (elastic#144634) [Lens] Multi metric partition charts (elastic#143966) [Dashboard] [Controls] Add unmapped runtime field support to options list (elastic#144947) [Security Solution] Add Task Metric Collection to New Tasks (elastic#145181) [TriggersActionsUi] disable jest config in CI (elastic#145186) [TableListView] Enhance tag filtering (elastic#142108) [Cloud Posture] Compliance by CIS section table (elastic#145114) [8.6][Session View] Fix hidden alert flyout in session view (elastic#145141) [customIntegrations] async load all components (elastic#145166) Fix time for logs smoke tests in integration test (elastic#145130) [RAM] Update rule status (elastic#140882) Update babel (main) (elastic#145060) [Actionable Observability] Add context.alertDetailsUrl variable to action connector template for APM rule types (elastic#144791) ...

## Summary Parent issue for updating rule status: #136039 Frontend issue: #145191 Backend PR: #140882 Updates the rules list and rules details page to support the new consolidated statuses. With E2E and unit testing. Rules list: - Table cell values - Last response filter - Table cell filtering - Status aggregations Rule details: - Rule status summary - KPI headers renaming - Event log cells renaming ![dashdash](https://user-images.githubusercontent.com/74562234/201778676-775f58e9-6707-4972-a1ca-2dcf71befc5b.png) ![rule_details_consolidate](https://user-images.githubusercontent.com/74562234/201778792-f03c368a-3b0d-43cf-805e-f8151b4b96ae.png) ### Checklist - [x] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios Co-authored-by: Xavier Mouligneau <xavier.mouligneau@elastic.co> Co-authored-by: kibanamachine <42973632+kibanamachine@users.noreply.github.com>

…ead of saved object (#147035) **Addresses:** #130966 **Based on:** #135127 ## Summary This PR deprecates the Sidecar SO of type `siem-detection-engine-rule-execution-info` in favour of storing Rule Execution Logging data within the rule itself, making use of the work previously done in the Alerting Framework: - #140882 - #147278 Work done: - **Pass execution statuses and metrics from rule executors to the Framework:** through the use of `RuleMonitoringService` and `RuleResultService` from within the rule execution log client for executor. `x-pack/plugins/security_solution/server/lib/detection_engine/rule_monitoring/logic/rule_execution_log/client_for_executors/client.ts` - **Fetch execution statuses and metrics from rules themselves instead of the sidecar `siem-detection-engine-rule-execution-info` saved objects**: through the use of the new function `createRuleExecutionSummary` in `x-pack/plugins/security_solution/server/lib/detection_engine/rule_monitoring/logic/rule_execution_log/create_rule_execution_summary.ts`, which extracts last execution information from the rule itself. - **Remove the siem-detection-engine-rule-execution-info saved objects type from the codebase. Mark it as deleted in Kibana Core:** added `siem-detection-engine-rule-execution-info` to `packages/core/saved-objects/core-saved-objects-migration-server-internal/src/core/unused_types.ts`; and got rid of the related Saved Object client. - **Make sure to keep backward compatibility in the Detection API endpoints and rule execution events we write into the Event Log**: API compatibility is maintained. No breaking changes. ### Checklist - [ ] [Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html) was added for features that require explanation or tutorials - [x] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios

…ead of saved object (elastic#147035) **Addresses:** elastic#130966 **Based on:** elastic#135127 ## Summary This PR deprecates the Sidecar SO of type `siem-detection-engine-rule-execution-info` in favour of storing Rule Execution Logging data within the rule itself, making use of the work previously done in the Alerting Framework: - elastic#140882 - elastic#147278 Work done: - **Pass execution statuses and metrics from rule executors to the Framework:** through the use of `RuleMonitoringService` and `RuleResultService` from within the rule execution log client for executor. `x-pack/plugins/security_solution/server/lib/detection_engine/rule_monitoring/logic/rule_execution_log/client_for_executors/client.ts` - **Fetch execution statuses and metrics from rules themselves instead of the sidecar `siem-detection-engine-rule-execution-info` saved objects**: through the use of the new function `createRuleExecutionSummary` in `x-pack/plugins/security_solution/server/lib/detection_engine/rule_monitoring/logic/rule_execution_log/create_rule_execution_summary.ts`, which extracts last execution information from the rule itself. - **Remove the siem-detection-engine-rule-execution-info saved objects type from the codebase. Mark it as deleted in Kibana Core:** added `siem-detection-engine-rule-execution-info` to `packages/core/saved-objects/core-saved-objects-migration-server-internal/src/core/unused_types.ts`; and got rid of the related Saved Object client. - **Make sure to keep backward compatibility in the Detection API endpoints and rule execution events we write into the Event Log**: API compatibility is maintained. No breaking changes. ### Checklist - [ ] [Documentation](https://www.elastic.co/guide/en/kibana/master/development-documentation.html) was added for features that require explanation or tutorials - [x] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios

XavierM added 3 commits September 13, 2022 14:17

re-structure migrations code

1dd3687

change mappings to match new design

d1ca8f0

wip

0b74e72

XavierM requested a review from a team as a code owner September 16, 2022 17:16

XavierM marked this pull request as draft September 16, 2022 17:17

XavierM added release_note:enhancement Team:ResponseOps Label for the ResponseOps team (formerly the Cases and Alerting teams) v8.5.0 labels Sep 16, 2022

XavierM changed the title ~~[RAM] Add Stats on top of rules log~~ [RAM] Update rule status Sep 16, 2022

XavierM added 8.6 candidate and removed v8.5.0 labels Sep 16, 2022

kibanamachine and others added 11 commits September 16, 2022 17:24

[CI] Auto-commit changed files from 'node scripts/precommit_hook.js -…

bbfc35a

…-ref HEAD~1..HEAD --fix'

[CI] Auto-commit changed files from 'node scripts/eslint --no-cache -…

5579627

…-fix'

Merge branch 'main' of github.com:elastic/kibana into 136039-rules-st…

defdf85

…atus

wip to add a monitoring service

1707107

Merge branch 'main' of github.com:elastic/kibana into 136039-rules-st…

eb8ce8b

…atus

Merge branch '136039-rules-status' of github.com:XavierM/kibana into …

7ca2fc2

…136039-rules-status

[CI] Auto-commit changed files from 'node scripts/eslint --no-cache -…

c882426

…-fix'

jiawei work on backend of rule

0fe91eb

[CI] Auto-commit changed files from 'node scripts/precommit_hook.js -…

6b4531f

…-ref HEAD~1..HEAD --fix'

Fix conflicts

4a1de41

Fix some of the missing functions used in rules_client

6430691

XavierM marked this pull request as ready for review November 2, 2022 17:08

JiaweiWu mentioned this pull request Nov 2, 2022

[RAM] Update Rule Status - UI Changes #144466

Merged

1 task

XavierM commented Nov 2, 2022

View reviewed changes

ymao1 reviewed Nov 2, 2022

View reviewed changes

Fix types and unit tests

4c57e09

ymao1 reviewed Nov 3, 2022

View reviewed changes

Removing running and fix associated tests

2151d4f

XavierM and others added 3 commits November 10, 2022 12:31

Merge branch 'main' of github.com:elastic/kibana into 136039-rules-st…

cae7541

…atus

Address comments

2ce6b6e

Update rule monitoring mock

d0b5298

Merge branch 'main' into 136039-rules-status

bbef4eb

JiaweiWu removed the request for review from a team November 14, 2022 06:37

XavierM added 3 commits November 14, 2022 08:48

Merge branch '136039-rules-status' of github.com:XavierM/kibana into …

1bc1db7

…136039-rules-status

add more test

d9a6b9d

unit test

1869e17

ymao1 approved these changes Nov 14, 2022

View reviewed changes

kibanamachine and others added 3 commits November 14, 2022 11:59

Merge branch 'main' into 136039-rules-status

c8e1da5

fix type

f2e4735

Merge branch '136039-rules-status' of github.com:XavierM/kibana into …

fad9da5

…136039-rules-status

Merge branch 'main' into 136039-rules-status

49d07a1

Merge branch 'main' into 136039-rules-status

367b3d8

XavierM enabled auto-merge (squash) November 14, 2022 20:13

XavierM merged commit e9feb06 into elastic:main Nov 14, 2022

kibanamachine added v8.6.0 backport:skip This commit does not require backporting labels Nov 14, 2022

jpdjere mentioned this pull request Dec 29, 2022

[Security Solution] Write and read Rule Execution Logs from rule instead of saved object #147035

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RAM] Update rule status #140882

[RAM] Update rule status #140882

XavierM commented Sep 16, 2022 •

edited by JiaweiWu

Loading

elasticmachine commented Nov 2, 2022

XavierM Nov 2, 2022

mdefazio Nov 2, 2022

XavierM Nov 9, 2022

mdefazio Nov 10, 2022

ymao1 left a comment

ymao1 Nov 3, 2022

JiaweiWu Nov 4, 2022

XavierM Nov 7, 2022

ymao1 Nov 3, 2022

ymao1 Nov 3, 2022

JiaweiWu Nov 4, 2022

XavierM Nov 9, 2022

JiaweiWu commented Nov 14, 2022

ymao1 left a comment

JiaweiWu commented Nov 14, 2022

JiaweiWu commented Nov 14, 2022

JiaweiWu commented Nov 14, 2022

kibana-ci commented Nov 14, 2022

API count

ESLint disabled in files

ESLint disabled line counts

References to deprecated APIs

Total ESLint disabled count

[RAM] Update rule status #140882

[RAM] Update rule status #140882

Conversation

XavierM commented Sep 16, 2022 • edited by JiaweiWu Loading

Summary

Checklist

elasticmachine commented Nov 2, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ymao1 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JiaweiWu commented Nov 14, 2022

ymao1 left a comment

Choose a reason for hiding this comment

JiaweiWu commented Nov 14, 2022

JiaweiWu commented Nov 14, 2022

JiaweiWu commented Nov 14, 2022

kibana-ci commented Nov 14, 2022

💚 Build Succeeded

Metrics [docs]

Public APIs missing comments

Public APIs missing exports

Page load bundle

Saved Objects .kibana field count

API count

ESLint disabled in files

ESLint disabled line counts

References to deprecated APIs

Total ESLint disabled count

History

XavierM commented Sep 16, 2022 •

edited by JiaweiWu

Loading