[Security Solution] Rule schemas tech debt #138606

banderror · 2022-08-11T12:15:00Z

Related to: #80792

Summary

TL;DR: We have multiple rule schemas on various levels which increases the costs of maintaining existing rule types and adding new rule types or parameters. We should have just one schema for reading rules + a few modifications of it for creating rules, importing rules, installing prebuilt rules, and such.

Problem

On a high level, we have the following rule schemas:

Schema of the data we store to Elasticsearch via RulesClient. These are the rule types and parameters that are known to the Alerting Framework.
Schema of rules that we return from our HTTP endpoints.
Schema of rules that can be accepted by the import endpoint.
Schema of rules that can be accepted by the "add prepackaged rules" endpoint.
POST, PUT and PATCH schemas of rules that can be accepted by the CRUD endpoints.
A completely separate schema of rules on the FE side.
Schemas of forms used on the Rule Creation/Editing pages.

Every time we need to add a new rule parameter (doesn't matter a common one or a rule type-specific one) or implement a new rule type, we need to correctly adjust the above 7 schemas + a big number of related files. The problem is:

It requires a lot of effort: it's about 40-80 files to change. Example PRs:
There's a high chance of human error. It's easy to overlook something because not all the functions are statically typed.

Solution

We should reduce the 7 schemas we have right now to the following ones:

We should have only one single schema for reading rules from the API. It should live in the common folder. This would be our "source of truth" and the main rule schema. We shouldn't have a separate FE schema.
We will have a few modifications of it for creating rules, importing rules, installing prebuilt rules, and such. These should be derivatives from the main schema.
We will have a schema for the data we store to Elasticsearch via RulesClient.
We will have schemas of forms used on the Rule Creation/Editing pages.

It will be a successful refactoring if, given the refactored schemas:

It is easy to add a new common or rule type-specific parameter.
It is easy to add a new rule type.
Adding them does not require changing a lot of files.
Adding them is low risk.
There are no duplicated schemas.
There is no copy-paste of fields in schemas and functions.
When passing a rule from one function to another, the rule is passed as a whole instead of passing its destructured parameters.

Details

Schemas we have

Backend level:

Alerting Framework rule types and params:
- security_solution/server/lib/detection_engine/schemas/rule_schemas.ts
- security_solution/server/lib/detection_engine/schemas/rule_converters.ts

HTTP API level:

common types (domain model):
- security_solution/common/detection_engine/schemas/common/schemas.ts
- security_solution/common/detection_engine/schemas/response/rules_schema.ts
- security_solution/common/detection_engine/schemas/request/rule_schemas.ts
- security_solution/common/detection_engine/schemas/request/add_prepackaged_rules_schema.ts
- security_solution/common/detection_engine/schemas/request/import_rules_schema.ts
request-response types:
- security_solution/common/detection_engine/schemas/request/*
- security_solution/common/detection_engine/schemas/response/*

Frontend level:

common types (domain model):
- security_solution/public/detections/containers/detection_engine/rules/types.ts
request-response types:
- security_solution/public/detections/containers/detection_engine/rules/types.ts
- security_solution/public/detections/containers/detection_engine/rules/api.ts

Sub-tasks

The text was updated successfully, but these errors were encountered:

elasticmachine · 2022-08-11T12:15:02Z

Pinging @elastic/security-solution (Team: SecuritySolution)

elasticmachine · 2022-08-11T12:15:02Z

Pinging @elastic/security-detections-response (Team:Detections and Resp)

…toring Rule Management (#142950) **Partially addresses:** #138600, #92169, #138606 **Addresses:** #136957, #136962, #138614 ## Summary In this PR we are: - Splitting the Detection Engine into subdomains ([ticket](#138600)). Every subdomain got its own folder under `detection_engine`, and we moved some (not all) code into them. More on that is below. New subdomains introduced: - `fleet_integrations` - `prebuilt_rules` - `rule_actions_legacy` - `rule_exceptions` - `rule_management` - `rule_preview` - `rule_schema` - `rule_creation_ui` - `rule_details_ui` - `rule_management_ui` - `rule_exceptions_ui` - Updating the CODEOWNERS file accordingly. - Refactoring the Rule Management page and the Rules table. Our main focus was on the way how we communicate with the API endpoints, how we cache and invalidate the fetched data, and how this code is organized in the codebase. More on that is below. - Increasing the bundle size limit. This is going to be decreased back in a follow-up PR ([ticket](#143532)) ## Restructuring folders into subdomains For the background and problem statement, please refer to #138600 We were focusing on code that is closely related to the Rules area: either owned by us de facto (we work on it) or owned by us de jure (according to the CODEOWNERS file). Or goal was to explicitly extract code that we don't own de facto into separate subdomains, transfer ownership to other area teams, and reflect this in the CODEOWNERS file. On the other hand, we wanted the code that we own to also be organized in clear subdomains that we could easily own via CODEOWNERS. We didn't touch the code that is already explicitly owned by other area teams, e.g. `x-pack/plugins/security_solution/server/lib/detection_engine/rule_types`. This is a draft "domain map" - an architectural diagram that shows how the Detection Engine _could_ be split into subdomains. It's more a TO-BE idea/aspiration rather than an AS-IS statement. Any feedback, critiques, and suggestions would be extremely appreciated! <img width="2592" alt="Screenshot 2022-10-18 at 16 08 40" src="https://user-images.githubusercontent.com/7359339/196453965-b65f5b49-9a33-4d90-bb48-1347e9576223.png"> It shows the flow of dependencies between subdomains and proposes some rules: - The whole graph of dependencies between all subdomains should be a DAG. There should not be bi-directional or circular dependencies between them. - **Generic subdomains** represent some general knowledge that can be used/applied outside of the Detection Engine. - Can depend on some generic kbn packages, npm packages or utils. - Can't depend on any other Detection Engine subdomains. - **Crosscutting subdomains** represent some code that can be common to / shared between many other subdomains. This could be some very common domain models and API schemas. - Can depend on generic subdomains. - Can depend on other crosscutting subdomains (dependencies between them must form a DAG). - Can't depend on core or UI subdomains. - **Core subdomains** contain most of the "meat" of the Detection Engine: domain models, server-side and client-side business logic, server-side API endpoints, client-side UI (potentially shareable between several pages). - Can depend on crosscutting and generic subdomains. - Can depend on other core subdomains (dependencies between them must form a DAG). - Can't depend on UI subdomains. - **UI subdomains** contain the implementation of pages related to the Detection Engine. Every page can easily depend on several core subdomains, so these subdomain are on top of everything. - Can depend on any other subdomains. Dependencies must form a DAG. Dashed lines show some existing dependencies that we think should be eliminated. Ownership TO-BE is color-coded. We updated the CODEOWNERS file according to the new folders. The folder restructuring is not 100% finished but we did a big part of it. Most of the FE code continues to live in legacy folders, e.g. see `x-pack/plugins/security_solution/public/detections`. So this work is to be continued... ## Refactoring of Rule Management FE - [x] #136957 For effective HTTP requests caching and deduplication, we've migrated all data fetching logic to `useQuery` and `useMutation` hooks from `react-query`. That allowed us to introduce the following improvements to our codebase: * All outgoing HTTP requests are now automatically deduplicated. That means that data fetching hooks like `useRule` could be used on any level in the component tree to access response data directly. So, no need to put the hook on the top level anymore and use prop-drilling to make the response data available to all children components that require it. * All HTTP responses are now cached with the default TTL of 5 minutes—no more redundant requests. With a hot cache, transitions to some pages now happen immediately. - [x] #136962 Data fetching hooks of the Rules Area are now organized in one place. `security_solution/public/detection_engine/rule_management/api/hooks` contains abstraction layer on top of the Kibana's HTTP client. All data fetching should happen exclusively through that layer to ensure that: * Mutation queries automatically invalidate associated cache entries. * Optimistic updates or updates from mutation responses could be implemented centrally where possible. - [x] #92169 From some of the Rule Management components, logic was extracted to hooks located in `security_solution/public/detection_engine/rule_management/logic`. ### Checklist - [x] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios

banderror added the Feature:Detection Rules Security Solution rules and Detection Engine label Aug 11, 2022

banderror mentioned this issue Aug 11, 2022

[Security Solution] Expose EQL search API configs in EQL rules #130339 #132247

Merged

2 tasks

banderror mentioned this issue Sep 8, 2022

[Security Solution][Alerts] Remove legacy rules schema #137605

Merged

banderror mentioned this issue Sep 29, 2022

[Security Solution] Adds Cypress test for index action in detection rules #141069

Merged

banderror mentioned this issue Oct 17, 2022

[Security Solution] Restructuring folders of Detection Engine + refactoring Rule Management #142950

Merged

4 tasks

This was referenced Dec 13, 2022

[Security Solution] Rule filters: strict schema #147438

Open

[Security Solution] Rule filters: saved object migrations #147441

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Security Solution] Rule schemas tech debt #138606

[Security Solution] Rule schemas tech debt #138606

banderror commented Aug 11, 2022 •

edited

Loading

elasticmachine commented Aug 11, 2022

elasticmachine commented Aug 11, 2022

[Security Solution] Rule schemas tech debt #138606

[Security Solution] Rule schemas tech debt #138606

Comments

banderror commented Aug 11, 2022 • edited Loading

Summary

Problem

Solution

Details

Schemas we have

Sub-tasks

elasticmachine commented Aug 11, 2022

elasticmachine commented Aug 11, 2022

banderror commented Aug 11, 2022 •

edited

Loading