Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Security Solution] Rule schemas tech debt #138606

Open
6 of 10 tasks
banderror opened this issue Aug 11, 2022 · 2 comments
Open
6 of 10 tasks

[Security Solution] Rule schemas tech debt #138606

banderror opened this issue Aug 11, 2022 · 2 comments
Labels
epic Feature:Detection Rules Security Solution rules and Detection Engine refactoring Team:Detection Rule Management Security Detection Rule Management Team Team:Detections and Resp Security Detection Response Team Team: SecuritySolution Security Solutions Team working on SIEM, Endpoint, Timeline, Resolver, etc. technical debt Improvement of the software architecture and operational architecture

Comments

@banderror
Copy link
Contributor

banderror commented Aug 11, 2022

Related to: #80792

Summary

TL;DR: We have multiple rule schemas on various levels which increases the costs of maintaining existing rule types and adding new rule types or parameters. We should have just one schema for reading rules + a few modifications of it for creating rules, importing rules, installing prebuilt rules, and such.

Problem

On a high level, we have the following rule schemas:

  1. Schema of the data we store to Elasticsearch via RulesClient. These are the rule types and parameters that are known to the Alerting Framework.
  2. Schema of rules that we return from our HTTP endpoints.
  3. Schema of rules that can be accepted by the import endpoint.
  4. Schema of rules that can be accepted by the "add prepackaged rules" endpoint.
  5. POST, PUT and PATCH schemas of rules that can be accepted by the CRUD endpoints.
  6. A completely separate schema of rules on the FE side.
  7. Schemas of forms used on the Rule Creation/Editing pages.

Every time we need to add a new rule parameter (doesn't matter a common one or a rule type-specific one) or implement a new rule type, we need to correctly adjust the above 7 schemas + a big number of related files. The problem is:

Solution

We should reduce the 7 schemas we have right now to the following ones:

  1. We should have only one single schema for reading rules from the API. It should live in the common folder. This would be our "source of truth" and the main rule schema. We shouldn't have a separate FE schema.
  2. We will have a few modifications of it for creating rules, importing rules, installing prebuilt rules, and such. These should be derivatives from the main schema.
  3. We will have a schema for the data we store to Elasticsearch via RulesClient.
  4. We will have schemas of forms used on the Rule Creation/Editing pages.

It will be a successful refactoring if, given the refactored schemas:

  • It is easy to add a new common or rule type-specific parameter.
  • It is easy to add a new rule type.
  • Adding them does not require changing a lot of files.
  • Adding them is low risk.
  • There are no duplicated schemas.
  • There is no copy-paste of fields in schemas and functions.
  • When passing a rule from one function to another, the rule is passed as a whole instead of passing its destructured parameters.

Details

Schemas we have

Backend level:

  • Alerting Framework rule types and params:
    • security_solution/server/lib/detection_engine/schemas/rule_schemas.ts
    • security_solution/server/lib/detection_engine/schemas/rule_converters.ts

HTTP API level:

  • common types (domain model):
    • security_solution/common/detection_engine/schemas/common/schemas.ts
    • security_solution/common/detection_engine/schemas/response/rules_schema.ts
    • security_solution/common/detection_engine/schemas/request/rule_schemas.ts
    • security_solution/common/detection_engine/schemas/request/add_prepackaged_rules_schema.ts
    • security_solution/common/detection_engine/schemas/request/import_rules_schema.ts
  • request-response types:
    • security_solution/common/detection_engine/schemas/request/*
    • security_solution/common/detection_engine/schemas/response/*

Frontend level:

  • common types (domain model):
    • security_solution/public/detections/containers/detection_engine/rules/types.ts
  • request-response types:
    • security_solution/public/detections/containers/detection_engine/rules/types.ts
    • security_solution/public/detections/containers/detection_engine/rules/api.ts

Sub-tasks

@banderror banderror added refactoring technical debt Improvement of the software architecture and operational architecture Team:Detections and Resp Security Detection Response Team Team: SecuritySolution Security Solutions Team working on SIEM, Endpoint, Timeline, Resolver, etc. epic Team:Detection Rule Management Security Detection Rule Management Team labels Aug 11, 2022
@elasticmachine
Copy link
Contributor

Pinging @elastic/security-solution (Team: SecuritySolution)

@elasticmachine
Copy link
Contributor

Pinging @elastic/security-detections-response (Team:Detections and Resp)

@banderror banderror added the Feature:Detection Rules Security Solution rules and Detection Engine label Aug 11, 2022
banderror added a commit that referenced this issue Oct 21, 2022
…toring Rule Management (#142950)

**Partially addresses:** #138600, #92169, #138606
**Addresses:** #136957, #136962, #138614

## Summary

In this PR we are:

- Splitting the Detection Engine into subdomains ([ticket](#138600)). Every subdomain got its own folder under `detection_engine`, and we moved some (not all) code into them. More on that is below. New subdomains introduced:
  - `fleet_integrations`
  - `prebuilt_rules`
  - `rule_actions_legacy`
  - `rule_exceptions`
  - `rule_management`
  - `rule_preview`
  - `rule_schema`
  - `rule_creation_ui`
  - `rule_details_ui`
  - `rule_management_ui`
  - `rule_exceptions_ui`
- Updating the CODEOWNERS file accordingly.
- Refactoring the Rule Management page and the Rules table. Our main focus was on the way how we communicate with the API endpoints, how we cache and invalidate the fetched data, and how this code is organized in the codebase. More on that is below.
- Increasing the bundle size limit. This is going to be decreased back in a follow-up PR ([ticket](#143532))

## Restructuring folders into subdomains

For the background and problem statement, please refer to #138600

We were focusing on code that is closely related to the Rules area: either owned by us de facto (we work on it) or owned by us de jure (according to the CODEOWNERS file). Or goal was to explicitly extract code that we don't own de facto into separate subdomains, transfer ownership to other area teams, and reflect this in the CODEOWNERS file. On the other hand, we wanted the code that we own to also be organized in clear subdomains that we could easily own via CODEOWNERS. We didn't touch the code that is already explicitly owned by other area teams, e.g. `x-pack/plugins/security_solution/server/lib/detection_engine/rule_types`.

This is a draft "domain map" - an architectural diagram that shows how the Detection Engine _could_ be split into subdomains. It's more a TO-BE idea/aspiration rather than an AS-IS statement. Any feedback, critiques, and suggestions would be extremely appreciated!

<img width="2592" alt="Screenshot 2022-10-18 at 16 08 40" src="https://user-images.githubusercontent.com/7359339/196453965-b65f5b49-9a33-4d90-bb48-1347e9576223.png">

It shows the flow of dependencies between subdomains and proposes some rules:

- The whole graph of dependencies between all subdomains should be a DAG. There should not be bi-directional or circular dependencies between them.
- **Generic subdomains** represent some general knowledge that can be used/applied outside of the Detection Engine.
  - Can depend on some generic kbn packages, npm packages or utils.
  - Can't depend on any other Detection Engine subdomains.
- **Crosscutting subdomains** represent some code that can be common to / shared between many other subdomains. This could be some very common domain models and API schemas.
  - Can depend on generic subdomains.
  - Can depend on other crosscutting subdomains (dependencies between them must form a DAG).
  - Can't depend on core or UI subdomains.
- **Core subdomains** contain most of the "meat" of the Detection Engine: domain models, server-side and client-side business logic, server-side API endpoints, client-side UI (potentially shareable between several pages).
  - Can depend on crosscutting and generic subdomains.
  - Can depend on other core subdomains (dependencies between them must form a DAG).
  - Can't depend on UI subdomains.
- **UI subdomains** contain the implementation of pages related to the Detection Engine. Every page can easily depend on several core subdomains, so these subdomain are on top of everything.
  - Can depend on any other subdomains. Dependencies must form a DAG.

Dashed lines show some existing dependencies that we think should be eliminated.

Ownership TO-BE is color-coded. We updated the CODEOWNERS file according to the new folders.

The folder restructuring is not 100% finished but we did a big part of it. Most of the FE code continues to live in legacy folders, e.g. see `x-pack/plugins/security_solution/public/detections`. So this work is to be continued...

## Refactoring of Rule Management FE

- [x] #136957 For effective HTTP requests caching and deduplication, we've migrated all data fetching logic to `useQuery` and `useMutation` hooks from `react-query`. That allowed us to introduce the following improvements to our codebase:
  * All outgoing HTTP requests are now automatically deduplicated. That means that data fetching hooks like `useRule` could be used on any level in the component tree to access response data directly. So, no need to put the hook on the top level anymore and use prop-drilling to make the response data available to all children components that require it.
  * All HTTP responses are now cached with the default TTL of 5 minutes—no more redundant requests. With a hot cache, transitions to some pages now happen immediately. 
- [x] #136962 Data fetching hooks of the Rules Area are now organized in one place. `security_solution/public/detection_engine/rule_management/api/hooks` contains abstraction layer on top of the Kibana's HTTP client. All data fetching should happen exclusively through that layer to ensure that:
  * Mutation queries automatically invalidate associated cache entries.
  * Optimistic updates or updates from mutation responses could be implemented centrally where possible.
- [x] #92169 From some of the Rule Management components, logic was extracted to hooks located in `security_solution/public/detection_engine/rule_management/logic`. 

### Checklist

- [x] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
epic Feature:Detection Rules Security Solution rules and Detection Engine refactoring Team:Detection Rule Management Security Detection Rule Management Team Team:Detections and Resp Security Detection Response Team Team: SecuritySolution Security Solutions Team working on SIEM, Endpoint, Timeline, Resolver, etc. technical debt Improvement of the software architecture and operational architecture
Projects
None yet
Development

No branches or pull requests

2 participants