Add better support for metric data types (TSDB) #74660

imotov · 2021-06-28T21:45:52Z

Phase 0 - Inception

Obtain schemas annotated with dimensions and metrics from the Metrics team (small) @nik9000
Prototyping Lucene Data Pull Mechanism(medium) @imotov
Prototyping Data Pull Mechanism in elasticsearch @imotov

Phase 1 - Mappings

Add time_series_dimension mapping parameter to fields
Add time_series_metric parameter #76766 @csoulios
TSDB: Add time series information to field caps #78790 @imotov
TSDB: Automatically add timestamp mapper #79136 @weizijun

Phase 2 - Ingest

Phase 2.1 Ingest follow ups

~~- [ ] Build the _id from dimension values~~
~~- [ ] Investigate moving timestamp to the front of the _id to automatically get an optimization on _id searches. Not sure if worth it - but possible. #84928 could be an alternative~~

Bring back something in the spirit of the append-only optimization but that works for tsdb. That's super improve write performance. Extract append-only optimization from Engine #84771 is a partial prototype
We store the _id in lucene stored fields. We could regenerate it from the _source or from doc values for the @timestamp and the _tsid. That'd save some bytes per document.
Move IndexRequest#autoGeneratId? It's a bit spook where it is but I don't like it any other place.
Improve error messages in _update_by_query when modifying the dimensions or @timestamp
On translog replay and recovery and replicas we regenerate the _id and assert that it matches the _id from the primary. Should we? Probably. Let's make sure.
Add tsdb benchmarks to the nightlies
~~- [ ] Document best practices for using dimensions-based ID generator including how to use this with component templates~~

Phase 3.1 QL storage API (Postponed)

Create simple time series reader
- Create a coordinating node level reader for tsdb #79197 @nik9000
- Add support for selectors in TimeSeriesMetricsService #79691 @imotov
  ~~- [ ] Reimplement QL storage API for TSDB database (depends on completion of Phase 2 and 3.2) (Postponed)~~

Phase 3.2 - Search MVP

Plans time series support in _search api are superceded by plans for this in ES|QL.

Distributed nested delayed execution framework
Treating data stream/index as a dimension
~~- [ ] Aggregation results filtering~~
~~- [ ] Retrieve the last value for a time series metric within a parent bucket~~
Time series aggregation
Rate Function
~~- [ ] Add a new histogram field subtype to support Prometheus-style histograms~~
~~- [ ] TSDB indices could speed up cardinality aggregations on dimension fields #85523~~
~~- [ ] Should the _tsid agg return doc_counts by default?~~
~~- [ ] Shortcut aggs for TSDB #90423~~

Phase 3.3 - Rollup / Downsampling

Phase 3.4 - TSID aggs (superseded by tsdb in ES|QL)

~~ - [ ] Update min, max, sum, avg pipeline aggs for intermediate result filtering optimization ~~
~~ - [ ] Sliding window aggregation ~~
~~ - [ ] A way to filter to windows within the sliding window. Like "measurements take in the last 30 seconds of the window". ~~
~~ - [ ] Open transform issue for newly added time series aggs ~~
~~ - [ ] Benchmarks for the tsid agg ~~

Phase 3.5 - Downsampling follow ups

Handling histograms
SQL support for downsampling

Phase 4.0 - Compression

Synthetic _source @nik9000 Synthetic Source #86603
Optimization of merge policies (Move backing indices of data streams to LogByteMergePolicy #87684)
Deltas of deltas compression
What about sequence number?

Phase 5.0 - Follow-ups and Nice-to-have-s

Default the setting's value to all of the keyword dimensions
Support shard splitting on time_series indices
Make an object or interface for _id's values. Right now it's a String that we encode with Uid.encodeId. That was reasonable. Maybe it still is. But it feels complex and for tsdb who's _id is always some bytes. And encoding it also wastes a byte about 1/128 of the time. It's a common prefix byte so this is probably not really an issue. But still. This is a big change but it'd make ES easier to read. Probably wouldn't really improve the storage though.
Figure out how to specify tsdb settings in component templates. For example index.routing_path can be specified in a composable index template if data stream template' index_mode is set to time_series. But if this setting is specified in a component template then it is required to also set the index.mode index setting. This feels backwards. @martijnvg
In order to retrieve the routing values (defined in index.routin_path), the source needs to be parsed on coordinating node. However in the case that an ingest pipeline is executed this, then the source of document will be parsed for the second time. Ideally the routing values should be extracted when ingest is performed. Similar to how the @timestamp field is already retrieved from a document during pipeline execution.
In order to determine the backing index a document should be to, a timestamp is parsed into Instant. The format being used is: strict_date_optional_time_nanos||strict_date_optional_time||epoch_millis. This to allow regular data format, data nanos date format and epoch since mills defined as string. We can optimise the data parsing if we know the exact format being used. For example if on data stream there is parameter that indices that exact data format we can optimise parsing by either using strict_date_optional_time_nanos, strict_date_optional_time or epoch_millis.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2021-06-29T23:38:30Z

Pinging @elastic/es-analytics-geo (Team:Analytics)

This PR adds the following constraints to dimension fields: It must be an indexed field and must has doc values It cannot be multi-valued The number of dimension fields in the index mapping must not be more than 16. This should be configurable through an index property (index.mapping.dimension_fields.limit) keyword fields cannot be more than 1024 bytes long keyword fields must not use a normalizer Based on the code added in PR #74450 Relates to #74660

…rameters (#78265) Backports the following PRs: * Add dimension mapping parameter (#74450) Added the dimension parameter to the following field types: keyword ip Numeric field types (integer, long, byte, short) The dimension parameter is of type boolean (default: false) and is used to mark that a field is a time series dimension field. Relates to #74014 * Add constraints to dimension fields (#74939) This PR adds the following constraints to dimension fields: It must be an indexed field and must has doc values It cannot be multi-valued The number of dimension fields in the index mapping must not be more than 16. This should be configurable through an index property (index.mapping.dimension_fields.limit) keyword fields cannot be more than 1024 bytes long keyword fields must not use a normalizer Based on the code added in PR #74450 Relates to #74660 * Expand DocumentMapperTests (#76368) Adds a test for setting the maximum number of dimensions setting and tests the names and types of the metadata fields in the index. Previously we just asserted the count of metadata fields. That made it hard to read failures. * Fix broken test for dimension keywords (#75408) Test was failing because it was testing 1024 bytes long keyword and assertion was failing. Closes #75225 * Checkstyle * Add time_series_metric parameter (#76766) This PR adds the time_series_metric parameter to the following field types: Numeric field types histogram aggregate_metric_double * Rename `dimension` mapping parameter to `time_series_dimension` (#78012) This PR renames dimension mapping parameter to time_series_dimension to make it consistent with time_series_metric parameter (#76766) Relates to #74450 and #74014 * Add time series params to `unsigned_long` and `scaled_float` (#78204) Added the time_series_metric mapping parameter to the unsigned_long and scaled_float field types Added the time_series_dimension mapping parameter to the unsigned_long field type Fixes #78100 Relates to #76766, #74450 and #74014 Co-authored-by: Nik Everett <nik9000@gmail.com>

Exposes information about dimensions and metrics via field caps. This information will be needed for PromQL support. Relates to elastic#74660

Exposes information about dimensions and metrics via field caps. This information will be needed for PromQL support. Relates to #74660

Adds basic support for selectors in TimeSeriesMetricsService Relates to #74660

This PR renames all public APIs for downsampling so that they contain the downsample keyword instead of the rollup that we had until now. 1. The API endpoint for the downsampling action is renamed to: /source-index/_downsample/target-index 2. The ILM action is renamed to PUT _ilm/policy/my_policy { "policy": { "phases": { "warm": { "actions": { "downsample": { "fixed_interval": "24h" } } } } } } 3. unsupported_aggregation_on_rollup_index was renamed to unsupported_aggregation_on_downsampled_index 4. Internal trasport actions were renamed: indices:admin/xpack/rollup -> indices:admin/xpack/downsample indices:admin/xpack/rollup_indexer -> indices:admin/xpack/downsample_indexer 5. Renamed the following index settings: index.rollup.source.uuid -> index.downsample.source.uuid index.rollup.source.name -> index.downsample.source.name index.rollup.status -> index.downsample.status Finally, we renamed many internal variables and classes from *Rollup* to *Downsample*. However, this effort will be completed in more than one PRs so that we minimize conflicts with other in-flight PRs. Relates to #74660

This PR modifies downsampling operation so that it uses global ordinal to track tsid changes PR depends on the work done in #90035 Relates to #74660

This PR removes the feature flag for the time-series data ingestion and downsampling functionality, making time-series indices and downsampling available For more information about the released functionality, see #74660 Aggregation time_series still remains behind the feature flag

oatkiller · 2022-09-19T17:53:59Z

Sorry for my newbie question. Is this the same as https://www.elastic.co/guide/en/elasticsearch///reference/master/tsds.html ? Thanks

jrodewig · 2022-09-19T18:01:11Z

@oatkiller Yes. It's the same thing. :) (I helped write the initial docs.)

Hope you enjoy Elastic! It's an awesome place to work.

Currently, the key is a map, which can make reducing large response more memory intense then it should be also. Also data structures used during reduce are not back by bigarrays so not accounted for. This commit changes how the key is represented internally. By using BytesRef instead of Map. This doesn't commit doesn't change how the key is represented in the response. It also changes the reduce method to make use of the bucket keys are now bytes refs. Relates to elastic#74660

juliaElastic · 2022-11-09T16:49:06Z

@martijnvg Hi! I am working on a feature on Fleet UI to enable TSDB index setting, and trying to leave routing_path empty to rely on elasticsearch's auto generation.

I'm getting this error when trying to set index.mode=time_series, tried on index template and also component template. Is there any way to work around this error and trigger the auto generation? Thanks!

   "caused_by": {
          "type": "illegal_argument_exception",
          "reason": "[index.mode=time_series] requires a non-empty [index.routing_path]"

martijnvg · 2022-11-10T08:31:38Z

Hey @juliaElastic, can you point me to the composable index templates and component templates? Composable index templates is the place where this setting can be used. Typically with component templates, not all settings / mappings are present there and each component template needs to be valid on its own. So if index.mode index setting has been specified in one component and mappings or index.routing_path is in another component or composable index template then storing the component template with index.mode index setting fails, because during validation on its own isn't valid, due to the index.mode index setting validation failing that there is no index.routing_path. Also, in the case index.routing_path is missing, the auto generation of the index.routing_path setting is only performed for composable index templates.

juliaElastic · 2022-11-10T09:34:44Z

I've tried to add to integrations index template here:

As discussed on slack, the setting works fine on installing a package, and the routing_path is generated correctly on the data stream:

I did see some errors when trying to add TSBD on existing templates, will check that again.

Currently, the key is a map, which can make reducing large responses more memory intense then it should be also. Also the map used during the reduce to detect duplicate buckets is not taken into account by circuit breaker. This map can become very large when reducing large shard level responses. This commit changes how the key is represented internally. By using BytesRef instead of Map. This commit doesn't change how the key is represented in the response. The reduce is also changed to merge the shard responses without creating intermediate data structures for detected duplicated buckets. This is possible because the buckets in the shard level responses are sorted by tsid. Relates to #74660

juliaElastic · 2022-11-15T12:17:55Z

@martijnvg So I managed to add the "index.mode=time_series" setting without routing_path to the metrics-system.cpu Index Template without an issue, however I am running to an error when trying to modify the Component Template metrics-system.cpu@custom, which is the parent of the Index Template.

Is there any workaround for this issue?

{
  "name": "ResponseError",
  "meta": {
    "body": {
      "error": {
        "root_cause": [
          {
            "type": "illegal_argument_exception",
            "reason": "updating component template [metrics-system.cpu@custom] results in invalid composable template [metrics-system.cpu] after templates are merged"
          }
        ],
        "type": "illegal_argument_exception",
        "reason": "updating component template [metrics-system.cpu@custom] results in invalid composable template [metrics-system.cpu] after templates are merged",
        "caused_by": {
          "type": "illegal_argument_exception",
          "reason": "[index.mode=time_series] requires a non-empty [index.routing_path]"
        }
      },
      "status": 400
    },
    "statusCode": 400,
    "headers": {
      "x-opaque-id": "59e2d33e-d6c8-4ed4-8d4a-14c412f64871;kibana::management:",
      "x-elastic-product": "Elasticsearch",
      "content-type": "application/json;charset=utf-8",
      "content-length": "549"
    },
    "meta": {
      "context": null,
      "request": {
        "params": {
          "method": "PUT",
          "path": "/_component_template/metrics-system.cpu%40custom",
          "body": "{\"template\":{\"settings\":{},\"mappings\":{\"properties\":{\"dummy\":{\"type\":\"text\"}}}},\"_meta\":{\"package\":{\"name\":\"system\"},\"managed_by\":\"fleet\",\"managed\":true}}",
          "querystring": "",
          "headers": {
            "user-agent": "Kibana/8.6.0",
            "x-elastic-product-origin": "kibana",
            "authorization": "Basic ZWxhc3RpYzpjaGFuZ2VtZQ==",
            "x-opaque-id": "59e2d33e-d6c8-4ed4-8d4a-14c412f64871;kibana::management:",
            "x-elastic-client-meta": "es=8.4.0p,js=16.18.1,t=8.2.0,hc=16.18.1",
            "content-type": "application/vnd.elasticsearch+json; compatible-with=8",
            "accept": "application/vnd.elasticsearch+json; compatible-with=8",
            "content-length": "154"
          }
        },
        "options": {
          "opaqueId": "59e2d33e-d6c8-4ed4-8d4a-14c412f64871;kibana::management:",
          "headers": {
            "x-elastic-product-origin": "kibana",
            "user-agent": "Kibana/8.6.0",
            "authorization": "Basic ZWxhc3RpYzpjaGFuZ2VtZQ==",
            "x-opaque-id": "59e2d33e-d6c8-4ed4-8d4a-14c412f64871",
            "x-elastic-client-meta": "es=8.4.0p,js=16.18.1,t=8.2.0,hc=16.18.1"
          }
        },
        "id": 2
      },
      "name": "elasticsearch-js",
      "connection": {
        "url": "http://localhost:9200/",
        "id": "http://localhost:9200/",
        "headers": {},
        "status": "alive"
      },
      "attempts": 0,
      "aborted": false
    },
    "warnings": null
  }
}

Typically time_series aggregation is wrapped by a date histogram aggregation. This commit explores idea around making things more efficient for time series agg if this is the case. This commit explores two main ideas: * With time series index searcher docs are emitted in tsid and timestamp order. Because of this within docs of the tsid, the date histogram buckets are also emitted in order to sub aggs. This allows time series aggregator to only keep track of the bucket belonging to the current tsid and bucket ordinal. The removes the need for using BytesKeyedBucketOrds, which in production is very heavy. Also given the fact the tsid is a high cardinality field. For each tsid and buck ordinal combination we keep track of doc count and delegate to sub agg. When the tsid / bucket ordinal combination changes the time series agg on the fly creates a new bucket. Sub aggs of time series agg, only ever contain buckets for a single parent bucket ordinal, this allows to always use a bucket ordinal of value 0. After each bucket has been created the sub agg is cleared. * If the bucket that date histogram creates are contained with the index boundaries of the backing index the shard the search is executed belongs to, then reduction/pipeline aggregation can happen locally only the fly when the time series buckets are created. In order to support this a TimestampBoundsAware interface was added. That can tell a sub agg of a date histogram whether the bounds of parent bucket are within the bounds of the backing index. In this experiment the terms aggregator was hard coded to use min bucket pipeline agg, which gets fed a time series bucket (with sub agg buckets) each time tsid / bucket ordinal combo changes. If buckets are outside backing index boundary then buckets are kept around and pipeline agg is executed in reduce method of InternalTimeSeries response class. This fundamentally changes the time series agg, since the response depends on the pipeline agg used. The `TimeSeriesAggregator3` contains both of these changes. Extra notes: * Date histogram could use `AggregationExecutionContext#getTimestamp()` as source for rounding values into buckets. * I think there is no need for doc count if pipeline aggs reduce on the fly the buckets created by time series agg. * Date agg's filter by filter optimization has been disabled when agg requires in order execution. The time series index searcher doesn't work with filter by filter optimization. Relates to elastic#74660

…t ordinal and buck ordinal. This avoids needlessly adding the same parent bucket ordinal or TSIDs to `BytesKeyedBucketOrds`. Relates to elastic#74660

…that docids are emitted in tsid and parent bucket ordinal. This is true when the parent aggregation is data histogram (which is typical), due to the fact that TimeSeriesIndexSearcher emits docs in tsid and timestamp order. Relates to elastic#74660

…t ordinal and buck ordinal (#91784) This avoids needlessly adding the same parent bucket ordinal or TSIDs to `BytesKeyedBucketOrds`. Relates to #74660

martijnvg · 2023-08-25T14:19:53Z

Initial TSDB support has been added a while ago. I moved the leftover tasks to #98877

imotov added >feature Meta :StorageEngine/TSDB You know, for Metrics labels Jun 28, 2021

elasticmachine added the Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) label Jun 29, 2021

csoulios mentioned this issue Jul 5, 2021

Add constraints to dimension fields #74939

Merged

nik9000 mentioned this issue Aug 9, 2021

Create a mode: time_series option for indices #75638

Closed

wchaparro assigned nik9000, imotov, csoulios and wchaparro Sep 7, 2021

wchaparro mentioned this issue Sep 20, 2021

WIP: add ProportionalSumAggregator #71191

Closed

imotov added a commit to imotov/elasticsearch that referenced this issue Oct 6, 2021

Add time series information to field caps

58bc365

Exposes information about dimensions and metrics via field caps. This information will be needed for PromQL support. Relates to elastic#74660

This was referenced Oct 6, 2021

TSDB: Add time series information to field caps #78790

Merged

Support for overlapping "buckets" in the date histogram #66856

Closed

imotov added a commit that referenced this issue Oct 13, 2021

TSDB: Add time series information to field caps (#78790)

f6034e6

Exposes information about dimensions and metrics via field caps. This information will be needed for PromQL support. Relates to #74660

This was referenced Oct 19, 2021

Add support for dimension fields elastic/package-spec#236

Merged

Add support for dimension fields defined in packages elastic/kibana#115620

Closed

tobiasstadler mentioned this issue Oct 29, 2021

Multiple service names per process and process-global metrics elastic/apm#65

Closed

jsoriano mentioned this issue Nov 1, 2021

Add some dimensions to the kubernetes integration elastic/integrations#2076

Merged

imotov assigned martijnvg Nov 3, 2021

jrodewig self-assigned this Nov 5, 2021

imotov mentioned this issue Nov 11, 2021

Add support for selectors in TimeSeriesMetricsService #79691

Merged

imotov added a commit that referenced this issue Nov 11, 2021

Add support for selectors in TimeSeriesMetricsService (#79691)

838dc0d

Adds basic support for selectors in TimeSeriesMetricsService Relates to #74660

This was referenced Dec 6, 2021

TSDB: Add security tests for _tsid #81382

Merged

TSDB: Test ids query on time series index #81436

Merged

csoulios mentioned this issue Sep 15, 2022

[TSDB] Improve downsampling performance by using tsid ordinals #90088

Merged

csoulios added a commit that referenced this issue Sep 15, 2022

[TSDB] Improve downsampling performance by using tsid ordinals (#90088)

ded9413

This PR modifies downsampling operation so that it uses global ordinal to track tsid changes PR depends on the work done in #90035 Relates to #74660

csoulios mentioned this issue Sep 16, 2022

[TSDB] Release time-series functionality #90116

Merged

csoulios mentioned this issue Oct 13, 2022

Rollup action bug unable to rollup date_nanos metrics #70161

Closed

martijnvg mentioned this issue Nov 8, 2022

Change internal representation of bucket key of time_series agg #91407

Merged

juliaElastic mentioned this issue Nov 10, 2022

[Fleet] Add experimental toggle + support package spec field for TSDB elastic/kibana#144530

Closed

5 tasks

juliaElastic mentioned this issue Nov 14, 2022

Added Time-series indexing (TSDB) to Integrations Experimental Indexing settings elastic/kibana#144974

Merged

6 tasks

juliaElastic mentioned this issue Nov 15, 2022

Error when updating component template when index template has "index.mode=time_series" #91592

Closed

martijnvg mentioned this issue Nov 21, 2022

Optimise time_series aggregation by making use of time series search mode #91767

Closed

martijnvg mentioned this issue Nov 22, 2022

Speedup time_series agg by caching current tsid ordinal, parent bucket ordinal and buck ordinal #91784

Merged

martijnvg mentioned this issue Aug 25, 2023

TSDB followups #98877

Open

14 tasks

martijnvg closed this as completed Aug 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add better support for metric data types (TSDB) #74660

Add better support for metric data types (TSDB) #74660

imotov commented Jun 28, 2021 •

edited by martijnvg

Loading

elasticmachine commented Jun 29, 2021

oatkiller commented Sep 19, 2022

jrodewig commented Sep 19, 2022 •

edited

Loading

juliaElastic commented Nov 9, 2022

martijnvg commented Nov 10, 2022

juliaElastic commented Nov 10, 2022 •

edited

Loading

juliaElastic commented Nov 15, 2022 •

edited

Loading

martijnvg commented Aug 25, 2023

Add better support for metric data types (TSDB) #74660

Add better support for metric data types (TSDB) #74660

Comments

imotov commented Jun 28, 2021 • edited by martijnvg Loading

Phase 0 - Inception

Phase 1 - Mappings

Phase 2 - Ingest

Phase 2.1 Ingest follow ups

Phase 3.1 QL storage API (Postponed)

Phase 3.2 - Search MVP

Phase 3.3 - Rollup / Downsampling

Phase 3.4 - TSID aggs (superseded by tsdb in ES|QL)

Phase 3.5 - Downsampling follow ups

Phase 4.0 - Compression

Phase 5.0 - Follow-ups and Nice-to-have-s

elasticmachine commented Jun 29, 2021

oatkiller commented Sep 19, 2022

jrodewig commented Sep 19, 2022 • edited Loading

juliaElastic commented Nov 9, 2022

martijnvg commented Nov 10, 2022

juliaElastic commented Nov 10, 2022 • edited Loading

juliaElastic commented Nov 15, 2022 • edited Loading

martijnvg commented Aug 25, 2023

imotov commented Jun 28, 2021 •

edited by martijnvg

Loading

jrodewig commented Sep 19, 2022 •

edited

Loading

juliaElastic commented Nov 10, 2022 •

edited

Loading

juliaElastic commented Nov 15, 2022 •

edited

Loading