Series API impl + index support

## What

This issue describes the current problems facing an efficient implementation of the Prometheus Series API and a possible path towards mitigating them.

/cc @gouthamve @cyriltovena 

## Background

Currently Prometheus supports a [Series API](https://prometheus.io/docs/prometheus/latest/querying/api/#finding-series-by-label-matchers) which basically has the type `Matchers -> [LabelsSet]`.

Cortex has a special case for this -- [it only queries ingesters](https://github.com/cortexproject/cortex/blob/master/pkg/querier/querier.go#L207-L214) because the `start/end` times are not included in the `Querier.Select` call. This behavior makes sense; it's infeasible to query _all_ series across _all_ time ranges in Cortex.

## Possible implementation path

1) The `Queryable` interface already has support for bounding the time range. We should be able to use this.
2) We can then ignore the fact that `*SelectParams` are not passed to the `Querier.Select` call as these boundaries are already encoded via the `Queryable.Querier` invocation.

This approach also raises some new problems: Internally, we resolve a `SeriesSet` by resolving _all_ chunks for those matchers/time range. This makes sense when we're looking for the timeseries data _inside_ chunks, but is wasteful if we're only concerned with the _series_ themselves.

Starting in the [v9 schema](https://github.com/cortexproject/cortex/blob/master/pkg/chunk/schema.go#L693), `SeriesID`s are encoded in the index. There are even unexported functions [lookupSeriesByMetricNameMatchers](https://github.com/cortexproject/cortex/blob/master/pkg/chunk/series_store.go#L257) and [lookupChunksBySeries](https://github.com/cortexproject/cortex/blob/master/pkg/chunk/series_store.go#L384) to support this lookup. Therefore, we can extend the [`chunk.Store`](https://github.com/cortexproject/cortex/blob/master/pkg/chunk/composite_store.go#L21) interface by exporting these. Then, we'd only need to pull _one_ chunk per series instead of every chunk in the time range in order to check the labels.

The combination of time bounding `/series` lookups and only needing to pull _one_ chunk per series to check labels should make Series API parity attainable.

## Context

Most of this comes from (naively) implementing the Series API in Loki: https://github.com/grafana/loki/pull/1419 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Series API impl + index support #2313

What

Background

Possible implementation path

Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Series API impl + index support #2313

Description

What

Background

Possible implementation path

Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions