GitHub - rax-maas/ceres: Simple multi-dimensional, time-series datastore backed by Cassandra with continuous downsampling

A simple multi-dimensional, time-series ingest and query service backed by Cassandra. Redis is also used for tracking of downsample operations.

Quickstart

Startup Cassandra and Redis containers:

docker-compose up -d

Watch the Cassandra container logs until it reports "Starting listening for CQL clients on /0.0.0.0:9042":

docker-compose logs -f cassandra

Create the keyspace:

docker-compose exec cassandra \
  cqlsh -e "CREATE KEYSPACE IF NOT EXISTS ceres WITH replication = {'class': 'SimpleStrategy', 'replication_factor': 1}"

Start the ingest/query application:

mvn spring-boot:run

Write a metric

POST http://localhost:8080/api/put
Content-Type: application/json

{
  "metric": "cpu_idle",
  "tags": {
    "tenant": "t-1",
    "os": "linux",
    "host": "h-1",
    "deployment": "prod",
    "metricGroup": "Misc"
  },
  "timestamp": {{$timestamp}},
  "value": {{$randomInt}}
}

Write a bunch of metrics

POST http://localhost:8080/api/put
Content-Type: application/json

[
    {
        "metric": "cpu_idle",
        "tags": {
            "tenant": "t-1",
            "os": "linux",
            "host": "h-1",
            "deployment": "prod",
            "metricGroup": "Misc"
        },
        "timestamp": {{$timestamp}},
        "value": {{$randomInt}}
    },
    {
        "metric": "memory_free",
        "tags": {
            "tenant": "t-1",
            "os": "linux",
            "host": "h-1",
            "deployment": "prod",
            "metricGroup": "Misc"
        },
        "timestamp": {{$timestamp}},
        "value": {{$randomInt}}
    }
]

where the above IntelliJ HTTP request snippet substitutes current epoch seconds for {{$timestamp}} and a random integer value at {{$randomInt}}.

Query metadata

Metric names, such as

GET http://localhost:8080/api/metadata/metricNames
X-Auth-Token: abc
X-Tenant: t-1

Response:

[
  "cpu_idle",
  "cpu_busy"
]

Tag keys, such as

GET http://localhost:8080/api/metadata/tagKeys?metricName=cpu_idle
X-Auth-Token: abc
X-Tenant: t-1

Response:

[
  "os",
  "deployment"
  "host",
  "metricGroup"
]

Tag values, such as

GET http://localhost:8080/api/metadata/tagValues?
  metricName=cpu_idle
  &tagKey=os
X-Auth-Token: abc
X-Tenant: t-1

Response:

[
  "linux"
]

Tags, such as

i. Using metricName

GET http://localhost:8080/api/metadata/tags?
  metricName=cpu_idle
X-Auth-Token: abc
X-Tenant: t-1

Response:

{
    "tenantId": "t-1",
    "metric": "Misc",
    "tags": {
        "os": "linux",
        "host": "h-1",
        "deployment": "prod",
        "metricGroup": "Misc"
    }
}

ii. Using metricGroup

GET http://localhost:8080/api/metadata/tags?
  metricGroup=Misc
X-Auth-Token: abc
X-Tenant: t-1

Response:

{
    "tenantId": "t-1",
    "metricGroup": "Misc",
    "tags": {
        "os": "linux",
        "host": "h-1",
        "deployment": "prod",
        "metricGroup": "Misc"
    }
}

Query data

Using metricName

GET http://localhost:8080/api/query?
  metricName=cpu_idle
  &tag=os=linux
  &start=2020-08-23T17:53:00Z
  &end=2020-08-23T17:54:40Z
X-Auth-Token: abc
X-Tenant: t-1

Responds with query results per series-set, such as:

[
  {
    "data": {
      "tenant": "t-1",
      "metricName": "cpu_idle",
      "tags": {
        "os": "linux",
        "host": "h-1",
        "deployment": "prod",
        "metricGroup": "Misc"
      },
      "values": {
        "2021-06-21T13:15:10.050Z": 81.0
      }
    },
    "metadata": {
      "aggregator": "raw",
      "startTime": "2021-05-13T13:49:52.270902Z",
      "endTime": "2021-06-22T13:49:52.274252Z"
    }
  }
]

Using metricGroup

GET http://localhost:8080/api/query?
  metricGroup=Misc
  &tag=os=linux
  &start=2020-08-23T17:53:00Z
  &end=2020-08-23T17:54:40Z
X-Auth-Token: abc
X-Tenant: t-1

Responds with query results per series-set, such as:

[
  {
    "data": {
      "tenant": "t-1",
      "metricName": "cpu_idle",
      "tags": {
        "os": "linux",
        "host": "h-1",
        "deployment": "prod",
        "metricGroup": "Misc"
      },
      "values": {
        "2021-06-21T13:15:10.050Z": 81.0
      }
    },
    "metadata": {
      "aggregator": "raw",
      "startTime": "2021-05-13T13:49:52.270902Z",
      "endTime": "2021-06-22T13:49:52.274252Z"
    }
  }
]

Ingesting data from telegraf

The following telegraf config snippet can be used to output metrics collected by telegraf into ceres:

[[outputs.opentsdb]]
  ## prefix for metrics keys
  # prefix = "my.specific.prefix."
  host = "http://localhost"
  port = 8080
  http_batch_size = 50
  http_path = "/api/put"
  debug = false
  separator = "_"

Downsampling

Continuous downsampling is configured in the app.downsample application properties, as shown in the following example:

app:
  downsample:
    # For tracking during ingest
    partitions: 4
    time-slot-width: 2m

    # For downsample processing
    partitions-to-process: 0-3
    last-touch-delay: 1m
    downsample-process-period: 10s
    granularities:
      - width: 1m
        ttl: 12h
      - width: 2m
        ttl: 24h

Querying for downsample data uses the same endpoint as raw data; however, the addition of aggregator and granularity indicate the use of downsample data. The following is an example of a querying for downsampled data with 'min' aggregation at 2-minute granularity:

GET http://localhost:8080/api/query?
  metricName=cpu_idle
  &aggregator=min
  &granularity=PT2M
  &tag=os=linux
  &tag=deployment=prod
  &start=2020-09-15T16:00:00Z
  &end=2020-09-15T17:00:00Z
X-Auth-Token: abc
X-Tenant: t-1

Design

Design documentation is available in DESIGN.md.

Resetting metadata during development

Sometimes during development it is necessary to wipe some or all of the metadata tables and as such synchronize that absence of information with the Redis caching layer. In this and similar scenarios the two datastores can be reset using the following operations.

Truncate the Cassandra tables using:

for table in metric_names series_set_hashes series_sets; do
  docker-compose exec cassandra cqlsh -e "truncate table ceres.$table"
done

Reset the Redis keys using:

docker-compose exec redis redis-cli scan 0

Running with Skaffold / Cloud Code

The Cloud Code plugin for IntelliJ / VS Code or Skaffold by itself can be used to deploy a development instance of this application to a kubernetes cluster.

With Cloud Code, create a "Cloud Code: Kubernetes" run config with at least the following settings:

Container image storage set to a repo path that you have write-access
"Run -> Deployment -> Switch context and deploy to" enabled and set to a specific kube config context
"Build / Deploy -> Skaffold configuration" set to the skaffold.yaml located in this module

Name		Name	Last commit message	Last commit date
Latest commit History 689 Commits
.github		.github
.mvn/wrapper		.mvn/wrapper
ES-Scripts		ES-Scripts
dev		dev
docs		docs
examples		examples
k8s		k8s
redis-script		redis-script
src		src
test-scripts		test-scripts
.dockerignore		.dockerignore
.gitignore		.gitignore
DESIGN.md		DESIGN.md
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README-MINIKUBE.md		README-MINIKUBE.md
README.md		README.md
TODO.md		TODO.md
cloudbuild-deploy.yaml		cloudbuild-deploy.yaml
cloudbuild-local.yaml		cloudbuild-local.yaml
cloudbuild.yaml		cloudbuild.yaml
deployment-local.yaml		deployment-local.yaml
docker-compose.yml		docker-compose.yml
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml
skaffold.yaml		skaffold.yaml
testing.cql		testing.cql
testing.http		testing.http

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quickstart

Write a metric

Write a bunch of metrics

Query metadata

Query data

Ingesting data from telegraf

Downsampling

Design

Resetting metadata during development

Running with Skaffold / Cloud Code

About

Releases

Packages

Contributors 12

Languages

License

rax-maas/ceres

Folders and files

Latest commit

History

Repository files navigation

Quickstart

Write a metric

Write a bunch of metrics

Query metadata

Query data

Ingesting data from telegraf

Downsampling

Design

Resetting metadata during development

Running with Skaffold / Cloud Code

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 12

Languages

Packages