Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make a separate dashboard for Tarantool Data Grid metrics #134

Closed
opomuc opened this issue Mar 30, 2022 · 7 comments · Fixed by #148
Closed

Make a separate dashboard for Tarantool Data Grid metrics #134

opomuc opened this issue Mar 30, 2022 · 7 comments · Fixed by #148
Assignees
Labels
customer Customer request dashboard Related to dashboard template part of repo enhancement New feature or request

Comments

@opomuc
Copy link
Contributor

opomuc commented Mar 30, 2022

I have some panels for TDG, but they might be outdated:

"GraphQL -- query rps",

"expr": "rate(tdg_graphql_query_time_count{job=~\"$job\", alias=~\"$instance\"}[$__rate_interval])",
"expr": "rate(tdg_graphql_query_time_sum{job=~\"$job\", alias=~\"$instance\"}[$__rate_interval])/rate(tdg_graphql_query_time_count{job=~\"$job\", alias=~\"$instance\"}[$__rate_interval])",
"title": "GraphQL -- avg. query time",
"expr": "histogram_quantile(0.95, sum(rate(tdg_graphql_query_time_bucket{job=~\"$job\", alias=~\"$instance\"}[$__rate_interval])) by (le))",
"title": "GraphQL -- 95th percentile",
"expr": "sum by (method, type) (rate(tdg_rest_result_200{job=~\"$job\", alias=~\"$instance\"}[$__rate_interval]))",
"title": "Success requests (code 2xx)",
"expr": "sum by (method, type) (rate(tdg_rest_exec_time_count{job=~\"$job\", alias=~\"$instance\"}[$__rate_interval]) - rate(tdg_rest_result_200{job=~\"$job\", alias=~\"$instance\"}[$__rate_interval]))",
"title": "Error requests (code 4xx)",

Queries should be validated against current TDG master.

@opomuc
Copy link
Contributor Author

opomuc commented Mar 30, 2022

@opomuc
Copy link
Contributor Author

opomuc commented Mar 30, 2022

{
          "datasource": "Prometheus",
          "fieldConfig": {
            "defaults": {
              "color": {
                "mode": "palette-classic"
              },
              "custom": {
                "axisLabel": "",
                "axisPlacement": "auto",
                "barAlignment": 0,
                "drawStyle": "line",
                "fillOpacity": 0,
                "gradientMode": "none",
                "hideFrom": {
                  "legend": false,
                  "tooltip": false,
                  "viz": false
                },
                "lineInterpolation": "linear",
                "lineWidth": 1,
                "pointSize": 5,
                "scaleDistribution": {
                  "type": "linear"
                },
                "showPoints": "auto",
                "spanNulls": false,
                "stacking": {
                  "group": "A",
                  "mode": "none"
                },
                "thresholdsStyle": {
                  "mode": "off"
                }
              },
              "mappings": [],
              "thresholds": {
                "mode": "absolute",
                "steps": [
                  {
                    "color": "green",
                    "value": null
                  },
                  {
                    "color": "red",
                    "value": 80
                  }
                ]
              }
            },
            "overrides": []
          },
          "gridPos": {
            "h": 8,
            "w": 12,
            "x": 0,
            "y": 1
          },
          "id": 72,
          "options": {
            "legend": {
              "calcs": [],
              "displayMode": "table",
              "placement": "right"
            },
            "tooltip": {
              "mode": "single"
            }
          },
          "targets": [
            {
              "exemplar": true,
              "expr": "rate(tdg_graphql_query_time_count{job=~\"$job\", ris_sub=~\"$zone\", alias=~\"$instance\"}[$__rate_interval])",
              "interval": "",
              "legendFormat": "{{alias}} - {{operation_name}}",
              "queryType": "randomWalk",
              "refId": "A"
            }
          ],
          "title": "GraphQL -- query rps",
          "type": "timeseries"
        },
        

@opomuc
Copy link
Contributor Author

opomuc commented Mar 30, 2022

{
          "datasource": "Prometheus",
          "fieldConfig": {
            "defaults": {
              "color": {
                "mode": "palette-classic"
              },
              "custom": {
                "axisLabel": "",
                "axisPlacement": "auto",
                "barAlignment": 0,
                "drawStyle": "line",
                "fillOpacity": 0,
                "gradientMode": "none",
                "hideFrom": {
                  "legend": false,
                  "tooltip": false,
                  "viz": false
                },
                "lineInterpolation": "linear",
                "lineWidth": 1,
                "pointSize": 5,
                "scaleDistribution": {
                  "type": "linear"
                },
                "showPoints": "auto",
                "spanNulls": false,
                "stacking": {
                  "group": "A",
                  "mode": "none"
                },
                "thresholdsStyle": {
                  "mode": "off"
                }
              },
              "mappings": [],
              "thresholds": {
                "mode": "absolute",
                "steps": [
                  {
                    "color": "green",
                    "value": null
                  },
                  {
                    "color": "red",
                    "value": 80
                  }
                ]
              },
              "unit": "ms"
            },
            "overrides": []
          },
          "gridPos": {
            "h": 8,
            "w": 12,
            "x": 12,
            "y": 1
          },
          "id": 74,
          "options": {
            "legend": {
              "calcs": [],
              "displayMode": "table",
              "placement": "right"
            },
            "tooltip": {
              "mode": "single"
            }
          },
          "targets": [
            {
              "exemplar": true,
              "expr": "rate(tdg_graphql_query_time_sum{job=~\"$job\", ris_sub=~\"$zone\", alias=~\"$instance\"}[$__rate_interval])/rate(tdg_graphql_query_time_count{job=~\"$job\", ris_sub=~\"$zone\", alias=~\"$instance\"}[$__rate_interval])",
              "interval": "",
              "legendFormat": "{{alias}} - {{operation_name}}",
              "queryType": "randomWalk",
              "refId": "A"
            }
          ],
          "title": "GraphQL -- avg. query time",
          "type": "timeseries"
        },
        

@opomuc
Copy link
Contributor Author

opomuc commented Mar 30, 2022

{
          "datasource": "Prometheus",
          "fieldConfig": {
            "defaults": {
              "color": {
                "mode": "palette-classic"
              },
              "custom": {
                "axisLabel": "",
                "axisPlacement": "auto",
                "barAlignment": 0,
                "drawStyle": "line",
                "fillOpacity": 0,
                "gradientMode": "none",
                "hideFrom": {
                  "legend": false,
                  "tooltip": false,
                  "viz": false
                },
                "lineInterpolation": "linear",
                "lineWidth": 1,
                "pointSize": 5,
                "scaleDistribution": {
                  "type": "linear"
                },
                "showPoints": "auto",
                "spanNulls": false,
                "stacking": {
                  "group": "A",
                  "mode": "none"
                },
                "thresholdsStyle": {
                  "mode": "off"
                }
              },
              "mappings": [],
              "thresholds": {
                "mode": "absolute",
                "steps": [
                  {
                    "color": "green",
                    "value": null
                  },
                  {
                    "color": "red",
                    "value": 80
                  }
                ]
              },
              "unit": "ms"
            },
            "overrides": []
          },
          "gridPos": {
            "h": 8,
            "w": 12,
            "x": 0,
            "y": 9
          },
          "id": 76,
          "options": {
            "legend": {
              "calcs": [],
              "displayMode": "list",
              "placement": "bottom"
            },
            "tooltip": {
              "mode": "single"
            }
          },
          "targets": [
            {
              "exemplar": true,
              "expr": "histogram_quantile(0.95, sum(rate(tdg_graphql_query_time_bucket{job=~\"$job\", ris_sub=~\"$zone\", alias=~\"$instance\"}[$__rate_interval])) by (le))",
              "interval": "",
              "legendFormat": "95th percentile",
              "queryType": "randomWalk",
              "refId": "A"
            }
          ],
          "title": "GraphQL -- 95th percentile",
          "type": "timeseries"
        

@opomuc
Copy link
Contributor Author

opomuc commented Mar 30, 2022

{
          "aliasColors": {},
          "bars": false,
          "dashLength": 10,
          "dashes": false,
          "datasource": "Prometheus",
          "decimals": 3,
          "description": "Requests, processed with success (code 2xx) on Tarantool's side.\nGraph shows mean count per second.\nIf `No data` displayed for Prometheus panel,\ncheck up your 'rate_time_range' variable.\n",
          "fill": 0,
          "fillGradient": 0,
          "gridPos": {
            "h": 9,
            "w": 24,
            "x": 0,
            "y": 17
          },
          "hiddenSeries": false,
          "id": 77,
          "legend": {
            "alignAsTable": true,
            "avg": true,
            "current": true,
            "max": true,
            "min": false,
            "rightSide": true,
            "show": true,
            "sideWidth": null,
            "sort": "max",
            "sortDesc": true,
            "total": false,
            "values": true
          },
          "lines": true,
          "linewidth": 1,
          "links": [],
          "nullPointMode": "null",
          "options": {
            "alertThreshold": true
          },
          "percentage": false,
          "pluginVersion": "8.0.3",
          "pointradius": 5,
          "points": false,
          "renderer": "flot",
          "seriesOverrides": [],
          "spaceLength": 10,
          "stack": false,
          "steppedLine": false,
          "targets": [
            {
              "exemplar": true,
              "expr": "sum by (method, type) (rate(tdg_rest_result_200{job=~\"$job\", ris_sub=~\"$zone\", alias=~\"$instance\"}[$__rate_interval]))",
              "format": "time_series",
              "interval": "",
              "intervalFactor": 2,
              "legendFormat": " {{method}} {{type}} ",
              "refId": "A"
            }
          ],
          "thresholds": [],
          "timeFrom": null,
          "timeRegions": [],
          "timeShift": null,
          "title": "Success requests (code 2xx)",
          "tooltip": {
            "shared": true,
            "sort": 2,
            "value_type": "individual"
          },
          "type": "graph",
          "xaxis": {
            "buckets": null,
            "mode": "time",
            "name": null,
            "show": true,
            "values": []
          },
          "yaxes": [
            {
              "decimals": 0,
              "format": "none",
              "label": "requests per second",
              "logBase": 1,
              "max": null,
              "min": null,
              "show": true
            },
            {
              "decimals": 3,
              "format": "none",
              "label": null,
              "logBase": 1,
              "max": null,
              "min": null,
              "show": true
            }
          ],
          "yaxis": {
            "align": false,
            "alignLevel": null
          }
        },
        

@opomuc
Copy link
Contributor Author

opomuc commented Mar 30, 2022

{
          "aliasColors": {},
          "bars": false,
          "dashLength": 10,
          "dashes": false,
          "datasource": "Prometheus",
          "decimals": 3,
          "description": "Requests, processed with 4xx error on Tarantool's side.\nGraph shows mean count per second.\nIf `No data` displayed for Prometheus panel,\ncheck up your 'rate_time_range' variable.\n",
          "fill": 0,
          "fillGradient": 0,
          "gridPos": {
            "h": 9,
            "w": 24,
            "x": 0,
            "y": 26
          },
          "hiddenSeries": false,
          "id": 78,
          "legend": {
            "alignAsTable": true,
            "avg": true,
            "current": true,
            "max": true,
            "min": false,
            "rightSide": true,
            "show": true,
            "sideWidth": null,
            "sort": "current",
            "sortDesc": true,
            "total": false,
            "values": true
          },
          "lines": true,
          "linewidth": 1,
          "links": [],
          "nullPointMode": "null",
          "options": {
            "alertThreshold": true
          },
          "percentage": false,
          "pluginVersion": "8.0.3",
          "pointradius": 5,
          "points": false,
          "renderer": "flot",
          "seriesOverrides": [],
          "spaceLength": 10,
          "stack": false,
          "steppedLine": false,
          "targets": [
            {
              "exemplar": true,
              "expr": "sum by (method, type) (rate(tdg_rest_exec_time_count{job=~\"$job\", ris_sub=~\"$zone\", alias=~\"$instance\"}[$__rate_interval]) - rate(tdg_rest_result_200{job=~\"$job\", ris_sub=~\"$zone\", alias=~\"$instance\"}[$__rate_interval]))",
              "format": "time_series",
              "interval": "",
              "intervalFactor": 2,
              "legendFormat": "{{method}} -- {{type}}",
              "refId": "A"
            }
          ],
          "thresholds": [],
          "timeFrom": null,
          "timeRegions": [],
          "timeShift": null,
          "title": "Error requests (code 4xx)",
          "tooltip": {
            "shared": true,
            "sort": 2,
            "value_type": "individual"
          },
          "type": "graph",
          "xaxis": {
            "buckets": null,
            "mode": "time",
            "name": null,
            "show": true,
            "values": []
          },
          "yaxes": [
            {
              "decimals": 0,
              "format": "none",
              "label": "requests per second",
              "logBase": 1,
              "max": null,
              "min": null,
              "show": true
            },
            {
              "decimals": 3,
              "format": "none",
              "label": null,
              "logBase": 1,
              "max": null,
              "min": null,
              "show": true
            }
          ],
          "yaxis": {
            "align": false,
            "alignLevel": null
          }
        }

@DifferentialOrange DifferentialOrange added 5sp dashboard Related to dashboard template part of repo customer Customer request enhancement New feature or request labels Mar 31, 2022
@no1seman
Copy link

Seems that this issue: https://github.com/tarantool/tdg2/issues/1623 must be taken into account

DifferentialOrange added a commit that referenced this issue Jun 2, 2022
Add docker-compose file for cluster with TDG2 application. To start it,
you'll need a pre-build TDG2 image. Add TDG configuration example
for simple Kafka activity. Add Telegraf example for TDG2 application
(for now, only Kafka labels are considered).

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 2, 2022
Add TDG2 Kafka sections:
- TDG Kafka common statistics
- TDG Kafka brokers statistics
- TDG Kafka topics statistics
- TDG Kafka consumer statistics
- TDG Kafka producer statistics

"TDG Kafka common statistics" section consists of graphs with following
metrics:
- tdg_kafka_replyq
- tdg_kafka_msg_size
- tdg_kafka_msg_cnt
- tdg_kafka_tx
- tdg_kafka_tx_bytes
- tdg_kafka_rx
- tdg_kafka_rx_bytes
- tdg_kafka_txmsgs
- tdg_kafka_txmsg_bytes
- tdg_kafka_rxmsgs
- tdg_kafka_rxmsg_bytes

"TDG Kafka brokers statistics" section consists of graphs with following
metrics:
- tdg_kafka_broker_stateage
- tdg_kafka_broker_connects
- tdg_kafka_broker_disconnects
- tdg_kafka_broker_wakeups
- tdg_kafka_broker_outbuf_cnt
- tdg_kafka_broker_outbuf_msg_cnt
- tdg_kafka_broker_waitresp_cnt
- tdg_kafka_broker_waitresp_msg_cnt
- tdg_kafka_broker_tx
- tdg_kafka_broker_txbytes
- tdg_kafka_broker_txerrs
- tdg_kafka_broker_txretries
- tdg_kafka_broker_txidle
- tdg_kafka_broker_req_timeouts
- tdg_kafka_broker_rx
- tdg_kafka_broker_rxbytes
- tdg_kafka_broker_rxerrs
- tdg_kafka_broker_rxcorriderrs
- tdg_kafka_broker_rxidle
- tdg_kafka_broker_rxpartial
- tdg_kafka_broker_req
- tdg_kafka_broker_int_latency
- tdg_kafka_broker_outbuf_latency
- tdg_kafka_broker_rtt
- tdg_kafka_broker_throttle

"TDG Kafka topics statistics" section consists of graphs with following
metrics:
- tdg_kafka_topic_age
- tdg_kafka_topic_metadata_age
- tdg_kafka_topic_batchsize
- tdg_kafka_topic_batchcnt
- tdg_kafka_topic_partitions_msgq_cnt
- tdg_kafka_topic_partitions_xmit_msgq_cnt
- tdg_kafka_topic_partitions_fetchq_cnt
- tdg_kafka_topic_partitions_msgq_bytes
- tdg_kafka_topic_partitions_xmit_msgq_bytes
- tdg_kafka_topic_partitions_fetchq_size
- tdg_kafka_partition_txmsgs
- tdg_kafka_partition_txbytes
- tdg_kafka_partition_rxmsgs
- tdg_kafka_partition_rxbytes
- tdg_kafka_partition_rx_ver_drops
- tdg_kafka_partition_msgs_inflight

"TDG Kafka consumer statistics" section consists of graphs with
following metrics:
- tdg_kafka_cgrp_stateage
- tdg_kafka_cgrp_rebalance_age
- tdg_kafka_cgrp_rebalance_cnt
- tdg_kafka_cgrp_assignment_size

"TDG Kafka producer statistics" section consists of graphs with
following metrics:
- tdg_kafka_eos_idemp_stateage
- tdg_kafka_eos_txn_stateage

Add a TDG2 dashboard template for Prometheus and InfluxDB. It consists
of all existing Tarantool/Tarantool Cartridge sections and TDG2 Kafka
sections.

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 2, 2022
Add docker-compose file for cluster with TDG2 application. To start it,
you'll need a pre-build TDG2 image. Add TDG configuration example
for simple Kafka activity. Add Telegraf example for TDG2 application
(for now, only Kafka labels are considered).

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 2, 2022
Add TDG2 Kafka sections:
- TDG Kafka common statistics
- TDG Kafka brokers statistics
- TDG Kafka topics statistics
- TDG Kafka consumer statistics
- TDG Kafka producer statistics

"TDG Kafka common statistics" section consists of graphs with following
metrics:
- tdg_kafka_replyq
- tdg_kafka_msg_size
- tdg_kafka_msg_cnt
- tdg_kafka_tx
- tdg_kafka_tx_bytes
- tdg_kafka_rx
- tdg_kafka_rx_bytes
- tdg_kafka_txmsgs
- tdg_kafka_txmsg_bytes
- tdg_kafka_rxmsgs
- tdg_kafka_rxmsg_bytes

"TDG Kafka brokers statistics" section consists of graphs with following
metrics:
- tdg_kafka_broker_stateage
- tdg_kafka_broker_connects
- tdg_kafka_broker_disconnects
- tdg_kafka_broker_wakeups
- tdg_kafka_broker_outbuf_cnt
- tdg_kafka_broker_outbuf_msg_cnt
- tdg_kafka_broker_waitresp_cnt
- tdg_kafka_broker_waitresp_msg_cnt
- tdg_kafka_broker_tx
- tdg_kafka_broker_txbytes
- tdg_kafka_broker_txerrs
- tdg_kafka_broker_txretries
- tdg_kafka_broker_txidle
- tdg_kafka_broker_req_timeouts
- tdg_kafka_broker_rx
- tdg_kafka_broker_rxbytes
- tdg_kafka_broker_rxerrs
- tdg_kafka_broker_rxcorriderrs
- tdg_kafka_broker_rxidle
- tdg_kafka_broker_rxpartial
- tdg_kafka_broker_req
- tdg_kafka_broker_int_latency
- tdg_kafka_broker_outbuf_latency
- tdg_kafka_broker_rtt
- tdg_kafka_broker_throttle

"TDG Kafka topics statistics" section consists of graphs with following
metrics:
- tdg_kafka_topic_age
- tdg_kafka_topic_metadata_age
- tdg_kafka_topic_batchsize
- tdg_kafka_topic_batchcnt
- tdg_kafka_topic_partitions_msgq_cnt
- tdg_kafka_topic_partitions_xmit_msgq_cnt
- tdg_kafka_topic_partitions_fetchq_cnt
- tdg_kafka_topic_partitions_msgq_bytes
- tdg_kafka_topic_partitions_xmit_msgq_bytes
- tdg_kafka_topic_partitions_fetchq_size
- tdg_kafka_partition_txmsgs
- tdg_kafka_partition_txbytes
- tdg_kafka_partition_rxmsgs
- tdg_kafka_partition_rxbytes
- tdg_kafka_partition_rx_ver_drops
- tdg_kafka_partition_msgs_inflight

"TDG Kafka consumer statistics" section consists of graphs with
following metrics:
- tdg_kafka_cgrp_stateage
- tdg_kafka_cgrp_rebalance_age
- tdg_kafka_cgrp_rebalance_cnt
- tdg_kafka_cgrp_assignment_size

"TDG Kafka producer statistics" section consists of graphs with
following metrics:
- tdg_kafka_eos_idemp_stateage
- tdg_kafka_eos_txn_stateage

Add a TDG2 dashboard template for Prometheus and InfluxDB. It consists
of all existing Tarantool/Tarantool Cartridge sections and TDG2 Kafka
sections.

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 2, 2022
Add TDG2 Kafka sections:
- TDG Kafka common statistics
- TDG Kafka brokers statistics
- TDG Kafka topics statistics
- TDG Kafka consumer statistics
- TDG Kafka producer statistics

"TDG Kafka common statistics" section consists of graphs with following
metrics:
- tdg_kafka_replyq
- tdg_kafka_msg_size
- tdg_kafka_msg_cnt
- tdg_kafka_tx
- tdg_kafka_tx_bytes
- tdg_kafka_rx
- tdg_kafka_rx_bytes
- tdg_kafka_txmsgs
- tdg_kafka_txmsg_bytes
- tdg_kafka_rxmsgs
- tdg_kafka_rxmsg_bytes

"TDG Kafka brokers statistics" section consists of graphs with following
metrics:
- tdg_kafka_broker_stateage
- tdg_kafka_broker_connects
- tdg_kafka_broker_disconnects
- tdg_kafka_broker_wakeups
- tdg_kafka_broker_outbuf_cnt
- tdg_kafka_broker_outbuf_msg_cnt
- tdg_kafka_broker_waitresp_cnt
- tdg_kafka_broker_waitresp_msg_cnt
- tdg_kafka_broker_tx
- tdg_kafka_broker_txbytes
- tdg_kafka_broker_txerrs
- tdg_kafka_broker_txretries
- tdg_kafka_broker_txidle
- tdg_kafka_broker_req_timeouts
- tdg_kafka_broker_rx
- tdg_kafka_broker_rxbytes
- tdg_kafka_broker_rxerrs
- tdg_kafka_broker_rxcorriderrs
- tdg_kafka_broker_rxidle
- tdg_kafka_broker_rxpartial
- tdg_kafka_broker_req
- tdg_kafka_broker_int_latency
- tdg_kafka_broker_outbuf_latency
- tdg_kafka_broker_rtt
- tdg_kafka_broker_throttle

"TDG Kafka topics statistics" section consists of graphs with following
metrics:
- tdg_kafka_topic_age
- tdg_kafka_topic_metadata_age
- tdg_kafka_topic_batchsize
- tdg_kafka_topic_batchcnt
- tdg_kafka_topic_partitions_msgq_cnt
- tdg_kafka_topic_partitions_xmit_msgq_cnt
- tdg_kafka_topic_partitions_fetchq_cnt
- tdg_kafka_topic_partitions_msgq_bytes
- tdg_kafka_topic_partitions_xmit_msgq_bytes
- tdg_kafka_topic_partitions_fetchq_size
- tdg_kafka_partition_txmsgs
- tdg_kafka_partition_txbytes
- tdg_kafka_partition_rxmsgs
- tdg_kafka_partition_rxbytes
- tdg_kafka_partition_rx_ver_drops
- tdg_kafka_partition_msgs_inflight

"TDG Kafka consumer statistics" section consists of graphs with
following metrics:
- tdg_kafka_cgrp_stateage
- tdg_kafka_cgrp_rebalance_age
- tdg_kafka_cgrp_rebalance_cnt
- tdg_kafka_cgrp_assignment_size

"TDG Kafka producer statistics" section consists of graphs with
following metrics:
- tdg_kafka_eos_idemp_stateage
- tdg_kafka_eos_txn_stateage

See [1] for more detailed info about metrics.

Add a TDG2 dashboard template for Prometheus and InfluxDB. It consists
of all existing Tarantool/Tarantool Cartridge sections and TDG2 Kafka
sections.

1. https://github.com/edenhill/librdkafka/blob/2aadc45974d32b7c7ff91805588be3fc9951f4ad/STATISTICS.md

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 2, 2022
Add CPU thread time panels for TDG2 dashboard. Graphs are based on
tnt_cpu_thread metrics.

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 2, 2022
Add docker-compose file for cluster with TDG2 application. To start it,
you'll need a pre-build TDG2 image. Add TDG configuration example
for simple Kafka activity. Add Telegraf example for TDG2 application
(for now, only Kafka labels are considered).

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 2, 2022
Add TDG2 Kafka sections:
- TDG Kafka common statistics
- TDG Kafka brokers statistics
- TDG Kafka topics statistics
- TDG Kafka consumer statistics
- TDG Kafka producer statistics

"TDG Kafka common statistics" section consists of graphs with following
metrics:
- tdg_kafka_replyq
- tdg_kafka_msg_size
- tdg_kafka_msg_cnt
- tdg_kafka_tx
- tdg_kafka_tx_bytes
- tdg_kafka_rx
- tdg_kafka_rx_bytes
- tdg_kafka_txmsgs
- tdg_kafka_txmsg_bytes
- tdg_kafka_rxmsgs
- tdg_kafka_rxmsg_bytes

"TDG Kafka brokers statistics" section consists of graphs with following
metrics:
- tdg_kafka_broker_stateage
- tdg_kafka_broker_connects
- tdg_kafka_broker_disconnects
- tdg_kafka_broker_wakeups
- tdg_kafka_broker_outbuf_cnt
- tdg_kafka_broker_outbuf_msg_cnt
- tdg_kafka_broker_waitresp_cnt
- tdg_kafka_broker_waitresp_msg_cnt
- tdg_kafka_broker_tx
- tdg_kafka_broker_txbytes
- tdg_kafka_broker_txerrs
- tdg_kafka_broker_txretries
- tdg_kafka_broker_txidle
- tdg_kafka_broker_req_timeouts
- tdg_kafka_broker_rx
- tdg_kafka_broker_rxbytes
- tdg_kafka_broker_rxerrs
- tdg_kafka_broker_rxcorriderrs
- tdg_kafka_broker_rxidle
- tdg_kafka_broker_rxpartial
- tdg_kafka_broker_req
- tdg_kafka_broker_int_latency
- tdg_kafka_broker_outbuf_latency
- tdg_kafka_broker_rtt
- tdg_kafka_broker_throttle

"TDG Kafka topics statistics" section consists of graphs with following
metrics:
- tdg_kafka_topic_age
- tdg_kafka_topic_metadata_age
- tdg_kafka_topic_batchsize
- tdg_kafka_topic_batchcnt
- tdg_kafka_topic_partitions_msgq_cnt
- tdg_kafka_topic_partitions_xmit_msgq_cnt
- tdg_kafka_topic_partitions_fetchq_cnt
- tdg_kafka_topic_partitions_msgq_bytes
- tdg_kafka_topic_partitions_xmit_msgq_bytes
- tdg_kafka_topic_partitions_fetchq_size
- tdg_kafka_partition_txmsgs
- tdg_kafka_partition_txbytes
- tdg_kafka_partition_rxmsgs
- tdg_kafka_partition_rxbytes
- tdg_kafka_partition_rx_ver_drops
- tdg_kafka_partition_msgs_inflight

"TDG Kafka consumer statistics" section consists of graphs with
following metrics:
- tdg_kafka_cgrp_stateage
- tdg_kafka_cgrp_rebalance_age
- tdg_kafka_cgrp_rebalance_cnt
- tdg_kafka_cgrp_assignment_size

"TDG Kafka producer statistics" section consists of graphs with
following metrics:
- tdg_kafka_eos_idemp_stateage
- tdg_kafka_eos_txn_stateage

See [1] for more detailed info about metrics.

Add a TDG2 dashboard template for Prometheus and InfluxDB. It consists
of all existing Tarantool/Tarantool Cartridge sections and TDG2 Kafka
sections.

1. https://github.com/edenhill/librdkafka/blob/2aadc45974d32b7c7ff91805588be3fc9951f4ad/STATISTICS.md

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 2, 2022
Add CPU thread time panels for TDG2 dashboard. Graphs are based on
tnt_cpu_thread metrics.

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 2, 2022
Add "TDG expirationd statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_expiration_checked_count
- tdg_expiration_expired_count
- tdg_expiration_restarts
- tdg_expiration_working_time

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 2, 2022
Add "TDG expirationd statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_expiration_checked_count
- tdg_expiration_expired_count
- tdg_expiration_restarts
- tdg_expiration_working_time

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 3, 2022
Add "TDG tuples statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_scanned_tuples_count
- tdg_scanned_tuples_sum
- tdg_returned_tuples_count
- tdg_returned_tuples_sum
- tdg_scanned_tuples_max
- tdg_returned_tuples_max

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 3, 2022
Add "TDG file connectors statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_connector_input_file_processed_count
- tdg_connector_input_file_processed_objects_count
- tdg_connector_input_file_failed_count
- tdg_connector_input_file_size
- tdg_connector_input_file_current_bytes_processed
- tdg_connector_input_file_current_processed_objects

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add TDG2 Kafka sections:
- TDG Kafka common statistics
- TDG Kafka brokers statistics
- TDG Kafka topics statistics
- TDG Kafka consumer statistics
- TDG Kafka producer statistics

"TDG Kafka common statistics" section consists of graphs with following
metrics:
- tdg_kafka_replyq
- tdg_kafka_msg_size
- tdg_kafka_msg_cnt
- tdg_kafka_tx
- tdg_kafka_tx_bytes
- tdg_kafka_rx
- tdg_kafka_rx_bytes
- tdg_kafka_txmsgs
- tdg_kafka_txmsg_bytes
- tdg_kafka_rxmsgs
- tdg_kafka_rxmsg_bytes

"TDG Kafka brokers statistics" section consists of graphs with following
metrics:
- tdg_kafka_broker_stateage
- tdg_kafka_broker_connects
- tdg_kafka_broker_disconnects
- tdg_kafka_broker_wakeups
- tdg_kafka_broker_outbuf_cnt
- tdg_kafka_broker_outbuf_msg_cnt
- tdg_kafka_broker_waitresp_cnt
- tdg_kafka_broker_waitresp_msg_cnt
- tdg_kafka_broker_tx
- tdg_kafka_broker_txbytes
- tdg_kafka_broker_txerrs
- tdg_kafka_broker_txretries
- tdg_kafka_broker_txidle
- tdg_kafka_broker_req_timeouts
- tdg_kafka_broker_rx
- tdg_kafka_broker_rxbytes
- tdg_kafka_broker_rxerrs
- tdg_kafka_broker_rxcorriderrs
- tdg_kafka_broker_rxidle
- tdg_kafka_broker_rxpartial
- tdg_kafka_broker_req
- tdg_kafka_broker_int_latency
- tdg_kafka_broker_outbuf_latency
- tdg_kafka_broker_rtt
- tdg_kafka_broker_throttle

"TDG Kafka topics statistics" section consists of graphs with following
metrics:
- tdg_kafka_topic_age
- tdg_kafka_topic_metadata_age
- tdg_kafka_topic_batchsize
- tdg_kafka_topic_batchcnt
- tdg_kafka_topic_partitions_msgq_cnt
- tdg_kafka_topic_partitions_xmit_msgq_cnt
- tdg_kafka_topic_partitions_fetchq_cnt
- tdg_kafka_topic_partitions_msgq_bytes
- tdg_kafka_topic_partitions_xmit_msgq_bytes
- tdg_kafka_topic_partitions_fetchq_size
- tdg_kafka_partition_txmsgs
- tdg_kafka_partition_txbytes
- tdg_kafka_partition_rxmsgs
- tdg_kafka_partition_rxbytes
- tdg_kafka_partition_rx_ver_drops
- tdg_kafka_partition_msgs_inflight

"TDG Kafka consumer statistics" section consists of graphs with
following metrics:
- tdg_kafka_cgrp_stateage
- tdg_kafka_cgrp_rebalance_age
- tdg_kafka_cgrp_rebalance_cnt
- tdg_kafka_cgrp_assignment_size

"TDG Kafka producer statistics" section consists of graphs with
following metrics:
- tdg_kafka_eos_idemp_stateage
- tdg_kafka_eos_txn_stateage

See [1] for more detailed info about metrics.

Add a TDG2 dashboard template for Prometheus and InfluxDB. It consists
of all existing Tarantool/Tarantool Cartridge sections and TDG2 Kafka
sections.

1. https://github.com/edenhill/librdkafka/blob/2aadc45974d32b7c7ff91805588be3fc9951f4ad/STATISTICS.md

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add CPU thread time panels for TDG2 dashboard. Graphs are based on
tnt_cpu_thread metrics.

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add "TDG expirationd statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_expiration_checked_count
- tdg_expiration_expired_count
- tdg_expiration_restarts
- tdg_expiration_working_time

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add docker-compose file for cluster with TDG2 application. To start it,
you'll need a pre-build TDG2 image. Add TDG configuration example
for simple Kafka activity. Add Telegraf example for TDG2 application
(for now, only Kafka labels are considered).

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add TDG2 Kafka sections:
- TDG Kafka common statistics
- TDG Kafka brokers statistics
- TDG Kafka topics statistics
- TDG Kafka consumer statistics
- TDG Kafka producer statistics

"TDG Kafka common statistics" section consists of graphs with following
metrics:
- tdg_kafka_replyq
- tdg_kafka_msg_size
- tdg_kafka_msg_cnt
- tdg_kafka_tx
- tdg_kafka_tx_bytes
- tdg_kafka_rx
- tdg_kafka_rx_bytes
- tdg_kafka_txmsgs
- tdg_kafka_txmsg_bytes
- tdg_kafka_rxmsgs
- tdg_kafka_rxmsg_bytes

"TDG Kafka brokers statistics" section consists of graphs with following
metrics:
- tdg_kafka_broker_stateage
- tdg_kafka_broker_connects
- tdg_kafka_broker_disconnects
- tdg_kafka_broker_wakeups
- tdg_kafka_broker_outbuf_cnt
- tdg_kafka_broker_outbuf_msg_cnt
- tdg_kafka_broker_waitresp_cnt
- tdg_kafka_broker_waitresp_msg_cnt
- tdg_kafka_broker_tx
- tdg_kafka_broker_txbytes
- tdg_kafka_broker_txerrs
- tdg_kafka_broker_txretries
- tdg_kafka_broker_txidle
- tdg_kafka_broker_req_timeouts
- tdg_kafka_broker_rx
- tdg_kafka_broker_rxbytes
- tdg_kafka_broker_rxerrs
- tdg_kafka_broker_rxcorriderrs
- tdg_kafka_broker_rxidle
- tdg_kafka_broker_rxpartial
- tdg_kafka_broker_req
- tdg_kafka_broker_int_latency
- tdg_kafka_broker_outbuf_latency
- tdg_kafka_broker_rtt
- tdg_kafka_broker_throttle

"TDG Kafka topics statistics" section consists of graphs with following
metrics:
- tdg_kafka_topic_age
- tdg_kafka_topic_metadata_age
- tdg_kafka_topic_batchsize
- tdg_kafka_topic_batchcnt
- tdg_kafka_topic_partitions_msgq_cnt
- tdg_kafka_topic_partitions_xmit_msgq_cnt
- tdg_kafka_topic_partitions_fetchq_cnt
- tdg_kafka_topic_partitions_msgq_bytes
- tdg_kafka_topic_partitions_xmit_msgq_bytes
- tdg_kafka_topic_partitions_fetchq_size
- tdg_kafka_partition_txmsgs
- tdg_kafka_partition_txbytes
- tdg_kafka_partition_rxmsgs
- tdg_kafka_partition_rxbytes
- tdg_kafka_partition_rx_ver_drops
- tdg_kafka_partition_msgs_inflight

"TDG Kafka consumer statistics" section consists of graphs with
following metrics:
- tdg_kafka_cgrp_stateage
- tdg_kafka_cgrp_rebalance_age
- tdg_kafka_cgrp_rebalance_cnt
- tdg_kafka_cgrp_assignment_size

"TDG Kafka producer statistics" section consists of graphs with
following metrics:
- tdg_kafka_eos_idemp_stateage
- tdg_kafka_eos_txn_stateage

See [1] for more detailed info about metrics.

Add a TDG2 dashboard template for Prometheus and InfluxDB. It consists
of all existing Tarantool/Tarantool Cartridge sections and TDG2 Kafka
sections.

1. https://github.com/edenhill/librdkafka/blob/2aadc45974d32b7c7ff91805588be3fc9951f4ad/STATISTICS.md

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add CPU thread time panels for TDG2 dashboard. Graphs are based on
tnt_cpu_thread metrics.

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add "TDG expirationd statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_expiration_checked_count
- tdg_expiration_expired_count
- tdg_expiration_restarts
- tdg_expiration_working_time

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add "TDG tuples statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_scanned_tuples_count
- tdg_scanned_tuples_sum
- tdg_returned_tuples_count
- tdg_returned_tuples_sum
- tdg_scanned_tuples_max
- tdg_returned_tuples_max

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add "TDG file connectors statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_connector_input_file_processed_count
- tdg_connector_input_file_processed_objects_count
- tdg_connector_input_file_failed_count
- tdg_connector_input_file_size
- tdg_connector_input_file_current_bytes_processed
- tdg_connector_input_file_current_processed_objects

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add "TDG GraphQL requests" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_graphql_query_time_sum
- tdg_graphql_query_time_count
- tdg_graphql_query_fail
- tdg_graphql_mutation_time_sum
- tdg_graphql_mutation_time_count
- tdg_graphql_mutation_fail

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add "TDG GraphQL requests" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_iproto_data_query_exec_time_count
- tdg_iproto_data_query_exec_time

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add "TDG REST API requests" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_rest_exec_time_count
- tdg_rest_exec_time

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 6, 2022
Add "TDG tasks statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_jobs_started
- tdg_jobs_failed
- tdg_jobs_succeeded
- tdg_jobs_running
- tdg_jobs_execution_time_sum
- tdg_jobs_execution_time_count
- tdg_tasks_started
- tdg_tasks_failed
- tdg_tasks_succeeded
- tdg_tasks_stopped
- tdg_tasks_running
- tdg_tasks_execution_time_sum
- tdg_tasks_execution_time_count
- tdg_system_tasks_started
- tdg_system_tasks_failed
- tdg_system_tasks_succeeded
- tdg_system_tasks_running
- tdg_system_tasks_execution_time_sum
- tdg_system_tasks_execution_time_count

Closes #134
DifferentialOrange added a commit that referenced this issue Jun 7, 2022
Add TDG2 Kafka sections:
- TDG Kafka common statistics
- TDG Kafka brokers statistics
- TDG Kafka topics statistics
- TDG Kafka consumer statistics
- TDG Kafka producer statistics

"TDG Kafka common statistics" section consists of graphs with following
metrics:
- tdg_kafka_replyq
- tdg_kafka_msg_size
- tdg_kafka_msg_cnt
- tdg_kafka_tx
- tdg_kafka_tx_bytes
- tdg_kafka_rx
- tdg_kafka_rx_bytes
- tdg_kafka_txmsgs
- tdg_kafka_txmsg_bytes
- tdg_kafka_rxmsgs
- tdg_kafka_rxmsg_bytes

"TDG Kafka brokers statistics" section consists of graphs with following
metrics:
- tdg_kafka_broker_stateage
- tdg_kafka_broker_connects
- tdg_kafka_broker_disconnects
- tdg_kafka_broker_wakeups
- tdg_kafka_broker_outbuf_cnt
- tdg_kafka_broker_outbuf_msg_cnt
- tdg_kafka_broker_waitresp_cnt
- tdg_kafka_broker_waitresp_msg_cnt
- tdg_kafka_broker_tx
- tdg_kafka_broker_txbytes
- tdg_kafka_broker_txerrs
- tdg_kafka_broker_txretries
- tdg_kafka_broker_txidle
- tdg_kafka_broker_req_timeouts
- tdg_kafka_broker_rx
- tdg_kafka_broker_rxbytes
- tdg_kafka_broker_rxerrs
- tdg_kafka_broker_rxcorriderrs
- tdg_kafka_broker_rxidle
- tdg_kafka_broker_rxpartial
- tdg_kafka_broker_req
- tdg_kafka_broker_int_latency
- tdg_kafka_broker_outbuf_latency
- tdg_kafka_broker_rtt
- tdg_kafka_broker_throttle

"TDG Kafka topics statistics" section consists of graphs with following
metrics:
- tdg_kafka_topic_age
- tdg_kafka_topic_metadata_age
- tdg_kafka_topic_batchsize
- tdg_kafka_topic_batchcnt
- tdg_kafka_topic_partitions_msgq_cnt
- tdg_kafka_topic_partitions_xmit_msgq_cnt
- tdg_kafka_topic_partitions_fetchq_cnt
- tdg_kafka_topic_partitions_msgq_bytes
- tdg_kafka_topic_partitions_xmit_msgq_bytes
- tdg_kafka_topic_partitions_fetchq_size
- tdg_kafka_partition_txmsgs
- tdg_kafka_partition_txbytes
- tdg_kafka_partition_rxmsgs
- tdg_kafka_partition_rxbytes
- tdg_kafka_partition_rx_ver_drops
- tdg_kafka_partition_msgs_inflight

"TDG Kafka consumer statistics" section consists of graphs with
following metrics:
- tdg_kafka_cgrp_stateage
- tdg_kafka_cgrp_rebalance_age
- tdg_kafka_cgrp_rebalance_cnt
- tdg_kafka_cgrp_assignment_size

"TDG Kafka producer statistics" section consists of graphs with
following metrics:
- tdg_kafka_eos_idemp_stateage
- tdg_kafka_eos_txn_stateage

See [1] for more detailed info about metrics.

Add a TDG2 dashboard template for Prometheus and InfluxDB. It consists
of all existing Tarantool/Tarantool Cartridge sections and TDG2 Kafka
sections.

1. https://github.com/edenhill/librdkafka/blob/2aadc45974d32b7c7ff91805588be3fc9951f4ad/STATISTICS.md

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 7, 2022
Add CPU thread time panels for TDG2 dashboard. Graphs are based on
tnt_cpu_thread metrics.

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 7, 2022
Add "TDG expirationd statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_expiration_checked_count
- tdg_expiration_expired_count
- tdg_expiration_restarts
- tdg_expiration_working_time

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 7, 2022
Add "TDG tuples statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_scanned_tuples_count
- tdg_scanned_tuples_sum
- tdg_returned_tuples_count
- tdg_returned_tuples_sum
- tdg_scanned_tuples_max
- tdg_returned_tuples_max

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 7, 2022
Add "TDG file connectors statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_connector_input_file_processed_count
- tdg_connector_input_file_processed_objects_count
- tdg_connector_input_file_failed_count
- tdg_connector_input_file_size
- tdg_connector_input_file_current_bytes_processed
- tdg_connector_input_file_current_processed_objects

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 7, 2022
Add "TDG GraphQL requests" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_graphql_query_time_sum
- tdg_graphql_query_time_count
- tdg_graphql_query_fail
- tdg_graphql_mutation_time_sum
- tdg_graphql_mutation_time_count
- tdg_graphql_mutation_fail

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 7, 2022
Add "TDG GraphQL requests" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_iproto_data_query_exec_time_count
- tdg_iproto_data_query_exec_time

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 7, 2022
Add "TDG REST API requests" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_rest_exec_time_count
- tdg_rest_exec_time

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 7, 2022
Add "TDG tasks statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_jobs_started
- tdg_jobs_failed
- tdg_jobs_succeeded
- tdg_jobs_running
- tdg_jobs_execution_time_sum
- tdg_jobs_execution_time_count
- tdg_tasks_started
- tdg_tasks_failed
- tdg_tasks_succeeded
- tdg_tasks_stopped
- tdg_tasks_running
- tdg_tasks_execution_time_sum
- tdg_tasks_execution_time_count
- tdg_system_tasks_started
- tdg_system_tasks_failed
- tdg_system_tasks_succeeded
- tdg_system_tasks_running
- tdg_system_tasks_execution_time_sum
- tdg_system_tasks_execution_time_count

Closes #134
DifferentialOrange added a commit that referenced this issue Jun 8, 2022
Add docker-compose file for cluster with TDG2 application. To start it,
you'll need a pre-build TDG2 image. Add TDG configuration example
for simple Kafka activity. Add Telegraf example for TDG2 application
(for now, only Kafka labels are considered).

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 8, 2022
Add TDG2 Kafka sections:
- TDG Kafka common statistics
- TDG Kafka brokers statistics
- TDG Kafka topics statistics
- TDG Kafka consumer statistics
- TDG Kafka producer statistics

"TDG Kafka common statistics" section consists of graphs with following
metrics:
- tdg_kafka_replyq
- tdg_kafka_msg_size
- tdg_kafka_msg_cnt
- tdg_kafka_tx
- tdg_kafka_tx_bytes
- tdg_kafka_rx
- tdg_kafka_rx_bytes
- tdg_kafka_txmsgs
- tdg_kafka_txmsg_bytes
- tdg_kafka_rxmsgs
- tdg_kafka_rxmsg_bytes

"TDG Kafka brokers statistics" section consists of graphs with following
metrics:
- tdg_kafka_broker_stateage
- tdg_kafka_broker_connects
- tdg_kafka_broker_disconnects
- tdg_kafka_broker_wakeups
- tdg_kafka_broker_outbuf_cnt
- tdg_kafka_broker_outbuf_msg_cnt
- tdg_kafka_broker_waitresp_cnt
- tdg_kafka_broker_waitresp_msg_cnt
- tdg_kafka_broker_tx
- tdg_kafka_broker_txbytes
- tdg_kafka_broker_txerrs
- tdg_kafka_broker_txretries
- tdg_kafka_broker_txidle
- tdg_kafka_broker_req_timeouts
- tdg_kafka_broker_rx
- tdg_kafka_broker_rxbytes
- tdg_kafka_broker_rxerrs
- tdg_kafka_broker_rxcorriderrs
- tdg_kafka_broker_rxidle
- tdg_kafka_broker_rxpartial
- tdg_kafka_broker_req
- tdg_kafka_broker_int_latency
- tdg_kafka_broker_outbuf_latency
- tdg_kafka_broker_rtt
- tdg_kafka_broker_throttle

"TDG Kafka topics statistics" section consists of graphs with following
metrics:
- tdg_kafka_topic_age
- tdg_kafka_topic_metadata_age
- tdg_kafka_topic_batchsize
- tdg_kafka_topic_batchcnt
- tdg_kafka_topic_partitions_msgq_cnt
- tdg_kafka_topic_partitions_xmit_msgq_cnt
- tdg_kafka_topic_partitions_fetchq_cnt
- tdg_kafka_topic_partitions_msgq_bytes
- tdg_kafka_topic_partitions_xmit_msgq_bytes
- tdg_kafka_topic_partitions_fetchq_size
- tdg_kafka_partition_txmsgs
- tdg_kafka_partition_txbytes
- tdg_kafka_partition_rxmsgs
- tdg_kafka_partition_rxbytes
- tdg_kafka_partition_rx_ver_drops
- tdg_kafka_partition_msgs_inflight

"TDG Kafka consumer statistics" section consists of graphs with
following metrics:
- tdg_kafka_cgrp_stateage
- tdg_kafka_cgrp_rebalance_age
- tdg_kafka_cgrp_rebalance_cnt
- tdg_kafka_cgrp_assignment_size

"TDG Kafka producer statistics" section consists of graphs with
following metrics:
- tdg_kafka_eos_idemp_stateage
- tdg_kafka_eos_txn_stateage

See [1] for more detailed info about metrics.

Add a TDG2 dashboard template for Prometheus and InfluxDB. It consists
of all existing Tarantool/Tarantool Cartridge sections and TDG2 Kafka
sections.

1. https://github.com/edenhill/librdkafka/blob/2aadc45974d32b7c7ff91805588be3fc9951f4ad/STATISTICS.md

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 8, 2022
Add CPU thread time panels for TDG2 dashboard. Graphs are based on
tnt_cpu_thread metrics.

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 8, 2022
Add "TDG expirationd statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_expiration_checked_count
- tdg_expiration_expired_count
- tdg_expiration_restarts
- tdg_expiration_working_time

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 8, 2022
Add "TDG tuples statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_scanned_tuples_count
- tdg_scanned_tuples_sum
- tdg_returned_tuples_count
- tdg_returned_tuples_sum
- tdg_scanned_tuples_max
- tdg_returned_tuples_max

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 8, 2022
Add "TDG file connectors statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_connector_input_file_processed_count
- tdg_connector_input_file_processed_objects_count
- tdg_connector_input_file_failed_count
- tdg_connector_input_file_size
- tdg_connector_input_file_current_bytes_processed
- tdg_connector_input_file_current_processed_objects

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 8, 2022
Add "TDG GraphQL requests" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_graphql_query_time_sum
- tdg_graphql_query_time_count
- tdg_graphql_query_fail
- tdg_graphql_mutation_time_sum
- tdg_graphql_mutation_time_count
- tdg_graphql_mutation_fail

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 8, 2022
Add "TDG GraphQL requests" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_iproto_data_query_exec_time_count
- tdg_iproto_data_query_exec_time

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 8, 2022
Add "TDG REST API requests" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_rest_exec_time_count
- tdg_rest_exec_time

Part of #134
DifferentialOrange added a commit that referenced this issue Jun 8, 2022
Add "TDG tasks statistics" section to TDG dashboard templates.
It consists of panels with following metrics:
- tdg_jobs_started
- tdg_jobs_failed
- tdg_jobs_succeeded
- tdg_jobs_running
- tdg_jobs_execution_time_sum
- tdg_jobs_execution_time_count
- tdg_tasks_started
- tdg_tasks_failed
- tdg_tasks_succeeded
- tdg_tasks_stopped
- tdg_tasks_running
- tdg_tasks_execution_time_sum
- tdg_tasks_execution_time_count
- tdg_system_tasks_started
- tdg_system_tasks_failed
- tdg_system_tasks_succeeded
- tdg_system_tasks_running
- tdg_system_tasks_execution_time_sum
- tdg_system_tasks_execution_time_count

Closes #134
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
customer Customer request dashboard Related to dashboard template part of repo enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants