Metrics request bytes #4982

NyaliaLui · 2022-06-01T05:16:27Z

Cover letter coming soon.

Add topic level metric, intended for the new endpoint, that measure the number of bytes in client payloads by request type.

VladLazar · 2022-06-01T09:15:55Z

src/v/kafka/topic_probe.h

+            return;
+        }
+
+        std::vector<sm::label_instance> labels{sm::label("request")("produce|consume")};


When I read the metric specification (see below) I interpreted it to mean that the request label should take either the produce or consume value depending on the request being processed. To achieve this you'd need to have two metrics in the add_group call: one with the consume label and the other with the produce label.

# HELP redpanda_kafka_request_bytes_total Number of bytes in client payloads by request type # TYPE redpanda_kafka_request_bytes_total counter redpanda_kafka_request_bytes_total{request="produce|consume",namespace,topic}

src/v/kafka/topic_probe.h

VladLazar · 2022-06-01T09:33:13Z

src/v/kafka/server/request_context.h

@@ -192,6 +196,8 @@ class request_context {
    request_header _header;
    request_reader _reader;
    ss::lowres_clock::duration _throttle_delay;
+
+    kafka::topic_probe _topic_probe;


I think the lifetime of request context objects is roughly the same as the lifetime of a request in the system, which means that it will get destroyed when request processing is complete. The ss::metrics::metric_groups destructor (owned by the probe) removes all the registered metrics so they're lost after each request.

The probe should probably be stashed somewhere else (to match the lifetime of the server maybe).

NyaliaLui · 2022-06-03T16:15:08Z

src/v/kafka/server/handlers/produce.cc

@@ -186,6 +186,8 @@ static partition_produce_stages partition_append(
                    p.error_code = error_code::none;
                    partition->probe().add_records_produced(num_records);
                    partition->probe().add_bytes_produced(num_bytes);
+                    partition->probe_v2().add_produce_bytes_cluster_lvl(


The requirements for this new metric are a combination of the old metrics generated from partition_probe::add_bytes_produced() and partition_probe::add_bytes_fetched(). But those metrics were at the cluster level. I still need to figure out where to put the topic level metrics.

NyaliaLui · 2022-06-22T21:05:15Z

Closing since this was implemented in #5165

NyaliaLui requested review from BenPope and VladLazar June 1, 2022 05:16

github-actions bot added the area/redpanda label Jun 1, 2022

VladLazar reviewed Jun 1, 2022

View reviewed changes

NyaliaLui added 2 commits June 3, 2022 12:11

cluster: add new partition probe for new metrics

21dcf87

cluster: use the new partition probe

1879fdb

NyaliaLui force-pushed the metrics-request-bytes branch from 1217149 to 1879fdb Compare June 3, 2022 16:12

NyaliaLui commented Jun 3, 2022

View reviewed changes

NyaliaLui closed this Jun 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metrics request bytes #4982

Metrics request bytes #4982

NyaliaLui commented Jun 1, 2022

VladLazar Jun 1, 2022

VladLazar Jun 1, 2022

NyaliaLui Jun 3, 2022

NyaliaLui commented Jun 22, 2022

Metrics request bytes #4982

Metrics request bytes #4982

Conversation

NyaliaLui commented Jun 1, 2022

VladLazar Jun 1, 2022

Choose a reason for hiding this comment

VladLazar Jun 1, 2022

Choose a reason for hiding this comment

NyaliaLui Jun 3, 2022

Choose a reason for hiding this comment

NyaliaLui commented Jun 22, 2022