Add semantic conventions for HTTP metrics #739

gfuller1 · 2020-07-24T23:46:44Z

This spec is written to describe WHAT http metric events should look like and not HOW they are generated. This spec DOES NOT describe the pipeline of how a metric event is created. This is to prevent confusion around coupling metrics and spans together in cases where there might not exist a span. By focusing on the expected outcome of metric generation and not the details of HOW they are generated, we can keep the spec simple and concise.

This spec PR is written to be a foundation for metric semantics and not the end all of this document. Thoughts on how this PR should change belong in comments below. If you have thoughts on what to add to this document I will kindly suggest that they go onto a follow up PR.

Context: This PR adds a spec to describe what HTTP metric events should look like when they are created. This will provide consistency and guidance for anyone looking for how to report HTTP metric data.

Thoughts:

All of the data needed to create the HTTP metric events can be found on an HTTP span.
Although all the data can be found on HTTP spans, the way the spec is written avoids coupling the two specs so people aren't confused if there isn't a span for them to get the data from.
The loose coupling leads to a bit of duplication in the HTTP metric and span spec, which could get out of sync down the road, but it should lead to less confusion for people interpreting the spec.

I am open to any and all feedback!

Related issues #738

…clude

… normative directives and more

… cleanup

linux-foundation-easycla · 2020-07-24T23:46:50Z

The committers are authorized under a signed CLA.

✅ grahamfuller1 (34eb0ba, 18a3e67, 028d345, 6f0e55d, d5d15cc, 09116fc, 14c0a98, b3e8bf2, 7e0c306, 4d2ec7f, b87fcb8)
✅ gfuller1 (e2ce1b3, 952d1ae)

specification/metrics/semantic_conventions/http-metrics.md

ebullient · 2020-08-04T18:24:46Z

Micrometer has a summary notion of outcome, which is a grouping/categorization of status codes. Just a snip of code because I'm lazy:

        if (status >= 100 && status < 200) {
            return INFORMATIONAL;
        } else if (status >= 200 && status < 300) {
            return SUCCESS;
        } else if (status >= 300 && status < 400) {
            return REDIRECTION;
        } else if (status >= 400 && status < 500) {
            return CLIENT_ERROR;
        } else if (status >= 500 && status < 600) {
            return SERVER_ERROR;
        }

This is really nice from a query perspective (to get all redirects, rather than 302 or 304, or pick a 400 response).

Should something like this be included?

specification/metrics/semantic_conventions/http-metrics.md

gfuller1 · 2020-08-06T00:13:11Z

Micrometer has a summary notion of outcome, which is a grouping/categorization of status codes. Just a snip of code because I'm lazy:
        if (status >= 100 && status < 200) {
            return INFORMATIONAL;
        } else if (status >= 200 && status < 300) {
            return SUCCESS;
        } else if (status >= 300 && status < 400) {
            return REDIRECTION;
        } else if (status >= 400 && status < 500) {
            return CLIENT_ERROR;
        } else if (status >= 500 && status < 600) {
            return SERVER_ERROR;
        }
This is really nice from a query perspective (to get all redirects, rather than 302 or 304, or pick a 400 response).

Should something like this be included?

This sounds very helpful and I think could stir up some good conversation. In order to prevent from scope creep on this PR I'd prefer to keep it as simple and small as possible, so I think this would be a great follow up PR for this spec!

jmacd · 2020-08-20T18:53:48Z

@tigrannajaryan and @bogdandrutu
We discussed this PR at length in the Metrics SIG call today. We feel that this should be merged as it is blocking progress.

We reached agreement that there is no intentional "subtle differences from equally named span attributes" here; the semantics should not change when an attribute is used on a span or on a metric. @grahamfuller1 will resolve any overt differences that are introduced here, aiming to get this merged quickly.

We reached agreement that there is no conceptual problem with converting a numeric value to a string value. I challenge the notion that the semantics of 404 and "404" are at all different; the semantics do not change when the data representation changes from integer to string.

The common difference between attributes on spans and labels on metrics is that we generally wish to avoid high-cardinality such as results from the use of request-size and response-size as span attributes. To address this, we recommend general guidance on avoiding labels with high cardinality, with examples given for the ones we know about, so that we don't have to update the specification for every new potentially high-cardinality HTTP semantic convention.

@justinfoote will follow this PR (which again, we hope to merge quickly) with a proposal that we refactor the OTel specification so that all semantic conventions for attributes and labels move into a general location, independent of spans/metrics/resources. Then where necessary the span and metrics specifications can refer to the general-purpose specification, but I think that should not be necessary very often.

jmacd · 2020-08-20T18:55:15Z

@ebullient re: #739 (comment)

Yes, I've also seen HTTP Status codes reduced to "2xx", "3xx", "4xx", and "5xx", but that is something that could be done in a processor and could probably be considered as a separate specification.

tigrannajaryan · 2020-08-20T18:59:26Z

@tigrannajaryan and @bogdandrutu
We discussed this PR at length in the Metrics SIG call today. We feel that this should be merged as it is blocking progress.

We reached agreement that there is no intentional "subtle differences from equally named span attributes" here; the semantics should not change when an attribute is used on a span or on a metric. @grahamfuller1 will resolve any overt differences that are introduced here, aiming to get this merged quickly.

We reached agreement that there is no conceptual problem with converting a numeric value to a string value. I challenge the notion that the semantics of 404 and "404" are at all different; the semantics do not change when the data representation changes from integer to string.

I agree. I came to the same realization here: #815 (comment)

We can explicitly legalize that using string representation in metric labels is valid and does not imply any semantic differences.

@justinfoote will follow this PR (which again, we hope to merge quickly) with a proposal that we refactor the OTel specification so that all semantic conventions for attributes and labels move into a general location, independent of spans/metrics/resources. Then where necessary the span and metrics specifications can refer to the general-purpose specification, but I think that should not be necessary very often.

Sounds good.

specification/metrics/semantic_conventions/http-metrics.md

tigrannajaryan · 2020-08-20T19:15:16Z

specification/metrics/semantic_conventions/http-metrics.md

+| `http.host`        | `client` & `server` | see [label alternatives](#label-alternatives) | The value of the [HTTP host header][]. When the header is empty or not present, this label should be the same. |
+| `http.scheme`      | `client` & `server` | see [label alternatives](#label-alternatives) | The URI scheme identifying the used protocol: `"http"` or `"https"` |
+| `http.status_code` | `client` & `server` | Optional          | [HTTP response status code][]. E.g. `200` (integer) |
+| `http.status_text` | `client` & `server` | Optional          | [HTTP reason phrase][]. E.g. `"OK"` |


https://tools.ietf.org/html/rfc7231#section-6.1 says that

reason phrases listed here are only recommendations -- they can be replaced by local equivalents

Not that I have seen this actually happening but it is a possibility. If source start sending different reason phrase for the same status code that would arguably make the status text less useful since aggregations would be difficult/impossible to do.

Since we already have http.status_code perhaps just drop this label?

tigrannajaryan · 2020-08-20T19:16:38Z

specification/metrics/semantic_conventions/http-metrics.md

+| `http.scheme`      | `client` & `server` | see [label alternatives](#label-alternatives) | The URI scheme identifying the used protocol: `"http"` or `"https"` |
+| `http.status_code` | `client` & `server` | Optional          | [HTTP response status code][]. E.g. `200` (String) |
+| `http.status_text` | `client` & `server` | Optional          | [HTTP reason phrase][]. E.g. `"OK"` |
+| `http.flavor`      | `client` & `server` | Optional          | Kind of HTTP protocol used: `"1.0"`, `"1.1"`, `"2"`, `"SPDY"` or `"QUIC"`. |


Is "flavor" the right term or "version"?

Also, I am not sure "SPDY" or "QUIC" belong to the same set.

Issue with status_code is resolved.

tigrannajaryan

LGTM, especially given that there is an intent to follow up with an improving PR.

gfuller1 · 2020-08-21T20:53:14Z

@tigrannajaryan since http.status_code changing seems like a very rare occurrence I'll address that in the follow up PR. flavor is what's used for the span event attribute. If we want to change the name of the attribute and this label that sounds like a great candidate for a small separate PR as to not add more conversation to this already long PR.

gfuller1 · 2020-08-24T16:27:30Z

@bogdandrutu see #739 (comment). Would love to hear back on this today.

bogdandrutu

I am happy to merge this as long as we create an issue to followup for the default list of labels that will be used, and how this impacts the cardinality of the metric that we produce for http.

gfuller1 · 2020-08-28T16:41:57Z

Here is the issue with the followup work once the common attribute/label list is created #897

lubingfeng · 2020-08-31T14:02:01Z

@bogdandrutu @jmacd Just want to check if we have finalized stats specifications. If not, when.

We are trying to complete statsreceiver (https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/master/receiver/statsdreceiver) - by middle September.

lubingfeng · 2020-09-02T22:53:35Z

@bogdandrutu @jmacd Just want to check if we have finalized stats specifications. If not, when.

We are trying to complete statsreceiver (https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/master/receiver/statsdreceiver) - by middle September.

@bogdandrutu @jmacd any comments on this?

jmacd · 2020-09-09T07:10:12Z

@lubingfeng we have released OTLP v0.5 that has support for delta aggregation temporality last week, and support is expected in this week's Collector release. Sorry for the delays! Let me know how else I can assist.

lubingfeng · 2020-09-09T13:52:49Z

Thank you for the info, @jmacd. Good to hear OTLP v0.5 had delta aggregation included and support in this week's Collector release. Do we know when OTLP for metrics can be finalized? Our implementation of statsd receiver
(https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/master/receiver/statsdreceiver) is based on open census based on Nick's discussion with you. We want to get it based on OTLP by the end of the month. Just want to check if its feasibility.

jmacd · 2020-09-09T15:04:47Z

You should be fully unblocked, now. Please start a new thread or PR and tag me, we’ll keep at it.

…

On Wed, Sep 9, 2020 at 6:53 AM Bingfeng ***@***.***> wrote: Thank you for the info, @jmacd <https://github.com/jmacd>. Good to hear OTLP v0.5 had delta aggregation included and support in this week's Collector release. Do we know when OTLP for metrics can be finalized? Our implementation of statsd receiver ( https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/master/receiver/statsdreceiver) is based on open census based on Nick's discussion with you. We want to get it based on OTLP by the end of the month. Just want to check if its feasibility. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#739 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AA3WFCKXPLIJ7LZGPHUBSTDSE6CEJANCNFSM4PHFAAJA> .

* add http metric label spec * typo * update attribute locations and clarify which to inlcude, alter and exclude * add metric instruments list * simplify intro sections, add requirement columns, clarify labels, fix normative directives and more * fix HTTP strings * add count metric instrument and shorten duration metric name * remove dependency on http.md, add more notes and examples and general cleanup * replace span.kind with type * add missing labels and cleanup links * substitution->alternatives and remove section not needed * make request count metric instrument plural * formatting * clarify type column * update intro and fix units * breakout metric instrument table * remove http.route since it is the same as http.target after http.target is simplified * update net labels to link to definition * add lowercase requirement to http.scheme * formatting Co-authored-by: gfuller <gfuller@newrelic.com> Co-authored-by: Bogdan Drutu <bogdandrutu@gmail.com>

grahamfuller1 added 11 commits July 16, 2020 11:09

add http metric label spec

34eb0ba

typo

18a3e67

update attribute locations and clarify which to inlcude, alter and ex…

028d345

…clude

add metric instruments list

6f0e55d

simplify intro sections, add requirement columns, clarify labels, fix…

d5d15cc

… normative directives and more

fix HTTP strings

09116fc

add count metric instrument and shorten duration metric name

14c0a98

remove dependency on http.md, add more notes and examples and general…

b3e8bf2

… cleanup

replace span.kind with type

7e0c306

add missing labels and cleanup links

4d2ec7f

substitution->alternatives and remove section not needed

b87fcb8

gfuller1 requested review from a team July 24, 2020 23:46

MrAlias mentioned this pull request Jul 27, 2020

Merge HTTP instrumentation open-telemetry/opentelemetry-go#974

Closed

5 tasks

gfuller1 changed the title ~~Http metrics spec~~ Add semantic conventions for Http metrics Jul 27, 2020

gfuller1 changed the title ~~Add semantic conventions for Http metrics~~ Add semantic conventions for HTTP metrics Jul 27, 2020

Oberon00 reviewed Jul 28, 2020

View reviewed changes

specification/metrics/semantic_conventions/http-metrics.md Show resolved Hide resolved

cijothomas reviewed Jul 28, 2020

View reviewed changes

specification/metrics/semantic_conventions/http-metrics.md Outdated Show resolved Hide resolved

gfuller1 added 2 commits July 30, 2020 15:09

make request count metric instrument plural

e2ce1b3

formatting

952d1ae

jkwatson reviewed Aug 4, 2020

View reviewed changes

specification/metrics/semantic_conventions/http-metrics.md Outdated Show resolved Hide resolved

justinfoote reviewed Aug 5, 2020

View reviewed changes

specification/metrics/semantic_conventions/http-metrics.md Outdated Show resolved Hide resolved

justinfoote reviewed Aug 5, 2020

View reviewed changes

specification/metrics/semantic_conventions/http-metrics.md Outdated Show resolved Hide resolved

gfuller1 added 2 commits August 5, 2020 16:35

clarify type column

c27d66d

update intro and fix units

7b59ac0

breakout metric instrument table

1ba7a15

justinfoote mentioned this pull request Aug 6, 2020

Add metrics semantic conventions for timed operations #657

Closed

MrAlias mentioned this pull request Aug 20, 2020

How to get metrics about gin/mux/kafka open-telemetry/opentelemetry-go-contrib#252

Closed

jmacd mentioned this pull request Aug 20, 2020

Proposal: Span Stats processor open-telemetry/opentelemetry-collector-contrib#403

Closed

update net labels to link to definition

7de9d45

tigrannajaryan reviewed Aug 20, 2020

View reviewed changes

tigrannajaryan approved these changes Aug 20, 2020

View reviewed changes

add lowercase requirement to http.scheme

d420059

justinfoote mentioned this pull request Aug 21, 2020

Proposal: Dictionary of common Attribute/Label definitions #855

Closed

gfuller1 mentioned this pull request Aug 26, 2020

Add support for HTTP metric events open-telemetry/opentelemetry-java-instrumentation#1109

Closed

bogdandrutu approved these changes Aug 27, 2020

View reviewed changes

gfuller1 mentioned this pull request Aug 28, 2020

Update http metric spec #897

Open

bogdandrutu and others added 3 commits August 28, 2020 10:14

Merge branch 'master' into http-metrics-spec

655a490

formatting

d607daf

Merge branch 'master' into http-metrics-spec

da2a2de

bogdandrutu merged commit a6db328 into open-telemetry:master Aug 30, 2020

gfuller1 deleted the http-metrics-spec branch August 31, 2020 01:30

tigrannajaryan mentioned this pull request Sep 8, 2020

No spec for HTTP specific metrics #738

Closed

ebullient mentioned this pull request Sep 24, 2020

Add outcome label for http conventions for metrics (only) #1000

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add semantic conventions for HTTP metrics #739

Add semantic conventions for HTTP metrics #739

gfuller1 commented Jul 24, 2020 •

edited

Loading

linux-foundation-easycla bot commented Jul 24, 2020 •

edited

Loading

ebullient commented Aug 4, 2020

gfuller1 commented Aug 6, 2020

jmacd commented Aug 20, 2020

jmacd commented Aug 20, 2020

tigrannajaryan commented Aug 20, 2020

tigrannajaryan Aug 20, 2020

tigrannajaryan Aug 20, 2020

tigrannajaryan left a comment

gfuller1 commented Aug 21, 2020

gfuller1 commented Aug 24, 2020

bogdandrutu left a comment

gfuller1 commented Aug 28, 2020

lubingfeng commented Aug 31, 2020

lubingfeng commented Sep 2, 2020

jmacd commented Sep 9, 2020

lubingfeng commented Sep 9, 2020

jmacd commented Sep 9, 2020 via email

Add semantic conventions for HTTP metrics #739

Add semantic conventions for HTTP metrics #739

Conversation

gfuller1 commented Jul 24, 2020 • edited Loading

linux-foundation-easycla bot commented Jul 24, 2020 • edited Loading

ebullient commented Aug 4, 2020

gfuller1 commented Aug 6, 2020

jmacd commented Aug 20, 2020

jmacd commented Aug 20, 2020

tigrannajaryan commented Aug 20, 2020

tigrannajaryan Aug 20, 2020

Choose a reason for hiding this comment

tigrannajaryan Aug 20, 2020

Choose a reason for hiding this comment

tigrannajaryan left a comment

Choose a reason for hiding this comment

gfuller1 commented Aug 21, 2020

gfuller1 commented Aug 24, 2020

bogdandrutu left a comment

Choose a reason for hiding this comment

gfuller1 commented Aug 28, 2020

lubingfeng commented Aug 31, 2020

lubingfeng commented Sep 2, 2020

jmacd commented Sep 9, 2020

lubingfeng commented Sep 9, 2020

jmacd commented Sep 9, 2020 via email

gfuller1 commented Jul 24, 2020 •

edited

Loading

linux-foundation-easycla bot commented Jul 24, 2020 •

edited

Loading