WSGI fixes #148

Oberon00 · 2019-09-18T11:38:46Z

Fix http.url (WSGI integration does nonstandard split between http.host and http.url #143)
Don't delay calling wrapped app (would be nicer if No way to use_span withouth ending it. #147 was fixed; see Implement WSGI middleware integration. #84 (comment) and following)

a-feld

@Oberon00 This looks great! I have some questions / suggestions, thanks for taking the time to improve this!

a-feld · 2019-09-18T16:10:38Z

ext/opentelemetry-ext-wsgi/src/opentelemetry/ext/wsgi/__init__.py

+                if environ.get("SERVER_PORT", "443") != "443":
+                    host += ":" + environ["SERVER_PORT"]
+            elif environ.get("SERVER_PORT", "80") != "80":
+                host += ":" + environ["SERVER_PORT"]


Ah I didn't realize that http.host was nonstandard! Thanks for catching this!
Given that it's not standard, should we just remove this? It seems like a lot of overhead per request to compute the host string.

I don't think the calculation for host is particularly expensive and it could probably even be optimized. I'd wait until the data formats are reworked (this is something that will most likely happen).

I shortened & hopefully optimized it.

ext/opentelemetry-ext-wsgi/src/opentelemetry/ext/wsgi/__init__.py

a-feld · 2019-09-18T16:37:30Z

ext/opentelemetry-ext-wsgi/src/opentelemetry/ext/wsgi/__init__.py

+                    scheme = environ["wsgi.url_scheme"] + "://"
+                    if not urlparts.netloc:
+                        url = host + url
+                    url = scheme + url


Also, are these cases that we have to cover?

self.environ["RAW_URI"] = "/?" self.environ["RAW_URI"] = "http:///?" self.environ["RAW_URI"] = "http://?"

These cases seem to add a lot of complexity. I'm curious to know which WSGI servers write these RAW_URI strings?

Looking more at this, in order to be completely spec compliant it feels like calling url = wsgiref_util.request_uri(environ) on every request would probably be easiest rather than playing with RAW_URI and REQUEST_URI as an optimization. What are your thoughts on this?

On the one hand, I think using RAW_URI et al is valuable because it catches things like http://myhost/? vs http://myhost/, a difference that should not matter but could. And since one of the prime use cases of tracing is debugging, I think exactly these edge cases are interesting to handle. But on the other hand, I might have gone overboard with my handling of malformed/very unusual raw URIs. Maybe there is a middle ground to aim for.

Not sure if we can get something like http://foobar.org/abc#CrazyFragment and how do we plan to handle it.
I swear that I've seen something like this from server side (which means both client and server are super buggy).

If we get that all in RAW_URL, we'll set it to the http.url attribute as-is. If we get invalid data, we are not in the business to normalize it, instead we should make it visible.

I simplified it:

If the raw URL starts with / we assume it is relative and prepend scheme://host

Otherwise, if the url starts with "scheme:", we use it as-is

Otherwise we assume a bogus value and fall back to wsgiref.util.request_uri.

c24t · 2019-09-19T03:58:12Z

ext/opentelemetry-ext-wsgi/src/opentelemetry/ext/wsgi/__init__.py

+            return iter_result()
+        except:  # noqa
+            span.end()
+            raise


Why raise again here instead of making this a finally block?

I guess it was because the assumption that Span.end() gets called in iter_result when there is no exception.
This piece of code is a bit risky and hard to maintain IMHO.

If we decided to go with this approach, I'll be okay if we can put a comment like this :)

# ATTENTION!!! HIGH VOLTAGE!!!

If we can determine if a span has ended, then finally: if not ended: end() would be good.

If we decided to go with this approach

I'm all for a better approach if somebody finds one that still doesn't delay the app invocation.

See also: #148 (comment)

I can't think of a way that doesn't have a slim chance of either ending the span twice or not ending it at all.

c24t

By making __call__ a generator, we only start the span once the server starts iterating.

That's very subtle, good catch @Oberon00.

This LGTM, but I agree that playing weird URI whack-a-mole may not be worth the effort.

reyang · 2019-09-19T04:16:53Z

ext/opentelemetry-ext-wsgi/src/opentelemetry/ext/wsgi/__init__.py

+
+            return iter_result()
+        except:  # noqa
+            span.end()


This seems to be risky.
According to the API, implementation could raise exception if span.end() is called twice.

If an exception happened right before iter_result returns (but after span.end() is called), we end up calling end() twice and kill the user app due to unwanted exception.

We may want to change the API so that we don't raise on duplicate calls to end, which seems like a good idea regardless of this PR.

I've got an open PR in specs about suppressing errors in the API. The goal is to avoid exactly this kind of thing -- crashing the application because we're using the instrumentation layer wrong.

Seems there is no way to do it right with the current API then.

Probably leave a TODO comment for now.

BTW, the current SDK only logs a warning, which is fine because having end called twice here should really be an edge case.

For end in particular the spec already says that calling it multiple times is fine, see #154 (comment)

Oberon00 · 2019-09-19T14:29:38Z

~~The test failure seems to be due to #147. Since this is a real problem, a fix for that issue is required before merging this PR.~~

Fixed by #154, I rebased this PR.

ext/opentelemetry-ext-wsgi/src/opentelemetry/ext/wsgi/__init__.py

Co-authored-by: Allan Feldman <6374032+a-feld@users.noreply.github.com>

Oberon00 · 2019-09-24T10:09:01Z

@a-feld Is there still something you want to have changed?

a-feld · 2019-09-24T15:31:41Z

Sorry for the delay @Oberon00 , I'm giving this another once over! 🙏
I'll also dismiss my review, this can be changed incrementally anyway over time if required 😄

a-feld · 2019-09-24T15:33:33Z

ext/opentelemetry-ext-wsgi/src/opentelemetry/ext/wsgi/__init__.py

+                and environ["wsgi.url_scheme"] == "http"
+                or port != "443"
+            ):
+                host += ":" + port


This logic appears to always attach port 80 in the presence of http. Is that intentional?

environ = {"SERVER_NAME": "example.com", "SERVER_PORT": "80", "wsgi.url_scheme": "http"} host = environ.get("HTTP_HOST") if not host: host = environ["SERVER_NAME"] port = environ["SERVER_PORT"] if port != "80" and environ["wsgi.url_scheme"] == "http" or port != "443": host += ":" + port print(host)

example.com:80

The logic is equivalent to:
port != "443" or environ["wsgi.url_scheme"] == "http" and port != "80" so all http ports will be reported?

Certainly not. I thought I had a unit test against exactly that. Will investigate.

dismiss review

a-feld

@Oberon00 Thanks for putting this together, from what I can see this looks great! I'll approve this now assuming that the http port investigation will conclude!

Oberon00 · 2019-09-25T09:03:33Z

@reyang I intented to fix the port issue before merging this, but oh well, I can also file a follow-up PR.

for start_time and end_time Make lint happy Addressing comments Addressing comments Allowing 0 as start and end time Fix lint issues Metrics API RFC 0003 cont'd (open-telemetry#136) * Create functions Comments for Meter More comments Add more comments Fix typos * fix lint * Fix lint * fix typing * Remove options, constructors, seperate labels * Consistent naming for float and int * Abstract time series * Use ABC * Fix typo * Fix docs * seperate measure classes * Add examples * fix lint * Update to RFC 0003 * Add spancontext, measurebatch * Fix docs * Fix comments * fix lint * fix lint * fix lint * skip examples * white space * fix spacing * fix imports * fix imports * LabelValues to str * Black formatting * fix isort * Remove aggregation * Fix names * Remove aggregation from docs * Fix lint * metric changes * Typing * Fix lint * Fix lint * Add space * Fix lint * fix comments * address comments * fix comments Adding a working propagator, adding to integrations and example (open-telemetry#137) Adding a full, end-to-end example of propagation at work in the example application, including a test. Adding the use of propagators into the integrations. Metrics API RFC 0009 (open-telemetry#140) * Create functions Comments for Meter More comments Add more comments Fix typos * fix lint * Fix lint * fix typing * Remove options, constructors, seperate labels * Consistent naming for float and int * Abstract time series * Use ABC * Fix typo * Fix docs * seperate measure classes * Add examples * fix lint * Update to RFC 0003 * Add spancontext, measurebatch * Fix docs * Fix comments * fix lint * fix lint * fix lint * skip examples * white space * fix spacing * fix imports * fix imports * LabelValues to str * Black formatting * fix isort * Remove aggregation * Fix names * Remove aggregation from docs * Fix lint * metric changes * Typing * Fix lint * Fix lint * Add space * Fix lint * fix comments * handle, recordbatch * docs * Update recordbatch * black * Fix typo * remove ValueType * fix lint Console exporter (open-telemetry#156) Make use_span more flexible (closes open-telemetry#147). (open-telemetry#154) Co-Authored-By: Reiley Yang <reyang@microsoft.com> Co-Authored-By: Chris Kleinknecht <libc@google.com> WSGI fixes (open-telemetry#148) Fix http.url. Don't delay calling wrapped app. Skeleton for azure monitor exporters (open-telemetry#151) Add link to docs to README (open-telemetry#170) Move example app to the examples folder (open-telemetry#172) WSGI: Fix port 80 always appended in http.host (open-telemetry#173) Build and host docs via github action (open-telemetry#167) Add missing license boilerplate to a few files (open-telemetry#176) sdk/trace/exporters: add batch span processor exporter (open-telemetry#153) The exporters specification states that two built-in span processors should be implemented, the simple processor span and the batch processor span. This commit implements the latter, it is mainly based on the opentelemetry/java one. The algorithm implements the following logic: - a condition variable is used to notify the worker thread in case the queue is half full, so that exporting can start before the queue gets full and spans are dropped. - export is called each schedule_delay_millis if there is a least one new span to export. - when the processor is shutdown all remaining spans are exported. Implementing W3C TraceContext (fixes open-telemetry#116) (open-telemetry#180) * Implementing TraceContext (fixes open-telemetry#116) This introduces a w3c TraceContext propagator, primarily inspired by opencensus. fix time conversion bug (open-telemetry#182) Introduce Context.suppress_instrumentation (open-telemetry#181) Metrics Implementation (open-telemetry#160) * Create functions Comments for Meter More comments Add more comments Fix typos * fix lint * Fix lint * fix typing * Remove options, constructors, seperate labels * Consistent naming for float and int * Abstract time series * Use ABC * Fix typo * Fix docs * seperate measure classes * Add examples * fix lint * Update to RFC 0003 * Add spancontext, measurebatch * Fix docs * Fix comments * fix lint * fix lint * fix lint * skip examples * white space * fix spacing * fix imports * fix imports * LabelValues to str * Black formatting * fix isort * Remove aggregation * Fix names * Remove aggregation from docs * Fix lint * metric changes * Typing * Fix lint * Fix lint * Add space * Fix lint * fix comments * handle, recordbatch * docs * Update recordbatch * black * Fix typo * remove ValueType * fix lint * sdk * metrics * example * counter * Tests * Address comments * ADd tests * Fix typing and examples * black * fix lint * remove override * Fix tests * mypy * fix lint * fix type * fix typing * fix tests * isort * isort * isort * isort * noop * lint * lint * fix tuple typing * fix type * black * address comments * fix type * fix lint * remove imports * default tests * fix lint * usse sequence * remove ellipses * remove ellipses * black * Fix typo * fix example * fix type * fix type * address comments Implement Azure Monitor Exporter (open-telemetry#175) Span add override parameters for start_time and end_time (open-telemetry#179) CONTRIBUTING.md: Fix clone URL (open-telemetry#177) Add B3 exporter to alpha release table (open-telemetry#164) Update README for alpha release (open-telemetry#189) Update Contributing.md doc (open-telemetry#194) Add **simple** client/server examples (open-telemetry#191) Remove unused dev-requirements.txt (open-telemetry#200) The requirements are contained in tox.ini now. Fx bug in BoundedList for Python 3.4 and add tests (open-telemetry#199) * fix bug in BoundedList for python 3.4 and add tests collections.deque.copy() was introduced in python 3.5, this commit changes that by the deque constructor and adds some tests to BoundedList and BoundedDict to avoid similar problems in the future. Also, improve docstrings of BoundedList and BoundedDict classes Move util.time_ns to API. (open-telemetry#205) Add Jaeger exporter (open-telemetry#174) This adds a Jeager exporter for OpenTelemetry. This exporter is based on https://github.com/census-instrumentation/opencensus-python/tree/master/contrib/opencensus-ext-jaeger. The exporter uses thrift and can be configured to send data to the agent and also to a remote collector. There is a long discussion going on about how to include generated files in the repo, so for now just put them here. Add code coverage Revert latest commit Fix some "errors" found by mypy. (open-telemetry#204) Fix some errors found by mypy (split from open-telemetry#201). Update README for new milestones (open-telemetry#218) Refactor current span handling for newly created spans. (open-telemetry#198) 1. Make Tracer.start_span() simply create and start the Span, without setting it as the current instance. 2. Add an extra Tracer.start_as_current_span() to create the Span and set it as the current instance automatically. Co-Authored-By: Chris Kleinknecht <libc@google.com> Add set_status to Span (open-telemetry#213) Initial commit Initial version

) * add readme for async-hooks scope manager package * add readme for base scope manager package * chore(readme): fix typos

Oberon00 requested review from a-feld, c24t, carlosalberto, lzchen, reyang and toumorokoshi as code owners September 18, 2019 11:38

Oberon00 force-pushed the wsgifixes branch 2 times, most recently from 88f5e91 to 8aee33c Compare September 18, 2019 13:24

a-feld previously requested changes Sep 18, 2019

View reviewed changes

a-feld reviewed Sep 18, 2019

View reviewed changes

c24t reviewed Sep 19, 2019

View reviewed changes

c24t approved these changes Sep 19, 2019

View reviewed changes

reyang reviewed Sep 19, 2019

View reviewed changes

Oberon00 force-pushed the wsgifixes branch from 4846503 to cbbf626 Compare September 19, 2019 12:06

Oberon00 mentioned this pull request Sep 19, 2019

Make use_span more flexible (closes #147). #154

Merged

Oberon00 force-pushed the wsgifixes branch from cbbf626 to 0f37eec Compare September 19, 2019 14:56

a-feld reviewed Sep 21, 2019

View reviewed changes

ext/opentelemetry-ext-wsgi/src/opentelemetry/ext/wsgi/__init__.py Outdated Show resolved Hide resolved

Oberon00 added 8 commits September 24, 2019 11:36

wsgi: Fix non-absolute http.url (fixes open-telemetry#143).

2bbc8be

wsgi: Don't delay calling wrapped app.

cfdcaef

Test query string handling.

48fa684

http.host & url: Fix and test edge cases.

77e641b

Simplify.

28e69b5

Optimize iter_result.

fa7aa21

Fix active span in WSGI instrumentation.

8329a2f

Lint, SERVER_PORT.

99fcb90

Oberon00 force-pushed the wsgifixes branch from 34aeb08 to 99fcb90 Compare September 24, 2019 09:39

Move iter_result to a global function.

df9ea84

Co-authored-by: Allan Feldman <6374032+a-feld@users.noreply.github.com>

Oberon00 requested a review from a-feld September 24, 2019 10:09

a-feld reviewed Sep 24, 2019

View reviewed changes

a-feld approved these changes Sep 24, 2019

View reviewed changes

reyang merged commit 7813924 into open-telemetry:master Sep 24, 2019

This was referenced Sep 25, 2019

wsgi: Fix port 80 always appended in http.host. #173

Merged

WSGI integration does nonstandard split between http.host and http.url #143

Closed

API: Span - should it provide HasEnded property open-telemetry/opentelemetry-specification#55

Closed

anna-git mentioned this pull request Jul 18, 2022

[ASM] Tags standardization, http.route, http.url, http.useragent, http.client_ip DataDog/dd-trace-dotnet#2915

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WSGI fixes #148

WSGI fixes #148

Oberon00 commented Sep 18, 2019 •

edited

Loading

a-feld left a comment

a-feld Sep 18, 2019

Oberon00 Sep 18, 2019

Oberon00 Sep 19, 2019

a-feld Sep 18, 2019

a-feld Sep 18, 2019

Oberon00 Sep 18, 2019 •

edited

Loading

reyang Sep 19, 2019

Oberon00 Sep 19, 2019 •

edited

Loading

Oberon00 Sep 19, 2019

c24t Sep 19, 2019

reyang Sep 19, 2019

reyang Sep 19, 2019

Oberon00 Sep 19, 2019

Oberon00 Sep 19, 2019 •

edited

Loading

c24t left a comment

reyang Sep 19, 2019

c24t Sep 19, 2019

Oberon00 Sep 19, 2019

reyang Sep 19, 2019

Oberon00 Sep 19, 2019

Oberon00 Sep 24, 2019

Oberon00 commented Sep 19, 2019 •

edited

Loading

Oberon00 commented Sep 24, 2019

a-feld commented Sep 24, 2019

a-feld Sep 24, 2019

Oberon00 Sep 24, 2019

a-feld left a comment

Oberon00 commented Sep 25, 2019

WSGI fixes #148

WSGI fixes #148

Conversation

Oberon00 commented Sep 18, 2019 • edited Loading

a-feld left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Oberon00 Sep 18, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Oberon00 Sep 19, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Oberon00 Sep 19, 2019 • edited Loading

Choose a reason for hiding this comment

c24t left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Oberon00 commented Sep 19, 2019 • edited Loading

Oberon00 commented Sep 24, 2019

a-feld commented Sep 24, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

a-feld left a comment

Choose a reason for hiding this comment

Oberon00 commented Sep 25, 2019

Oberon00 commented Sep 18, 2019 •

edited

Loading

Oberon00 Sep 18, 2019 •

edited

Loading

Oberon00 Sep 19, 2019 •

edited

Loading

Oberon00 Sep 19, 2019 •

edited

Loading

Oberon00 commented Sep 19, 2019 •

edited

Loading