Should instrumentations be able to interact with or know about other instrumentations #369

owais · 2021-03-12T14:52:47Z

There are some cases where an instrumentation may generate a completely valid span on it's own but the span may not make sense when looking at the overall trace. Couple of examples:

WSGI + django/flask/etc

If WSGI and django instrumentations both are enabled, WSGI will extract context from incoming request and generate a SERVER span. Django will extract context from incoming request headers and again generate SERVER span.

If the incoming request has tracing context present in it, the two spans will be siblings instead of the WSGI span being parent of the Django span.

If the incoming request does not have any tracing context present in it, the two spans will not even be part of the same trace and the same request will generate multiple traces.

One possible fix for this is for django (and other server side instrumentations) to check if a parent span is active in the current context and not extract parent context from request headers in this case.

Q. Would there be a situation where there might be a parent span present but not generated for the current in-flight request? Let's say a long lived span representing the process/worker lifetime. It is unlikely but something we should try to answer.

Example:
https://cloud-native.slack.com/archives/C01PD4HUVBL/p1614339078002900
https://cloud-native.slack.com/archives/C01PD4HUVBL/p1614340059009000

urllib3/requests + http.client

urllib3 and requests instrumentations both generate CLIENT spans and inject tracing context into HTTP headers. Both libraries use http.client internall and we may also have an instrumentation for http.client. Such an instrumentation would obviously generate a CLIENT span and inject tracing context again. If both requests/urllib3 and http.client instrumentations are enabled, this will result in a parent > child span pair where both spans are of type CLIENT.

Solution proposed in a PR (#299) is for urllib to suppress http instrumentation by setting a well-known key in context. While this solves the problem of having multiple CLIENT spans, it completely silences the lower level instrumentation which might be able to add a lot more contextual information (DNS resolution time/cache hit/miss, etc) to the spans. I'm not sure if this is the ideal solution.

Another possible solution might be for urllib3 and requests to never generate CLIENT spans and specify the http.client instrumentation as a dependency. So whenever urllib3 or requests is installed, it always installs http.client instrumentation as well.

Example: #299 (comment)

Both of these cases scream for some kind of system that either let's intrumentations discover each other and modify their behaviour, or let's them inspect parent/child spans and then modify their own span accordingly. I think letting instrumentations discover each other might not be as complex as it might sound and solve both problems mentioned above. Would love to hear other thoughts.

owais · 2021-03-26T12:13:01Z

One simple solution for this would be for such child spans to modify parent spans as they see fit. For example, instead of urllib3 instrumentation silencing http.client instrumentation, http.client spans should check if the parent span has type set to CLIENT and update it to INTERNAL instead. Similarly in case of Django + WSGI, Django instrumentation should check if there is an active span present in the current context and perhaps also if the span's type is set to SERVER. In case it finds such a span in current context, it should use the span as parent instead of extracting parent context from incoming request. I think Java and Node do something similar but I'll need to look deeper and confirm it.

We'll need to come up with some heuristics for such cases I'm fairly sure we should be able to solve most if not all of such cases by letting a child span look at parent before deciding on it's own fields/attributes or even modify it's parent's fields/attributes.

github-actions · 2021-04-26T03:23:05Z

This issue was marked stale due to lack of activity. It will be closed in 30 days.

lzchen · 2021-04-27T00:16:59Z

@owais
Has this issue been brought up to the SIG?

owais · 2021-04-27T07:44:53Z

Yes, we brought it up once but did not have a decision. We then proposed two concrete solution to specific issues

#456 and #445

There were no objections to 445 and everyone agreed to do it.

446 is a bit more involved as it needs changes to the API spec.

owais mentioned this issue Mar 12, 2021

Add urllib3 instrumentation #299

Merged

6 tasks

github-actions bot added the backlog label Apr 26, 2021

owais closed this as completed Apr 27, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should instrumentations be able to interact with or know about other instrumentations #369

Should instrumentations be able to interact with or know about other instrumentations #369

owais commented Mar 12, 2021

owais commented Mar 26, 2021

github-actions bot commented Apr 26, 2021

lzchen commented Apr 27, 2021

owais commented Apr 27, 2021

Should instrumentations be able to interact with or know about other instrumentations #369

Should instrumentations be able to interact with or know about other instrumentations #369

Comments

owais commented Mar 12, 2021

WSGI + django/flask/etc

urllib3/requests + http.client

owais commented Mar 26, 2021

github-actions bot commented Apr 26, 2021

lzchen commented Apr 27, 2021

owais commented Apr 27, 2021