Remove dashes to underscores normalization from http header attribute keys #369

trask · 2023-10-03T22:42:52Z

Fixes #304

Changes

Removes the dashes to underscores normalization from http header attribute keys.

Merge requirement checklist

CONTRIBUTING.md guidelines followed.
CHANGELOG.md updated for non-trivial changes.
~~schema-next.yaml updated with changes to existing conventions.~~

… keys

arminru

The same change also needs to be applied to response headers to stay consistent:

http.response.header.<key>
HTTP response headers, <key> being the normalized HTTP Header name (lowercase, with - characters replaced by _), the value being the header values. [5]
http.response.header.content_type=["application/json"]; http.response.header.my_custom_header=["abc", "def"]

docs/http/http-spans.md

pyohannes · 2023-10-04T09:15:47Z

The same change also needs to be applied to response headers to stay consistent:

Definitely, and in case we decide to do this for HTTP, we probably should also do it for RPC. Likely not in this PR, but at least note it in issues:
https://github.com/open-telemetry/semantic-conventions/blob/main/docs/rpc/connect-rpc.md?plain=1#L23
https://github.com/open-telemetry/semantic-conventions/blob/main/docs/rpc/grpc.md?plain=1#L22

Oberon00 · 2023-10-04T14:55:41Z

Likely not in this PR

I would say, please do it also in this PR. This should stay consistent. I'd rather leave HTTP unchanged than change only HTTP and leave RPC in an inconsistent state if we then in the rpc follow-up figure out the reasons for the normalization.

RPC handling adapted to match HTTP in 16bc32a

jsuereth · 2023-10-10T15:15:30Z

Based on our discussion in Semconv, I'm against this change.

I think we should either:

Not normalize at all, so conflicts are always obvious, this doesn't actually solve the underlying issue.
Normalize for consistency, including _. The notion of multiple headers and conflicts is something we have to resolve no mater what if we do any normalization of the raw data received.

AlexanderWert · 2023-10-10T15:24:11Z

Based on our discussion in Semconv, I'm against this change.

I think we should either:

Not normalize at all, so conflicts are always obvious, this doesn't actually solve the underlying issue.

Normalize for consistency, including _. The notion of multiple headers and conflicts is something we have to resolve no mater what if we do any normalization of the raw data received.

+1 (not blocking it from my side though, if we end up with lowercasing only)

I'm leaning towards:

Not normalize at all, so conflicts are always obvious, this doesn't actually solve the underlying issue.

Lowercasing would be for the sake of cardinality, right? But, all the attributes being touched in this PR are not part of metric dimensions, so cardinality should not be an issue.

arminru

Approving following the reasoning @trask provided in #369 (comment)

trask · 2023-10-10T15:47:34Z

Lowercasing would be for the sake of cardinality, right?

Lowercasing is only for the keys (not the values, so not related to cardinality). If we don't lowercase the keys, then we end up with different keys, e.g. http.request.header.content-type and http.request.header.Content-Type.

trask · 2023-10-10T15:48:52Z

I think we should either:

Not normalize at all, so conflicts are always obvious

I don't believe Content-Type and content-type are truly conflicts, so I don't think we need to make that difference obvious.

pyohannes · 2023-10-10T16:28:40Z

Lowercasing would be for the sake of cardinality, right?

Lowercasing is only for the keys (not the values, so not related to cardinality). If we don't lowercase the keys, then we end up with different keys, e.g. http.request.header.content-type and http.request.header.Content-Type.

👍 Agreeing with @trask, lowercasing makes sense here because we're "converting" from a case-insensitive to a case-sensitive format.

It doesn't cause conflicts (because the origin format is case-insensitive), and I think it drastically improves the user experience. If I don't have that normalization and I'd want to filter for a certain content type, I'd theoretically need to look for all http.request.header.[cC][oO][nN][tT][eE][nN][tT]-[tT][yY][pP][eE] keys.

Flarna · 2023-10-10T18:54:15Z

Most HTTP frameworks do the one or the other sort of normalizing of incoming headers because this eases the usage. This effects casing and concatenation/arrayfying. Simply because the spec is clear that casing doesn't matter and also it doesn't matter if headers which are allowed multiple times (e.g. tracestate) are sent as a single concat line or as array.

Similar on sending side HTTP frameworks might decide on such normalization.

In my opinion the main use of tracing is not to debug low level, framework specific low level details like the actual casing on the wire. It's more to monitor the HTTP request from functional point of view.
Allowing easy search/indexing/.. in backend by normalizing the input data without loosing functional details seems to be the better choice here.

If there is a use case to track also the low level bits and details (assuming the framework in use allows to extract this) I would add a separate set of attributes like http.request.raw_headers and http.response.raw_headers.

Remove dashes to underscores normalization from http header attribute…

be84f79

… keys

trask force-pushed the http-header-attr-keys branch from 41ad5b2 to be84f79 Compare October 3, 2023 22:43

trask marked this pull request as ready for review October 3, 2023 22:43

trask requested review from a team October 3, 2023 22:43

github-actions bot assigned reyang Oct 3, 2023

trask mentioned this pull request Oct 3, 2023

Normalizing HTTP header names is ambiguous #304

Closed

tables

151bbd7

arminru previously requested changes Oct 4, 2023

View reviewed changes

arminru reviewed Oct 4, 2023

View reviewed changes

docs/http/http-spans.md Outdated Show resolved Hide resolved

all places

16bc32a

AlexanderWert approved these changes Oct 5, 2023

View reviewed changes

pyohannes approved these changes Oct 5, 2023

View reviewed changes

lmolkova approved these changes Oct 5, 2023

View reviewed changes

Merge remote-tracking branch 'upstream/main' into http-header-attr-keys

3c9fac6

Oberon00 approved these changes Oct 10, 2023

View reviewed changes

mateuszrzeszutek approved these changes Oct 10, 2023

View reviewed changes

Merge branch 'main' into http-header-attr-keys

c929b5a

MSNev approved these changes Oct 10, 2023

View reviewed changes

jack-berg approved these changes Oct 10, 2023

View reviewed changes

arminru approved these changes Oct 10, 2023

View reviewed changes

Flarna approved these changes Oct 10, 2023

View reviewed changes

joaopgrassi approved these changes Oct 16, 2023

View reviewed changes

Merge branch 'main' into http-header-attr-keys

052bc94

reyang approved these changes Oct 16, 2023

View reviewed changes

Merge branch 'main' into http-header-attr-keys

bc3c036

AlexanderWert merged commit b23075c into open-telemetry:main Oct 20, 2023
9 checks passed

trask deleted the http-header-attr-keys branch October 20, 2023 16:04

mateuszrzeszutek mentioned this pull request Oct 23, 2023

Don't normalize the '-' character in HTTP header names open-telemetry/opentelemetry-java-instrumentation#9735

Merged

aabmass mentioned this pull request Feb 23, 2024

Fix HTTP header normalization open-telemetry/opentelemetry-python-contrib#2260

Open

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove dashes to underscores normalization from http header attribute keys #369

Remove dashes to underscores normalization from http header attribute keys #369

trask commented Oct 3, 2023

arminru left a comment

pyohannes commented Oct 4, 2023

Oberon00 commented Oct 4, 2023 •

edited

Loading

jsuereth commented Oct 10, 2023

AlexanderWert commented Oct 10, 2023 •

edited

Loading

arminru left a comment

trask commented Oct 10, 2023

trask commented Oct 10, 2023

pyohannes commented Oct 10, 2023

Flarna commented Oct 10, 2023

Remove dashes to underscores normalization from http header attribute keys #369

Remove dashes to underscores normalization from http header attribute keys #369

Conversation

trask commented Oct 3, 2023

Changes

Merge requirement checklist

arminru left a comment

Choose a reason for hiding this comment

pyohannes commented Oct 4, 2023

Oberon00 commented Oct 4, 2023 • edited Loading

jsuereth commented Oct 10, 2023

AlexanderWert commented Oct 10, 2023 • edited Loading

arminru left a comment

Choose a reason for hiding this comment

trask commented Oct 10, 2023

trask commented Oct 10, 2023

pyohannes commented Oct 10, 2023

Flarna commented Oct 10, 2023

Oberon00 commented Oct 4, 2023 •

edited

Loading

AlexanderWert commented Oct 10, 2023 •

edited

Loading