Environment Variable Resource Detection in Kubernetes #195

dashpole · 2022-01-27T16:02:40Z

This OTEP proposes using a consistent set of environment variables to detect resource attributes available through the Kubernetes downward API. The Kubernetes downward API is designed to serve this function and is the most consistent way to detect Kubernetes resource attributes. This will improve user experience, and enable re-use of the same Kubernetes configuration across languages and when using the collector.

text/0195-kubernetes-resource-detection.md

dashpole · 2022-01-27T21:44:28Z

cc @Aneurysm9 @pmm-sumo @dmitryax
who might be interested

dmitryax

Thanks for putting it together!

text/0195-kubernetes-resource-detection.md

anuraaga

Just for reference we currently use OTEL_RESOURCE_ATTRIBUTES in the operator

https://github.com/open-telemetry/opentelemetry-operator/blob/a94a13888165ec688a5f83d4e444e9fbc934fca2/pkg/instrumentation/sdk.go#L144

If there were different recognized variables we would probably switch to it though, as far as I can tell it's not practical to set that purely with YAML so having dedicated env vars seems worth it.

pmm-sumo · 2022-01-28T08:15:00Z

I think that the downward API can currently also expose labels and annotations. While it's not included in the semantic conventions, k8sattributesprocessor can currently extract those into resource attributes (and this information is frequently very useful).

I am wondering if there's a space to include such capability via env resource detection. Currently, the labels/annotations are filtered by k8sattributesprocessor but perhaps we could include those in a separate namespace and let filtering happen (if needed) using e.g. resourceprocessor

text/0195-kubernetes-resource-detection.md

dashpole · 2022-01-28T16:14:25Z

If there were different recognized variables we would probably switch to it though, as far as I can tell it's not practical to set that purely with YAML so having dedicated env vars seems worth it.

Oh, wow. I just realized you can define dependent env vars, which might be a better solution than i'm proposing.

I could add:

- name: OTEL_RESOURCE_ATTRIBUTES
  value: k8s.pod.name=$(K8S_POD_NAME),k8s.pod.uid=$(K8S_POD_UID),k8s.namespace.name=$(K8S_NAMESPACE_NAME),k8s.node.name=$(K8S_NODE_NAME)

to achieve the goals of this proposal, without requiring any detectors to be installed. I've added this as an alternative for now.

mat-rumian · 2022-01-28T16:26:44Z

@dashpole I would like to share my opinion. What do you think about configuring additional env var holding resource attributes related to k8s in separated var like:

- name: OTEL_RESOURCE_ATTRIBUTES_K8S
  value: k8s.pod.name=$(K8S_POD_NAME),k8s.pod.uid=$(K8S_POD_UID),k8s.namespace.name=$(K8S_NAMESPACE_NAME),k8s.node.name=$(K8S_NODE_NAME)

and use it this way:

- name: OTEL_RESOURCE_ATTRIBUTES
  value: $(OTEL_RESOURCE_ATTRIBUTES_K8S),other=attribs...

I think it will be simpler, transparent and easier to handle and modify.

dashpole · 2022-01-28T16:45:22Z

I think that the downward API can currently also expose labels and annotations. ... I am wondering if there's a space to include such capability via env resource detection.

If we went with the current proposal (explicitly defined env vars that are detected with a kubernetes detector), we could also support K8S_LABELS and K8S_ATTRIBUTES. The kubernetes detector could accept a configuration option to allow elevating parsed labels or attributes to resource attributes.

If we used dependent environment variables, as we are discussing above (no detector needed), users could just fetch labels and attributes themselves:

- name: MY_CUSTOM_ATTRIBUTE
   valueFrom:
     fieldRef:
       fieldPath: metadata.labels['my-custom-attribute']
- name: OTEL_RESOURCE_ATTRIBUTES
  value: other.semantic.convention=$(MY_CUSTOM_ATTRIBUTE)

There are a lot of labels/attributes that would come in if the resourcedetectionprocessor added them be default. Filtering later seems tedious (e.g. what if my cluster admin adds a new label)...

WDYT?

dashpole · 2022-01-28T16:50:13Z

What do you think about configuring additional env var holding resource attributes related to k8s in separated var like ...

That seems like a perfectly reasonable thing to do, practically speaking. In terms of this proposal, I think the core question is whether or not we want to specify a new resource detector. I'd prefer to keep the yaml simpler for readability, if thats acceptable.

dashpole · 2022-01-28T17:00:17Z

text/0195-kubernetes-resource-detection.md

+
+Many kubernetes detectors currently use `HOSTNAME` environment variable, which defaults to the Pod name. However, the `HOSTNAME` can be [modified in a few ways](https://kubernetes.io/docs/concepts/services-networking/dns-pod-service/#pod-s-hostname-and-subdomain-fields) in the pod spec. Kubernetes resource detectors may fall back to detecting the pod name using `HOSTNAME` if `K8S_POD_NAME` is not available, but this may cause user confusion in some cases.
+
+### Alternative: Using Dependent Environment Variables


@tigrannajaryan
@jsuereth mentioned to me that this approach may not interact well with schemas, since OTEL_RESOURCE_ATTRIBUTES is merged into resource attributes without any notion of version according to the specification. Using this approach would seem to remove the benefits of #161. Is there a way to make OTEL_RESOURCE_ATTRIBUTES produce resource attributes associated with a particular version of the semantic conventions?

text/0195-kubernetes-resource-detection.md

aabmass · 2022-01-28T17:30:24Z

text/0195-kubernetes-resource-detection.md

+
+SDK's should add a Kubernetes environment variable-based resource detector.  The Kubernetes resource detector and the collector's `resourcedetectionprocessor` should support the following environment variables, and map them to the following semantic conventions:
+
+| Environment Variable | Semantic Convention |


I wonder if these environment variable names will run into conflicts. Should we prefix these with OTEL_?

I don't have a strong opinion. Is there precedent from other detectors?

I don't think there is a precedent. Since SDK resource detectors would need to interpret them, would you consider these SDK environment variables?

Likely an instrumented application would need these vars from downward API as well, but it's very unlikely that they would expect any different value. So conflict here is rather useful, otherwise they would need to repeat all these definitions with OTEL_ prefix and without.

The question here is whether we want to support an application that uses K8S_POD_NAME to represent a different value like deployment name or something else.

If we want to read these env vars from instrumentation libraries as is, we can probably support both prefixed (OTEL_K8S_POD_NAME) and not prefixed, defined as preferred (K8S_POD_NAME) just to not block use cases like the one described above. If we are still going to use only OTEL_RESOURCE_ATTRIBUTES in instrumentation libraries and compose it with k8s.pod.name=$(K8S_POD_NAME),k8s.pod.uid=$(K8S_POD_UID), I think we shouldn't use any prefix, users always can rename them if really needed.

Thinking about this more, I think we should switch to OTEL_ prefixed env vars. The reason isn't that someone might want opentelemetry to attach attributes that don't match your own identity. This is mostly relevant for container name (e.g. sidecar), but I could also imagine it for pod or even namespace.

@dashpole do you mean that OTEL_K8S_CONTAINER_NAME env var added to one container would mean something else other than a name of the container which it's added to, like name of an otel sidecar when added to an application container?

Yes, thats what I mean. container name isn't part of this proposal, but would be a likely future extension of this.

IMO in that case meaning of OTEL_K8S_CONTAINER_NAME becomes a bit confusing and inconsistent with other OTEL_K8S_* env vars. I, as a user, would expect OTEL_K8S_CONTAINER_NAME to have a value of the container where env var is defined, giving that other OTEL_K8S_* have values of other k8s objects where the env var is defined.

I think we should use another name for the sidecar use case, not OTEL_K8S_CONTAINER_NAME

I don't think multiple env vars for the same semantic convention would work, as resource detectors will ultimately have to choose one of the two to set in that resource attribute.

If we named it without a prefix, I'm not sure that would stop users from treating the env vars as a way to set attribute X in OTEL, even if we would prefer they use another mechanism in that scenario. We just risk introducing naming conflicts if we don't prefix.

text/0195-kubernetes-resource-detection.md

pavolloffay

This proposal is related to open-telemetry/opentelemetry-specification#2135 which proposes support for OTEL_RESOURCE_ATTRIBUTES_* env var. These env vars are added to OTEL_RESOURCE_ATTRIBUTES`

e.g.

OTEL_RESOURCE_ATTRIBUTES_K8S_POD_NAME=myapp
OTEL_RESOURCE_ATTRIBUTES=foo=bar

the final value of OTEL_RESOURCE_ATTRIBUTES would be foo=bar,k8s_pod_name=myapp

text/0195-kubernetes-resource-detection.md

bogdandrutu · 2022-02-01T09:40:31Z

@dashpole very nice OTEP, please consider to also refer to that proposal of supporting OTEL_RESOURCE_ATTRIBUTES_*

dashpole · 2022-02-04T15:01:21Z

please consider to also refer to that proposal of supporting OTEL_RESOURCE_ATTRIBUTES_*

Done.

text/0195-kubernetes-resource-detection.md

seh · 2022-05-06T13:34:00Z

On the subject of the difficulty of knitting together several environment variables into a final "OTEL_RESOURCE_ATTRIBUTES" variable, please see open-telemetry/opentelemetry-specification#1982.

Here's how I handled this in a kustomization I wrote recently, with the following in the "base" manifest for a Deployment:

- name: _OTEL_RESOURCES_ATTRIBUTES_UNDERLAY
  value: ""
- name: _OTEL_RESOURCES_ATTRIBUTES_OVERLAY
  value: ""
- name: OTEL_RESOURCE_ATTRIBUTES
  value: >-
    $(_OTEL_RESOURCES_ATTRIBUTES_UNDERLAY)
    k8s.container.name=my-thing,
    k8s.deployment.name=something,
    k8s.namespace.name=$(POD_NAMESPACE),
    k8s.node.name=$(NODE_NAME),
    k8s.pod.name=$(POD_NAME),
    k8s.pod.primary_ip_address=$(POD_IP_ADDRESS),
    k8s.pod.service_account.name=$(POD_SERVICE_ACCOUNT_NAME),
    k8s.pod.uid=$(POD_UID),
    net.host.ip=$(NODE_IP_ADDRESS)
    $(_OTEL_RESOURCES_ATTRIBUTES_OVERLAY)

Note the two empty "_OTEL_RESOURCES_ATTRIBUTES_UNDERLAY" and "_OTEL_RESOURCES_ATTRIBUTES_OVERLAY" variables. In overlay kustomizations, I can populate either or both of those, but I have to be careful to terminate the former with a trailing comma and prefix the latter with a leading comma.

dashpole · 2022-05-06T16:04:40Z

Thanks @seh, i've added that to the drawbacks of using dependent environment variables.

Co-authored-by: Dmitrii Anoshin <[email protected]>

dashpole requested a review from a team January 27, 2022 16:02

dashpole force-pushed the k8s_env branch from 9bc186e to 6239c32 Compare January 27, 2022 16:03

jpkrohling reviewed Jan 27, 2022

View reviewed changes

text/0195-kubernetes-resource-detection.md Outdated Show resolved Hide resolved

dashpole force-pushed the k8s_env branch from 9aebd52 to 12d7598 Compare January 27, 2022 17:45

Aneurysm9 approved these changes Jan 27, 2022

View reviewed changes

dmitryax reviewed Jan 28, 2022

View reviewed changes

text/0195-kubernetes-resource-detection.md Outdated Show resolved Hide resolved

anuraaga reviewed Jan 28, 2022

View reviewed changes

dmitryax reviewed Jan 28, 2022

View reviewed changes

text/0195-kubernetes-resource-detection.md Outdated Show resolved Hide resolved

dashpole commented Jan 28, 2022

View reviewed changes

aabmass reviewed Jan 28, 2022

View reviewed changes

punya reviewed Jan 28, 2022

View reviewed changes

text/0195-kubernetes-resource-detection.md Show resolved Hide resolved

text/0195-kubernetes-resource-detection.md Show resolved Hide resolved

dashpole force-pushed the k8s_env branch from 0ee26a4 to 857abfd Compare January 28, 2022 18:38

dmitryax reviewed Jan 28, 2022

View reviewed changes

text/0195-kubernetes-resource-detection.md Outdated Show resolved Hide resolved

pavolloffay reviewed Jan 31, 2022

View reviewed changes

text/0195-kubernetes-resource-detection.md Show resolved Hide resolved

dashpole force-pushed the k8s_env branch from af75095 to d6bdafc Compare January 31, 2022 16:39

dashpole force-pushed the k8s_env branch from d6bdafc to 2218d0b Compare February 4, 2022 14:15

mx-psi reviewed Feb 4, 2022

View reviewed changes

text/0195-kubernetes-resource-detection.md Show resolved Hide resolved

mx-psi approved these changes Feb 7, 2022

View reviewed changes

pmm-sumo mentioned this pull request Feb 15, 2022

K8sattribute processor documentation need updates open-telemetry/opentelemetry-collector-contrib#7912

Closed

dashpole mentioned this pull request Apr 26, 2022

Unify and improve GCP resource detection open-telemetry/opentelemetry-go-contrib#2223

Closed

jsuereth approved these changes Apr 28, 2022

View reviewed changes

jmacd approved these changes May 5, 2022

View reviewed changes

dashpole and others added 13 commits May 6, 2022 16:06

Add kubernetes resource detection OTEP

ba28f4a

the OpenTelemetry Operator would handle env var injection

827798d

fix table formatting

1acc897

rename kubernetes to k8s

85274fb

Update text/0195-kubernetes-resource-detection.md

49f5d27

Co-authored-by: Dmitrii Anoshin <[email protected]>

add dependent env var alternative

94303d8

add downsides to alternatives

c36f8ac

update intro section to mention dependent env vars alternative

76c9668

typo

d7aae11

add multiple env var alternative

1d23b97

note that _ to . is not always correct

70d18ca

prefix env vars with otel

973a39b

Add difficulties merging env vars together

ed8b534

dashpole force-pushed the k8s_env branch from 974e21f to ed8b534 Compare May 6, 2022 16:09

dashpole mentioned this pull request Jun 3, 2022

Request to Improve Namespace Detection in GKE Resource Detector open-telemetry/opentelemetry-go-contrib#2336

Open

This was referenced Jun 13, 2022

[exporter/datadog] Add Kubernetes hostname provider open-telemetry/opentelemetry-collector-contrib#10911

Merged

[exporter/datadog] Kubernetes node name resolution via the downward API open-telemetry/opentelemetry-collector-contrib#11033

Open

tedsuo added priority:p1 triaged labels Jan 30, 2023

dashpole closed this Feb 28, 2023

aabmass mentioned this pull request Jul 20, 2023

Point upstream @opentelemetry/resource-detector-gcp at the resource detector in this repo GoogleCloudPlatform/opentelemetry-operations-js#518

Open

dashpole mentioned this pull request Aug 4, 2023

Request to Add Detector for Kubernetes open-telemetry/opentelemetry-go-contrib#4136

Open

10 tasks

dmitryax mentioned this pull request May 14, 2024

[receiver/kubeletstats] Add k8s.container.cpu.node.utilization metric open-telemetry/opentelemetry-collector-contrib#32295

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Environment Variable Resource Detection in Kubernetes #195

Environment Variable Resource Detection in Kubernetes #195

dashpole commented Jan 27, 2022

dashpole commented Jan 27, 2022

dmitryax left a comment

anuraaga left a comment

pmm-sumo commented Jan 28, 2022

dashpole commented Jan 28, 2022 •

edited

Loading

mat-rumian commented Jan 28, 2022

dashpole commented Jan 28, 2022 •

edited

Loading

dashpole commented Jan 28, 2022 •

edited

Loading

dashpole Jan 28, 2022

aabmass Jan 28, 2022

dashpole Jan 28, 2022

aabmass Jan 28, 2022

dmitryax Jan 28, 2022 •

edited

Loading

dashpole Feb 4, 2022

dmitryax Feb 7, 2022

dashpole Feb 7, 2022

dmitryax Feb 8, 2022 •

edited

Loading

dashpole Feb 9, 2022

pavolloffay left a comment

bogdandrutu commented Feb 1, 2022

dashpole commented Feb 4, 2022

seh commented May 6, 2022 •

edited

Loading

dashpole commented May 6, 2022


		Many kubernetes detectors currently use `HOSTNAME` environment variable, which defaults to the Pod name. However, the `HOSTNAME` can be [modified in a few ways](https://kubernetes.io/docs/concepts/services-networking/dns-pod-service/#pod-s-hostname-and-subdomain-fields) in the pod spec. Kubernetes resource detectors may fall back to detecting the pod name using `HOSTNAME` if `K8S_POD_NAME` is not available, but this may cause user confusion in some cases.

		### Alternative: Using Dependent Environment Variables


		SDK's should add a Kubernetes environment variable-based resource detector. The Kubernetes resource detector and the collector's `resourcedetectionprocessor` should support the following environment variables, and map them to the following semantic conventions:

		\| Environment Variable \| Semantic Convention \|

Environment Variable Resource Detection in Kubernetes #195

Environment Variable Resource Detection in Kubernetes #195

Conversation

dashpole commented Jan 27, 2022

dashpole commented Jan 27, 2022

dmitryax left a comment

Choose a reason for hiding this comment

anuraaga left a comment

Choose a reason for hiding this comment

pmm-sumo commented Jan 28, 2022

dashpole commented Jan 28, 2022 • edited Loading

mat-rumian commented Jan 28, 2022

dashpole commented Jan 28, 2022 • edited Loading

dashpole commented Jan 28, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dmitryax Jan 28, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dmitryax Feb 8, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pavolloffay left a comment

Choose a reason for hiding this comment

bogdandrutu commented Feb 1, 2022

dashpole commented Feb 4, 2022

seh commented May 6, 2022 • edited Loading

dashpole commented May 6, 2022

dashpole commented Jan 28, 2022 •

edited

Loading

dashpole commented Jan 28, 2022 •

edited

Loading

dashpole commented Jan 28, 2022 •

edited

Loading

dmitryax Jan 28, 2022 •

edited

Loading

dmitryax Feb 8, 2022 •

edited

Loading

seh commented May 6, 2022 •

edited

Loading