Consolidate Kubernetes `Watch` and `Stream` calls at the Pod-level over Container-level. #519

cognifloyd · 2022-03-15T02:01:11Z

Description

Consolidate Kubernetes Watch and Stream calls at the Pod-level instead of the Container-level.

There is prior work we can use to inform the implementation as well:

stern:
- watches events in a Namespace for any Pods with a set of labels (see: https://github.com/stern/stern/blob/master/stern/watch.go#L56)
- as each container starts, it gets put on a channel to be consumed in another goroutine (see: https://github.com/stern/stern/blob/master/stern/watch.go#L124)
- then another goroutine pulls from the channel (see https://github.com/stern/stern/blob/master/stern/stern.go#L124)
- which adds a tail for that container (see: https://github.com/stern/stern/blob/master/stern/tail.go#L162-L170 and https://github.com/stern/stern/blob/master/stern/tail.go#L197)

If we do something similar, then we could:

In runtime.AssembleBuild(), we start streaming pod changes + events (similar to how WaitContainer currently watches for pod changes, but only one stream of changes instead of one per step).
- Based on the Kubernetes API, we will need two streams:
  - ✔️ one to watch for Pod changes (including status changes like a container starting or finishing) and
  - ⌛ another to capture a stream of events.
runtime.AssembleBuild() also starts a goroutine that consumes the stream of events to do a number of things:
- ❓ start the log stream as quickly as possible (which runtime.TailContainer can access once Vela catches up recording state in the database)
- ⌛ split the stream into per-container streams of changes + events
- save each container's event stream in some kind of buffer for runtime.*Container methods to access.
  - ⌛ For example, RunContainer can wait until image pull is successful before returning or return any image pull errors identified in the container's event stream
  - ✔️ WaitContainer can wait for the events to say that the container has completed.
  - In the future we might be able to surface some of these events in some kind of "status" stream (separate from the logs stream), but that is an out-of-scope feature that we can discuss elsewhere.

Value

Improve the reliability of the Kubernetes runtime, especially:

better manage the async nature of k8s API calls
capture all container logs no matter how short lived
report pod or container errors more reliably

Hopefully, starting the container log stream as quickly as possible will mean that a log stream starts for each container before we modify it to add the step image (ie start streaming while the step has the kubernetes/pause:latest image instead of the step image. The TailContainer can start reading from the stream for a given step once it's ready to do so.

Another benefit is reducing the Vela-generated load on Kubernetes API servers.

The Vela Worker is effectively a "Kubernetes Controller" (eg the software that watches for ReplicaSet changes to create Pods) because it watches for events (albeit not sourced from Kubernetes resource changes) and converts those events into Pods, managing the lifecycle of those Pods. The libraries involved in building such a controller go to great lengths to avoid polling the Kubernetes API servers; they watch and then locally cache Kubernetes resources, allowing Kubernetes to push changes through the watch. Similar to a Kubernetes Controller, the Vela Worker's Kubernetes Runtime should minimize the API calls required to manage its Pipeline Pods (and any other resources it needs to retrieve in the course of creating the Pod).

Definition of Done

The Kubernetes Runtime should capture ALL logs from ALL containers, no matter how short-lived. To determine this, run an external tailing utility (like stern) and compare the captured logs with what Vela shows in its UI or CLI. There should be no difference in the log contents
Asynchronous Kubernetes errors, such as Image Pull Errors, or Admissions Controller Errors, should be surfaced in the UI/CLI.
API Watches and Streaming are managed primarily at the Pipeline/Build/Pod level, instead of the Step/Container level.
Steps get details about their Containers from the local event/stream cache instead of via direct API calls.

Effort (Optional)

Good question. I don't know, but I need it to work, so I'll be working on this.

Previous attempts at fixing these issues include:

Impacted Personas (Optional)

Anyone who uses the Kubernetes Runtime.

Depending on the implementation, some API changes may be needed between Executor and Runtime interfaces (which is much easier to coordinate now that Executor and Runtime packages are both in the Worker repo).

The text was updated successfully, but these errors were encountered:

cognifloyd · 2022-04-28T22:13:17Z

go-vela/worker#303 is not panning out for fixing log streaming, but several other issues have been (or will soon be) fixed. And, the log stream has to be per-container, so managing the log streams at the pod level has not been helpful so far. For now, I'm going to step away from the log streaming part of this and see how the rest of the fixes affect things over time. If I notice any more logging issues, I will revisit this. Here are some of the logging changes that will hopefully improve things:

However, go-vela/worker#302 has been working out very well. I'm very happy with watching the pod at the pod-level (instead of per container) using the k8s-controller primitives (SharedInformer).

So far, go-vela/worker#302 allows us to watch for Pod changes (add/update/delete) for the pod itself. To get information about pull errors, however, we need to start watching the stream of events. These events do not affect the fields of the pod, so we need a separate stream.

In go-vela/worker#279 I added plain watches for the kubernetes events. I will create a new PR that re-implements (reuses?) that, possibly via the SharedInformer on the PodTracker.

cognifloyd · 2022-10-28T03:41:13Z

go-vela/worker#390 massively improves log streaming reliability by allowing the streaming to take longer than the execution using a configurable timer.

I have one more WIP fix that will make sure the pod doesn't get deleted until streaming is "done" (finished or timed out).

plyr4 · 2024-04-10T14:40:58Z

@cognifloyd just checking in on some older issues. what do you think about this? still an issue? something we should prioritize for k8s runtime?

cognifloyd added the enhancement Indicates an improvement to a feature label Mar 15, 2022

cognifloyd changed the title ~~Consolidate Kubernetes Watch and Stream calls at the Pod-level instead of the Container-level.~~ Consolidate Kubernetes Watch and Stream calls at the Pod-level over Container-level. Mar 15, 2022

cognifloyd self-assigned this Apr 7, 2022

This was referenced Apr 8, 2022

enhance(kubernetes): Add podTracker and containerTracker to use k8s API more like a k8s controller go-vela/worker#302

Merged

refactor(kubernetes): manage log streaming at pod-level go-vela/worker#303

Closed

cognifloyd added this to Kubernetes Apr 19, 2022

cognifloyd moved this to In Progress in Kubernetes Apr 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consolidate Kubernetes `Watch` and `Stream` calls at the Pod-level over Container-level. #519

Consolidate Kubernetes `Watch` and `Stream` calls at the Pod-level over Container-level. #519

cognifloyd commented Mar 15, 2022 •

edited

Loading

cognifloyd commented Apr 28, 2022

cognifloyd commented Oct 28, 2022

plyr4 commented Apr 10, 2024

Consolidate Kubernetes Watch and Stream calls at the Pod-level over Container-level. #519

Consolidate Kubernetes Watch and Stream calls at the Pod-level over Container-level. #519

Comments

cognifloyd commented Mar 15, 2022 • edited Loading

Description

Value

Definition of Done

Effort (Optional)

Impacted Personas (Optional)

cognifloyd commented Apr 28, 2022

cognifloyd commented Oct 28, 2022

plyr4 commented Apr 10, 2024

Consolidate Kubernetes `Watch` and `Stream` calls at the Pod-level over Container-level. #519

Consolidate Kubernetes `Watch` and `Stream` calls at the Pod-level over Container-level. #519

cognifloyd commented Mar 15, 2022 •

edited

Loading