-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
remote: pod logs: catch-up delay #16132
Comments
Recent flakes, to remind everyone that this is still not fixed:
|
Another one in remote f36-aarch64 root:
Although in CI I only see this on podman-remote tests, yesterday I saw this flake on a 1mt VM:
No idea if there's a connection, and the VM is long gone. |
Here's another non-remote logs flake (in f37 root):
I'm not saying the bug is in podman-logs, but maybe I am. |
And another one in int tests, also a podman-logs that spits out nothing. |
This one has been a thorn in my side: it's a podman-log issue, but not remote, so I _almost_ retitled containers#16132 (removing "remote"). Nope, it's a bug in the tests themselves. One solution would be to podman-wait, but I see no reason for logs to be involved, so I went with podman start -a instead. This removes the k8s-log stuff which is no longer necessary. Cleanup all around. Signed-off-by: Ed Santiago <[email protected]>
It looks like containers#16132 was my fault: a missing 'wait' for a container to exit. Let's see if this fixes the flake. And, while poking through flake logs, I found another missing wait. And... in wait_for_output(), address a potential race. Signed-off-by: Ed Santiago <[email protected]>
I'm going to hope that this was fixed in #16437 |
[backport containers#16437] It looks like containers#16132 was my fault: a missing 'wait' for a container to exit. Let's see if this fixes the flake. And, while poking through flake logs, I found another missing wait. And... in wait_for_output(), address a potential race. Signed-off-by: Ed Santiago <[email protected]>
As best I can tell,
podman-remote pod logs
is sometimes not actually fully catching up, so a system test is failingThe code in question is doing
echo teststring | nc localhost PORT
, then immediately runningpodman logs
on a container in a pod.podman/test/system/200-pod.bats
Lines 300 to 307 in 2a9df07
Seems to have started 08-Sept, which suggests a real code problem because that test code hasn't changed since June 2020.
The text was updated successfully, but these errors were encountered: