Investigating the missing logs #1008

betatim · 2021-01-28T08:22:39Z

This is a follow up to #987

As we observed there the test_env.py tests break. Locally this also happens, but only sometimes. After about half a day of messing around and trying to figure out where the logs go I am resigning myself to not being able to figure it out :-/

It seems even with somewhat "extreme" measures like in the PR you either get the logs from the command executed (both in the "normal" logs and the newly added "late logs" or you don't get them in either).

A thing that seems to help is to add a ; sleep 1 to the command the test runs.

This makes me think that the issue is related to buffering of some kind, some where. I am tempted to suggest we add the sleep to the test and assume that the flakeyness is due to the very short execution time of the container, which isn't typical for repo2docker containers.

Unfortunately I don't understand enough about how the logging in docker works to know if changing config options in the logging driver https://docs.docker.com/config/containers/logging/configure/ would help or not :-/

A popular google result regarding "my docker logs are missing" is a note that journald rate limits log producers but I think this doesn't apply to us because I am running OSX. This means I don't have journald and as far as I understand it the default is logging driver is the json-file driver which doesn't use journald.

Thoughts welcome on how to fix this properly or the workaround.

minrk · 2021-02-16T15:18:40Z

I poked around for a bit and was able to reproduce it consistently until I got my debugging in place, then it promptly started succeeding 100% of the time and hasn't failed since (even after reverting changes).

Buffer ideas:

PYTHONUNBUFFERED might help
maybe calling sys.stdout / sys.stderr.flush()?
stdbuf -o0 might be the equivalent for bash?

minrk · 2021-02-16T15:25:10Z

Given that failure sems to have become more frequent, I think maybe we should merge this one as-is to unblock folks and proceed with.

Maybe the above stdbuf -o{log_fd} should go in the entrypoint in the right place?

minrk · 2021-02-17T10:03:50Z

repo2docker/app.py

+            )
+            # are there more logs? Let's send them back too
+            late_logs = container.logs().decode("utf-8")
+            for line in late_logs.split("\n"):


In my runs, late_logs is never empty, it's always the complete logs of the container. So this is always duplicating the logs and also missed the same logs.

Absolutely right. I added this thinking that maybe we missed some logs. But that isn't the problem.

If I remove/comment out the "delete stopped container" part of r2d and then run docker logs <container> afterwards the logs are also empty (when this problems occurs). So I think your ideas about PYTHONUNBUFFERED and stdbuf are the way to go investigating.

I think, the "late log" lines should be removed

minrk · 2021-02-17T10:04:05Z

repo2docker/buildpacks/repo2docker-entrypoint

@@ -15,10 +15,10 @@ if [[ ! -z "${R2D_ENTRYPOINT:-}" ]]; then
    if [[ ! -x "$R2D_ENTRYPOINT" ]]; then
        chmod u+x "$R2D_ENTRYPOINT"
    fi
-    exec "$R2D_ENTRYPOINT" "$@" >&"$log_fd"
+    exec "$R2D_ENTRYPOINT" "$@" 2>&1 >&"$log_fd"


I think this merge is a good thing to do, regardless

manics · 2021-02-17T10:12:24Z

Unfortunately I don't understand enough about how the logging in docker works to know if changing config options in the logging driver https://docs.docker.com/config/containers/logging/configure/ would help or not :-/

A bit late... but I don't think the Docker daemon logging options will make a difference as they only have an effect after the logs have been emitted from the container on stdout/stderr. They're more to do with how the Docker daemon stores or forwards them.

I'm also in favour of merging this to fix the tests, and to see if it somehow makes the logs appear on mybinder (so far in all containers I've launched ~/.jupyter-server-log.txt is always empty).

minrk · 2021-02-17T10:23:07Z

@manics good call. I'll go ahead and merge, and try to refine a bit as I attempted (unsuccessfully) in #1014

Extreme measures

6e372b5

minrk mentioned this pull request Feb 16, 2021

Update to JupyterLab 3.0 #996

Merged

5 tasks

minrk mentioned this pull request Feb 17, 2021

reimplement entrypoint in Python #1014

Merged

minrk reviewed Feb 17, 2021

View reviewed changes

minrk merged commit e862630 into jupyterhub:master Feb 17, 2021

henchbot mentioned this pull request Feb 17, 2021

repo2docker: 2021.01.0-18.g8582d44...2021.01.0-33.gf8d386a jupyterhub/mybinder.org-deploy#1839

Merged

minrk mentioned this pull request Feb 17, 2021

Refine buffered output debugging #1016

Merged

betatim deleted the missing-logs branch February 18, 2021 11:49

manics mentioned this pull request Jun 14, 2021

Feature request: exposing server logs to a JupyterHub user - but how? jupyterhub/team-compass#419

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigating the missing logs #1008

Investigating the missing logs #1008

betatim commented Jan 28, 2021

minrk commented Feb 16, 2021

minrk commented Feb 16, 2021

minrk Feb 17, 2021

betatim Feb 18, 2021

minrk Feb 17, 2021

manics commented Feb 17, 2021 •

edited

Loading

minrk commented Feb 17, 2021

Investigating the missing logs #1008

Investigating the missing logs #1008

Conversation

betatim commented Jan 28, 2021

minrk commented Feb 16, 2021

minrk commented Feb 16, 2021

minrk Feb 17, 2021

Choose a reason for hiding this comment

betatim Feb 18, 2021

Choose a reason for hiding this comment

minrk Feb 17, 2021

Choose a reason for hiding this comment

manics commented Feb 17, 2021 • edited Loading

minrk commented Feb 17, 2021

manics commented Feb 17, 2021 •

edited

Loading