Import deadlock with streams #193

rkern · 2016-09-23T16:11:34Z

Write this text to the file minimal.py:

import sys

sys.__stdout__.write('About to do\n')
sys.__stdout__.flush()
sys.stdout.write('Doing\n')
sys.stdout.flush()
sys.__stdout__.write('Done\n')
sys.__stdout__.flush()

Start up an IPython (Python 2; the issue may not exist in Python 3) notebook. Execute import minimal. The kernel will become unresponsive.

The problem is that sys.stdout.flush() will call the OutputStream.flush() method which adds a callback to the event loop that sends off a zmq message to the notebook. That callback is executed in another thread. To create the message to send, it will create a UUID using uuid.uuid4() which has a local import of os in it. Python 2 has a global import lock that was acquired by the main thread which is executing our import minimal. That import does not complete because OutputStream.flush() is synchronous and is waiting for the event loop to be processed. Deadlock.

As far as I can tell, this is the only place that something is imported at runtime in the message-sending thread, so reimplementing uuid.uuid4() to not locally import would be a minimal fix that avoids the issue.

The text was updated successfully, but these errors were encountered:

rkern · 2016-09-23T16:13:51Z

Oh, the offending uuid.uuid4() calls are in jupyter_client/session.py, FWIW. Let me know if this issue should be moved over to that repo.

minrk · 2016-09-24T14:44:47Z

Here's a fine place for the issue for now. We can consider removing the eventloop wait in flush. I'll need to investigate what that is there for (I think it might have to do with forked subprocesses exiting before sending completes).

rkern · 2016-09-26T09:05:40Z

It's there for the semantics, I think. flush() is supposed to block until the buffer is actually, you know, flushed.

Fixes ipython#193. This should make sure we properly cull all subprocesses at shutdown, it does change one of the private method from sync to async in order to no user time.sleep or thread so this may affect subclasses, though I doubt it. It's also not completely clear to me whether this works on windows as SIGINT I belove is not a thing. Regardless as this affects things like dask, and others that are mostly on unix, it should be an improvement. It does the following, stopping as soon as it does not find any more children to current process. - Send sigint to everything - Immediately send sigterm in look with an exponential backoff from 0.01 to 1 second roughtly multiplying the delay until next send by 3 each time. - Switch to sending sigkill with same backoff. There is no delay after sigint, as this is just a courtesy. The delays backoff are not configurable. I can imagine that on slow systems it may make sens

minrk mentioned this issue Sep 26, 2016

reimplement uuid4 in session jupyter/jupyter_client#206

Merged

minrk closed this as completed in jupyter/jupyter_client#206 Sep 26, 2016

minrk added this to the no action milestone Nov 16, 2016

Carreau mentioned this issue Feb 17, 2022

BUG: Kill subprocesses on shutdown. #869

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Import deadlock with streams #193

Import deadlock with streams #193

rkern commented Sep 23, 2016

rkern commented Sep 23, 2016

minrk commented Sep 24, 2016

rkern commented Sep 26, 2016

Import deadlock with streams #193

Import deadlock with streams #193

Comments

rkern commented Sep 23, 2016

rkern commented Sep 23, 2016

minrk commented Sep 24, 2016

rkern commented Sep 26, 2016