Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error importing modin on Linux: Connection to IPC socket failed for pathname /tmp/ray/session... #468

Closed
carlosssanchez opened this issue Feb 9, 2019 · 26 comments
Labels
dependencies 🔗 Issues related to dependencies Memory 💾 Issues related to memory

Comments

@carlosssanchez
Copy link

System information

  • OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Manjaro Linux (Up date to Feb 9, 2019)
  • Modin installed from (source or binary): pip
  • Modin version: 0.3.1
  • Python version: 3.7.2
  • Exact command to reproduce: import modin.pandas as pd

Describe the problem

Modin fails to import when I try to import modin.pandas as pd (tested on different shells/venv and from a script). Running import modin alone does not result in error.

Source code / logs

import modin.pandas as pd
Process STDOUT and STDERR is being redirected to /tmp/ray/session_2019-02-09_13-26-56_29493/logs.
Waiting for redis server at 127.0.0.1:16725 to respond...
Waiting for redis server at 127.0.0.1:53530 to respond...
Starting Redis shard with 10.0 GB max memory.
WARNING: The object store is using /tmp instead of /dev/shm because /dev/shm has only 3623899136 bytes available. This may slow down performance! You may be able to free up space by deleting files in /dev/shm or terminating any running plasma_store_server processes. If you are inside a Docker container, you may need to pass an argument with the flag '--shm-size' to 'docker run'.
Starting the Plasma object store with 4.0 GB memory using /tmp.
WARNING: Logging before InitGoogleLogging() is written to STDERR
E0209 13:26:56.985399 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 50 more times
E0209 13:26:57.085587 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 49 more times
E0209 13:26:57.185729 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 48 more times
E0209 13:26:57.285882 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 47 more times
E0209 13:26:57.386044 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 46 more times
E0209 13:26:57.486229 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 45 more times
E0209 13:26:57.586522 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 44 more times
E0209 13:26:57.686915 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 43 more times
E0209 13:26:57.787245 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 42 more times
E0209 13:26:57.887514 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 41 more times
E0209 13:26:57.987782 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 40 more times
E0209 13:26:58.088189 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 39 more times
E0209 13:26:58.188467 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 38 more times
E0209 13:26:58.288707 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 37 more times
E0209 13:26:58.388949 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 36 more times
E0209 13:26:58.489233 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 35 more times
E0209 13:26:58.589490 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 34 more times
E0209 13:26:58.689779 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 33 more times
E0209 13:26:58.790030 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 32 more times
E0209 13:26:58.890283 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 31 more times
E0209 13:26:58.990538 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 30 more times
E0209 13:26:59.090790 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 29 more times
E0209 13:26:59.191045 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 28 more times
E0209 13:26:59.291301 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 27 more times
E0209 13:26:59.391549 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 26 more times
E0209 13:26:59.491786 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 25 more times
E0209 13:26:59.592041 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 24 more times
E0209 13:26:59.692338 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 23 more times
E0209 13:26:59.792659 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 22 more times
E0209 13:26:59.892916 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 21 more times
E0209 13:26:59.993175 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 20 more times
E0209 13:27:00.093627 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 19 more times
E0209 13:27:00.194108 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 18 more times
E0209 13:27:00.294461 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 17 more times
E0209 13:27:00.394773 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 16 more times
E0209 13:27:00.495095 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 15 more times
E0209 13:27:00.595379 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 14 more times
E0209 13:27:00.695606 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 13 more times
E0209 13:27:00.795925 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 12 more times
E0209 13:27:00.896240 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 11 more times
E0209 13:27:00.996529 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 10 more times
E0209 13:27:01.096850 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 9 more times
E0209 13:27:01.197140 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 8 more times
E0209 13:27:01.297454 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 7 more times
E0209 13:27:01.397768 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 6 more times
E0209 13:27:01.498037 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 5 more times
E0209 13:27:01.598280 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 4 more times
E0209 13:27:01.698601 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 3 more times
E0209 13:27:01.798869 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 2 more times
E0209 13:27:01.899190 29493 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store, retrying 1 more times
Traceback (most recent call last):
File "", line 1, in
File "/home/c/coins/venv-linux/lib/python3.7/site-packages/modin/pandas/init.py", line 134, in
initialize_ray()
File "/home/c/coins/venv-linux/lib/python3.7/site-packages/modin/pandas/init.py", line 126, in initialize_ray
object_store_memory=object_store_memory,
File "/home/c/coins/venv-linux/lib/python3.7/site-packages/ray/worker.py", line 1530, in init
driver_id=driver_id)
File "/home/c/coins/venv-linux/lib/python3.7/site-packages/ray/worker.py", line 1882, in connect
plasma.connect(info["store_socket_name"]))
File "pyarrow/_plasma.pyx", line 789, in pyarrow._plasma.connect
File "pyarrow/error.pxi", line 83, in pyarrow.lib.check_status
pyarrow.lib.ArrowIOError: Could not connect to socket /tmp/ray/session_2019-02-09_13-26-56_29493/sockets/plasma_store

@carlosssanchez
Copy link
Author

It seems the problems is with ray. I'm using ray version 0.6.2 if it helps.

@devin-petersohn
Copy link
Collaborator

Thanks @carlosssanchez for reporting this. It does look like plasma it unable to initialize in your environment. Does the issue persist with pip install ray==0.6.3?

cc @pcmoritz @robertnishihara

@carlosssanchez
Copy link
Author

@devin-petersohn I downloaded the new ray version but it didn't fix it. However, I fixed it by closing all my other running programs (Firefox and Code editors) and running from a new shell.

I think this line "Starting the Plasma object store with 4.0 GB memory using /tmp" tried to allocate 4GB of RAM when I didn't have it available. I have 8GB RAM total and I'm pretty sure I could've passed more than 4GB with Pycharm + Atom + Firefox open.

However I have a runtime error with column names being integers apparently. I'll open another issue on it. Is this a known modin modification? The error is I'm checking df column names to compare the last few characters in a function but it gives TypeError int object is not subscriptable.

@devin-petersohn
Copy link
Collaborator

I see, that makes sense. Typically plasma will throw an Out of Memory error if it can't allocate enough memory, but it seems like this may be another path that an OOM can present itself. Great find!

That would be great to open a new issue to track the columns issue. Looping through column names should work identically as it does in pandas.

@devin-petersohn devin-petersohn added the Memory 💾 Issues related to memory label Feb 9, 2019
@shaunstoltz
Copy link

Is there some guaranteed way of running this module? I have tried the simplest of imports on multiple machines and this lib simply dosn't work. My base machine is 16c32t/96gig mem/4 1080ti and get the same memory error.

4c/8gig/2 1080ti machine, same error complaining about not being able to connect.

Is there a docker image somewhere that actually guarantees this will work?

If not, I strongly suggest you get some none native way of allowing users to use this without drama, otherwise this is just a toy project, and has no real world value.

@devin-petersohn devin-petersohn added the dependencies 🔗 Issues related to dependencies label Feb 27, 2019
@devin-petersohn
Copy link
Collaborator

Hi @shaunstoltz, can you let me know if the following works for you:

import ray

object_store_memory = int(0.6 * ray.utils.get_system_memory() // 10 ** 9 * 10 ** 9)
ray.init(
            include_webui=False,
            ignore_reinit_error=True,
            plasma_directory="/tmp",
            object_store_memory=object_store_memory,
        )

If this also errors for you, it is an issue with the initialization in Ray. We should open an issue in Ray if this is the case because we definitely want this resolved.

Is there a docker image somewhere that actually guarantees this will work?

If not, I strongly suggest you get some none native way of allowing users to use this without drama, otherwise this is just a toy project, and has no real world value.

We have built docker images in the past for tutorials and testing, but they are not up to date. A simple way to build a docker image from this repo would be to use BinderHub (a great project). We also have a link in the badges of the README to a mybinder.org instance where you can try it, but those instances have <1GB in memory. Building a Docker image as a part of the release process is a great idea, I will add it to the backlog.

If you're interested in helping with the success this project, feel free to get involved! Creating an issue and following up on that issue is a great way to do that.

There are also several open issues flagged as Help Wanted if you are interested in contributing code.

@shaunstoltz
Copy link

Thanks for quick reply. Have tried the code snippet above, and same error:

import ray

object_store_memory = int(0.6 * ray.utils.get_system_memory() // 10 ** 9 * 10 ** 9)
ray.init(
            include_webui=False,
            ignore_reinit_error=True,
            plasma_directory="/tmp",
            object_store_memory=object_store_memory,
        )
2019-02-27 23:08:59,437	INFO node.py:278 -- Process STDOUT and STDERR is being redirected to /tmp/ray/session_2019-02-27_23-08-59_73139/logs.
2019-02-27 23:08:59,545	INFO services.py:396 -- Waiting for redis server at 127.0.0.1:12520 to respond...
2019-02-27 23:08:59,659	INFO services.py:396 -- Waiting for redis server at 127.0.0.1:31524 to respond...
2019-02-27 23:08:59,662	INFO services.py:798 -- Starting Redis shard with 10.0 GB max memory.
2019-02-27 23:08:59,679	WARNING services.py:1242 -- WARNING: object_store_memory is not verified when plasma_directory is set.
2019-02-27 23:08:59,680	INFO services.py:1360 -- Starting the Plasma object store with 60.0 GB memory using /tmp.
---------------------------------------------------------------------------
ArrowIOError                              Traceback (most recent call last)
<ipython-input-1-4b0bccaf545a> in <module>()
      6             ignore_reinit_error=True,
      7             plasma_directory="/tmp",
----> 8             object_store_memory=object_store_memory,
      9         )

~/anaconda3/lib/python3.6/site-packages/ray/worker.py in init(redis_address, num_cpus, num_gpus, resources, object_store_memory, redis_max_memory, node_ip_address, object_id_seed, num_workers, local_mode, driver_mode, redirect_worker_output, redirect_output, ignore_reinit_error, num_redis_shards, redis_max_clients, redis_password, plasma_directory, huge_pages, include_webui, driver_id, configure_logging, logging_level, logging_format, plasma_store_socket_name, raylet_socket_name, temp_dir, _internal_config, use_raylet)
   1492         mode=driver_mode,
   1493         worker=global_worker,
-> 1494         driver_id=driver_id)
   1495 
   1496     for hook in _post_init_hooks:

~/anaconda3/lib/python3.6/site-packages/ray/worker.py in connect(info, redis_password, object_id_seed, mode, worker, driver_id)
   1800     # Create an object store client.
   1801     worker.plasma_client = thread_safe_client(
-> 1802         plasma.connect(info["store_socket_name"]))
   1803 
   1804     raylet_socket = info["raylet_socket_name"]

~/anaconda3/lib/python3.6/site-packages/ray/pyarrow_files/pyarrow/_plasma.pyx in pyarrow._plasma.connect()

~/anaconda3/lib/python3.6/site-packages/ray/pyarrow_files/pyarrow/error.pxi in pyarrow.lib.check_status()

ArrowIOError: Could not connect to socket /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store

$ python --version
Python 3.6.4 :: Anaconda custom (64-bit)
$ python -m pip list
alabaster (0.7.10)
anaconda-client (1.6.9)
anaconda-navigator (1.7.0)
anaconda-project (0.8.2)
asn1crypto (0.24.0)
astroid (1.6.1)
astropy (2.0.3)
attrs (17.4.0)
Babel (2.5.3)
backports.shutil-get-terminal-size (1.0.0)
beautifulsoup4 (4.6.0)
bitarray (0.8.1)
bkcharts (0.2)
blaze (0.11.3)
bleach (2.1.2)
blessings (1.7)
bokeh (0.12.13)
boto (2.48.0)
Bottleneck (1.2.1)
branca (0.3.0)
catamount (0.9)
certifi (2018.10.15)
cffi (1.11.4)
chardet (3.0.4)
click (6.7)
cloudpickle (0.5.2)
clyent (1.2.2)
colorama (0.3.9)
colorlover (0.2.1)
conda (4.5.11)
conda-build (3.4.1)
conda-verify (2.0.0)
contextlib2 (0.5.5)
cryptography (2.1.4)
cufflinks (0.14.6)
cycler (0.10.0)
Cython (0.27.3)
cytoolz (0.9.0)
dask (0.16.1)
datashape (0.5.4)
decorator (4.2.1)
distributed (1.20.2)
docutils (0.14)
entrypoints (0.2.3)
et-xmlfile (1.0.1)
evdev (1.1.2)
ez-setup (0.9)
fastcache (1.0.2)
filelock (2.0.13)
Flask (0.12.2)
Flask-Cors (3.0.3)
flatbuffers (1.10)
folium (0.5.0)
funcsigs (1.0.2)
gevent (1.2.2)
Glances (3.0.2)
glob2 (0.6)
gmpy2 (2.0.8)
gpustat (0.5.0)
greenlet (0.4.12)
h5py (2.7.1)
heapdict (1.0.0)
html5lib (1.0.1)
idna (2.6)
imageio (2.2.0)
imagesize (0.7.1)
ipykernel (4.8.0)
ipython (6.2.1)
ipython-genutils (0.2.0)
ipywidgets (7.1.1)
isort (4.2.15)
itsdangerous (0.24)
jdcal (1.3)
jedi (0.11.1)
Jinja2 (2.10)
jsonschema (2.6.0)
jupyter (1.0.0)
jupyter-client (5.2.2)
jupyter-console (5.2.0)
jupyter-core (4.4.0)
jupyterlab (0.31.5)
jupyterlab-launcher (0.10.2)
kiwisolver (1.0.1)
lazy-object-proxy (1.3.1)
llvmlite (0.21.0)
locket (0.2.0)
lxml (4.1.1)
MarkupSafe (1.0)
matplotlib (2.2.2)
mccabe (0.6.1)
mistune (0.8.3)
modin (0.3.1)
moviepy (0.2.3.5)
mpmath (1.0.0)
msgpack-python (0.5.1)
multipledispatch (0.4.9)
navigator-updater (0.1.0)
nbconvert (5.3.1)
nbformat (4.4.0)
networkx (2.1)
nltk (3.2.5)
nose (1.3.7)
notebook (5.4.0)
numba (0.36.2)
numexpr (2.6.4)
numpy (1.14.0)
numpydoc (0.7.0)
nvidia-ml-py3 (7.352.0)
odo (0.5.1)
olefile (0.45.1)
openpyxl (2.4.10)
packaging (16.8)
pandas (0.23.4)
pandas-datareader (0.6.0)
pandocfilters (1.4.2)
parso (0.1.1)
partd (0.3.8)
path.py (10.5)
pathlib2 (2.3.0)
patsy (0.5.0)
pep8 (1.7.1)
pexpect (4.3.1)
pickleshare (0.7.4)
Pillow (5.1.0)
pip (9.0.1)
pkginfo (1.4.1)
plotly (3.4.1)
pluggy (0.6.0)
ply (3.10)
prompt-toolkit (1.0.15)
psutil (5.4.7)
ptyprocess (0.5.2)
py (1.5.2)
pycodestyle (2.3.1)
pycosat (0.6.3)
pycparser (2.18)
pycrypto (2.6.1)
pycurl (7.43.0.1)
pyflakes (1.6.0)
Pygments (2.2.0)
pylint (1.8.2)
pyodbc (4.0.22)
pyOpenSSL (17.5.0)
pyparsing (2.2.0)
PySocks (1.6.7)
pytest (3.3.2)
python-dateutil (2.6.1)
pytz (2017.3)
pyusb (1.0.2)
PyWavelets (0.5.2)
PyYAML (3.12)
pyzmq (16.0.3)
QtAwesome (0.4.4)
qtconsole (4.3.1)
QtPy (1.3.1)
ray (0.6.3)
redis (3.2.0)
requests (2.18.4)
requests-file (1.4.3)
requests-ftp (0.3.1)
retrying (1.3.3)
rope (0.10.7)
ruamel-yaml (0.15.35)
scikit-image (0.13.1)
scikit-learn (0.19.1)
scipy (1.0.0)
seaborn (0.8.1)
Send2Trash (1.4.2)
setproctitle (1.1.10)
setuptools (38.4.0)
simplegeneric (0.8.1)
singledispatch (3.4.0.3)
six (1.11.0)
snowballstemmer (1.2.1)
sortedcollections (0.5.3)
sortedcontainers (1.5.9)
Sphinx (1.6.6)
sphinxcontrib-websupport (1.0.1)
spyder (3.2.6)
SQLAlchemy (1.2.1)
sqlalchemy-dremio (0.1.0.dev0)
statsmodels (0.8.0)
sympy (1.1.1)
tables (3.4.2)
tblib (1.3.2)
terminado (0.8.1)
testpath (0.3.1)
toolz (0.9.0)
tornado (4.5.3)
tqdm (4.23.4)
traitlets (4.3.2)
twitter (1.18.0)
typing (3.6.2)
unicodecsv (0.14.1)
urllib3 (1.22)
utils (0.9.0)
wcwidth (0.1.7)
webencodings (0.5.1)
Werkzeug (0.14.1)
wheel (0.30.0)
widgetsnbextension (3.1.0)
wrapt (1.10.11)
xlrd (1.1.0)
XlsxWriter (1.0.2)
xlwt (1.3.0)
zict (0.1.3)

[I 23:08:49.734 LabApp] Kernel restarted: 5e6a7278-bd80-4aec-9596-ed1d4eefae79
[I 23:08:50.335 LabApp] Adapting to protocol v5.1 for kernel 5e6a7278-bd80-4aec-9596-ed1d4eefae79
[I 23:08:50.336 LabApp] Restoring connection for 5e6a7278-bd80-4aec-9596-ed1d4eefae79:5a901346bdf87f0d0015cfd29c8056b2
[I 23:08:50.336 LabApp] Replaying 5 buffered messages
WARNING: Logging before InitGoogleLogging() is written to STDERR
E0227 23:08:59.697196 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 50 more times
WARNING: Logging before InitGoogleLogging() is written to STDERR
E0227 23:08:59.708341 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 50 more times
E0227 23:08:59.797468 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 49 more times
E0227 23:08:59.808454 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 49 more times
E0227 23:08:59.897589 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 48 more times
E0227 23:08:59.908550 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 48 more times
E0227 23:08:59.997697 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 47 more times
E0227 23:09:00.008649 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 47 more times
E0227 23:09:00.097797 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 46 more times
E0227 23:09:00.108742 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 46 more times
E0227 23:09:00.197921 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 45 more times
E0227 23:09:00.208851 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 45 more times
E0227 23:09:00.298036 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 44 more times
E0227 23:09:00.308956 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 44 more times
E0227 23:09:00.398152 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 43 more times
E0227 23:09:00.409060 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 43 more times
E0227 23:09:00.498260 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 42 more times
E0227 23:09:00.509160 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 42 more times
E0227 23:09:00.598383 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 41 more times
E0227 23:09:00.609275 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 41 more times
E0227 23:09:00.698513 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 40 more times
E0227 23:09:00.709403 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 40 more times
E0227 23:09:00.798641 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 39 more times
E0227 23:09:00.809523 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 39 more times
E0227 23:09:00.898766 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 38 more times
E0227 23:09:00.909641 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 38 more times
E0227 23:09:00.998893 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 37 more times
E0227 23:09:01.009758 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 37 more times
E0227 23:09:01.099022 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 36 more times
E0227 23:09:01.109884 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 36 more times
E0227 23:09:01.199153 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 35 more times
E0227 23:09:01.210002 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 35 more times
E0227 23:09:01.299286 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 34 more times
E0227 23:09:01.310122 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 34 more times
E0227 23:09:01.399394 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 33 more times
E0227 23:09:01.410223 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 33 more times
E0227 23:09:01.499480 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 32 more times
E0227 23:09:01.510315 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 32 more times
E0227 23:09:01.599562 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 31 more times
E0227 23:09:01.610406 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 31 more times
E0227 23:09:01.699648 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 30 more times
E0227 23:09:01.710501 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 30 more times
E0227 23:09:01.799743 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 29 more times
E0227 23:09:01.810600 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 29 more times
E0227 23:09:01.899832 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 28 more times
E0227 23:09:01.910693 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 28 more times
E0227 23:09:01.999919 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 27 more times
E0227 23:09:02.010782 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 27 more times
E0227 23:09:02.100001 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 26 more times
E0227 23:09:02.110865 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 26 more times
E0227 23:09:02.200103 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 25 more times
E0227 23:09:02.210973 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 25 more times
E0227 23:09:02.300210 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 24 more times
E0227 23:09:02.311079 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 24 more times
E0227 23:09:02.400315 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 23 more times
E0227 23:09:02.411183 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 23 more times
E0227 23:09:02.500421 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 22 more times
E0227 23:09:02.511282 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 22 more times
E0227 23:09:02.600519 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 21 more times
E0227 23:09:02.611387 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 21 more times
E0227 23:09:02.700619 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 20 more times
E0227 23:09:02.711488 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 20 more times
E0227 23:09:02.800721 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 19 more times
E0227 23:09:02.811589 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 19 more times
E0227 23:09:02.900823 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 18 more times
E0227 23:09:02.911689 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 18 more times
E0227 23:09:03.000928 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 17 more times
E0227 23:09:03.011793 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 17 more times
E0227 23:09:03.101037 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 16 more times
E0227 23:09:03.111896 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 16 more times
E0227 23:09:03.201154 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 15 more times
E0227 23:09:03.212009 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 15 more times
E0227 23:09:03.301265 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 14 more times
E0227 23:09:03.312115 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 14 more times
E0227 23:09:03.401366 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 13 more times
E0227 23:09:03.412215 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 13 more times
E0227 23:09:03.501466 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 12 more times
E0227 23:09:03.512312 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 12 more times
E0227 23:09:03.601572 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 11 more times
E0227 23:09:03.612416 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 11 more times
E0227 23:09:03.701681 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 10 more times
E0227 23:09:03.712527 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 10 more times
E0227 23:09:03.801787 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 9 more times
E0227 23:09:03.812635 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 9 more times
E0227 23:09:03.901895 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 8 more times
E0227 23:09:03.912739 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 8 more times
E0227 23:09:04.001994 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 7 more times
E0227 23:09:04.012836 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 7 more times
E0227 23:09:04.102088 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 6 more times
E0227 23:09:04.112932 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 6 more times
E0227 23:09:04.202205 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 5 more times
E0227 23:09:04.213044 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 5 more times
E0227 23:09:04.302312 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 4 more times
E0227 23:09:04.313140 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 4 more times
E0227 23:09:04.402408 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 3 more times
E0227 23:09:04.413235 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 3 more times
E0227 23:09:04.502508 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 2 more times
E0227 23:09:04.513332 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 2 more times
E0227 23:09:04.602599 73188 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 1 more times
E0227 23:09:04.613422 73139 io.cc:168] Connection to IPC socket failed for pathname /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store, retrying 1 more times
F0227 23:09:04.702714 73188 object_store_notification_manager.cc:23]  Check failed: _s.ok() Bad status: IOError: Could not connect to socket /tmp/ray/session_2019-02-27_23-08-59_73139/sockets/plasma_store
*** Check failure stack trace: ***
*** Aborted at 1551326944 (unix time) try "date -d @1551326944" if you are using GNU date ***
PC: @                0x0 (unknown)
*** SIGABRT (@0x3e800011de4) received by PID 73188 (TID 0x7fd038fa0740) from PID 73188; stack trace: ***
    @     0x7fd038ba4390 (unknown)
    @     0x7fd037d55428 gsignal
    @     0x7fd037d5702a abort
    @           0x5b906d google::logging_fail()
    @           0x5b9096 google::LogMessage::Fail()
    @           0x5b8fe2 google::LogMessage::SendToLog()
    @           0x5b8966 google::LogMessage::Flush()
    @           0x5b8775 google::LogMessage::~LogMessage()
    @           0x503618 ray::RayLog::~RayLog()
    @           0x5633ed ray::ObjectStoreNotificationManager::ObjectStoreNotificationManager()
    @           0x51c5f3 ray::ObjectManager::ObjectManager()
    @           0x4c45d4 ray::raylet::Raylet::Raylet()
    @           0x4b3f97 main
    @     0x7fd037d40830 __libc_start_main
    @           0x4b9c91 (unknown)

@devin-petersohn
Copy link
Collaborator

Thanks @shaunstoltz, that helps a lot. I created an issue on Ray's repo (ray-project/ray#4193) to track the problem. We use Ray as a dependency, so we can move the conversation there where it can be resolved at the source. In the meantime, I will talk with the Ray team to see if there is some temporary patch we can do in Modin so that some users don't have this problem. Thanks again for the reply!

@pcmoritz
Copy link
Contributor

Thanks for the report @shaunstoltz, can you post the plasma logs in
/tmp/ray/session_2019-02-27_23-08-59_73139/logs?

@devin-petersohn
Copy link
Collaborator

I found a machine that can reproduce this. @pcmoritz here is the output:

I0307 12:03:15.429157 110195 store.cc:994] Allowing the Plasma store to use up to 649GB of memory.
I0307 12:03:15.429505 110195 store.cc:1024] Starting object store with directory /tmp and huge page support disabled
F0307 12:03:15.429574 110195 store.cc:1039] System memory request exceeds memory available in /tmp. The request is for 649000000000 bytes, and the amount available is 189231636480 bytes. You may be able to free up space by deleting files in /dev/shm. If you are inside a Docker container, you may need to pass an argument with the flag '--shm-size' to 'docker run'.
*** Check failure stack trace: ***
    @           0x43ce9d  google::LogMessage::Fail()
    @           0x44094c  google::LogMessage::SendToLog()
    @           0x43c9c3  google::LogMessage::Flush()
    @           0x43cbc9  google::LogMessage::~LogMessage()
    @           0x43aa88  arrow::util::ArrowLog::~ArrowLog()
    @           0x413a99  main
    @     0x7fda49789830  __libc_start_main
    @           0x414a41  (unknown)

@robertnishihara
Copy link
Contributor

@devin-petersohn @pcmoritz I wonder if it is this issue ray-project/ray#3670.

@robertnishihara
Copy link
Contributor

@devin-petersohn, how much memory does that machine actually have?

@devin-petersohn
Copy link
Collaborator

@robertnishihara ~1TB

I think it is the same issue that you linked above. This also makes out of core not possible on Modin.

I rolled back to Ray 0.6 and the same issue came up.

@shaunstoltz
Copy link

shaunstoltz commented Mar 7, 2019 via email

@devin-petersohn
Copy link
Collaborator

For now, a temporary fix is to manually set MODIN_MEMORY=<num_bytes> in the bash environment. That API is currently experimental, but should fix the issue in the short term. num_bytes needs to be an integer value for it to work.

@Q3g
Copy link

Q3g commented Mar 14, 2019

For now, a temporary fix is to manually set MODIN_MEMORY=<num_bytes> in the bash environment. That API is currently experimental, but should fix the issue in the short term. num_bytes needs to be an integer value for it to work.

It doesn't work, because there isn't a type cast on os.environ["MODIN_MEMORY"].
I temporarily solve this problem by modifing file modin/pandas/__init__.py
edit
object_store_memory = int( 0.6 * ray.utils.get_system_memory() // 10 ** 9 * 10 ** 9 )
to
object_store_memory = int( 0.3 * ray.utils.get_system_memory() // 10 ** 9 * 10 ** 9 )
It seems that there are some problems about the value of object_store_memory.
hope it helps

@devin-petersohn
Copy link
Collaborator

@LonelyFingertip thanks, you're right about MODIN_MEMORY.

The problem is that the plasma store is checking the filesystem instead of the memory amount. There is an open issue in ray (linked above) and I believe it is getting resolved.

@Austrie
Copy link

Austrie commented Mar 22, 2019

@LonelyFingertip I was able to get past one of my error thanks your 0.3 tip.

@Austrie
Copy link

Austrie commented Mar 22, 2019

However, I can't fit my dataset in my memory due to this, so I'll have to continue playing around with Modin's source code.

@devin-petersohn
Copy link
Collaborator

Hi @carlosssanchez, @shaunstoltz, @LonelyFingertip, @Austrie

The fix has just been published in Ray's nightly wheels, to install visit https://ray.readthedocs.io/en/latest/installation.html#trying-snapshots-from-master and find your OS + Python version in that table. It will give a warning saying that Modin is not compatible, but it should still work for the time. Ray is planning to release within a couple of weeks, and we will release at the same time so this is fixed. Thanks to all of you who helped debug the issue.

@Austrie
Copy link

Austrie commented Mar 22, 2019

@devin-petersohn I did pip install -U ray-0.7.0.dev1-cp27-cp27mu-manylinux1_x86_64.whl, still getting ArrowIOError: Could not connect to socket /tmp/ray/session_2019-03-22_19-16-25_48551/sockets/plasma_store

@devin-petersohn
Copy link
Collaborator

Thanks @Austrie, does this work for you?

import ray

object_store_memory = int(0.6 * ray.utils.get_system_memory() // 10 ** 9 * 10 ** 9)
ray.init(
            include_webui=False,
            ignore_reinit_error=True,
            plasma_directory="/tmp",
            object_store_memory=object_store_memory,
        )
import modin.pandas as pd

@pcmoritz: It is working for me on Python3, is there a difference in the plasma implementation for Python2 for this kind of error?

@Austrie
Copy link

Austrie commented Mar 22, 2019

@devin-petersohn It works now, but the error transformed into a different error:

<ipython-input-11-3fdb98b24005> in <module>()
      8 # combat this, I make the blocksize much large (500MBs), so there's less chance of the CSV being split improperly, since this is
      9 # a 5GB file, it will only have around 10 CSVs, compared to 100 CSVs if we use a small number like "50e6" (50MBs)
---> 10 rated_anime_df = pd.read_csv(directory + '/UserAnimeList.csv')

/home/ubuntu/anaconda2/lib/python2.7/site-packages/modin/pandas/io.pyc in parser_func(filepath_or_buffer, sep, delimiter, header, names, index_col, usecols, squeeze, prefix, mangle_dupe_cols, dtype, engine, converters, true_values, false_values, skipinitialspace, skiprows, nrows, na_values, keep_default_na, na_filter, verbose, skip_blank_lines, parse_dates, infer_datetime_format, keep_date_col, date_parser, dayfirst, iterator, chunksize, compression, thousands, decimal, lineterminator, quotechar, quoting, escapechar, comment, encoding, dialect, tupleize_cols, error_bad_lines, warn_bad_lines, skipfooter, doublequote, delim_whitespace, low_memory, memory_map, float_precision)
     94     ):
     95         _, _, _, kwargs = inspect.getargvalues(inspect.currentframe())
---> 96         return _read(**kwargs)
     97 
     98     return parser_func

/home/ubuntu/anaconda2/lib/python2.7/site-packages/modin/pandas/io.pyc in _read(**kwargs)
    107         kwargs: Keyword arguments in pandas.read_csv
    108     """
--> 109     pd_obj = BaseFactory.read_csv(**kwargs)
    110     # This happens when `read_csv` returns a TextFileReader object for iterating through
    111     if isinstance(pd_obj, pandas.io.parsers.TextFileReader):

/home/ubuntu/anaconda2/lib/python2.7/site-packages/modin/data_management/factories.pyc in read_csv(cls, **kwargs)
     53     @classmethod
     54     def read_csv(cls, **kwargs):
---> 55         return cls._determine_engine()._read_csv(**kwargs)
     56 
     57     @classmethod

/home/ubuntu/anaconda2/lib/python2.7/site-packages/modin/data_management/factories.pyc in _read_csv(cls, **kwargs)
     57     @classmethod
     58     def _read_csv(cls, **kwargs):
---> 59         return cls.io_cls.read_csv(**kwargs)
     60 
     61     @classmethod

/home/ubuntu/anaconda2/lib/python2.7/site-packages/modin/engines/ray/pandas_on_ray/io.pyc in read_csv(cls, filepath_or_buffer, sep, delimiter, header, names, index_col, usecols, squeeze, prefix, mangle_dupe_cols, dtype, engine, converters, true_values, false_values, skipinitialspace, skiprows, nrows, na_values, keep_default_na, na_filter, verbose, skip_blank_lines, parse_dates, infer_datetime_format, keep_date_col, date_parser, dayfirst, iterator, chunksize, compression, thousands, decimal, lineterminator, quotechar, quoting, escapechar, comment, encoding, dialect, tupleize_cols, error_bad_lines, warn_bad_lines, skipfooter, doublequote, delim_whitespace, low_memory, memory_map, float_precision)
    361             "float_precision": float_precision,
    362         }
--> 363         return cls._read(**kwargs)
    364 
    365     @classmethod

/home/ubuntu/anaconda2/lib/python2.7/site-packages/modin/engines/ray/pandas_on_ray/io.pyc in _read(cls, filepath_or_buffer, **kwargs)
    431         else:
    432             return cls._read_csv_from_file_pandas_on_ray(
--> 433                 filepath_or_buffer, filtered_kwargs
    434             )
    435 

/home/ubuntu/anaconda2/lib/python2.7/site-packages/modin/engines/ray/pandas_on_ray/io.pyc in _read_csv_from_file_pandas_on_ray(cls, filepath, kwargs)
    172                 f.seek(0, os.SEEK_SET)  # Return to beginning of file
    173 
--> 174             prefix_id = ray.put(prefix)
    175             partition_kwargs_id = ray.put(partition_kwargs)
    176             # Skip the header since we already have the header information and skip the

/home/ubuntu/anaconda2/lib/python2.7/site-packages/ray/worker.pyc in put(value, worker)
   2227             # In LOCAL_MODE, ray.put is the identity operation.
   2228             return value
-> 2229         object_id = worker.raylet_client.compute_put_id(
   2230             worker.current_task_id,
   2231             worker.task_context.put_index,

AttributeError: 'Worker' object has no attribute 'raylet_client'```

@Austrie
Copy link

Austrie commented Mar 22, 2019

Nevermind, solved this error by doing:
pip uninstall ray
touch /home/ubuntu/anaconda2/lib/python2.7/site-packages/easy-install.pth
conda update setuptools
pip install -U ray-0.7.0.dev1-cp27-cp27mu-manylinux1_x86_64.whl

@maidousj
Copy link

Hi, guys. I have met the same issue. Finally fixed by change the directory "/tmp" in plasma_directory="/tmp" to a directory (disk) that has enough space (more than 40G). My root directory only has 10G left. I have no idea if this can help, but it fixed my problem. 😂

@devin-petersohn
Copy link
Collaborator

Closing this. Feel free to reopen if the discussion should continue or if issue was not resolved.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dependencies 🔗 Issues related to dependencies Memory 💾 Issues related to memory
Projects
None yet
Development

No branches or pull requests

8 participants