-
-
Notifications
You must be signed in to change notification settings - Fork 8.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Grid searching with dask interface. #5567
Comments
This issue is fixed now. In case of EC2 machines, we need to open ports for communication and run dask workers on these ports so that they can communicate with the scheduler. |
My current setup is: The script is running but only 2% of cpu is being used for both workers. Using the script posted above. Why the full cpu power of worker is not being used ? |
Grid searching for dask is not yet supported. We need a specialized implementation. |
It was working here #5451 (comment) |
No, it's not working correctly. We need a specialized implementation that can split data locally on each worker. I will seek to implement it on dask ml once 1.1 is sorted out. If you are interested in doing so, feel free to let me know I will explain more details. :-) |
@trivialfis Trying without gridsearch. Still not working.
Commands to set up the cluster:
On Machine 2
|
What's the error message? |
The error is that only one worker is being used and that too with just 2% CPU usage. On 1st worker's terminal ( worker which is on the same machine as scheduler)
Remote worker's terminal shows connection timed out after 5-10 minutes and restarts. |
Could you check what dask is doing on the web interface? Also we have some bug fixes for dask on master branch. |
I am facing a similar issue. I am trying to add dask support for a project
The function I am using ^
Messages in the output (with the master branch) Have I done something wrong? (I am posting this here because I have the same error log and under-utilization of CPU (around 40%) like the OP) |
We need to start distributed training as soon as possible. How can I contribute ? |
Sorry, completely missed this.... To enable grid searching with dask, we need a grid search implementation that is aware of MPI based algorithm. I'm hoping I can jump into dask ml, but so many things piled up. |
Prefer #6525 instead. |
Hey guys,
We want to run dask xgboost on a cluster of EC2 machines. I've 2 EC2 machines with 2 cores and 4GB ram each. I am trying to run this script on the cluster.
When I run
clf.fit()
, I get a lot of the following messages in the worker logs:What am I missing here ?
The text was updated successfully, but these errors were encountered: