-
Notifications
You must be signed in to change notification settings - Fork 138
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training on the cloud / multiple instances / clusters #50
Comments
Thanks for your interest in AlphaZero.jl! AlphaZero.jl itself does not deal with any kind of cluster setup. It just gets a list of available workers using the Distributed module and splits the work equally between them. What's nice with JuliaHub is that it takes care of the details of configuring a cluster and spawning remote processes, but I guess it should not be hard to configure the system to work on your own cluster: see the documentation. I am not familiar with the package you linked to but it looks like a replacement for Distributed so it is may not be what you want here. If you want more general advice on running Julia code that relies on Distributed.jl on Azure (as is the case of AlphaZero), I would advise you to ask on Discourse or on the Julia Slack. :-) |
Sweet! This looks great - Also while I have you can I ask two super quick questions - Sorry - I'm really new to reinforcement learning and julia, kinda working this out as I go. |
Programming in |
Having
I am not sure I understand the question here. What are you calling "board"? In your case, if both players have inventories, these inventories should be part of the state. |
Sorry - Going off the examples state is board and player. |
Awesome awesome thank you so much! |
Speaking of the grid-world example, I steered away from it since it used CommonRLInterface as opposed to AlphaZero.GI so I wasn't entirely sure and it functioned incredibly different than the other examples. |
I agree that this example looks pretty different on the surface but remember that AlphaZero.jl only provides a thin wrapper over CommonRLInterface.jl. Therefore, it should not be too hard to translate the example so that it uses Good luck using AlphaZero on your game and please don't hesitate to report back about your results or experience! |
Any tips for running this on Azure without paying Julia hubs insane premium?
I'm trying to leverage spot pricing which is about 1/10th-1/20th the cost of juliahubs pricing.
I found this:
https://github.com/microsoft/AzureClusterlessHPC.jl
I'm not entirely sure how exactly Juliahub handles running this code on multiple machines together... Is there a command or something to connect multiple instances together or something built in similar to Ray? Or will this be an incredibly painful process of setting up the code for use with that previous github I linked?
The text was updated successfully, but these errors were encountered: