Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

maxSimultaneousGames limited to 4000 #982

Open
hpckurt opened this issue Sep 10, 2024 · 2 comments
Open

maxSimultaneousGames limited to 4000 #982

hpckurt opened this issue Sep 10, 2024 · 2 comments

Comments

@hpckurt
Copy link

hpckurt commented Sep 10, 2024

Using the cuda backend for distributed training, it always errors out when you set the maxSimultaneousGames to higher than 4000. Nvidia H100s and H200s can easily handle more than 4000 games simultaneously. Can this be increased in the next version?

EDIT: This is using v1.15.3

@lightvector
Copy link
Owner

Thanks for the feedback. What is the optimal number of threads that the benchmark command concludes that you should use? And is this a setup with multiple GPUs, or only one?

@hpckurt
Copy link
Author

hpckurt commented Sep 13, 2024

The benchmark tool says 320 is the optimal number of threads, but GPU utilization never grows beyond 10% and gpu memory usage barely hits 8GB. That leaves a lot of performance on the table that just flat our isn't being used. This is an 8 GPU setup, but on a single GPU, the benchmark tool also says 320 is the optimal number.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants