We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How many gpus are needed to finetune? I have tried 16 PPUs (96GB each) but got CUDA OUT OF MEMROY
The text was updated successfully, but these errors were encountered:
try --enable_liger_kernel and --use_unsloth_gc
Sorry, something went wrong.
use_unsloth_gc
Thanks.
BTW, I have encounter an error : Triton Error [CUDA]: device kernel image is invalid when --enable_liger_kernel.
Here are some pkg info: triton==3.1.0 transformers==4.44.2 torch=2.3.0 CUDA SDK == 12.3.2
Any suggestions?
No branches or pull requests
How many gpus are needed to finetune? I have tried 16 PPUs (96GB each) but got CUDA OUT OF MEMROY
The text was updated successfully, but these errors were encountered: