-
Notifications
You must be signed in to change notification settings - Fork 986
Issues: Lightning-AI/litgpt
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
use initial_checkpoint_dir for continue-pretraining but can't load model correctly
documentation
Improvements or additions to documentation
enhancement
New feature or request
question
Further information is requested
#1729
opened Sep 18, 2024 by
wodelt
Question about tie_embeddings
question
Further information is requested
#1727
opened Sep 14, 2024 by
twaka
Is Support for the DeepSeek v2.5 model on the roadmap?
checkpoints
#1723
opened Sep 12, 2024 by
ColumbusAI
Cannot attend to 9904, block size is only 4096
question
Further information is requested
#1717
opened Sep 11, 2024 by
starjob42
"RuntimeError: All the chunks should have been deleted." on non-Studio machine
bug
Something isn't working
#1716
opened Sep 10, 2024 by
rasbt
Llama 3.1 RoPE not implemented -- Finetuning will lead to spelling/grammar mistakes
bug
Something isn't working
#1713
opened Sep 7, 2024 by
calvintwr
Data Loading bug in Something isn't working
pretrain
on resume over multiple epochs
bug
#1712
opened Sep 7, 2024 by
fdalvi
Training not working with default script
bug
Something isn't working
#1698
opened Aug 28, 2024 by
ByteBrigand
adding a UI for the training and the finetuning
enhancement
New feature or request
#1683
opened Aug 20, 2024 by
Esmail-ibraheem
Llama3 finetuning and generation: Double begin_of_text, no eot_id
bug
Something isn't working
#1682
opened Aug 20, 2024 by
sanderland
attention mask is incorrect when generate with softcapping
bug
Something isn't working
#1672
opened Aug 13, 2024 by
twaka
Tensor parallelism generates non-sensical outputs
bug
Something isn't working
#1663
opened Aug 8, 2024 by
rasbt
access hidden layer(s) from a model
question
Further information is requested
#1642
opened Jul 30, 2024 by
Byungsooo
Implement prompt caching to speed up inference
enhancement
New feature or request
#1638
opened Jul 27, 2024 by
rasbt
Skip safetensors->bin file conversion
enhancement
New feature or request
#1625
opened Jul 24, 2024 by
rasbt
Support downloading and using quantized weights (GGUF)
enhancement
New feature or request
#1616
opened Jul 23, 2024 by
rasbt
adding Aya model (checkpoints) to the supported checkpoints list that litgpt support
enhancement
New feature or request
#1614
opened Jul 23, 2024 by
Esmail-ibraheem
Pretrain then finetune on multiple GPUs error
bug
Something isn't working
#1613
opened Jul 23, 2024 by
ZeguanXiao
Previous Next
ProTip!
Updated in the last three days: updated:>2024-09-18.