Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TinyLlama exl2 quants for speculative decoding #159

Open
1 task
irthomasthomas opened this issue Dec 22, 2023 · 0 comments
Open
1 task

TinyLlama exl2 quants for speculative decoding #159

irthomasthomas opened this issue Dec 22, 2023 · 0 comments
Labels
Algorithms Sorting, Learning or Classifying. All algorithms go here. llm Large Language Models llm-experiments experiments with large language models MachineLearning ML Models, Training and Inference ml-inference Running and serving ML models. Models LLM and ML model repos and links

Comments

@irthomasthomas
Copy link
Owner

EXL2 quants of TinyLlama/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T intended for use in speculative decoding.

@irthomasthomas irthomasthomas added inbox-url unclassified Choose this if none of the other labels (bar New Label) fit the content. llm Large Language Models llm-experiments experiments with large language models Algorithms Sorting, Learning or Classifying. All algorithms go here. MachineLearning ML Models, Training and Inference ml-inference Running and serving ML models. Models LLM and ML model repos and links labels Dec 22, 2023
@irthomasthomas irthomasthomas changed the title TinyLlama exl quants for speculative decoding TinyLlama exl2 quants for speculative decoding Dec 22, 2023
@irthomasthomas irthomasthomas removed the unclassified Choose this if none of the other labels (bar New Label) fit the content. label Dec 28, 2023
@ShellLM ShellLM removed the llama label May 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Algorithms Sorting, Learning or Classifying. All algorithms go here. llm Large Language Models llm-experiments experiments with large language models MachineLearning ML Models, Training and Inference ml-inference Running and serving ML models. Models LLM and ML model repos and links
Projects
None yet
Development

No branches or pull requests

2 participants