TinyLlama exl2 quants for speculative decoding #159
Labels
Algorithms
Sorting, Learning or Classifying. All algorithms go here.
llm
Large Language Models
llm-experiments
experiments with large language models
MachineLearning
ML Models, Training and Inference
ml-inference
Running and serving ML models.
Models
LLM and ML model repos and links
royallab/TinyLlama-1.1B-ckpt-2.5T-exl2 · Hugging Face
TinyLlama-1.1B-ckpt-2.5T-exl2
EXL2 quants of TinyLlama/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T intended for use in speculative decoding.
The text was updated successfully, but these errors were encountered: