Support for AWQ quantization in TGI #59

nigue3025 · 2024-05-14T01:52:42Z

Hi
As I tried with 13b version in TGI, it works fine with bitsandbytes quantization.
While trying with AWQ quantization in TGI, it shows error as "Cannot load 'awq' weight, make sure the model is already quantized"
I am wondering if AWQ is too new to this model while deploying by TGI
Or there is any suggestion or comment?
Thanks

adamlin120 · 2024-05-16T02:12:35Z

For quantized model, i only tried with AWQ on vllm. you can find -awq model on my huggingface

nigue3025 changed the title ~~Support of AWQ quantization in TGI~~ Support for AWQ quantization in TGI May 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for AWQ quantization in TGI #59

Support for AWQ quantization in TGI #59

nigue3025 commented May 14, 2024 •

edited

Loading

adamlin120 commented May 16, 2024

Support for AWQ quantization in TGI #59

Support for AWQ quantization in TGI #59

Comments

nigue3025 commented May 14, 2024 • edited Loading

adamlin120 commented May 16, 2024

nigue3025 commented May 14, 2024 •

edited

Loading