Release v0.2.6 · casper-hansen/AutoAWQ

What's Changed

Cohere Support by @TechxGenus in #457
Add phi3 support by @pprp in #481
Support Weight-Only quantization on CPU device with QBits backend by @PenghuiCheng in #437
Fix typo by @wanyaworld in #486
Add updates + sponsorship by @casper-hansen in #495
Update README.md by @casper-hansen in #497
Update doc by @imba-tjd in #499
add support for Openbmb/MiniCPM by @LDLINGLINGLING in #504
Update RunPod support by @casper-hansen in #514
add deepseek v2 support by @TechxGenus in #508
nan problem of Qwen2-72B quantization by @baoyf4244 in #519
Qwen nan fix by @baoyf4244 in #522
fix deepseek v2 input feat by @TechxGenus in #524
Batched quantization by @casper-hansen in #516
Fix step size when computing clipping by @casper-hansen in #531
Pin torch version to 2.3.1 by @devin-ai-integration in #542
Revert "Pin torch version to 2.3.1 (#542)" by @casper-hansen in #547
CLI example + Runpod launch script by @casper-hansen in #548
Print warning if AutoAWQ cannot load extensions by @casper-hansen in #515
Remove progress bars by @casper-hansen in #550
Add test for chunked methods by @casper-hansen in #551
Llama with inputs_embeds only(LLava-v1.5 bug fixed) and Llava-v1.6 Support by @WanBenLe in #471
Better CLI + RunPod Script by @casper-hansen in #552
Release 026 by @casper-hansen in #546
pin torch==2.3.1 by @casper-hansen in #554
Remove ROCm build and only build for PyPi by @casper-hansen in #555

Full Changelog: v0.2.5...v0.2.6