What's Changed
- Cohere Support by @TechxGenus in #457
- Add phi3 support by @pprp in #481
- Support Weight-Only quantization on CPU device with QBits backend by @PenghuiCheng in #437
- Fix typo by @wanyaworld in #486
- Add updates + sponsorship by @casper-hansen in #495
- Update README.md by @casper-hansen in #497
- Update doc by @imba-tjd in #499
- add support for Openbmb/MiniCPM by @LDLINGLINGLING in #504
- Update RunPod support by @casper-hansen in #514
- add deepseek v2 support by @TechxGenus in #508
- nan problem of Qwen2-72B quantization by @baoyf4244 in #519
- Qwen nan fix by @baoyf4244 in #522
- fix deepseek v2 input feat by @TechxGenus in #524
- Batched quantization by @casper-hansen in #516
- Fix step size when computing clipping by @casper-hansen in #531
- Pin torch version to 2.3.1 by @devin-ai-integration in #542
- Revert "Pin torch version to 2.3.1 (#542)" by @casper-hansen in #547
- CLI example + Runpod launch script by @casper-hansen in #548
- Print warning if AutoAWQ cannot load extensions by @casper-hansen in #515
- Remove progress bars by @casper-hansen in #550
- Add test for chunked methods by @casper-hansen in #551
- Llama with inputs_embeds only(LLava-v1.5 bug fixed) and Llava-v1.6 Support by @WanBenLe in #471
- Better CLI + RunPod Script by @casper-hansen in #552
- Release 026 by @casper-hansen in #546
- pin torch==2.3.1 by @casper-hansen in #554
- Remove ROCm build and only build for PyPi by @casper-hansen in #555
New Contributors
- @pprp made their first contribution in #481
- @PenghuiCheng made their first contribution in #437
- @wanyaworld made their first contribution in #486
- @imba-tjd made their first contribution in #499
- @LDLINGLINGLING made their first contribution in #504
- @baoyf4244 made their first contribution in #519
- @devin-ai-integration made their first contribution in #542
- @WanBenLe made their first contribution in #471
Full Changelog: v0.2.5...v0.2.6