Skip to content

v0.3.1: Patch Release

Compare
Choose a tag to compare
@shimizust shimizust released this 01 Oct 20:55
· 63 commits to main since this release
1520999

Summary

This patch release brings important updates and fixes to Liger-Kernel. Notable changes include:

  • KLDiv calculation fix: KLDiv now functions correctly with larger vocab sizes
  • SwiGLU/GeGLU casting fix: Program IDs are now cast to int64 in SwiGLU/GeGLU kernels to prevent memory errors with larger dimensions.
  • AutoLigerKernelForCausalLM fix: The model now properly passes through all original keyword arguments
  • Post-init model patching fix: Fix to post-init model patching to ensure HF Trainer integration works correctly
  • Relaxed transformers dependency: Improve compatibility with a broader range of versions.

What's Changed

New Contributors

Full Changelog: v0.3.0...v0.3.1