Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support dequantizing GGUF FP16 format #31783

Merged
merged 5 commits into from
Jul 24, 2024
Merged

Commits on Jul 3, 2024

  1. support gguf fp16

    PenutChen committed Jul 3, 2024
    Configuration menu
    Copy the full SHA
    5430803 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    02261d3 View commit details
    Browse the repository at this point in the history

Commits on Jul 24, 2024

  1. Configuration menu
    Copy the full SHA
    66f391b View commit details
    Browse the repository at this point in the history
  2. add gguf f16 test

    PenutChen committed Jul 24, 2024
    Configuration menu
    Copy the full SHA
    2c42437 View commit details
    Browse the repository at this point in the history
  3. remove bf16

    PenutChen committed Jul 24, 2024
    Configuration menu
    Copy the full SHA
    a831005 View commit details
    Browse the repository at this point in the history