Skip to content
View ochougul's full-sized avatar
🧪
🧪

Block or report ochougul

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. quic/efficient-transformers quic/efficient-transformers Public

    This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transformers library) into inference-ready formats that run efficien…

    Python 39 26

  2. QLLM QLLM Public

    Forked from wejoncy/QLLM

    A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ, and export to onnx/onnx-runtime easily.

    Python

  3. wanda wanda Public

    Forked from locuslab/wanda

    A simple and effective LLM pruning approach.

    Python