add support for closed source model for Generalized Knowledge Distillation Trainer #2179

imrankh46 · 2024-10-05T04:24:53Z

Feature request

closed source model support for GKS, like openai gpt4-o and claude etc.

from datasets import Dataset
from trl import GKDConfig, GKDTrainer
from transformers import (
    AutoModelForCausalLM,
    AutoTokenizer,
)

NUM_DUMMY_SAMPLES = 100

tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2-0.5B-Instruct")
# The model to optimise
model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2-0.5B-Instruct")
# The teacher model to calculate the KL divergence against
teacher_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2-1.5B-Instruct")

train_dataset = Dataset.from_dict(
    {
        "messages": [
            [
                {"role": "user", "content": "Hi, how are you?"},
                {"role": "assistant", "content": "I'm great thanks"},
            ]
        ]
        * NUM_DUMMY_SAMPLES
    }
)
eval_dataset = Dataset.from_dict(
    {
        "messages": [
            [
                {"role": "user", "content": "What colour is the sky?"},
                {"role": "assistant", "content": "The sky is blue"},
            ]
        ]
        * NUM_DUMMY_SAMPLES
    }
)

args = GKDConfig(output_dir="gkd-model", per_device_train_batch_size=1)
trainer = GKDTrainer(
    model=model,
    teacher_model=teacher_model,
    args=args,
    tokenizer=tokenizer,
    train_dataset=train_dataset,
    eval_dataset=eval_dataset,
)
trainer.train()

Motivation

Your contribution

The text was updated successfully, but these errors were encountered:

imrankh46 · 2024-10-05T04:26:41Z

@kashif @lewtun

kashif · 2024-10-07T10:23:04Z

since the logits/dictionary needs to match between the teacher and student model, I do not thinks possible to train with closed models

August-murr · 2024-10-08T06:29:39Z

since the logits/dictionary needs to match between the teacher and student model, I do not thinks possible to train with closed models

Anthropic API doesn't output any logits or logprobs and they have no plans to, and OpenAI only allows a max of 20 logprobs. It seems like they really don't want you to distill.
OpenAI recently announced a distillation service, but it's only for their own models and not open source.

qgallouedec assigned kashif Oct 7, 2024

qgallouedec added the ✨ enhancement New feature or request label Oct 7, 2024

kashif added the 🏋 GKD Related to GKD label Oct 7, 2024

qgallouedec added the ⏳ needs more info Additional information or clarification is required to proceed label Oct 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for closed source model for Generalized Knowledge Distillation Trainer #2179

add support for closed source model for Generalized Knowledge Distillation Trainer #2179

imrankh46 commented Oct 5, 2024

imrankh46 commented Oct 5, 2024

kashif commented Oct 7, 2024

August-murr commented Oct 8, 2024

add support for closed source model for Generalized Knowledge Distillation Trainer #2179

add support for closed source model for Generalized Knowledge Distillation Trainer #2179

Comments

imrankh46 commented Oct 5, 2024

Feature request

Motivation

Your contribution

imrankh46 commented Oct 5, 2024

kashif commented Oct 7, 2024

August-murr commented Oct 8, 2024