Remove JSON config mangling for Gemma ckpt #124

lsy323 · 2024-06-12T22:25:59Z

The fix to the invalid JSON config file in HF Gemma pytorch ckpt is merged. We don't need to fix the invalid json in convert script.

https://huggingface.co/google/gemma-7b-it-pytorch/discussions/2#6667fe530ec7ed4422eb070c
https://huggingface.co/google/gemma-7b-pytorch/discussions/2#6667fdbf647001c39240a47e
https://huggingface.co/google/gemma-2b-pytorch/discussions/2#6667fdab51545a8b46c3a121

Other gemma pytorch ckpts are handled by HF staff as well.

Test

export input_ckpt_dir=/mnt/disks/lsiyuan/gemma_weight/gemma-7b-pytorch-it
export output_ckpt_dir=/mnt/disks/lsiyuan/gemma_weight/gemma-7b-pytorch-it-bf16
export model_name="gemma"
export quantize_weights=False
export quantize_type="int8_per_channel"
export from_hf=False
python -m convert_checkpoints --model_name=$model_name \
    --input_checkpoint_dir=$input_ckpt_dir \
    --output_checkpoint_dir=$output_ckpt_dir \
    --quantize_weights=$quantize_weights \
    --quantize_type=$quantize_type \
    --from_hf=$from_hf

export tokenizer_path=/mnt/disks/lsiyuan/gemma_weight/gemma-7b-pytorch-it/tokenizer.model
export size="7b"
export quantize_weights=False
export quantize_activation=False
export quantize_kv_cache=False

python run_interactive.py --model_name=$model_name --size=$size --batch_size=2 --max_cache_length=2048 \
    --checkpoint_path=$output_ckpt_dir \
    --tokenizer_path=$tokenizer_path \
    --quantize_kv_cache=$quantize_kv_cache \
    --quantize_weights=$quantize_weights \
    --quantize_type=$quantize_type \

wang2yn84 · 2024-06-13T17:19:40Z

Thank you very much for raising the issue and get it solved!

update gemma convert

5b89f3f

lsy323 requested a review from qihqi June 12, 2024 22:26

lsy323 changed the title ~~update gemma convert~~ Remove JSON config mangling for Gemma ckpt Jun 12, 2024

FanhaiLu1 approved these changes Jun 12, 2024

View reviewed changes

lsy323 merged commit fe8dbde into AI-Hypercomputer:main Jun 13, 2024
4 checks passed

lsy323 deleted the lsiyuan/update-gemma-convert branch June 13, 2024 17:20

wang2yn84 requested review from wang2yn84 and removed request for wang2yn84 June 13, 2024 17:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove JSON config mangling for Gemma ckpt #124

Remove JSON config mangling for Gemma ckpt #124

lsy323 commented Jun 12, 2024 •

edited

Loading

wang2yn84 commented Jun 13, 2024

Remove JSON config mangling for Gemma ckpt #124

Remove JSON config mangling for Gemma ckpt #124

Conversation

lsy323 commented Jun 12, 2024 • edited Loading

wang2yn84 commented Jun 13, 2024

lsy323 commented Jun 12, 2024 •

edited

Loading