Why `group_by_modality` is set to False on the `custom_finetune.sh` script? #89

ggcr · 2024-06-23T09:56:37Z

Hi,

I am exploring on doing different experiments, by training the projector only, and I've seen that the default provided script for custom finetuning has the group_by_modality_length flag set to False.

TinyLLaVA_Factory/scripts/train/custom_finetune.sh

Lines 1 to 24 in f5c2ded

    
           DATA_PATH="/home/ai/data/llava/dataset/text_files/llava_v1_5_mix665k.json" 
        
           IMAGE_PATH="/home/ai/data/llava/dataset" 
        
           MODEL_MAX_LENGTH=3072 
        
           OUTPUT_DIR="/mnt/data/sata/yinghu/checkpoints/llava_factory/custom-finetune-TinyLLaVA-Phi-2-SigLIP-3.1B-lora" 
        
           deepspeed --include localhost:0,1,2,3 --master_port 29501 tinyllava/train/custom_finetune.py \ 
        
               --deepspeed ./scripts/zero2.json \ 
        
               --data_path  $DATA_PATH \ 
        
               --image_folder $IMAGE_PATH \ 
        
               --is_multimodal True \ 
        
               --conv_version phi \ 
        
               --mm_vision_select_layer -2 \ 
        
               --image_aspect_ratio square \ 
        
               --fp16 True \ 
        
               --training_recipe lora \ 
        
               --tune_type_llm lora \ 
        
               --tune_type_vision_tower frozen \ 
        
               --tune_vision_tower_from_layer 0 \ 
        
               --tune_type_connector full \ 
        
               --lora_r 128 \ 
        
               --lora_alpha 256 \ 
        
               --group_by_modality_length False \ 
        
               --pretrained_model_path "tinyllava/TinyLLaVA-Phi-2-SigLIP-3.1B" \ 
        
               --output_dir $OUTPUT_DIR \

Because it has the llava_v1_5_mix665k.json as Data Path, it is not a uni-modal dataset, as it contains conversations with the <image> mm token.

Can you clarify when we should set the group_by_modality_length flag to True?

I am running in errors that are due to this:

E.g.

File "/root/TinyLLaVA_Factory/tinyllava/train/tinyllava_trainer.py", line 47, in get_modality_length_grouped_indices
    lang_indices, lang_lengths = zip(*[(i, -l) for i, l in enumerate(lengths) if l < 0])
ValueError: not enough values to unpack (expected 2, got 0)

Thanks in advance.

The text was updated successfully, but these errors were encountered:

ggcr · 2024-06-25T13:31:49Z

There is an inconsistency, in some (full) finetuning scripts it is set to False, and on others True.

ggcr · 2024-06-25T13:48:54Z

I have a dataset of pure multi-modal data, that is, each question ("from": "gpt") contains the <image> label. Hence, this error should not pop up.

LLaVA author addresses it in this issue.

I created a PR with the same fix that the main LLaVA repo currently is using.

ZhangXJ199 · 2024-06-26T05:47:46Z

Thanks for your question, group_by_modality_length needs to be set according to the data type: for data that only contains multimodal QA, group_by_modality_length needs to be set to False; for data that contains both multimodal QA and pure text QA (such as SQA), group_by_modality_length can be set to True.

ggcr closed this as completed Jun 24, 2024

ggcr reopened this Jun 25, 2024

ggcr mentioned this issue Jun 25, 2024

Fix group_by_modality_length in sync with LLaVA repo #90

Open

ggcr mentioned this issue Aug 22, 2024

Finetuning get: ValueError: not enough values to unpack (expected 2, got 0) in get_modality_length_grouped_indices #109

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why `group_by_modality` is set to False on the `custom_finetune.sh` script? #89

Why `group_by_modality` is set to False on the `custom_finetune.sh` script? #89

ggcr commented Jun 23, 2024 •

edited

Loading

ggcr commented Jun 25, 2024

ggcr commented Jun 25, 2024

ZhangXJ199 commented Jun 26, 2024

Why group_by_modality is set to False on the custom_finetune.sh script? #89

Why group_by_modality is set to False on the custom_finetune.sh script? #89

Comments

ggcr commented Jun 23, 2024 • edited Loading

ggcr commented Jun 25, 2024

ggcr commented Jun 25, 2024

ZhangXJ199 commented Jun 26, 2024

Why `group_by_modality` is set to False on the `custom_finetune.sh` script? #89

Why `group_by_modality` is set to False on the `custom_finetune.sh` script? #89

ggcr commented Jun 23, 2024 •

edited

Loading