You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am exploring on doing different experiments, by training the projector only, and I've seen that the default provided script for custom finetuning has the group_by_modality_length flag set to False.
Because it has the llava_v1_5_mix665k.json as Data Path, it is not a uni-modal dataset, as it contains conversations with the <image> mm token.
Can you clarify when we should set the group_by_modality_length flag to True?
I am running in errors that are due to this:
E.g.
File "/root/TinyLLaVA_Factory/tinyllava/train/tinyllava_trainer.py", line 47, in get_modality_length_grouped_indices
lang_indices, lang_lengths = zip(*[(i, -l) for i, l in enumerate(lengths) if l < 0])
ValueError: not enough values to unpack (expected 2, got 0)
Thanks in advance.
The text was updated successfully, but these errors were encountered:
Thanks for your question, group_by_modality_length needs to be set according to the data type: for data that only contains multimodal QA, group_by_modality_length needs to be set to False; for data that contains both multimodal QA and pure text QA (such as SQA), group_by_modality_length can be set to True.
Hi,
I am exploring on doing different experiments, by training the projector only, and I've seen that the default provided script for custom finetuning has the
group_by_modality_length
flag set to False.TinyLLaVA_Factory/scripts/train/custom_finetune.sh
Lines 1 to 24 in f5c2ded
Because it has the
llava_v1_5_mix665k.json
as Data Path, it is not a uni-modal dataset, as it contains conversations with the<image>
mm token.Can you clarify when we should set the
group_by_modality_length
flag to True?I am running in errors that are due to this:
E.g.
Thanks in advance.
The text was updated successfully, but these errors were encountered: