Are all video-based checkpoints trained with 2 tokens? #82

haodi19 · 2024-04-12T16:17:12Z

Hello, thank you for your great work. I noticed that in the open checkpoints, all checkpoints trained on video data have the compress type as 'mean' (or 'mean_concat', but I couldn't find the corresponding logic in the code). Are all video-based checkpoints, regardless of whether the training data is short or long videos, trained with 2 tokens?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are all video-based checkpoints trained with 2 tokens? #82

Are all video-based checkpoints trained with 2 tokens? #82

haodi19 commented Apr 12, 2024

Are all video-based checkpoints trained with 2 tokens? #82

Are all video-based checkpoints trained with 2 tokens? #82

Comments

haodi19 commented Apr 12, 2024