VLMs: small clean-up for cache class #32417

zucchini-nlp · 2024-08-05T04:32:11Z

What does this PR do?

VideoLLaVa was generating garbage in beam search as reported by some users after moving to the new cache format. Thsi PR fixes it as we no longer will have past_key_values is None in any generation step. Except for when caching isn't enabled at all.

Also a little clean up on VLMs, after removing old cache format for all major LLMs we no longer need the reorder cache method

HuggingFaceDocBuilderDev · 2024-08-05T04:50:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

amyeroberts

Thanks for fixing @zucchini-nlp!

Are there slow tests which should have caught this? If so, can we run them here to make sure they all pass? If not, can we add some?

ArthurZucker

I only see the cleanup but LGTM

zucchini-nlp · 2024-08-16T04:06:07Z

Yep, after resolving conflicts the fix wasn't needed anymore. I'll rename PR and merge it :)

fix beam search in video llava

e9f4e08

zucchini-nlp requested a review from LysandreJik August 5, 2024 04:32

zucchini-nlp mentioned this pull request Aug 5, 2024

Video-LLaVa now available in the Transformers library! PKU-YuanGroup/Video-LLaVA#156

Open

zucchini-nlp requested a review from ArthurZucker August 9, 2024 05:43

zucchini-nlp changed the title ~~VideLLaVA: fix beam search~~ VideLLaVA: fix generation Aug 14, 2024

zucchini-nlp mentioned this pull request Aug 14, 2024

Video-LLaVA-7B-hf doesn't work (returns nonsense) #32655

Closed

4 tasks

amyeroberts reviewed Aug 14, 2024

View reviewed changes

zucchini-nlp added the run-slow label Aug 14, 2024

zucchini-nlp added 2 commits August 14, 2024 12:13

[run-slow] video_llava

cc0c37b

Merge branch 'main' into videollava

2ffd344

ArthurZucker approved these changes Aug 15, 2024

View reviewed changes

zucchini-nlp changed the title ~~VideLLaVA: fix generation~~ VLMs; small clean-up after cache class Aug 16, 2024

zucchini-nlp changed the title ~~VLMs; small clean-up after cache class~~ VLMs; small clean-up for cache class Aug 16, 2024

zucchini-nlp changed the title ~~VLMs; small clean-up for cache class~~ VLMs: small clean-up for cache class Aug 16, 2024

zucchini-nlp merged commit f3c8b18 into huggingface:main Aug 16, 2024
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VLMs: small clean-up for cache class #32417

VLMs: small clean-up for cache class #32417

zucchini-nlp commented Aug 5, 2024

HuggingFaceDocBuilderDev commented Aug 5, 2024

amyeroberts left a comment

ArthurZucker left a comment

zucchini-nlp commented Aug 16, 2024

VLMs: small clean-up for cache class #32417

VLMs: small clean-up for cache class #32417

Conversation

zucchini-nlp commented Aug 5, 2024

What does this PR do?

HuggingFaceDocBuilderDev commented Aug 5, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

zucchini-nlp commented Aug 16, 2024