[Bugfix][Model] Add base class for vision-language models #4809

DarkLight1337 · 2024-05-14T10:35:33Z

This PR adds a base class VLMBase to avoid importing LlavaForConditionalGeneration in vllm/model_executor/model_loader/loader.py, thus solving #4807.

Along the way, I have also ported the improved error handling logic regarding image_feature_size for LLaVA model.

FIX #4807

vllm/model_executor/models/base.py

vllm/model_executor/models/llava.py

rkooo567

I think the change looks good to me. cc @simon-mo

rkooo567 · 2024-05-16T08:11:59Z

vllm/model_executor/models/llava.py

@@ -172,7 +174,7 @@ def forward(
                image_features = image_input
            vision_embeddings = self.multi_modal_projector(image_features)
            inputs_embeds = self.language_model.get_input_embeddings(input_ids)
-            _merge_vision_embeddings(
+            inputs_embeds = _merge_vision_embeddings(


Is this a bug?

No, it is just to make explicit the fact that inputs_embeds is modified.

vllm/model_executor/models/vlm_base.py

ywang96

LGTM - thanks for the fix!

DarkLight1337 · 2024-05-17T02:14:00Z

@simon-mo The models-test keeps getting interrupted, causing the CI to fail.

rkooo567

let me retry one more time

WoosukKwon · 2024-05-19T06:57:12Z

@DarkLight1337 @ywang96 @rkooo567 Is this PR ready for merge?

DarkLight1337 · 2024-05-19T06:59:55Z

@DarkLight1337 @ywang96 @rkooo567 Is this PR ready for merge?

Yes.

…ct#4809)

DarkLight1337 added 2 commits May 14, 2024 10:23

Update LLaVA model with is_vlm tag and clean up type annotations

50bb1a6

Add import test

df2765d

DarkLight1337 changed the title ~~[Bugfix][Model] Use ClassVar to indicate vision models~~ [Bugfix][Model] Use ClassVar to indicate vision models and improve error handling when incorrect image_feature_size is passed May 14, 2024

simon-mo assigned ywang96 May 14, 2024

Merge branch 'upstream' into vlm-tag

3e478d5

DarkLight1337 changed the title ~~[Bugfix][Model] Use ClassVar to indicate vision models and improve error handling when incorrect image_feature_size is passed~~ [Bugfix][Model] Add base class for vision-language models. May 15, 2024

DarkLight1337 force-pushed the vlm-tag branch from 4ef4287 to 77a97b0 Compare May 15, 2024 03:29

DarkLight1337 changed the title ~~[Bugfix][Model] Add base class for vision-language models.~~ [Bugfix][Model] Add base class for vision-language models May 15, 2024

Use a base class instead of ClassVar

12eea44

DarkLight1337 force-pushed the vlm-tag branch from 77a97b0 to 12eea44 Compare May 15, 2024 03:47

hiyouga mentioned this pull request May 15, 2024

[Bugfix] Avoid circular import in model loader #4828

Closed

ywang96 reviewed May 15, 2024

View reviewed changes

vllm/model_executor/models/base.py Outdated Show resolved Hide resolved

ywang96 reviewed May 15, 2024

View reviewed changes

vllm/model_executor/models/llava.py Show resolved Hide resolved

Rename base.py -> vlm_base.py

c14812d

rkooo567 reviewed May 16, 2024

View reviewed changes

DarkLight1337 added 2 commits May 16, 2024 08:19

Merge branch 'upstream' into vlm-tag

8bbb2d3

Make naming more explicit

7607c89

ywang96 approved these changes May 16, 2024

View reviewed changes

rkooo567 approved these changes May 17, 2024

View reviewed changes

WoosukKwon merged commit f68470e into vllm-project:main May 19, 2024
55 checks passed

robertgshaw2-neuralmagic pushed a commit to neuralmagic/nm-vllm that referenced this pull request May 19, 2024

[Bugfix][Model] Add base class for vision-language models (vllm-proje…

c79bcb7

…ct#4809)

DarkLight1337 deleted the vlm-tag branch May 20, 2024 02:11

dtrifiro pushed a commit to dtrifiro/vllm that referenced this pull request May 21, 2024

[Bugfix][Model] Add base class for vision-language models (vllm-proje…

fc3cc45

…ct#4809)

DarkLight1337 mentioned this pull request May 24, 2024

[Model] Add base class for LoRA-supported models #5018

Merged

Temirulan pushed a commit to Temirulan/vllm-whisper that referenced this pull request Sep 6, 2024

[Bugfix][Model] Add base class for vision-language models (vllm-proje…

bcb49a9

…ct#4809)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bugfix][Model] Add base class for vision-language models #4809

[Bugfix][Model] Add base class for vision-language models #4809

DarkLight1337 commented May 14, 2024 •

edited

Loading

rkooo567 left a comment

rkooo567 May 16, 2024

DarkLight1337 May 16, 2024

ywang96 left a comment

DarkLight1337 commented May 17, 2024

rkooo567 left a comment

WoosukKwon commented May 19, 2024

DarkLight1337 commented May 19, 2024

[Bugfix][Model] Add base class for vision-language models #4809

[Bugfix][Model] Add base class for vision-language models #4809

Conversation

DarkLight1337 commented May 14, 2024 • edited Loading

rkooo567 left a comment

Choose a reason for hiding this comment

rkooo567 May 16, 2024

Choose a reason for hiding this comment

DarkLight1337 May 16, 2024

Choose a reason for hiding this comment

ywang96 left a comment

Choose a reason for hiding this comment

DarkLight1337 commented May 17, 2024

rkooo567 left a comment

Choose a reason for hiding this comment

WoosukKwon commented May 19, 2024

DarkLight1337 commented May 19, 2024

DarkLight1337 commented May 14, 2024 •

edited

Loading