[Feature]: microsoft/Phi-3-vision-128k-instruct Vision support #4958

pseudotensor · 2024-05-21T17:46:24Z

🚀 The feature, motivation and pitch

https://huggingface.co/microsoft/Phi-3-vision-128k-instruct

Alternatives

No response

Additional context

vllm is somewhat behind in vision support. idefics2 is supported by TGI and lllava next been out for months and not supported yet. There is a PR, is it close?

Isotr0py · 2024-05-22T15:52:55Z

The vllm's multi-modality support is still under refactoring:

[RFC]: Multi-modality Support Refactoring #4194

So we need waiting some necessary refactoring work (like ImageProcessor support) finished before we add new vision model.

pseudotensor added the feature request label May 21, 2024

Isotr0py mentioned this issue May 22, 2024

[Model] Initialize Phi-3-vision support #4986

Merged

3 tasks

DarkLight1337 mentioned this issue May 31, 2024

[RFC]: Multi-modality Support Refactoring #4194

Open

81 tasks

ywang96 closed this as completed in #4986 Jun 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: microsoft/Phi-3-vision-128k-instruct Vision support #4958

[Feature]: microsoft/Phi-3-vision-128k-instruct Vision support #4958

pseudotensor commented May 21, 2024 •

edited

Loading

Isotr0py commented May 22, 2024

[Feature]: microsoft/Phi-3-vision-128k-instruct Vision support #4958

[Feature]: microsoft/Phi-3-vision-128k-instruct Vision support #4958

Comments

pseudotensor commented May 21, 2024 • edited Loading

🚀 The feature, motivation and pitch

Alternatives

Additional context

Isotr0py commented May 22, 2024

pseudotensor commented May 21, 2024 •

edited

Loading