LlamaIndex Multi_Modal_Llms Integration: Huggingface #16133

g-hano · 2024-09-20T20:27:51Z

Description

This project integrates Hugging Face's multimodal language models into the LlamaIndex framework, enabling advanced multimodal capabilities for various AI applications.

Fixes #16056

Features

Seamless integration of Hugging Face multimodal models with LlamaIndex
Support for multiple state-of-the-art vision-language models and their finetunes:
Easy-to-use interface for multimodal tasks like image captioning and visual question answering
Configurable model parameters for fine-tuned performance

New Package?

Did I fill in the tool.llamahub section in the pyproject.toml and provide a detailed README.md for my new integration or package?

Yes
No

Version Bump?

Did I bump the version in the pyproject.toml file of the package I am updating? (Except for the llama-index-core package)

Yes
No

Type of Change

Please delete options that are not relevant.

New feature (non-breaking change which adds functionality)

How Has This Been Tested?

Please describe the tests that you ran to verify your changes. Provide instructions so we can reproduce. Please also list any relevant details for your test configuration

Added new unit/integration tests
Added new notebook (that tests end-to-end)
I stared at the code and made sure it makes sense

Suggested Checklist:

I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
I have added Google Colab support for the newly added notebooks.
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I ran make format; make lint to appease the lint gods

review-notebook-app · 2024-09-20T20:27:56Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

llama-index-integrations/multi_modal_llms/llama-index-multi-modal-llms-huggingface/README.md

…dal-llms-huggingface/llama_index/multi_modal_llms/huggingface/__pycache__ directory

g-hano added 5 commits September 20, 2024 22:41

Add files via upload

f1b2de8

add comments

af6356f

huggingface multimodal llm example

6939407

huggingface multimodal llm example

1638c6a

Update README.md

b15fa8a

dosubot bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label Sep 20, 2024

g-hano mentioned this pull request Sep 20, 2024

[Question]: How to use VLMs from HuggingFace for Multimodal rag? #16056

Closed

1 task

g-hano added 3 commits September 20, 2024 23:59

Merge branch 'main' into main

625f415

Fix

3b155e5

Create BUILD

3ad0590

logan-markewich reviewed Sep 23, 2024

View reviewed changes

llama-index-integrations/multi_modal_llms/llama-index-multi-modal-llms-huggingface/README.md Outdated Show resolved Hide resolved

logan-markewich reviewed Sep 23, 2024

View reviewed changes

llama-index-integrations/multi_modal_llms/llama-index-multi-modal-llms-huggingface/README.md Outdated Show resolved Hide resolved

g-hano and others added 17 commits September 24, 2024 08:17

fix imports

34b6fe4

linting

4ac57eb

build files

d30dfda

deps

ba14350

forgot about notebook

6042e5c

more deps

648d0a6

Add llama3.2 multimodal class

2aa184a

linting

3e3bd0d

Delete llama-index-integrations/multi_modal_llms/llama-index-multi-mo…

2f189cb

…dal-llms-huggingface/llama_index/multi_modal_llms/huggingface/__pycache__ directory

Update test_multi_modal_llms_huggingface.py

e546b43

add qwen-vl-utils dependency

7be71e7

resolve linting

0154206

delete example notebook

e388226

add dependencies

28984b2

Update pyproject.toml

1324e2c

Update BUILD

0328174

Update BUILD

772f684

logan-markewich approved these changes Sep 29, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Sep 29, 2024

logan-markewich enabled auto-merge (squash) September 29, 2024 20:48

logan-markewich added 2 commits September 30, 2024 12:08

tailor

777a9fe

fix tests

cb6a390

logan-markewich merged commit dc04c69 into run-llama:main Oct 1, 2024
10 checks passed

raspawar pushed a commit to raspawar/llama_index that referenced this pull request Oct 7, 2024

LlamaIndex Multi_Modal_Llms Integration: Huggingface (run-llama#16133)

e47e59d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LlamaIndex Multi_Modal_Llms Integration: Huggingface #16133

LlamaIndex Multi_Modal_Llms Integration: Huggingface #16133

g-hano commented Sep 20, 2024

review-notebook-app bot commented Sep 20, 2024

LlamaIndex Multi_Modal_Llms Integration: Huggingface #16133

LlamaIndex Multi_Modal_Llms Integration: Huggingface #16133

Conversation

g-hano commented Sep 20, 2024

Description

Features

New Package?

Version Bump?

Type of Change

How Has This Been Tested?

Suggested Checklist:

review-notebook-app bot commented Sep 20, 2024