Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

adding embedding support for CLIP based models for VideoRAGQnA example for v0.9 #538

Merged
merged 28 commits into from
Sep 4, 2024

Conversation

srinarayan-srikanthan
Copy link
Collaborator

Description

Add support for CLIP based embeddings for multimodal VideoRAG usecase.

Issues

List the issue or RFC link this PR is working on. If there is no such link, please mark it as n/a.
The RFC is under review : opea-project/docs#49

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)
  • Others (enhancement, documentation, validation, etc.)

Dependencies

List the newly introduced 3rd party dependency if exists.

Tests

Describe the tests that you ran to verify your changes.

@srinarayan-srikanthan
Copy link
Collaborator Author

@chensuyue can you please tag this PR with 'V0.9' Milestone

Copy link

codecov bot commented Aug 21, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Files with missing lines Coverage Δ
comps/cores/proto/docarray.py 99.12% <100.00%> (+<0.01%) ⬆️

@srinarayan-srikanthan srinarayan-srikanthan changed the title adding embedding support for CLIP based models for VideoRAGQnA example adding embedding support for CLIP based models for VideoRAGQnA example for v0.9 Aug 21, 2024
@BaoHuiling
Copy link
Collaborator

@XuhuiRen @lvliang-intel Could you help to review the code? Thanks

@lvliang-intel
Copy link
Collaborator

@srinarayan-srikanthan,
Please also check why the example test failed.

@BaoHuiling BaoHuiling added this to the v1.0 milestone Aug 29, 2024
@BaoHuiling BaoHuiling mentioned this pull request Aug 29, 2024
3 tasks
@ttrigui
Copy link

ttrigui commented Aug 29, 2024

@lvliang-intel , the example test fail seem to be unrelated to the PR changes. This test seems to fail in another PR too.

@XuhuiRen
Copy link
Collaborator

please fix the name confliction with #575 before merging

@BaoHuiling
Copy link
Collaborator

For the failed test "Example-test", here is the comment from @chensuyue

The test failed at megaservice step due to mismatch of the expected keyword and output from curl
image

To fix this, please do local test on your side with this script to see what happens. Please note you need to modify line 19 to checkout to your branch under testing. (for example, git clone -b sri-clip-embedding https://github.com/siddhivelankar23/GenAIComps.git)

Signed-off-by: srinarayan-srikanthan <[email protected]>
for more information, see https://pre-commit.ci

Signed-off-by: srinarayan-srikanthan <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
for more information, see https://pre-commit.ci

Signed-off-by: srinarayan-srikanthan <[email protected]>
Signed-off-by: Sun, Xuehao <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
* fix get file error & refine logs

Signed-off-by: letonghan <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: letonghan <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: srinarayan-srikanthan <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: srinarayan-srikanthan <[email protected]>
BaoHuiling and others added 12 commits September 1, 2024 10:36
Signed-off-by: BaoHuiling <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: srinarayan-srikanthan <[email protected]>
Signed-off-by: XuhuiRen <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
Signed-off-by: Xinyu Ye <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: lkk <[email protected]>
Co-authored-by: test <[email protected]>
Co-authored-by: root <[email protected]>
Co-authored-by: Letong Han <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
Signed-off-by: chensuyue <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
* Use Gaudi base images from Dockerhub

Signed-off-by: Abolfazl Shahbazi <[email protected]>

* Fixing the malformed tag

Signed-off-by: Abolfazl Shahbazi <[email protected]>

* fix another malformed tag

Signed-off-by: Abolfazl Shahbazi <[email protected]>

---------

Signed-off-by: Abolfazl Shahbazi <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
* Add toxicity detection microservice

Signed-off-by: Qun Gao <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* Modification to toxicity plugin PR  (opea-project#432)

* changed microservice to use Service.GUARDRAILS and input/output to TextDoc

Signed-off-by: Tyler Wilbers <[email protected]>

* simplify dockerfile to use langchain

Signed-off-by: Tyler Wilbers <[email protected]>

* sort requirements

Signed-off-by: Tyler Wilbers <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Tyler Wilbers <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

* Minor SPDX header update (opea-project#434)

Signed-off-by: Abolfazl Shahbazi <[email protected]>

* Remove 'langsmith' per code review (opea-project#534)

Signed-off-by: Abolfazl Shahbazi <[email protected]>

* Add toxicity detection microservices with E2E testing

Signed-off-by: Qun Gao <[email protected]>

---------

Signed-off-by: Qun Gao <[email protected]>
Signed-off-by: Tyler Wilbers <[email protected]>
Signed-off-by: Abolfazl Shahbazi <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Abolfazl Shahbazi <[email protected]>
Co-authored-by: Tyler W <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
Signed-off-by: BaoHuiling <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
Signed-off-by: BaoHuiling <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
for more information, see https://pre-commit.ci

Signed-off-by: srinarayan-srikanthan <[email protected]>
@XuhuiRen
Copy link
Collaborator

XuhuiRen commented Sep 3, 2024

@BaoHuiling @srinarayan-srikanthan please consider to have a better name to distinguish your previous PR for the folder https://github.com/opea-project/GenAIComps/tree/main/comps/embeddings/multimodal_embeddings/multimodal_langchain and the folder "comps/embeddings/langchain_multimodal/" in this PR. These two names are too similar. The name should have a clarify representation. Otherwise, it will lead to unexpected confusion.

@ttrigui
Copy link

ttrigui commented Sep 3, 2024

@lvliang-intel , can we please merge this PR since all issues are resolved?

Signed-off-by: srinarayan-srikanthan <[email protected]>
Signed-off-by: srinarayan-srikanthan <[email protected]>
@yinghu5 yinghu5 self-requested a review September 4, 2024 00:54
@yinghu5 yinghu5 merged commit 2a53e25 into opea-project:main Sep 4, 2024
19 checks passed
lkk12014402 pushed a commit that referenced this pull request Sep 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.