Optimize mega flow by removing microservice wrapper #582

Spycsh · 2024-08-30T03:20:50Z

Description

We will remove the microservice wrappers (e.g. for LLM, TEI) and forward the messages directly to servers. This aims to help users define a whole pipeline more easily, decrease network overhead, and also improve scaling-in scaling-out capabilities.

Users can rewrite the align_inputs, align_outputs to align the format consistency between microservices.

Issues

opea-project/GenAIExamples#700

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Dependencies

None

Tests

Local Test

for more information, see https://pre-commit.ci

codecov · 2024-08-30T03:22:53Z

Codecov Report

Attention: Patch coverage is 80.95238% with 8 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
comps/cores/mega/orchestrator.py	80.00%	6 Missing ⚠️
comps/cores/mega/gateway.py	33.33%	2 Missing ⚠️

Files with missing lines	Coverage Δ
comps/cores/proto/docarray.py	`99.18% <100.00%> (+0.06%)`	⬆️
comps/cores/mega/gateway.py	`22.61% <33.33%> (-0.17%)`	⬇️
comps/cores/mega/orchestrator.py	`91.13% <80.00%> (-1.62%)`	⬇️

lkk12014402 · 2024-09-03T08:27:57Z

link this pr: #324

for more information, see https://pre-commit.ci

* update tgi version Signed-off-by: Xinyao Wang <[email protected]> * add k8s for faq Signed-off-by: Xinyao Wang <[email protected]> * add benchmark for faq Signed-off-by: Xinyao Wang <[email protected]> * refine k8s for faq Signed-off-by: Xinyao Wang <[email protected]> * add tuning for faq Signed-off-by: Xinyao Wang <[email protected]> * add prompts with different length for faq Signed-off-by: Xinyao Wang <[email protected]> * add tgi docker for llama3.1 Signed-off-by: Xinyao Wang <[email protected]> * remove useless code Signed-off-by: Xinyao Wang <[email protected]> * remove nodeselector Signed-off-by: Xinyao Wang <[email protected]> * remove hg token Signed-off-by: Xinyao Wang <[email protected]> * refine code structure Signed-off-by: Xinyao Wang <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix readme Signed-off-by: Xinyao Wang <[email protected]> --------- Signed-off-by: Xinyao Wang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Spycsh added 2 commits August 29, 2024 19:15

refactor orchestrator

eb78b98

fix

397d9ec

Spycsh requested a review from lvliang-intel as a code owner August 30, 2024 03:20

[pre-commit.ci] auto fixes from pre-commit.com hooks

8c296c7

for more information, see https://pre-commit.ci

Spycsh marked this pull request as draft August 30, 2024 03:23

Spycsh mentioned this pull request Aug 30, 2024

Add megaservice definition without microservice wrappers opea-project/GenAIExamples#700

Merged

4 tasks

Spycsh and others added 5 commits August 29, 2024 23:37

remove no_wrapper

f3a448e

fix

15e2d2d

fix

944d8b6

add align_gen

2ad86ad

Merge branch 'main' into rm_wrapper

4ab33e9

Spycsh marked this pull request as ready for review September 3, 2024 02:22

hshen14 requested a review from lkk12014402 September 3, 2024 11:13

Spycsh and others added 6 commits September 3, 2024 22:07

add retriever and rerank params

668f76c

[pre-commit.ci] auto fixes from pre-commit.com hooks

8155a30

for more information, see https://pre-commit.ci

add fake test for customize params

d5ec7ca

[pre-commit.ci] auto fixes from pre-commit.com hooks

8b19573

for more information, see https://pre-commit.ci

fix dep

a4a2ff6

Merge branch 'main' into rm_wrapper

e6c39df

lkk12014402 approved these changes Sep 4, 2024

View reviewed changes

lvliang-intel approved these changes Sep 4, 2024

View reviewed changes

lvliang-intel merged commit 0bb69ac into opea-project:main Sep 4, 2024
11 checks passed

lvliang-intel mentioned this pull request Sep 6, 2024

Fix LVM streaming issue #637

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize mega flow by removing microservice wrapper #582

Optimize mega flow by removing microservice wrapper #582

Spycsh commented Aug 30, 2024 •

edited

Loading

codecov bot commented Aug 30, 2024 •

edited

Loading

lkk12014402 commented Sep 3, 2024

Optimize mega flow by removing microservice wrapper #582

Optimize mega flow by removing microservice wrapper #582

Conversation

Spycsh commented Aug 30, 2024 • edited Loading

Description

Issues

Type of change

Dependencies

Tests

codecov bot commented Aug 30, 2024 • edited Loading

Codecov Report

lkk12014402 commented Sep 3, 2024

Spycsh commented Aug 30, 2024 •

edited

Loading

codecov bot commented Aug 30, 2024 •

edited

Loading