Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize mega flow by removing microservice wrapper #582

Merged
merged 14 commits into from
Sep 4, 2024

Conversation

Spycsh
Copy link
Member

@Spycsh Spycsh commented Aug 30, 2024

Description

We will remove the microservice wrappers (e.g. for LLM, TEI) and forward the messages directly to servers. This aims to help users define a whole pipeline more easily, decrease network overhead, and also improve scaling-in scaling-out capabilities.

Users can rewrite the align_inputs, align_outputs to align the format consistency between microservices.

Issues

opea-project/GenAIExamples#700

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)
  • Others (enhancement, documentation, validation, etc.)

Dependencies

None

Tests

Local Test

Copy link

codecov bot commented Aug 30, 2024

Codecov Report

Attention: Patch coverage is 80.95238% with 8 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
comps/cores/mega/orchestrator.py 80.00% 6 Missing ⚠️
comps/cores/mega/gateway.py 33.33% 2 Missing ⚠️
Files with missing lines Coverage Δ
comps/cores/proto/docarray.py 99.18% <100.00%> (+0.06%) ⬆️
comps/cores/mega/gateway.py 22.61% <33.33%> (-0.17%) ⬇️
comps/cores/mega/orchestrator.py 91.13% <80.00%> (-1.62%) ⬇️

@Spycsh Spycsh marked this pull request as ready for review September 3, 2024 02:22
@lkk12014402
Copy link
Collaborator

link this pr: #324

@lvliang-intel lvliang-intel merged commit 0bb69ac into opea-project:main Sep 4, 2024
11 checks passed
@lvliang-intel lvliang-intel mentioned this pull request Sep 6, 2024
4 tasks
lkk12014402 pushed a commit that referenced this pull request Sep 19, 2024
* update tgi version

Signed-off-by: Xinyao Wang <[email protected]>

* add k8s for faq

Signed-off-by: Xinyao Wang <[email protected]>

* add benchmark for faq

Signed-off-by: Xinyao Wang <[email protected]>

* refine k8s for faq

Signed-off-by: Xinyao Wang <[email protected]>

* add tuning for faq

Signed-off-by: Xinyao Wang <[email protected]>

* add prompts with different length for faq

Signed-off-by: Xinyao Wang <[email protected]>

* add tgi docker for llama3.1

Signed-off-by: Xinyao Wang <[email protected]>

* remove useless code

Signed-off-by: Xinyao Wang <[email protected]>

* remove nodeselector

Signed-off-by: Xinyao Wang <[email protected]>

* remove hg token

Signed-off-by: Xinyao Wang <[email protected]>

* refine code structure

Signed-off-by: Xinyao Wang <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix readme

Signed-off-by: Xinyao Wang <[email protected]>

---------

Signed-off-by: Xinyao Wang <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants