[HPU] [Serve] [experimental] Add vllm HPU support in vllm example #45893

KepingYan · 2024-06-12T09:00:08Z

Why are these changes needed?

This PR adds vllm HPU support in vllm example (#45430). The added codes will check whether the HPU device exists before allocating resources to vllm actors. If it exists, HPU resources are used, otherwise GPU resources are still used.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: KepingYan <[email protected]>

can-anyscale

has been several weeks, @akshay-anyscale , @edoakes do you mind have a look, thanks

doc/source/serve/doc_code/vllm_openai_example.py

Signed-off-by: KepingYan <[email protected]>

doc/source/serve/doc_code/vllm_openai_example.py

Signed-off-by: KepingYan <[email protected]>

KepingYan requested review from edoakes, shrekris-anyscale, zcin, GeneDer, akshay-anyscale and a team as code owners June 12, 2024 09:00

KepingYan added 5 commits June 14, 2024 15:34

add hpu support for vllm example

a4b502b

Signed-off-by: KepingYan <[email protected]>

upd doc

2e806ce

Signed-off-by: KepingYan <[email protected]>

fix lint check

2ac6fb8

Signed-off-by: KepingYan <[email protected]>

Merge remote-tracking branch 'upstream/master' into add_vllm_hpu

83eb712

add device param

37bfddb

Signed-off-by: KepingYan <[email protected]>

KepingYan force-pushed the add_vllm_hpu branch from df9a271 to 37bfddb Compare June 14, 2024 08:22

fix ci

b3abb07

Signed-off-by: KepingYan <[email protected]>

KepingYan force-pushed the add_vllm_hpu branch from 2992c7d to b3abb07 Compare June 14, 2024 09:41

KepingYan changed the title ~~[HPU] [Serve] Add vllm HPU support in vllm example~~ [HPU] [Serve] [experimental] Add vllm HPU support in vllm example Jun 21, 2024

can-anyscale assigned akshay-anyscale and edoakes Jun 27, 2024

can-anyscale reviewed Jul 2, 2024

View reviewed changes

Merge branch 'master' into add_vllm_hpu

e68d823

akshay-anyscale approved these changes Jul 2, 2024

View reviewed changes

doc/source/serve/doc_code/vllm_openai_example.py Outdated Show resolved Hide resolved

doc/source/serve/doc_code/vllm_openai_example.py Outdated Show resolved Hide resolved

KepingYan added 3 commits July 4, 2024 17:18

address comments

2f5bfa5

Signed-off-by: KepingYan <[email protected]>

Merge remote-tracking branch 'origin/add_vllm_hpu' into add_vllm_hpu

094acc0

fix ci

48f2a32

Signed-off-by: KepingYan <[email protected]>

anyscalesam added the serve Ray Serve Related Issue label Jul 15, 2024

edoakes added the go add ONLY when ready to merge, run all tests label Jul 16, 2024

KepingYan added 2 commits August 13, 2024 18:03

merge latest code

7bae18f

compatible with the latest vLLM version

cd391ed

Signed-off-by: KepingYan <[email protected]>

KepingYan closed this Aug 13, 2024

KepingYan force-pushed the add_vllm_hpu branch from 1ff2528 to 0e03169 Compare August 13, 2024 11:50

KepingYan reopened this Aug 13, 2024

KepingYan force-pushed the add_vllm_hpu branch from 1ff2528 to cd391ed Compare August 13, 2024 12:01

fix lint

7395f9e

Signed-off-by: KepingYan <[email protected]>

akshay-anyscale reviewed Aug 13, 2024

View reviewed changes

doc/source/serve/doc_code/vllm_openai_example.py Outdated Show resolved Hide resolved

KepingYan added 6 commits August 14, 2024 09:50

remove hpu env check

4af9ca1

Signed-off-by: KepingYan <[email protected]>

Merge remote-tracking branch 'upstream/master' into add_vllm_hpu

8863b19

remove unused package

de9eefd

Signed-off-by: KepingYan <[email protected]>

Merge remote-tracking branch 'upstream/master' into add_vllm_hpu

31d211c

Merge remote-tracking branch 'upstream/master' into add_vllm_hpu

ceaa8a9

Merge remote-tracking branch 'upstream/master' into add_vllm_hpu

3068ced

anyscalesam merged commit c46c2e5 into ray-project:master Aug 19, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HPU] [Serve] [experimental] Add vllm HPU support in vllm example #45893

[HPU] [Serve] [experimental] Add vllm HPU support in vllm example #45893

KepingYan commented Jun 12, 2024

can-anyscale left a comment

[HPU] [Serve] [experimental] Add vllm HPU support in vllm example #45893

[HPU] [Serve] [experimental] Add vllm HPU support in vllm example #45893

Conversation

KepingYan commented Jun 12, 2024

Why are these changes needed?

Related issue number

Checks

can-anyscale left a comment

Choose a reason for hiding this comment