add benchmark for fix length input and output #5857

haichuan1221 · 2024-06-26T05:57:30Z

vllm only support benchmark for sonnet and sharegpt dataset, but the input and output length is not fixed; in the contribution, I use random sample to evealuate the benchmark of fixed input length and output length

DarkLight1337 · 2024-06-28T10:48:27Z

To speed up the CI queue, I've cancelled the distributed tests for the latest CI run in this PR since they won't pass anyway until #5905 has been merged. Now that it has been merged, please merge main into your branch so that the CI can pass once again.

haichuan1221 · 2024-07-03T03:19:19Z

To speed up the CI queue, I've cancelled the distributed tests for the latest CI run in this PR since they won't pass anyway until #5905 has been merged. Now that it has been merged, please merge main into your branch so that the CI can pass once again.

I have merged from the main functiion

ywang96

Hey @haichuan1221! Sorry for the late review and thank you for the contribution!

Overall LGTM and I have left a few comments/suggestions

ywang96 · 2024-07-07T03:37:11Z

benchmarks/benchmark_serving.py

@@ -185,6 +184,29 @@ def sample_sonnet_requests(
    return sampled_requests


+def sample_random_requests(input_len, output_len, num_prompts, range_ratio,


Please add type hint for parameters in the function signature

Just fixed, please check it again

Update the formatting issues for you. For future reference, you can run the format.sh we provide in the repo for easily formatting your code changes.

OK, I will. Thinks for the tip

ywang96 · 2024-07-07T03:42:18Z

benchmarks/benchmark_serving.py

+    parser.add_argument("--random-input-len",
+                        type=int,
+                        default=1024,
+                        help="random sample input length")


Suggested change

help="random sample input length")

help="Number of randomly sampled input tokens per request, used only for random dataset")

ywang96 · 2024-07-07T03:42:39Z

benchmarks/benchmark_serving.py

+    parser.add_argument("--random-output-len",
+                        type=int,
+                        default=128,
+                        help="random sample output length")


Please update the help message per suggestion above.

ywang96 · 2024-07-07T03:42:43Z

benchmarks/benchmark_serving.py

+    parser.add_argument("--random-range-ratio",
+                        type=float,
+                        default=1.0,
+                        help="random sample range ratio")


Please update the help message per suggestion above.

Co-authored-by: Roger Wang <[email protected]>

haichuan1221 added 10 commits June 26, 2024 13:54

add benchmark for fix length input and output

afa0e91

fix format issue

ffcc34c

fix format issue

4f4962e

fix format issue

3c405e0

fix format issue

6b7415c

fix format issue

7c5f1d9

fix format issue

0f8fbf4

fix format issue

cab9aa4

fix format issue

66710c6

Merge branch 'vllm-project:main' into main

ab11c8a

ywang96 self-assigned this Jun 26, 2024

haichuan1221 added 3 commits June 26, 2024 18:48

Merge branch 'vllm-project:main' into main

0af62a8

Merge branch 'vllm-project:main' into main

68e25fb

Merge branch 'vllm-project:main' into main

b798b8e

haichuan1221 added 4 commits June 30, 2024 11:27

Merge branch 'vllm-project:main' into main

bac04ae

Merge branch 'vllm-project:main' into main

e8fd224

Merge branch 'vllm-project:main' into main

ff9a0b7

Merge branch 'vllm-project:main' into main

13fbce4

haichuan1221 added 2 commits July 4, 2024 11:06

Merge branch 'vllm-project:main' into main

e2d4290

Merge branch 'vllm-project:main' into main

80eb2ed

ywang96 reviewed Jul 7, 2024

View reviewed changes

haichuan1221 and others added 7 commits July 7, 2024 13:13

add typing name and help message

bea16c3

Merge branch 'main' of github.com:haichuan1221/vllm into main

1608c7a

fix comma issue

32add2a

fix comma issue

e306c5b

update format

fee6383

remove unneeded noqa

26a67cd

move up args

5b4897f

ywang96 approved these changes Jul 7, 2024

View reviewed changes

ywang96 enabled auto-merge (squash) July 7, 2024 05:59

ywang96 merged commit 333306a into vllm-project:main Jul 7, 2024
70 checks passed

xjpang pushed a commit to xjpang/vllm that referenced this pull request Jul 24, 2024

add benchmark for fix length input and output (vllm-project#5857)

11070e9

Co-authored-by: Roger Wang <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add benchmark for fix length input and output #5857

add benchmark for fix length input and output #5857

haichuan1221 commented Jun 26, 2024

DarkLight1337 commented Jun 28, 2024

haichuan1221 commented Jul 3, 2024

ywang96 left a comment

ywang96 Jul 7, 2024

haichuan1221 Jul 7, 2024

ywang96 Jul 7, 2024

haichuan1221 Jul 7, 2024

ywang96 Jul 7, 2024

ywang96 Jul 7, 2024

ywang96 Jul 7, 2024

		@@ -185,6 +184,29 @@ def sample_sonnet_requests(
		return sampled_requests


		def sample_random_requests(input_len, output_len, num_prompts, range_ratio,

	help="random sample input length")
	help="Number of randomly sampled input tokens per request, used only for random dataset")

add benchmark for fix length input and output #5857

add benchmark for fix length input and output #5857

Conversation

haichuan1221 commented Jun 26, 2024

DarkLight1337 commented Jun 28, 2024

haichuan1221 commented Jul 3, 2024

ywang96 left a comment

Choose a reason for hiding this comment

ywang96 Jul 7, 2024

Choose a reason for hiding this comment

haichuan1221 Jul 7, 2024

Choose a reason for hiding this comment

ywang96 Jul 7, 2024

Choose a reason for hiding this comment

haichuan1221 Jul 7, 2024

Choose a reason for hiding this comment

ywang96 Jul 7, 2024

Choose a reason for hiding this comment

ywang96 Jul 7, 2024

Choose a reason for hiding this comment

ywang96 Jul 7, 2024

Choose a reason for hiding this comment