[Serve] Add http request latency #32839

sihanwang41 · 2023-02-25T00:50:50Z

Why are these changes needed?

Add http request latency metrics.

Related issue number

Closes #32711

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

doc/source/serve/production-guide/monitoring.md

python/ray/serve/_private/http_proxy.py

architkulkarni · 2023-02-28T17:40:09Z

python/ray/serve/tests/test_metrics.py

@@ -56,9 +56,9 @@ def verify_metrics(do_assert=False):
            "serve_deployment_request_counter",
            "serve_deployment_replica_starts",
            # histogram
-            "deployment_processing_latency_ms_bucket",
-            "deployment_processing_latency_ms_count",
-            "deployment_processing_latency_ms_sum",


Do you know why this test was passing before if these metrics didn't exist yet?

Sorry I misunderstood, this is a different metric. Still want to know if it was passing or not running though

It is same metric, the reason it can pass before because we check whether the deployment_processing_latency_ms_sum string is inside the metrics blob.
For this change, I just want to make the test check more restrict.

Ah makes sense!

architkulkarni

Looks good to me pending Edward's suggestions, and it would be good to double check that the test is running in CI

Signed-off-by: Sihan Wang <[email protected]>

shrekris-anyscale · 2023-03-01T18:05:05Z

doc/source/serve/production-guide/monitoring.md

@@ -279,6 +279,9 @@ The following metrics are exposed by Ray Serve:
       * error_code
       * method
     - The number of non-200 HTTP responses returned by each deployment.
+   * - ``serve_http_request_latency_ms`` [*]
+     - * endpoint
+     - The end-to-end latency of HTTP requests (measured from the Serve HTTP proxy).


Suggested change

- The end-to-end latency of HTTP requests (measured from the Serve HTTP proxy).

- A histogram of end-to-end latencies for HTTP requests (measured from the Serve HTTP proxy).

I will stick to the original one since it is following our current words convention.

sihanwang41 · 2023-03-01T20:15:01Z

@edoakes ping for merge!

Add http request latency metrics. Signed-off-by: Jack He <[email protected]>

Add http request latency metrics. Signed-off-by: Edward Oakes <[email protected]>

Add http request latency metrics.

Add http request latency metrics. Signed-off-by: elliottower <[email protected]>

Add http request latency metrics. Signed-off-by: Jack He <[email protected]>

sihanwang41 force-pushed the http_proxy_latency branch from fd015c9 to ff3fdc5 Compare February 26, 2023 20:38

sihanwang41 marked this pull request as ready for review February 28, 2023 17:27

sihanwang41 requested review from edoakes, simon-mo, jiaodong, shrekris-anyscale, architkulkarni and a team as code owners February 28, 2023 17:27

sihanwang41 assigned sihanwang41, edoakes, zcin and shrekris-anyscale Feb 28, 2023

edoakes reviewed Feb 28, 2023

View reviewed changes

doc/source/serve/production-guide/monitoring.md Outdated Show resolved Hide resolved

python/ray/serve/_private/http_proxy.py Outdated Show resolved Hide resolved

python/ray/serve/_private/http_proxy.py Outdated Show resolved Hide resolved

architkulkarni reviewed Feb 28, 2023

View reviewed changes

architkulkarni approved these changes Feb 28, 2023

View reviewed changes

sihanwang41 force-pushed the http_proxy_latency branch from e4bff52 to 265bba0 Compare February 28, 2023 18:39

edoakes approved these changes Feb 28, 2023

View reviewed changes

sihanwang41 added 2 commits March 1, 2023 08:48

[Serve] Add http request latency

146a8d8

Signed-off-by: Sihan Wang <[email protected]>

Address comments

5d89517

Signed-off-by: Sihan Wang <[email protected]>

sihanwang41 force-pushed the http_proxy_latency branch from 265bba0 to 5d89517 Compare March 1, 2023 16:54

shrekris-anyscale approved these changes Mar 1, 2023

View reviewed changes

edoakes merged commit 0c02ae8 into ray-project:master Mar 1, 2023

ProjectsByJackHe pushed a commit to ProjectsByJackHe/ray that referenced this pull request Mar 21, 2023

[Serve] Add http request latency (ray-project#32839)

a1e1c32

Add http request latency metrics. Signed-off-by: Jack He <[email protected]>

edoakes pushed a commit to edoakes/ray that referenced this pull request Mar 22, 2023

[Serve] Add http request latency (ray-project#32839)

44d2433

Add http request latency metrics. Signed-off-by: Edward Oakes <[email protected]>

peytondmurray pushed a commit to peytondmurray/ray that referenced this pull request Mar 22, 2023

[Serve] Add http request latency (ray-project#32839)

66542b2

Add http request latency metrics.

elliottower pushed a commit to elliottower/ray that referenced this pull request Apr 22, 2023

[Serve] Add http request latency (ray-project#32839)

f40874e

Add http request latency metrics. Signed-off-by: elliottower <[email protected]>

ProjectsByJackHe pushed a commit to ProjectsByJackHe/ray that referenced this pull request May 4, 2023

[Serve] Add http request latency (ray-project#32839)

6ad00b3

Add http request latency metrics. Signed-off-by: Jack He <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Serve] Add http request latency #32839

[Serve] Add http request latency #32839

sihanwang41 commented Feb 25, 2023 •

edited

Loading

architkulkarni Feb 28, 2023

architkulkarni Feb 28, 2023

sihanwang41 Feb 28, 2023

architkulkarni Feb 28, 2023

architkulkarni left a comment

shrekris-anyscale Mar 1, 2023

sihanwang41 Mar 1, 2023

sihanwang41 commented Mar 1, 2023

	- The end-to-end latency of HTTP requests (measured from the Serve HTTP proxy).
	- A histogram of end-to-end latencies for HTTP requests (measured from the Serve HTTP proxy).

[Serve] Add http request latency #32839

[Serve] Add http request latency #32839

Conversation

sihanwang41 commented Feb 25, 2023 • edited Loading

Why are these changes needed?

Related issue number

Checks

architkulkarni Feb 28, 2023

Choose a reason for hiding this comment

architkulkarni Feb 28, 2023

Choose a reason for hiding this comment

sihanwang41 Feb 28, 2023

Choose a reason for hiding this comment

architkulkarni Feb 28, 2023

Choose a reason for hiding this comment

architkulkarni left a comment

Choose a reason for hiding this comment

shrekris-anyscale Mar 1, 2023

Choose a reason for hiding this comment

sihanwang41 Mar 1, 2023

Choose a reason for hiding this comment

sihanwang41 commented Mar 1, 2023

sihanwang41 commented Feb 25, 2023 •

edited

Loading