Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[serve] Cherry-pick fix for router queue length metric #38020

Merged
merged 1 commit into from
Aug 8, 2023

Commits on Aug 2, 2023

  1. [Serve] Decrement ray_serve_deployment_queued_queries when client d…

    …isconnects (ray-project#37965)
    
    The `ray_serve_deployment_queued_queries` metric tracks the number of queries that have yet to be assigned a replica. If a client disconnects before its query has been assigned a replica– but after the metric has counted their query– the query terminates, but the metric doesn't decrease.
    
    This change decrements `ray_serve_deployment_queued_queries` when a queued request is disconnected.
    
    Signed-off-by: Edward Oakes <[email protected]>
    shrekris-anyscale authored and edoakes committed Aug 2, 2023
    Configuration menu
    Copy the full SHA
    97f4964 View commit details
    Browse the repository at this point in the history