You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The ray_serve_deployment_queued_queries metric tracks the number of ServeHandle requests that are queued for a particular deployment. It increments whenever a query starts being assigned, and it decrements once a query starts being processed.
However, when a client disconnects before a request is assigned to a replica, the request gets canceled, but the metric never decrements for that request. This causes the metric to increase over time as clients disconnect.
I expect the metric to decrement when a client disconnects and the request is canceled.
Versions / Dependencies
Ray on the latest master.
Reproduction script
# bad_metrics.pyimporttimeimportloggingfromrayimportservelogger=logging.getLogger("ray.serve")
@serve.deployment(max_concurrent_queries=100, # 1graceful_shutdown_timeout_s=0.0001,)defwait(self, *args):
logger.info("Started a request...")
time.sleep(100000)
logger.info("Finished a request...")
app=wait.bind()
Run this app on a Ray cluster that exposes prometheus metrics (e.g. ray start --head --metrics-export-port=8080). Submit a few curl requests from different terminal windows. Then kill the requests. The metric does not decrease.
Issue Severity
Medium: It is a significant difficulty but I can work around it.
The text was updated successfully, but these errors were encountered:
What happened + What you expected to happen
The
ray_serve_deployment_queued_queries
metric tracks the number of ServeHandle requests that are queued for a particular deployment. It increments whenever a query starts being assigned, and it decrements once a query starts being processed.However, when a client disconnects before a request is assigned to a replica, the request gets canceled, but the metric never decrements for that request. This causes the metric to increase over time as clients disconnect.
I expect the metric to decrement when a client disconnects and the request is canceled.
Versions / Dependencies
Ray on the latest
master
.Reproduction script
Run this app on a Ray cluster that exposes prometheus metrics (e.g.
ray start --head --metrics-export-port=8080
). Submit a fewcurl
requests from different terminal windows. Then kill the requests. The metric does not decrease.Issue Severity
Medium: It is a significant difficulty but I can work around it.
The text was updated successfully, but these errors were encountered: