[PERF] Update number of cores on every iteration #1480

jaychia · 2023-10-09T23:35:18Z

Updates the number of cores available before/after every batch dispatch

This should allow us to take advantage of autoscaling of the Ray cluster better as we will schedule larger batches of tasks + more total inflight tasks as the cluster autoscales.

xcharleslin

LGTM, thank you!

xcharleslin · 2023-10-10T00:03:23Z

daft/runners/ray_runner.py

+                    # This call takes about 0.3ms and hits a locally in-memory cached record of cluster resources
+                    cores: int = int(ray.cluster_resources()["CPU"]) - self.reserved_cores
+                    max_inflight_tasks = cores + self.max_task_backlog
+
                    while True:  # Loop: Dispatch (get tasks -> batch dispatch).
                        tasks_to_dispatch: list[PartitionTask] = []



You might even want to do it here, this is where batches are dispatched up to the limit

Github UI is being unclear, but I mean the last line of that block, 458/456

Moved into inner loop and guarded it with a TTL

codecov · 2023-10-10T00:08:39Z

Codecov Report

Merging #1480 (0b63fb7) into main (553a911) will increase coverage by 0.15%.
Report is 2 commits behind head on main.
The diff coverage is 100.00%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1480      +/-   ##
==========================================
+ Coverage   74.70%   74.86%   +0.15%     
==========================================
  Files          60       60              
  Lines        6061     6102      +41     
==========================================
+ Hits         4528     4568      +40     
- Misses       1533     1534       +1

Files	Coverage Δ
daft/runners/ray_runner.py	`91.48% <100.00%> (+0.28%)`	⬆️

... and 2 files with indirect coverage changes

[PERF] Update number of cores on every iteration

0342fd0

github-actions bot added the performance label Oct 9, 2023

jaychia requested review from xcharleslin and samster25 October 9, 2023 23:35

Also compute max_inflight_tasks on every iteration

e9e4569

xcharleslin approved these changes Oct 10, 2023

View reviewed changes

xcharleslin reviewed Oct 10, 2023

View reviewed changes

Jay Chia added 2 commits October 9, 2023 17:09

Move into inner loop

3283afe

Move into inner loop and guard with a ttl

0b63fb7

samster25 approved these changes Oct 10, 2023

View reviewed changes

samster25 merged commit 439f2bd into main Oct 10, 2023
24 checks passed

samster25 deleted the jay/update-cores-scheduler branch October 10, 2023 01:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[PERF] Update number of cores on every iteration #1480

[PERF] Update number of cores on every iteration #1480

jaychia commented Oct 9, 2023 •

edited

Loading

xcharleslin left a comment

xcharleslin Oct 10, 2023

xcharleslin Oct 10, 2023

jaychia Oct 10, 2023

codecov bot commented Oct 10, 2023 •

edited

Loading

[PERF] Update number of cores on every iteration #1480

[PERF] Update number of cores on every iteration #1480

Conversation

jaychia commented Oct 9, 2023 • edited Loading

xcharleslin left a comment

Choose a reason for hiding this comment

xcharleslin Oct 10, 2023

Choose a reason for hiding this comment

xcharleslin Oct 10, 2023

Choose a reason for hiding this comment

jaychia Oct 10, 2023

Choose a reason for hiding this comment

codecov bot commented Oct 10, 2023 • edited Loading

Codecov Report

jaychia commented Oct 9, 2023 •

edited

Loading

codecov bot commented Oct 10, 2023 •

edited

Loading