[Serve][Deployment Graph] Let .bind return ray DAGNode types and remove exposing DeploymentNode as public #24065

jiaodong · 2022-04-21T00:04:23Z

Why are these changes needed?

See dag layering summary in #24061

We need to cleanup and set right ray dag -> serve dag layering where .bind() can be called on @serve.deployment decorated class or func, but only returns raw Ray DAGNode type, executable by ray core and serve_dag is only available after serve-specific transformations.

Thus this PR removes exposed serve DAGNode type such as DeploymentNode.

It also removes the syntax of class.bind().bind() to return a DeploymentMethodNode that defaults to __call__ to match same behavior in ray dag building.

closes #24062

Next steps

Refactor DeploymentNode implementation that doesn't always bundle DAGNode, Deployment creation, handling config and replacing handle in one node. We need all of the functionalities, but shouldn't be in one place.

Concerte example: all Ray DAGNode types uses _copy to handle function calls recursively which will re-instantiate DAGNode instances upon replacement it's the mechanism used to execute the DAG as well. But we don't want to re-do everything upon each call in DeploymentNode, not just for abstraction, but also for performance.

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

jiaodong · 2022-04-21T00:09:29Z

python/ray/serve/deployment_graph.py

-from ray.experimental.dag.class_node import ClassNode
-from ray.experimental.dag.function_node import FunctionNode
-from ray.experimental.dag import DAGNode
+from ray.experimental.dag.class_node import ClassNode  # noqa: F401


these imports happen at deployment_graph level so from api.py we're just importing from serve module, rather than directly exposing experimental folder stuff.

Ideally we should have another effort to cleanup imports while eliminating circular imports.

python/ray/serve/deployment_graph.py

python/ray/serve/pipeline/deployment_node.py

python/ray/experimental/dag/class_node.py

jiaodong · 2022-04-21T02:14:54Z

python/ray/experimental/dag/class_node.py

@@ -79,6 +79,12 @@ def _contains_input_node(self) -> bool:
        return False

    def __getattr__(self, method_name: str):
+        # User trying to call .bind() without a bind class method
+        if method_name == "bind" and "bind" not in dir(self._body):


@ericl addressed, now we don't have the extra .bind() on @serve.deployment, consistent with Ray Core .bind now.

Only little nit i did here is to detect and surface the same DAGNode base exception message when user tries to Actor.bind().bind(), that previously we only surfaced type object 'Actor' has no attribute 'bind'

edoakes

LGTM, let's wait for @ericl to sign off

edoakes · 2022-04-21T15:31:57Z

python/ray/experimental/dag/class_node.py

+        if method_name == "bind" and "bind" not in dir(self._body):
+            raise AttributeError(
+                f".bind() cannot be used again on {type(self)} "
+                f"(args: {self.get_args()}, kwargs: {self.get_kwargs()})."


why print args and kwargs here? Doesn't seem relevant and might be spammy.

edoakes · 2022-04-21T16:11:43Z

@jiaodong please fix error message before we merge

jiaodong added 7 commits April 13, 2022 15:07

wip

b25e5f5

Merge branch 'master' of https://github.com/ray-project/ray

c435936

Merge branch 'master' of https://github.com/ray-project/ray

a11ae9b

Merge branch 'master' of https://github.com/ray-project/ray

c1dba9f

Merge branch 'master' of https://github.com/ray-project/ray

76144f5

Merge branch 'master' of https://github.com/ray-project/ray

cc4ba55

test_pipeline_dag all passed

3c4fd2d

jiaodong requested review from shrekris-anyscale, edoakes and simon-mo April 21, 2022 00:04

jiaodong assigned edoakes, simon-mo and shrekris-anyscale Apr 21, 2022

jiaodong commented Apr 21, 2022

View reviewed changes

edoakes reviewed Apr 21, 2022

View reviewed changes

python/ray/serve/deployment_graph.py Outdated Show resolved Hide resolved

python/ray/serve/pipeline/deployment_node.py Outdated Show resolved Hide resolved

jiaodong added 2 commits April 20, 2022 18:22

make .bind() default to __call__ on ClassNode

efa7535

move handle back

4097a4b

jiaodong commented Apr 21, 2022

View reviewed changes

python/ray/experimental/dag/class_node.py Outdated Show resolved Hide resolved

jiaodong assigned ericl Apr 21, 2022

jiaodong requested a review from ericl April 21, 2022 01:28

ericl requested changes Apr 21, 2022

View reviewed changes

python/ray/experimental/dag/class_node.py Outdated Show resolved Hide resolved

ericl added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Apr 21, 2022

remove default to __call__ magic

33e0c75

jiaodong commented Apr 21, 2022

View reviewed changes

fix cli

b8f3cf9

jiaodong removed the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Apr 21, 2022

jiaodong requested review from ericl and edoakes April 21, 2022 04:26

jiaodong added tests-ok The tagger certifies test failures are unrelated and assumes personal liability. flaky-test labels Apr 21, 2022

edoakes approved these changes Apr 21, 2022

View reviewed changes

edoakes removed the flaky-test label Apr 21, 2022

ericl approved these changes Apr 21, 2022

View reviewed changes

error message changes and ray dag test fixes

46a138b

ericl merged commit f0071d3 into ray-project:master Apr 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Serve][Deployment Graph] Let .bind return ray DAGNode types and remove exposing DeploymentNode as public #24065

[Serve][Deployment Graph] Let .bind return ray DAGNode types and remove exposing DeploymentNode as public #24065

jiaodong commented Apr 21, 2022 •

edited

Loading

jiaodong Apr 21, 2022

jiaodong Apr 21, 2022 •

edited

Loading

edoakes left a comment

edoakes Apr 21, 2022

edoakes commented Apr 21, 2022

[Serve][Deployment Graph] Let .bind return ray DAGNode types and remove exposing DeploymentNode as public #24065

[Serve][Deployment Graph] Let .bind return ray DAGNode types and remove exposing DeploymentNode as public #24065

Conversation

jiaodong commented Apr 21, 2022 • edited Loading

Why are these changes needed?

Next steps

Checks

jiaodong Apr 21, 2022

Choose a reason for hiding this comment

jiaodong Apr 21, 2022 • edited Loading

Choose a reason for hiding this comment

edoakes left a comment

Choose a reason for hiding this comment

edoakes Apr 21, 2022

Choose a reason for hiding this comment

edoakes commented Apr 21, 2022

jiaodong commented Apr 21, 2022 •

edited

Loading

jiaodong Apr 21, 2022 •

edited

Loading