-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deflakey test advanced 9 #34883
Deflakey test advanced 9 #34883
Changes from 8 commits
222b728
331bda5
9b1282e
bf27138
afce57e
03eb252
08c0622
ee7755d
a2245c9
ee24009
2e16e2c
487e5a4
f7a7a8a
eb7738e
627688b
5e3b76f
e431de6
669d5e9
c7b0cef
2c6ab64
5afe594
31b06cb
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -784,8 +784,8 @@ void CoreWorker::Exit( | |
detail = std::move(detail), | ||
creation_task_exception_pb_bytes]() { | ||
rpc::DrainServerCallExecutor(); | ||
Disconnect(exit_type, detail, creation_task_exception_pb_bytes); | ||
KillChildProcs(); | ||
Disconnect(exit_type, detail, creation_task_exception_pb_bytes); | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. hmm, why does this order matter? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. It doesn't matter since raylet has code handling failure. This is only to make the async_wait shorter. Not very important. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Can we add a comment to this effect? |
||
Shutdown(); | ||
}, | ||
"CoreWorker.Shutdown"); | ||
|
@@ -829,10 +829,10 @@ void CoreWorker::ForceExit(const rpc::WorkerExitType exit_type, | |
const std::string &detail) { | ||
RAY_LOG(WARNING) << "Force exit the process. " | ||
<< " Details: " << detail; | ||
Disconnect(exit_type, detail); | ||
|
||
KillChildProcs(); | ||
|
||
Disconnect(exit_type, detail); | ||
|
||
// NOTE(hchen): Use `QuickExit()` to force-exit this process without doing cleanup. | ||
// `exit()` will destruct static objects in an incorrect order, which will lead to | ||
// core dumps. | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: add comment? Is this waiting for the socket to be cleaned up (
TIME_WAIT
)?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is to wait until a connection terminated.
Ideally, the fix should monitor the pid. But this needs more updates. So instead, we monitor the tcp.
I'll add comment for this.