-
Notifications
You must be signed in to change notification settings - Fork 5.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Core] Modify the low frequency log level of some critical path from DEBUG to INFO or WARNING #14205
Conversation
Can we avoid using WARNING? We shouldn't use it unless the message will clearly indicate the symptoms of the warning and action items because they are streamed to user drivers. It is desirable not to print impl details. But anyway I will review the PR soon. |
@@ -313,8 +313,8 @@ Status GcsActorManager::CreateActor(const ray::rpc::CreateActorRequest &request, | |||
|
|||
auto iter = registered_actors_.find(actor_id); | |||
if (iter == registered_actors_.end()) { | |||
RAY_LOG(DEBUG) << "Actor " << actor_id | |||
<< " may be already destroyed, job id = " << actor_id.JobId(); | |||
RAY_LOG(WARNING) << "Actor " << actor_id |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please change it to INFO
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There's nothing users can do when they look at this log.
@@ -834,8 +834,8 @@ void GcsActorManager::Initialize(const GcsInitData &gcs_init_data) { | |||
// Notify raylets to release unused workers. | |||
gcs_actor_scheduler_->ReleaseUnusedWorkers(node_to_workers); | |||
|
|||
RAY_LOG(DEBUG) << "The number of registered actors is " << registered_actors_.size() | |||
<< ", and the number of created actors is " << created_actors_.size(); | |||
RAY_LOG(INFO) << "The number of registered actors is " << registered_actors_.size() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Instead of changing this to INFO, can we have more generalized log messages in the initialize step? Like we are doing it for DebugString
@@ -49,7 +49,7 @@ void GcsResourceManager::HandleUpdateResources( | |||
const rpc::UpdateResourcesRequest &request, rpc::UpdateResourcesReply *reply, | |||
rpc::SendReplyCallback send_reply_callback) { | |||
NodeID node_id = NodeID::FromBinary(request.node_id()); | |||
RAY_LOG(DEBUG) << "Updating resources, node id = " << node_id; | |||
RAY_LOG(INFO) << "Updating resources, node id = " << node_id; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is high frequency isn't it?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should be updated every 100ms
@@ -84,7 +84,7 @@ void GcsResourceManager::HandleUpdateResources( | |||
nullptr)); | |||
|
|||
GCS_RPC_SEND_REPLY(send_reply_callback, reply, status); | |||
RAY_LOG(DEBUG) << "Finished updating resources, node id = " << node_id; | |||
RAY_LOG(INFO) << "Finished updating resources, node id = " << node_id; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
same here
@@ -101,7 +101,7 @@ void GcsResourceManager::HandleDeleteResources( | |||
const rpc::DeleteResourcesRequest &request, rpc::DeleteResourcesReply *reply, | |||
rpc::SendReplyCallback send_reply_callback) { | |||
NodeID node_id = NodeID::FromBinary(request.node_id()); | |||
RAY_LOG(DEBUG) << "Deleting node resources, node id = " << node_id; | |||
RAY_LOG(INFO) << "Deleting node resources, node id = " << node_id; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think we use this endpoint anymore. Isn't it for dymanic resources? It is deprecated.
@@ -135,7 +135,7 @@ Process WorkerPool::StartWorkerProcess( | |||
RAY_CHECK(!job_id.IsNil()); | |||
auto it = all_jobs_.find(job_id); | |||
if (it == all_jobs_.end()) { | |||
RAY_LOG(DEBUG) << "Job config of job " << job_id << " are not local yet."; | |||
RAY_LOG(WARNING) << "Job config of job " << job_id << " are not local yet."; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also shouldn't be warning.
RAY_LOG(DEBUG) << "Worker not started, " << starting_workers | ||
<< " workers of language type " << static_cast<int>(language) | ||
<< " pending registration"; | ||
RAY_LOG(WARNING) << "Worker not started, " << starting_workers |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This also shouldn't be warning. There's no way any user will know what to do after looking at this from their driver.
@@ -283,8 +283,8 @@ Process WorkerPool::StartWorkerProcess( | |||
auto duration = std::chrono::duration_cast<std::chrono::milliseconds>(end - start); | |||
stats::ProcessStartupTimeMs.Record(duration.count()); | |||
|
|||
RAY_LOG(DEBUG) << "Started worker process of " << workers_to_start | |||
<< " worker(s) with pid " << proc.GetId(); | |||
RAY_LOG(INFO) << "Started worker process of " << workers_to_start |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
STarting -> Started -> Registered is duplicated log. Can you just have one log message?
@@ -736,8 +747,8 @@ void NodeManager::NodeRemoved(const NodeID &node_id) { | |||
|
|||
// Remove the node from the resource map. | |||
if (!cluster_resource_scheduler_->RemoveNode(node_id.Binary())) { | |||
RAY_LOG(DEBUG) << "Received NodeRemoved callback for an unknown node: " << node_id | |||
<< "."; | |||
RAY_LOG(WARNING) << "Received NodeRemoved callback for an unknown node: " << node_id |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please indicate what's the side effect of this warning message.
RAY_LOG(DEBUG) << "[ResourceCreateUpdated] received callback from node id " << node_id | ||
<< " with created or updated resources: " | ||
<< createUpdatedResources.ToString() << ". Updating resource map."; | ||
RAY_LOG(INFO) << "[ResourceCreateUpdated] received callback from node id " << node_id |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also the log message is too low level. Same as below.
|
|
This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.
|
Hi again! The issue will be closed because there has been no more activity in the 14 days since the last message. Please feel free to reopen or open a new issue if you'd still like it to be addressed. Again, you can always ask for help on our discussion forum or Ray's public slack channel. Thanks again for opening the issue! |
Why are these changes needed?
Problem
Solution
Related issue number
Checks
scripts/format.sh
to lint the changes in this PR.