You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The script at the bottom failed because it pass evaluation/episode_reward_mean as the metric to optimize. In the reported dict result, episode_reward_mean is nested under evaluation.
After digging a bit into the source code, I think this kind of metric could be optimize in tune. But, the validation process failed because it use nested result.
Ray version and other system information (Python version, TensorFlow version, OS): ray: 2.0.0
Reproduction (REQUIRED)
Please provide a short code snippet (less than 50 lines if possible) that can be copy-pasted to reproduce the issue. The snippet should have no external library dependencies (i.e., use fake or mock data / environments):
If the code snippet cannot be run by itself, the issue will be closed with "needs-repro-script".
I have verified my script runs in a clean environment and reproduces the issue.
I have verified the issue also occurs with the latest wheels.
The text was updated successfully, but these errors were encountered:
ghost
added
bug
Something that is supposed to be working; but isn't
triage
Needs triage (eg: priority, bug/not-bug, and owning component)
labels
Feb 26, 2021
What is the problem?
The script at the bottom failed because it pass
evaluation/episode_reward_mean
as the metric to optimize. In the reported dictresult
,episode_reward_mean
is nested underevaluation
.After digging a bit into the source code, I think this kind of metric could be optimize in tune. But, the validation process failed because it use nested result.
Ray version and other system information (Python version, TensorFlow version, OS):
ray: 2.0.0
Reproduction (REQUIRED)
Please provide a short code snippet (less than 50 lines if possible) that can be copy-pasted to reproduce the issue. The snippet should have no external library dependencies (i.e., use fake or mock data / environments):
If the code snippet cannot be run by itself, the issue will be closed with "needs-repro-script".
The text was updated successfully, but these errors were encountered: