[tune] Cannot optimize a metric nested in result #14374

ghost · 2021-02-26T04:10:20Z

What is the problem?

The script at the bottom failed because it pass evaluation/episode_reward_mean as the metric to optimize. In the reported dict result, episode_reward_mean is nested under evaluation.
After digging a bit into the source code, I think this kind of metric could be optimize in tune. But, the validation process failed because it use nested result.

Ray version and other system information (Python version, TensorFlow version, OS):
ray: 2.0.0

Reproduction (REQUIRED)

Please provide a short code snippet (less than 50 lines if possible) that can be copy-pasted to reproduce the issue. The snippet should have no external library dependencies (i.e., use fake or mock data / environments):

import ray
from ray.rllib.agents.dqn import dqn
from ray import tune


if __name__ == "__main__":
    ray.init(local_mode=True, num_cpus=2)

    config = {
        "env": "CartPole-v1",
        "framework": "torch",

        "timesteps_per_iteration": 10,
        "evaluation_interval": 1,
        "evaluation_num_episodes": 1,
    }

    analysis = tune.run(
        dqn.DQNTrainer,
        config=config,
        metric="evaluation/episode_reward_mean",
        mode="max",
        num_samples=10,
        stop={
            "evaluation/episode_reward_mean": 20
        }
    )

    ray.shutdown()

If the code snippet cannot be run by itself, the issue will be closed with "needs-repro-script".

I have verified my script runs in a clean environment and reproduces the issue.
I have verified the issue also occurs with the latest wheels.

The text was updated successfully, but these errors were encountered:

ghost added bug Something that is supposed to be working; but isn't triage Needs triage (eg: priority, bug/not-bug, and owning component) labels Feb 26, 2021

ghost mentioned this issue Feb 26, 2021

[tune] Correctly validate nested metrics #14375

Merged

6 tasks

krfricke closed this as completed in #14375 Feb 26, 2021

Juno-T mentioned this issue Aug 9, 2022

[tune] Error saving checkpoint based on nested metric score #27701

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tune] Cannot optimize a metric nested in result #14374

[tune] Cannot optimize a metric nested in result #14374

ghost commented Feb 26, 2021 •

edited by ghost

Loading

[tune] Cannot optimize a metric nested in result #14374

[tune] Cannot optimize a metric nested in result #14374

Comments

ghost commented Feb 26, 2021 • edited by ghost Loading

What is the problem?

Reproduction (REQUIRED)

ghost commented Feb 26, 2021 •

edited by ghost

Loading