[rllib] Fix error messages and example for dataset writer #23419

krfricke · 2022-03-23T11:35:55Z

Why are these changes needed?

Currently the error message and example refer to a field type that is actually format.

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

krfricke · 2022-03-23T11:51:46Z

Btw @gjoliver, it seems like final batches are not written because max_num_samples_per_file is never reached.

Thus in the example "max_num_samples_per_file": 1 or similar to be passed as well. Note also that len(self.samples) is usually the number of batches, not the number of samples.

Is this something we can fix? I.e. is there a deconstructor/cleanup method we can use for this?

gjoliver · 2022-03-23T18:09:23Z

rllib/offline/dataset_writer.py

@@ -44,7 +44,7 @@ def __init__(
        output_config: Dict = ioctx.output_config
        assert (
            "format" in output_config
-        ), "output_config.type must be specified when using Dataset output."
+        ), "output_config.format must be specified when using Dataset output."


ah thanks for catching this. went through iterations ...

gjoliver · 2022-03-23T18:09:39Z

rllib/tuned_examples/marwil/cartpole-bc.yaml

@@ -1,7 +1,7 @@
 # To generate training data, first run:
 # $ ./train.py --run=PPO --env=CartPole-v0 \
 #      --stop='{"timesteps_total": 50000}' \
-#      --config='{"output": "dataset", "output_config": {"type": "json", "path": "/tmp/out"}, "batch_mode": "complete_episodes"}'
+#      --config='{"output": "dataset", "output_config": {"format": "json", "path": "/tmp/out", "max_num_samples_per_file": 1}, "batch_mode": "complete_episodes"}'


just to double check, you want to write 1 sample per file??

Well 1 sample = 1 episode here so usually has a few hundred timesteps. When I ran this I ended up with 39 files.

yeah, but still, why write 1 episode per file? :)
the default half-cheetah json dataset is a single file of maybe a few thousand episodes.
is there any efficiency consideration behind this?
anyways though, this is just an example, so whatever you see fit. I am just curious what made you make this change.

The main reason is that currently remaining samples are not written at all - they are only written when at least max_num_samples_per_file samples are reached during training. I.e. we don't flush remaining results when training is over and I didn't see a great way to make that happen

oh this is pretty bad. please add a Note or TODO in dataset_writer.py
this is another case where we are using a workaround to fix underlying problems. and per eric's message, we shouldn't do this long-term.
let's intercept a signal somewhere and flush the last shard before things go down.

Ah yeah that's what I mentioned in my first comment to this PR. Ok, will add a todo in the file. I've tried to flush on __del__ but that doesn't seem to work with actors, so no quick fix. Probably we need to call some kind of deconstructor/cleanup method manually

ah, it all comes back. I didn't understand what you meant at the time :)
cool, interesting topic ...

gjoliver · 2022-03-24T16:51:24Z

rllib/tuned_examples/marwil/cartpole-bc.yaml

@@ -1,7 +1,7 @@
 # To generate training data, first run:
 # $ ./train.py --run=PPO --env=CartPole-v0 \
 #      --stop='{"timesteps_total": 50000}' \
-#      --config='{"output": "dataset", "output_config": {"type": "json", "path": "/tmp/out"}, "batch_mode": "complete_episodes"}'
+#      --config='{"output": "dataset", "output_config": {"format": "json", "path": "/tmp/out", "max_num_samples_per_file": 1}, "batch_mode": "complete_episodes"}'


yeah, but still, why write 1 episode per file? :)
the default half-cheetah json dataset is a single file of maybe a few thousand episodes.
is there any efficiency consideration behind this?
anyways though, this is just an example, so whatever you see fit. I am just curious what made you make this change.

gjoliver

feel free to merge btw.

[rllib] Fix error messages and example for dataset writer

d32a9b1

krfricke requested review from sven1977, gjoliver and avnishn as code owners March 23, 2022 11:35

krfricke assigned gjoliver Mar 23, 2022

Fix input format

3405d7a

gjoliver reviewed Mar 23, 2022

View reviewed changes

gjoliver approved these changes Mar 24, 2022

View reviewed changes

Add todo

8b254a7

gjoliver approved these changes Mar 28, 2022

View reviewed changes

Fix linter error

654cbae

krfricke merged commit 262d612 into ray-project:master Mar 28, 2022

krfricke deleted the rllib/dataset-format branch March 28, 2022 18:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib] Fix error messages and example for dataset writer #23419

[rllib] Fix error messages and example for dataset writer #23419

krfricke commented Mar 23, 2022

krfricke commented Mar 23, 2022 •

edited

Loading

gjoliver Mar 23, 2022

gjoliver Mar 23, 2022

krfricke Mar 24, 2022

gjoliver Mar 24, 2022

krfricke Mar 28, 2022

gjoliver Mar 28, 2022

krfricke Mar 28, 2022

gjoliver Mar 28, 2022

gjoliver Mar 24, 2022

gjoliver left a comment

[rllib] Fix error messages and example for dataset writer #23419

[rllib] Fix error messages and example for dataset writer #23419

Conversation

krfricke commented Mar 23, 2022

Why are these changes needed?

Related issue number

Checks

krfricke commented Mar 23, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gjoliver left a comment

Choose a reason for hiding this comment

krfricke commented Mar 23, 2022 •

edited

Loading