CQL rllib 1.7.2 backport #170

dmlyubim · 2023-01-11T01:07:48Z

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

…proper train iteration size)

The test passes for me in command line but fails in the pipeline where it fails to locate the json data file.

…it help?

* set recursive mod 777 on /home/vsts/work/_temp/_bazel_vsts directory prior to build * use $TEST_TMPDIR env variable instead of literal directory name

…rectly for CQL

…o dmlyubim/cql-1.7.2-port

This reverts commit 84bf8ae.

* set recursive mod 777 on /home/vsts/work/_temp/_bazel_vsts directory prior to build * use $TEST_TMPDIR env variable instead of literal directory name * explicitly set MACOSX_DEPLOYMENT_TARGET env variable * removed minor version of Python; renamed steps to relect correct Python version * get latest pip version to test MacOs wheels * updated hash * undid changes to info,yml * unbounded setuptools * undid change * Fix MacOs version if bdist_wheel generates incorrect MacOS version tag for wheel * undid changes * undid changes * undid changes * force reinstall tune and upstream requirements * updatd CI hash * updated dependencies * updated requirements * updated requirements * updated requirements * updated requirements * updated requirements * updated ci folder hash * updated requirements * updated requirements * updates CI hash * updated requirements * updated requirements * updated requirements * updated requirements * undid requirement changes * updated ci folder hash * updated requirements * updated requirements * updated requirements * updated requirements * updated requirements * updated requirements * updated requirements * updated dependencies * updated requirements * updated dependencies * apt update * fixed GCC download, set Ubuntu 20.04 as default OS for pipeline * updated requirements * updated requirements * fixed setup.py * updated ci hash * fixed setup.py * fixed setup.py * fixed setup.py * updated requirements * fixed setup.py * force reintall of torch and torchvision * updated ci hash * fixed rllib requirements * updated requirements * updated requirements * updated requirements * updated requirements * updated requirements * updated requirements * updated dependencies * updated dependencies * updated requirements * updated requirements * updated requirements * explicitly set locale in MacOS to fix test_signal

…l-1.7.2-port

abhiksingla · 2023-02-08T01:56:54Z

rllib/agents/trainer_factory.py

-    CQL_SAC = (cql.CQLSACTrainer, cql.CQLSAC_DEFAULT_CONFIG)
-    CQL_APEX_SAC = (cql.CQLApexSACTrainer, cql.CQLAPEXSAC_DEFAULT_CONFIG)
-    CQL_DQN = (cql.CQLDQNTrainer, cql.CQLDQN_DEFAULT_CONFIG)
+    CQL = (cql.CQLTrainer, cql.CQL_DEFAULT_CONFIG)


RLlib documentation mentions that CQL does not support discrete actions. Are we supporting discrete actions?

i don't think so. this is backported code. I am not sure exactly how rllib hanldes that restriction, but we have ability to restrict it elsewhere in the outer code. I would not deviate from original rllib coding unless absolutely incorrect, makes further backporting merges easier.

abhiksingla · 2023-02-08T02:05:20Z

rllib/agents/cql/cql_tf_policy.py

-    action_dist_class = _get_dist_class(policy, policy.config,
-                                        policy.action_space)
+    action_dist_class = _get_dist_class(
+        # policy,


Good to clean this to avoid confusion later.

abhiksingla · 2023-02-08T02:27:07Z

rllib/models/tf/tf_action_dist.py

            [cat.deterministic_sample() for cat in self.cats], axis=1)
+        if isinstance(self.action_space, gym.spaces.Box):


Given that it is categorical distribution and will be used for discrete action, is this statement valid?

Also, it is not clear to me why extra dim is required for Box space only but not for others.

abhiksingla · 2023-02-08T02:28:59Z

rllib/models/tf/tf_action_dist.py


    @override(ActionDistribution)
    def logp(self, actions: TensorType) -> TensorType:
        # If tensor is provided, unstack it into list.
        if isinstance(actions, tf.Tensor):
+            if isinstance(self.action_space, gym.spaces.Box):


Same comment as above

abhiksingla · 2023-02-08T02:32:02Z

rllib/models/tf/tf_action_dist.py


    @staticmethod
    @override(ActionDistribution)
    def required_model_output_shape(
            action_space: gym.Space,
            model_config: ModelConfigDict) -> Union[int, np.ndarray]:
-        return np.sum(action_space.nvec)
+        # Int Box.
+        if isinstance(action_space, gym.spaces.Box):


same comment as above.

abhiksingla · 2023-02-08T05:01:01Z

dashboard/client/package-lock.json

  "requires": true,
+  "packages": {


I am not aware what this is. Ignoring it. Will suggest to get this reviewed by Ruofan or Kiko.

dmlyubim added 11 commits January 10, 2023 10:47

syncing to 1.7.2

5db8b9a

common public rllib cql renames

55bc018

patching sac dist class get

31b77f5

retrofitting rllib/offline package to 1.7.2

844dba4

retrofit space_utils 1.7.2

ace3f85

retrofit ray.tune.registry to 1.7.2 (add input registry)

6640af3

test changes

a9b7a56

cql test pendulum data

e09c33d

in 1.3, replay buffer isn't reworked to track capacity vs. current size

313f88a

Updating metrics to 1.7.2 (update sampled count on request to enable …

1e5159a

…proper train iteration size)

slight test refactoring to enable intermediate debugging

5f12afa

dmlyubim requested a review from a team as a code owner January 11, 2023 01:07

dmlyubim requested review from RuofanKong and removed request for a team January 11, 2023 01:07

dmlyubim changed the title ~~Dmlyubim/cql 1.7.2 port~~ CQL rllib 1.7.2 backport Jan 11, 2023

fixing bazel test //rllib:test_cql

8598a97

dmlyubim force-pushed the dmlyubim/cql-1.7.2-port branch from 07481dc to 8598a97 Compare January 11, 2023 01:13

dmlyubim added 3 commits January 11, 2023 09:14

additional cql_sac cleanup

0c20a1b

removing cql apex sac tests

2dbfb9d

rolling back non-existent policy call signature in offline component

016bde6

dmlyubim requested a review from a team as a code owner January 12, 2023 21:53

dmlyubim force-pushed the dmlyubim/cql-1.7.2-port branch from d09381a to acc2dde Compare January 12, 2023 22:03

trying to fix macos python verison at 3.8.15

e099a1d

dmlyubim force-pushed the dmlyubim/cql-1.7.2-port branch from acc2dde to e099a1d Compare January 12, 2023 22:38

dmlyubim and others added 6 commits January 12, 2023 17:25

changing bazel definition for test_cql.

625bf4b

The test passes for me in command line but fails in the pipeline where it fails to locate the json data file.

parity with BUILD for test_cql in 1.7.2 (removing data glob) -- does …

d5abccb

…it help?

fixes -- this now runs with the benchmark

d56abda

Rolling back cql_dqn cleanup

fb7ef1a

trying to add data label to test

90660d0

Kiko/cql 1.7.2 port (#172)

e2f9e7f

* set recursive mod 777 on /home/vsts/work/_temp/_bazel_vsts directory prior to build * use $TEST_TMPDIR env variable instead of literal directory name

dmlyubim added 3 commits January 25, 2023 14:47

brining more changes from 1.13.0 to update timesteps_total metric cor…

f809b8f

…rectly for CQL

Merge branch 'dmlyubim/cql-1.7.2-port' of github.com:BonsaiAI/ray int…

65ddbce

…o dmlyubim/cql-1.7.2-port

REVERTING TO PYTHON 3.8 FOR MAC

bf7c81d

dmlyubim force-pushed the dmlyubim/cql-1.7.2-port branch from 15df96a to 08bc679 Compare January 30, 2023 23:47

trying the checksum it wants for grpc jar

84bf8ae

dmlyubim force-pushed the dmlyubim/cql-1.7.2-port branch from 08bc679 to 84bf8ae Compare January 30, 2023 23:58

dmlyubim and others added 3 commits January 30, 2023 16:10

Revert "trying the checksum it wants for grpc jar"

a44aff0

This reverts commit 84bf8ae.

Merge remote-tracking branch 'origin/releases/1.3.0' into dmlyubim/cq…

7b5c907

…l-1.7.2-port

abhiksingla reviewed Feb 8, 2023

View reviewed changes

dmlyubim requested a review from abhiksingla February 8, 2023 19:51

abhiksingla approved these changes Feb 8, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CQL rllib 1.7.2 backport #170

CQL rllib 1.7.2 backport #170

dmlyubim commented Jan 11, 2023

abhiksingla Feb 8, 2023

dmlyubim Feb 8, 2023

abhiksingla Feb 8, 2023

abhiksingla Feb 8, 2023

abhiksingla Feb 8, 2023

abhiksingla Feb 8, 2023

abhiksingla Feb 8, 2023

abhiksingla Feb 8, 2023

		[cat.deterministic_sample() for cat in self.cats], axis=1)
		if isinstance(self.action_space, gym.spaces.Box):

CQL rllib 1.7.2 backport #170

Are you sure you want to change the base?

CQL rllib 1.7.2 backport #170

Conversation

dmlyubim commented Jan 11, 2023

Why are these changes needed?

Related issue number

Checks

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment