[rllib] Eager execution for centralized critic example, fix simple optimizer for multiagent #5683

ericl · 2019-09-10T20:42:36Z

AmplabJenkins · 2019-09-10T22:26:57Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/16948/
Test FAILed.

AmplabJenkins · 2019-09-11T04:01:33Z

Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/16963/
Test FAILed.

AmplabJenkins · 2019-09-11T11:01:55Z

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/Ray-PRB/16969/
Test PASSed.

richardliaw · 2019-09-11T18:54:09Z

rllib/examples/custom_tf_policy.py

+                         sample_batch,
+                         other_agent_batches=None,
+                         episode=None):
+    sample_batch["advantages"] = discount(sample_batch["rewards"], 0.99)


this should try to fetch gamma from policy config instead right

Not for the example.

richardliaw · 2019-09-11T18:57:03Z

rllib/optimizers/sync_samples_optimizer.py

@@ -29,17 +32,22 @@ def __init__(self,
                 workers,
                 num_sgd_iter=1,
                 train_batch_size=1,
-                 sgd_minibatch_size=0):
+                 sgd_minibatch_size=0,
+                 standardize_fields=[]):


richardliaw · 2019-09-11T18:57:10Z

rllib/optimizers/sync_samples_optimizer.py

        PolicyOptimizer.__init__(self, workers)

        self.update_weights_timer = TimerStat()
+        self.standardize_fields = standardize_fields


Changed to froenzeset

ericl added 2 commits September 10, 2019 13:40

wip

ff20a2c

remove

ac49731

ericl assigned richardliaw Sep 10, 2019

add adv

ac6f56d

ericl added 2 commits September 10, 2019 23:50

fix indent

847f9c7

fix fetch

94ba30a

ericl added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Sep 11, 2019

richardliaw reviewed Sep 11, 2019

View reviewed changes

richardliaw approved these changes Sep 11, 2019

View reviewed changes

Update sync_samples_optimizer.py

4e51521

ericl merged commit bc6a95d into ray-project:master Sep 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib] Eager execution for centralized critic example, fix simple optimizer for multiagent #5683

[rllib] Eager execution for centralized critic example, fix simple optimizer for multiagent #5683

ericl commented Sep 10, 2019

AmplabJenkins commented Sep 10, 2019

AmplabJenkins commented Sep 11, 2019

AmplabJenkins commented Sep 11, 2019

richardliaw Sep 11, 2019

ericl Sep 11, 2019

richardliaw Sep 11, 2019

richardliaw Sep 11, 2019

ericl Sep 11, 2019

[rllib] Eager execution for centralized critic example, fix simple optimizer for multiagent #5683

[rllib] Eager execution for centralized critic example, fix simple optimizer for multiagent #5683

Conversation

ericl commented Sep 10, 2019

AmplabJenkins commented Sep 10, 2019

AmplabJenkins commented Sep 11, 2019

AmplabJenkins commented Sep 11, 2019

richardliaw Sep 11, 2019

Choose a reason for hiding this comment

ericl Sep 11, 2019

Choose a reason for hiding this comment

richardliaw Sep 11, 2019

Choose a reason for hiding this comment

richardliaw Sep 11, 2019

Choose a reason for hiding this comment

ericl Sep 11, 2019

Choose a reason for hiding this comment