feat: support policy evaluation #137

Gaiejj · 2023-03-07T16:06:27Z

Description

feat: support policy evaluation

Types of changes

What types of changes does your code introduce? Put an x in all the boxes that apply:

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds core functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (update in the documentation)

Checklist

Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!

I have read the CONTRIBUTION guide. (required)
My change requires a change to the documentation.
I have updated the tests accordingly. (required for a bug fix or a new feature)
I have updated the documentation accordingly.
I have reformatted the code using make format. (required)
I have checked the code using make lint. (required)
I have ensured make test pass. (required)

friedmainfunction · 2023-03-08T15:40:59Z

omnisafe/__init__.py

@@ -17,6 +17,7 @@
 from omnisafe import algorithms
 from omnisafe.algorithms import ALGORITHMS
 from omnisafe.algorithms.algo_wrapper import AlgoWrapper as Agent
+from omnisafe.evaluator import Evaluator

 # from omnisafe.algorithms.env_wrapper import EnvWrapper as Env


Suggested change

# from omnisafe.algorithms.env_wrapper import EnvWrapper as Env

friedmainfunction · 2023-03-08T15:52:59Z

omnisafe/evaluator.py

+            print(f'Episode cost: {ep_cost}')
+            print(f'Episode length: {length}')
+
+        print('#' * 50)


friedmainfunction · 2023-03-08T15:55:03Z

omnisafe/evaluator.py

+            done = False
+            while not done and step <= 2000:  # a big number to make sure the episode will end
+                with torch.no_grad():
+                    act = self._actor.predict(obs, deterministic=False)


why deterministic = False?

zmsn-2077 · 2023-03-08T16:56:02Z

omnisafe/common/normalizer.py

@@ -14,7 +14,7 @@
 # ==============================================================================
 """Implementation of Vector Buffer."""


Vector Buffer should not appear in this file.

zmsn-2077 · 2023-03-08T16:57:59Z

omnisafe/evaluator.py

+        self.__set_render_mode(play, save_replay)
+
+    def __set_render_mode(self, play: bool = True, save_replay: bool = True):
+        """Set the render mode.

        Args:


I think the Args in the comment do not match the real function arguments.

zmsn-2077 · 2023-03-08T16:58:24Z

omnisafe/evaluator.py


        Args:
-            env (gym.Env): The environment.
-            actor (omnisafe.actor.Actor): The actor.
+            save_dir (str): directory where the model is saved.


the same problems

zmsn-2077 · 2023-03-08T16:59:14Z

omnisafe/evaluator.py


+        Returns:
+            episode_rewards (list): list of episode rewards.


need to fix,
return ( episode_rewards, episode_costs, )

zmsn-2077

.

zmsn-2077

LGTM.

Co-authored-by: ruiyang sun <[email protected]>

Gaiejj added 3 commits March 8, 2023 00:04

feat: support policy evaluation

badc42e

wip

abdd611

refactor: change evaluator building

2042d07

Gaiejj requested review from rockmagma02 and zmsn-2077 and removed request for rockmagma02 March 8, 2023 02:21

rockmagma02 and others added 3 commits March 8, 2023 16:13

refactor(evaluate)

f6f4208

fix(normalize): fix normalize can't load correctly

25f9dd2

Merge pull request #3 from rockmagma02/pr/Gaiejj/137

648ba4c

friedmainfunction reviewed Mar 8, 2023

View reviewed changes

zmsn-2077 reviewed Mar 8, 2023

View reviewed changes

zmsn-2077 requested changes Mar 8, 2023

View reviewed changes

Gaiejj added 2 commits March 9, 2023 01:37

wip

75ba123

refactor: clean the code

53dd79a

rockmagma02 approved these changes Mar 9, 2023

View reviewed changes

Gaiejj requested a review from zmsn-2077 March 10, 2023 01:48

zmsn-2077 approved these changes Mar 10, 2023

View reviewed changes

Gaiejj merged commit 5f37e02 into PKU-Alignment:dev Mar 10, 2023

Gaiejj deleted the dev-eval branch March 14, 2023 02:43

zmsn-2077 pushed a commit to zmsn-2077/omnisafe_zmsn that referenced this pull request Mar 14, 2023

feat: support policy evaluation (PKU-Alignment#137)

de04bb2

Co-authored-by: ruiyang sun <[email protected]>

zmsn-2077 pushed a commit to zmsn-2077/omnisafe_zmsn that referenced this pull request Mar 14, 2023

feat: support policy evaluation (PKU-Alignment#137)

116e7b9

Co-authored-by: ruiyang sun <[email protected]>

zmsn-2077 pushed a commit to zmsn-2077/omnisafe_zmsn that referenced this pull request Mar 14, 2023

feat: support policy evaluation (PKU-Alignment#137)

6401c63

Co-authored-by: ruiyang sun <[email protected]>

zmsn-2077 pushed a commit that referenced this pull request Mar 14, 2023

feat: support policy evaluation (#137)

06dcbdb

Co-authored-by: ruiyang sun <[email protected]>

zmsn-2077 pushed a commit that referenced this pull request Mar 15, 2023

feat: support policy evaluation (#137)

3de924f

Co-authored-by: ruiyang sun <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support policy evaluation #137

feat: support policy evaluation #137

Gaiejj commented Mar 7, 2023

friedmainfunction Mar 8, 2023

friedmainfunction Mar 8, 2023

friedmainfunction Mar 8, 2023

zmsn-2077 Mar 8, 2023

zmsn-2077 Mar 8, 2023

zmsn-2077 Mar 8, 2023

zmsn-2077 Mar 8, 2023

zmsn-2077 left a comment

zmsn-2077 left a comment

		@@ -14,7 +14,7 @@
		# ==============================================================================
		"""Implementation of Vector Buffer."""

feat: support policy evaluation #137

feat: support policy evaluation #137

Conversation

Gaiejj commented Mar 7, 2023

Description

Types of changes

Checklist

friedmainfunction Mar 8, 2023

Choose a reason for hiding this comment

friedmainfunction Mar 8, 2023

Choose a reason for hiding this comment

friedmainfunction Mar 8, 2023

Choose a reason for hiding this comment

zmsn-2077 Mar 8, 2023

Choose a reason for hiding this comment

zmsn-2077 Mar 8, 2023

Choose a reason for hiding this comment

zmsn-2077 Mar 8, 2023

Choose a reason for hiding this comment

zmsn-2077 Mar 8, 2023

Choose a reason for hiding this comment

zmsn-2077 left a comment

Choose a reason for hiding this comment

zmsn-2077 left a comment

Choose a reason for hiding this comment