Skip to content

Commit

Permalink
Merge pull request #2 from zmsn-2077/first-order
Browse files Browse the repository at this point in the history
refactor: change architecture of omnisafe (PKU-MARL#121)
  • Loading branch information
muchvo authored Mar 14, 2023
2 parents 1d28386 + 7937894 commit ed6448f
Showing 1 changed file with 0 additions and 5 deletions.
5 changes: 0 additions & 5 deletions omnisafe/algorithms/on_policy/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,15 +11,10 @@ Supported algorithms are listed below:
**First-Order**

- **[NIPS 1999]** [Policy Gradient(PG)](https://papers.nips.cc/paper/1999/file/464d828b85b0bed98e80ade0a5c43b0f-Paper.pdf)

- [Proximal Policy Optimization (PPO)](https://arxiv.org/pdf/1707.06347.pdf)

- [The Lagrange version of PPO (PPO-Lag)](https://cdn.openai.com/safexp-short.pdf)

- **[IJCAI 2022]** [Penalized Proximal Policy Optimization for Safe Reinforcement Learning(P3O)]( https://arxiv.org/pdf/2205.11814.pdf)

- **[NeurIPS 2020]** [First Order Constrained Optimization in Policy Space (FOCOPS)](https://arxiv.org/abs/2002.06506)

- **[NeurIPS 2022]** [Constrained Update Projection Approach to Safe Policy Optimization (CUP)](https://arxiv.org/abs/2209.07089)

**Second-Order**
Expand Down

0 comments on commit ed6448f

Please sign in to comment.