RL_LunarLanderContinuous_v2

Using reinforcement learning algorithm PPO from PARL to solve LunarLanderContinuous-v2 based on gym box2d environment.

LunarLanderContinuous-v2 is one of the environments from gym box2d. The goal is to train a LunarLander from knowing nothing about the environment to land safely on the correct position on the moon.

This repository includes the user-friendly jupyter notebook version of the solution. The code is written using PaddlePaddle. The PPO algorithm from PARL is being used.

After training for about 3000 episodes, the current best evaluation reward is about 313 points per episode. The current result is far better than the claimed pass points 200+.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LunarLanderContinuous_v2.ipynb		LunarLanderContinuous_v2.ipynb
README.md		README.md
eval.png		eval.png
requirements.txt		requirements.txt
result.gif		result.gif
train.png		train.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL_LunarLanderContinuous_v2

About

Releases

Packages

Languages

eepgxxy/RL_LunarLanderContinuous_v2

Folders and files

Latest commit

History

Repository files navigation

RL_LunarLanderContinuous_v2

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages