DRLND

This repo contains project code for Udacity's Deep Reinforcement Learning Nanodegree. Each project consists of a self-contained Jupyter notebook solving a (modified) environment from Unity's ml-agent example learning environments.

Banana Collectors

The agent needs to navigate a 3D space to collect as many yellow bananas as possible while trying to avoid blue bananas.

The environment is solved with Deep Q-Learning, with several "Rainbow" extensions, including prioritized experience replay, noisy network, double Q-learning, and dueling network.

Reacher

The agent needs to control a double-jointed arm to track a moving target in a 3D environment.

The environment is solved with Deep Deterministic Policy Gradient (DDPG), with extensions from Twin Delayed DDPG (TD3) and also utilizes prioritized experience replay.

Tennis

A pair of agents needs to control rackets to play toy tennis with each other.

The environment is solved with Multi-agent DDPG (MADDPG), while incorporating extensions from TD3 and utilizing prioritized experience replay as well.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
LICENSE		LICENSE
README.md		README.md
banana-dqn-demo.gif		banana-dqn-demo.gif
banana-dqn.ipynb		banana-dqn.ipynb
reacher-td3-demo.gif		reacher-td3-demo.gif
reacher-td3.ipynb		reacher-td3.ipynb
tennis-maddpg-demo.gif		tennis-maddpg-demo.gif
tennis-maddpg.ipynb		tennis-maddpg.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DRLND

Banana Collectors

Reacher

Tennis

About

Languages

License

tomtung/drlnd

Folders and files

Latest commit

History

Repository files navigation

DRLND

Banana Collectors

Reacher

Tennis

About

Topics

Resources

License

Stars

Watchers

Forks

Languages