Skip to content

Deep Q Network reinformcement learning experiment with Unity.

Notifications You must be signed in to change notification settings

ekponk/banana_rl

Repository files navigation

Banana Game Reinforcement Learning

This is an example of Deep Q Network reinformcement learning in a simple game environment. The goal is to pick up as many of the yellow bananas as possible. Yellow bananas are rewarded positively, while Blue bananas are penalized.

Some pre-trained model weights are provided. A GPU is preferred for training.

For more information, please see the Introduction to the original github project [here] ( https://github.com/udacity/deep-reinforcement-learning/tree/master/p1_navigation ).

Environment

This is an example of Deep Q Network reinformcement learning in a simple game environment. The goal is to pick up as many of the yellow bananas as possible. Yellow bananas are rewarded positively, while Blue bananas are penalized.

The environment is provided as a custom Unity project from Udacity, found here in the Banana_Linux folder. The original Linux environment can be downloaded [here] (https://s3-us-west-1.amazonaws.com/udacity-drlnd/P1/Banana/Banana_Linux.zip). The environment has 37 states. Four actions are available: forward, backward, left and right.

Two improvements to the basic Q-Learning algorithm are implemented: experience replay, and fixed Q-targets.

Manifest

File Description
navigation_project.ipynb Main ipython file
Banana_Linux Unity code repository
dqn_agent.py Learning agent
model.py PyTorch network model
checkpoint.pth Model weights
install.sh Installation
requirements.txt Necessary modules
report.md Report on the project
training.png Image
README.md This file

Installation

  1. Clone the assocated repository
  2. Run the install.sh file source install.sh
  3. jupyter-notebook navigation_project.ipynb
  4. Follow the instructions in the notebook

About

Deep Q Network reinformcement learning experiment with Unity.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published