Rational Reinforcement Learning Repository

Using Rational Networks in Simple Reinforcement Learning (Atati task so far). This repository is used in the Recurrent Rational Networks publication.

Dependencies

This Repository depends on:

MushroomRL for managing agents, environments, ... etc
Rational Activation Functions for the learnable rational activation functions
Pytorch - For the neural network part

Installation

First, please clone this repo and go into it:

git clone https://github.com/ml-research/rational_rl
cd rational_rl

A Dockerfile is provided, to create a docker image, please run:

docker build -t rationalrl . # to create a docker image
docker run -ti --gpus all -v $(pwd):/home/rl_paus rationalrl bash

This last command will instantiate a container from your image and run bash into it.*

*You need to have nvidia-docker installed to run docker containers with GPU and CUDA support (otherwise, please drop --gpu all).

Watch a trained agent play:

To watch a trained Recurrent Rational agent on Kangaroo, please provide its path:
python3 rendering_atari.py updated_agents/DQN_recrat_Kangaroo_s0_e500.zip Hereafter are provided some compiled example of DQN Agents (left with Leaky ReLU, center with Rational and right with Recurrent Rationals)

Enduro 🚘
Kangaroo 🌀
SpaceInvaders 👾
Tennis 🎾
*Agent is orange
TimePilot ✈️
Tutankham 💍

you can find more gifs in videos/gifs_files/optim/Asterix

Usage

To train a DQN agent on Space Invaders, with recurrent rational and seed set to 0:
python3 train.py -g SpaceInvaders -alg DQN -af rpau -s 0
To make the scores plot of the agent on Asterix and store it:
python3 scores_evolutions_graph.py -g Asterix -s
Creating the following image:
To get the raw scores on all activation functions and all game:
python3 scores_table.py --all
To get the bar plot comparing rational agents and original [Leaky ReLU] agent. python3 bar_plot_human_compare.py -h

To get the trained agents, please contact Quentin Delfosse

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
configs		configs
images		images
scores		scores
scores_tables		scores_tables
trained_functions		trained_functions
videos		videos
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
activation_functions.py		activation_functions.py
bar_plot_human_compare.py		bar_plot_human_compare.py
dqn_net.py		dqn_net.py
evaluate_agent.py		evaluate_agent.py
evo_scores_with_rainbow.py		evo_scores_with_rainbow.py
generate_videos.bash		generate_videos.bash
heatmap.py		heatmap.py
networks.py		networks.py
parsers.py		parsers.py
plot_all_afs.py		plot_all_afs.py
populate_histograms.py		populate_histograms.py
render_agent.py		render_agent.py
requirements.txt		requirements.txt
scores_evolutions_graph.py		scores_evolutions_graph.py
scores_table.py		scores_table.py
scores_table_sl.py		scores_table_sl.py
train.py		train.py
utils.py		utils.py
visualize_net.py		visualize_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rational Reinforcement Learning Repository

Dependencies

Installation

Watch a trained agent play:

Enduro 🚘

Kangaroo 🌀

SpaceInvaders 👾

Tennis 🎾

TimePilot ✈️

Tutankham 💍

Usage

About

Releases

Packages

Contributors 2

Languages

ml-research/rational_rl

Folders and files

Latest commit

History

Repository files navigation

Rational Reinforcement Learning Repository

Dependencies

Installation

Watch a trained agent play:

Enduro 🚘

Kangaroo 🌀

SpaceInvaders 👾

Tennis 🎾

TimePilot ✈️

Tutankham 💍

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages