GitHub - hedrox/random-network-distillation: Code associated with the paper "Ego Networks"

Ego Networks

Andrei Marin, Traian Rebedea, Ionel Hosu
Politehnica University of Bucharest

Abstract

Games on the Atari 2600 platform have served as a benchmark for reinforcement learning algorithms in recent years, and while deep reinforcement learning approaches make progress on most games, there are still some games that the majority of these algorithms struggle with. These are called hard exploration games. We introduce two new developments for the Random Network Distillation (RND) architecture. We apply self-attention and the mechanism of ego motion on the RND architecture and we evaluate them on three hard exploration tasks from the Atari platform. We find that the proposed ego network model improve the baseline of the RND architecture on these tasks.

Installation Guide

First install the conda environment

conda create --name <env_name> --file conda_requirements.txt

Then install dependencies that cannot be installed with conda

pip install -r pip_requirements.txt

Usage

To train an Ego RND agent on Montezuma's Revenge, run the following command

python run_atari.py --save_model

Acknowledgement

This work is based on OpenAI's Exploration by Random Network Distillation by Yuri Burda, Harri Edwards, Amos Storkey, Oleg Klimov

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
policies		policies
.gitignore		.gitignore
README.md		README.md
aggregate_runs.py		aggregate_runs.py
atari_wrappers.py		atari_wrappers.py
cmd_util.py		cmd_util.py
conda_requirements.txt		conda_requirements.txt
console_util.py		console_util.py
load_log.py		load_log.py
monitor.py		monitor.py
mpi_util.py		mpi_util.py
pip_requirements.txt		pip_requirements.txt
plot_graphs.py		plot_graphs.py
ppo_agent.py		ppo_agent.py
recorder.py		recorder.py
replayer.py		replayer.py
run_atari.py		run_atari.py
stochastic_policy.py		stochastic_policy.py
tf_util.py		tf_util.py
utils.py		utils.py
vec_env.py		vec_env.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ego Networks

Abstract

Installation Guide

Usage

Acknowledgement

About

Releases

Packages

Languages

hedrox/random-network-distillation

Folders and files

Latest commit

History

Repository files navigation

Ego Networks

Abstract

Installation Guide

Usage

Acknowledgement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages