Human intervention reinforcement learning

Research code for the paper "Trial without Error: Towards Safe Reinforcement Learning via Human Intervention" (arxiv) (2017)

Contributors (alphabetical): Owain Evans, Vlad Firoiu, Girish Sastry, William Saunders

Overview

This repository contains the code for human intervention reinforcement learning in Atari environments (based on OpenAI's Gym). The humanrl package contains various Gym environment wrappers and utilities that allow modifying Atari environments to include catastrophes.

scripts/human_feedback.py is a script that allows a human to intervene during offline or online training of an RL agent.

Installation and use

To label and run the code locally, first create an Anaconda environment with our packages:

conda env create
source activate humanrl

See the human feedback README for directions on providing human feedback with the OpenAI universe starter agent.

See the catastrophe wrapper for a general purpose way to add catastrophes to Gym environments.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
docker		docker
humanrl		humanrl
models		models
scripts		scripts
train		train
universe-starter-agent		universe-starter-agent
.atomignore		.atomignore
.dockerignore		.dockerignore
.gitignore		.gitignore
DOCKER-INSTRUCTIONS.md		DOCKER-INSTRUCTIONS.md
LICENSE		LICENSE
README.md		README.md
base.docker		base.docker
build_docker.sh		build_docker.sh
build_docker_gpu.sh		build_docker_gpu.sh
environment.yml		environment.yml
mac_run_docker.sh		mac_run_docker.sh
main.docker		main.docker
setup_conda.sh		setup_conda.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Human intervention reinforcement learning

Overview

Installation and use

About

Releases

Packages

Languages

License

gsastry/human-rl

Folders and files

Latest commit

History

Repository files navigation

Human intervention reinforcement learning

Overview

Installation and use

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages