Skip to content

Code for human intervention reinforcement learning

License

Notifications You must be signed in to change notification settings

gsastry/human-rl

Repository files navigation

Human intervention reinforcement learning

Research code for the paper "Trial without Error: Towards Safe Reinforcement Learning via Human Intervention" (arxiv) (2017)

Contributors (alphabetical): Owain Evans, Vlad Firoiu, Girish Sastry, William Saunders

Overview

This repository contains the code for human intervention reinforcement learning in Atari environments (based on OpenAI's Gym). The humanrl package contains various Gym environment wrappers and utilities that allow modifying Atari environments to include catastrophes.

scripts/human_feedback.py is a script that allows a human to intervene during offline or online training of an RL agent.

Installation and use

To label and run the code locally, first create an Anaconda environment with our packages:

conda env create
source activate humanrl

See the human feedback README for directions on providing human feedback with the OpenAI universe starter agent.

See the catastrophe wrapper for a general purpose way to add catastrophes to Gym environments.

About

Code for human intervention reinforcement learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published