Skip to content

Maximum Causal Entropy Inverse Reinforcement Learning

License

Notifications You must be signed in to change notification settings

DavidJunL/max-causal-ent-irl

 
 

Repository files navigation

Maximum Causal Entropy Inverse Reinforcement Learning

Python implementation of Maximum Causal Entropy Inverse Reinforcement Learning from Brian Ziebart's PhD thesis (2010): http://www.cs.cmu.edu/~bziebart/publications/thesis-bziebart.pdf

Main file: max_causal_ent_irl.py.

Dependencies: numpy, gym.

Algorithm notes

The finite horizon version of the algorithm is consistent and works as it should by Ziebart (2010).

The discounted infinite horizon version used in e.g Levine's Gaussian Process IRL is work in progress. Current TODOs:

  1. Compute P_0 for the occupancy measure for the gradient of the infinite horizon discounted version of the algorithm as in section A of Levine's GPIRL supplement http://graphics.stanford.edu/projects/gpirl/gpirl_supplement.pdf.

About

Maximum Causal Entropy Inverse Reinforcement Learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%