Skip to content

Latest commit

 

History

History
31 lines (23 loc) · 958 Bytes

README.md

File metadata and controls

31 lines (23 loc) · 958 Bytes

Reinforcement Learning

Reinforcement learning algorithms implementation. Custom OpenCV based environment to test codes.

Implemented Algorithms:

  • Value Iteration
  • Policy Iteration
  • Q-Learning
  • Deep Q-Learning
  • Vanilla Policy Gradient - REINFORCE
  • A2C
  • A3C
  • PPO
  • DDPG

Results

Value and Policy Iteration

Method Deterministic Frozen Lake Stochastic Frozen Lake
Value Iteration 7 8
Policy Iteration 7 3
Q-Learning

Deep Q-Learning