Mountain car Actor-Critic This is an implementation of the Reinforcement Learning Actor-Critic algorithm as a solution for the Mountain Car environement in OpenAI gym. The agent will learn the envionment and improve his policy based on the reward received: Finally arriving to an optimal policy: