energy_py

energy_py supports running reinforcement learning experiments on energy environments.

energy_py is built and maintained by Adam Green - [email protected].

Basic usage

The most common access point for a user will be to run an experiment. The experiment is setup using config files that live in energy_py/experiments/configs. An experiment is run by passing the experiment name and run name as arguments

$ cd energy_py/experiments

$ python experiment.py example dqn

energy_py provides a simple and familiar gym style low-level API for agent and environment initialization and interactions

import energy_py

env = energy_py.make_env(env_id='battery')

agent = energy_py.make_agent(
    agent_id='dqn',
    env=env,
    total_steps=1000000
    )

observation = env.reset()

while not done:
    action = agent.act(observation)
    next_observation, reward, done, info = env.step(action)
    training_info = agent.learn()
    observation = next_observation

Results for this run are then available at

$ cd energy_py/experiments/results/example/dqn

$ ls
agent_args.txt
debug.log
env_args.txt
env_histories
ep_rewards.csv
expt.ini
info.log
runs.ini

The progress of an experiment can be watched with TensorBoard

$ tensorboard --logdir='./energy_py/experiments/results'

Learning performance of the DQN agent on the flexibility environment across two random seeds. The red line is a naive benchmark agent. The reward per episode can be directly interpreted as dollars per day.

Installation

The main dependencies of energy_py are TensorFlow, numpy, pandas and matplotlib.

To install energy_py using an Anaconda virtual environment

$ conda create --name energy_py python=3.5.2

$ activate energy_py (windows) OR source activate energy_py (unix)

$ git clone https://github.com/ADGEfficiency/energy_py.git

$ cd energy_py

#  currently the package only works when installed as develop
#  this is to do with how the dataset csvs are loaded - [see issue here](https://github.com/ADGEfficiency/energy_py/issues/34)
$ python setup.py develop 

$ pip install --ignore-installed -r requirements.txt

Philosophy

The aim of energy_py is to provide

high quality implementations of agents suited to solving energy problems
energy environments based on decarbonization problems
tools to run experiments

The design philosophies of energy_py are

simplicity
iterative design
simple class hierarchy structure (maximum of two levels)
ideally code will document itself - comments only used when needed
utilize Python standard library (deques, namedtuples etc)
utilize tensorflow & tensorboard
provide sensible defaults for args

Preference is given to improving and iterating on existing designs over implementing new agents or environments.

energy_py was heavily influenced by Open AI baselines and gym.

Agents

energy_py is currently focused on a high quality implementation of DQN (ref Mnih et. al (2015)) and along with naive and heuristic agents for comparison.

DQN was chosen because

most energy environments have low dimensional action spaces (making discretization tractable). Discretization still means a loss of action space shape, but the action space dimensionality is reasonable
highly extensible (DDQN, prioritized experience replay, dueling, n-step returns - see Rainbow for a summary
able to learn off policy
established algorithm, with many implementations on GitHub

Naive agents include an agent that randomly samples the action space, independent of observation. Heuristic agents are usually custom built for a specific environment. Examples of heuristic agents include actions based on the time of day or on the values of a forecast.

Environments

energy_py provides custom built models of energy environments and wraps around Open AI gym. Support for basic gym models is included to allow debugging of agents with familiar environments.

Beware that gym deals with random seeds for action spaces in particular ways. v0 of gym environments ignore the selected action 25% of the time and repeat the previous action (to make environment more stochastic). v4 can remove this randomness.

CartPole-v0

Classic cartpole balancing - gym - energy_py

Pendulum-v0

Inverted pendulum swingup - gym - energy_py

MountainCar-v0

An exploration problem - gym - energy_py

Electric battery storage

Dispatch of a battery arbitraging wholesale prices - energy_py

Battery is defined by a capacity and a maximum rate to charge and discharge, with a round trip efficiency applied on storage.

Demand side flexibility

Dispatch of price responsive demand side flexibility - energy_py

Flexible asset is a chiller system, with an action space of the return temperature setpoint. An increased setpoint will reduce demand, decreased will increase demand.

Name		Name	Last commit message	Last commit date
Latest commit History 680 Commits
assets		assets
energy_py		energy_py
notebooks		notebooks
.gitattributes		.gitattributes
.gitignore		.gitignore
PITCHME.css		PITCHME.css
PITCHME.md		PITCHME.md
PITCHME.yaml		PITCHME.yaml
license.md		license.md
readme.md		readme.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

energy_py

Basic usage

Installation

Philosophy

Agents

Environments

About

Releases

Packages

Languages

License

DataScienceRetreat/energy_py

Folders and files

Latest commit

History

Repository files navigation

energy_py

Basic usage

Installation

Philosophy

Agents

Environments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages