Optimizing Energy Production Using Policy Search and Predictive State Representations.

Yuri Grinberg Doina Precup Michel Gendreau

Published in: NIPS (2014)

Keyphrases

policy search
predictive state representations
partially observable markov decision processes
dynamical systems
reinforcement learning
continuous state
stochastic systems
temporal difference
reinforcement learning algorithms
dynamic programming
decision problems
finite state
planning problems
reward function
optimal policy
neural network
state space
policy gradient