Optimizing Energy Production Using Policy Search and Predictive State Representations.
Yuri GrinbergDoina PrecupMichel GendreauPublished in: NIPS (2014)
Keyphrases
- policy search
- predictive state representations
- partially observable markov decision processes
- dynamical systems
- reinforcement learning
- continuous state
- stochastic systems
- temporal difference
- reinforcement learning algorithms
- dynamic programming
- decision problems
- finite state
- planning problems
- reward function
- optimal policy
- neural network
- state space
- policy gradient