Login / Signup
Learning to Compensate Photovoltaic Power Fluctuations from Images of the Sky by Imitating an Optimal Policy.
Robin Spiess
Felix Berkenkamp
Jan Poland
Andreas Krause
Published in:
CoRR (2018)
Keyphrases
</>
optimal policy
reinforcement learning
learning algorithm
markov decision processes
average reward reinforcement learning
infinite horizon
bayesian reinforcement learning
decision making
dynamic programming
monte carlo
long run
finite horizon