Login / Signup
Learning to Compensate Photovoltaic Power Fluctuations from Images of the Sky by Imitating an Optimal Policy.
Robin Spiess
Felix Berkenkamp
Andreas Krause
Jan Poland
Published in:
ECC (2019)
Keyphrases
</>
optimal policy
reinforcement learning
learning algorithm
state space
average reward reinforcement learning
infinite horizon
multistage
decision problems
state dependent
decision making
dynamic programming
linear programming
markov decision processes