Login / Signup
Least Squares Policy Iteration with Instrumental Variables vs. Direct Policy Search: Comparison Against Optimal Benchmarks Using Energy Storage.
Warren R. Scott
Warren B. Powell
Somayeh Moazehi
Published in:
CoRR (2014)
Keyphrases
</>
direct policy search
reinforcement learning
dynamic programming
optimal control
genetic algorithm
optimal solution
support vector
state variables
model free