Login / Signup
Simple random search of static linear policies is competitive for reinforcement learning.
Horia Mania
Aurelia Guy
Benjamin Recht
Published in:
NeurIPS (2018)
Keyphrases
</>
random search
reinforcement learning
optimal policy
simulated annealing
function approximation
parameter optimization
learning algorithm
evolutionary algorithm
genetic algorithm
dynamic programming
state space
policy search