Login / Signup
An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search.
Kyunghyun Lee
Byeong-Uk Lee
Ukcheol Shin
In So Kweon
Published in:
CoRR (2020)
Keyphrases
</>
policy search
objective function
dynamic programming
model free
theoretical analysis
machine learning
reinforcement learning
support vector machine
convergence rate
markov model