Login / Signup
Accelerating Model-Free Policy Optimization Using Model-Based Gradient: A Composite Optimization Perspective.
Yansong Li
Shuo Han
Published in:
L4DC (2022)
Keyphrases
</>
model free
optimization problems
reinforcement learning
search space
learning algorithm
pattern recognition
small number
image classification
policy iteration