Login / Signup
Convergence and sample complexity of gradient methods for the model-free linear quadratic regulator problem.
Hesameddin Mohammadi
Armin Zare
Mahdi Soltanolkotabi
Mihailo R. Jovanovic
Published in:
CoRR (2019)
Keyphrases
</>
model free
sample complexity
reinforcement learning
linear quadratic
data mining
learning problems
density estimation