Login / Signup

Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity.

Bo LiuIan GempMohammad GhavamzadehJi LiuSridhar MahadevanMarek Petrik
Published in: J. Artif. Intell. Res. (2018)
Keyphrases