• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity.

Bo LiuIan GempMohammad GhavamzadehJi LiuSridhar MahadevanMarek Petrik
Published in: J. Artif. Intell. Res. (2018)
Keyphrases