Login / Signup

Variance-constrained actor-critic algorithms for discounted and average reward MDPs.

Prashanth L. A.Mohammad Ghavamzadeh
Published in: Mach. Learn. (2016)
Keyphrases