Login / Signup
Provably Convergent Policy Gradient Methods for Model-Agnostic Meta-Reinforcement Learning.
Alireza Fallah
Aryan Mokhtari
Asuman E. Ozdaglar
Published in:
CoRR (2020)
Keyphrases
</>
reinforcement learning
mathematical model
model free
neural network
dynamic programming
decision theoretic