Login / Signup
CoMPS: Continual Meta Policy Search.
Glen Berseth
Zhiwei Zhang
Grace Zhang
Chelsea Finn
Sergey Levine
Published in:
CoRR (2021)
Keyphrases
</>
policy search
reinforcement learning
continuous state
dynamic programming
reinforcement learning algorithms
continuous action
reward function
policy gradient
markov decision processes
partially observable markov decision processes
markov decision problems