Login / Signup
Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards.
Yijie Guo
Jongwook Choi
Marcin Moczulski
Shengyu Feng
Samy Bengio
Mohammad Norouzi
Honglak Lee
Published in:
NeurIPS (2020)
Keyphrases
</>
reinforcement learning
learning process
supervised learning
learning systems
learning algorithm
learning problems
neural network
mobile devices
active learning
online learning
markov decision processes
learning community