Login / Signup
On Learning Intrinsic Rewards for Policy Gradient Methods.
Zeyu Zheng
Junhyuk Oh
Satinder Singh
Published in:
CoRR (2018)
Keyphrases
</>
reinforcement learning
learning algorithm
learning process
supervised learning
learning problems
actor critic
learning tasks
policy gradient methods
pose estimation