On Learning Intrinsic Rewards for Policy Gradient Methods.

Zeyu Zheng Junhyuk Oh Satinder Singh

Published in: CoRR (2018)

Keyphrases

reinforcement learning
learning algorithm
learning process
supervised learning
learning problems
actor critic
learning tasks
policy gradient methods
pose estimation