Login / Signup
Meta-Gradient Reinforcement Learning with an Objective Discovered Online.
Zhongwen Xu
Hado van Hasselt
Matteo Hessel
Junhyuk Oh
Satinder Singh
David Silver
Published in:
CoRR (2020)
Keyphrases
</>
reinforcement learning
online learning
real time
state space
function approximation
machine learning
optimal policy
balancing exploration and exploitation
website
learning process
optimal control
multiple objectives
structure tensor
policy gradient