Login / Signup
Improving Deep Reinforcement Learning With Mirror Loss.
Jian Zhao
Weide Shu
Youpeng Zhao
Wengang Zhou
Houqiang Li
Published in:
IEEE Trans. Games (2023)
Keyphrases
</>
reinforcement learning
function approximation
real time
machine learning
model free
learning algorithm
viewpoint
state space
optimal policy
markov decision processes
reinforcement learning algorithms
temporal difference
learning classifier systems
least squares
computer vision
information retrieval
neural network