Improving Deep Reinforcement Learning With Mirror Loss.

Jian Zhao Weide Shu Youpeng Zhao Wengang Zhou Houqiang Li

Published in: IEEE Trans. Games (2023)

Keyphrases

reinforcement learning
function approximation
real time
machine learning
model free
learning algorithm
viewpoint
state space
optimal policy
markov decision processes
reinforcement learning algorithms
temporal difference
learning classifier systems
least squares
computer vision
information retrieval
neural network