Automatic Successive Reinforcement Learning with Multiple Auxiliary Rewards.

Zhao-Yang Fu De-Chuan Zhan Xin-Chun Li Yi-Xing Lu

Published in: IJCAI (2019)

Keyphrases

reinforcement learning
markov decision processes
function approximation
machine learning
learning algorithm
optimal policy
case study
neural network
real world
multi agent
state space
data driven
reinforcement learning algorithms