Login / Signup
Automatic Successive Reinforcement Learning with Multiple Auxiliary Rewards.
Zhao-Yang Fu
De-Chuan Zhan
Xin-Chun Li
Yi-Xing Lu
Published in:
IJCAI (2019)
Keyphrases
</>
reinforcement learning
markov decision processes
function approximation
machine learning
learning algorithm
optimal policy
case study
neural network
real world
multi agent
state space
data driven
reinforcement learning algorithms