Active Deep Q-learning with Demonstration.

Si-An Chen Voot Tangkaratt Hsuan-Tien Lin Masashi Sugiyama

Published in: CoRR (2018)

Keyphrases

reinforcement learning
cooperative
function approximation
model free
state space
multi agent
learning algorithm
neural network
temporal difference learning
reinforcement learning algorithms
supervised learning
bayesian networks
data mining
search algorithm
probabilistic model
image sequences
case study
machine learning
real time
stochastic approximation