Active Deep Q-learning with Demonstration.
Si-An ChenVoot TangkarattHsuan-Tien LinMasashi SugiyamaPublished in: CoRR (2018)
Keyphrases
- reinforcement learning
- cooperative
- function approximation
- model free
- state space
- multi agent
- learning algorithm
- neural network
- temporal difference learning
- reinforcement learning algorithms
- supervised learning
- bayesian networks
- data mining
- search algorithm
- probabilistic model
- image sequences
- case study
- machine learning
- real time
- stochastic approximation