Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning.
Shang-Yu SuXiujun LiJianfeng GaoJingjing LiuYun-Nung ChenPublished in: EMNLP (2018)
Keyphrases
- learning process
- learning algorithm
- learning problems
- domain independent
- unsupervised learning
- reinforcement learning
- deep learning
- learning systems
- prior knowledge
- generative model
- online learning
- active learning
- function approximation
- decision theoretic
- discriminative learning
- natural language
- temporal difference learning