Login / Signup
Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning.
Baolin Peng
Xiujun Li
Jianfeng Gao
Jingjing Liu
Kam-Fai Wong
Published in:
ACL (1) (2018)
Keyphrases
</>
reinforcement learning
learning process
learning algorithm
active learning
mixed initiative
knowledge acquisition
learning problems
neural network
natural language
supervised learning
learning systems
heuristic search
learning tasks
action selection