Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning.

Baolin Peng Xiujun Li Jianfeng Gao Jingjing Liu Kam-Fai Wong

Published in: ACL (1) (2018)

Keyphrases

reinforcement learning
learning process
learning algorithm
active learning
mixed initiative
knowledge acquisition
learning problems
neural network
natural language
supervised learning
learning systems
heuristic search
learning tasks
action selection