Login / Signup
Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems.
Pei-Hao Su
David Vandyke
Milica Gasic
Nikola Mrksic
Tsung-Hsien Wen
Steve J. Young
Published in:
SIGDIAL Conference (2015)
Keyphrases
</>
recurrent neural networks
recurrent networks
learning algorithm
reinforcement learning
spoken dialogue systems
supervised learning
learning process
dialogue system
echo state networks
natural language
knowledge acquisition
learning tasks
markov decision process
policy gradient