Sign in

Reward Shaping with Recurrent Neural Networks for Speeding up On-Line Policy Learning in Spoken Dialogue Systems.

Pei-Hao SuDavid VandykeMilica GasicNikola MrksicTsung-Hsien WenSteve J. Young
Published in: SIGDIAL Conference (2015)
Keyphrases