Sign in

On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems.

Pei-Hao SuMilica GasicNikola MrksicLina Maria Rojas-BarahonaStefan UltesDavid VandykeTsung-Hsien WenSteve J. Young
Published in: ACL (1) (2016)
Keyphrases