On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems.

Published in: ACL (1) (2016)

Keyphrases