Login / Signup

Off-policy learning in large-scale POMDP-based dialogue systems.

Lucie DaubigneyMatthieu GeistOlivier Pietquin
Published in: ICASSP (2012)
Keyphrases
  • dialogue system
  • reinforcement learning
  • learning algorithm
  • knowledge acquisition
  • ground truth
  • human computer
  • continuous state
  • predictive state representations