Login / Signup
Sample-efficient batch reinforcement learning for dialogue management optimization.
Olivier Pietquin
Matthieu Geist
Senthilkumar Chandramohan
Hervé Frezza-Buet
Published in:
ACM Trans. Speech Lang. Process. (2011)
Keyphrases
</>
dialogue management
reinforcement learning
machine learning
domain specific
optimal policy
function approximation
knowledge base
dynamic programming
information extraction
natural language processing
model free