• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Sample-efficient batch reinforcement learning for dialogue management optimization.

Olivier PietquinMatthieu GeistSenthilkumar ChandramohanHervé Frezza-Buet
Published in: ACM Trans. Speech Lang. Process. (2011)
Keyphrases
  • dialogue management
  • reinforcement learning
  • machine learning
  • domain specific
  • optimal policy
  • function approximation
  • knowledge base
  • dynamic programming
  • information extraction
  • natural language processing
  • model free