Login / Signup

Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization.

Yangyang ZhaoZhenyu WangMehdi DastaniShihan Wang
Published in: CoRR (2023)
Keyphrases
  • dialogue system
  • dead ends
  • tutorial dialogue
  • optimization problems
  • reinforcement learning
  • multi dimensional
  • optimal policy
  • message passing