Login / Signup
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization.
Yangyang Zhao
Zhenyu Wang
Mehdi Dastani
Shihan Wang
Published in:
CoRR (2023)
Keyphrases
</>
dialogue system
dead ends
tutorial dialogue
optimization problems
reinforcement learning
multi dimensional
optimal policy
message passing