Login / Signup
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems.
Yihao Feng
Shentao Yang
Shujian Zhang
Jianguo Zhang
Caiming Xiong
Mingyuan Zhou
Huan Wang
Published in:
CoRR (2023)
Keyphrases
</>
dialogue system
reinforcement learning
tutorial dialogue
learning process
learning algorithm
human computer
natural language
knowledge acquisition
machine learning
prior knowledge
user interaction
natural language generation
dialogue management