Login / Signup
Multi-Action Dialog Policy Learning from Logged User Feedback.
Shuo Zhang
Junzhou Zhao
Pinghui Wang
Tianxiang Wang
Zi Liang
Jing Tao
Yi Huang
Junlan Feng
Published in:
CoRR (2023)
Keyphrases
</>
user feedback
machine learning
learning algorithm
active learning
user interaction
action selection
search engine
training data
spoken dialog
information retrieval
user interface
supervised learning
information retrieval systems