Login / Signup
Adaptive Dialog Policy Learning with Hindsight and User Modeling.
Yan Cao
Keting Lu
Xiaoping Chen
Shiqi Zhang
Published in:
SIGdial (2020)
Keyphrases
</>
user modeling
learning process
learning systems
learning algorithm
adaptive hypermedia
optimal policy
adaptive systems
adaptive learning
user profiles
supervised learning
training data
information retrieval systems
domain independent
general purpose
action selection
secondary school
reinforcement learning