Login / Signup
Adaptive Dialog Policy Learning with Hindsight and User Modeling.
Yan Cao
Keting Lu
Xiaoping Chen
Shiqi Zhang
Published in:
CoRR (2020)
Keyphrases
</>
user modeling
adaptive hypermedia
learning process
learning systems
learning algorithm
adaptive learning
reinforcement learning
user interface
optimal policy
active learning
supervised learning
general purpose
actor critic
user model
online learning
website
artificial intelligence