C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization.
Zhoujian Sun
Chenyang Zhao
Zhengxing Huang
Nai Ding
Published in:
CoRR (2023)
Keyphrases
</>
imitation learning
real time
feature selection
human teacher
learning algorithm
reinforcement learning
pairwise
active learning
supervised learning
multi modal
maximum margin