Login / Signup
Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning.
Baolin Peng
Xiujun Li
Jianfeng Gao
Jingjing Liu
Yun-Nung Chen
Kam-Fai Wong
Published in:
CoRR (2017)
Keyphrases
</>
actor critic
reinforcement learning
learning algorithm
mathematical model
learning problems
machine learning
supervised learning
neuro fuzzy
policy gradient