Login / Signup
Efficient Exploration for Dialog Policy Learning with Deep BBQ Networks \& Replay Buffer Spiking.
Zachary C. Lipton
Jianfeng Gao
Lihong Li
Xiujun Li
Faisal Ahmed
Li Deng
Published in:
CoRR (2016)
Keyphrases
</>
learning process
learning algorithm
connectionist networks
learning scheme
reinforcement learning
bio inspired
spiking neural networks
social networks
online learning
unsupervised learning
learning tasks
deep learning
recurrent networks
conversational agents