Login / Signup
Batch Policy Gradient Methods for Improving Neural Conversation Models.
Kirthevasan Kandasamy
Yoram Bachrach
Ryota Tomioka
Daniel Tarlow
David Carter
Published in:
CoRR (2017)
Keyphrases
</>
neural network
monte carlo