Login / Signup
End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning.
Jason D. Williams
Geoffrey Zweig
Published in:
CoRR (2016)
Keyphrases
</>
end to end
reinforcement learning
optimal control
learning algorithm
multipath
ad hoc networks
admission control
wireless ad hoc networks
congestion control
high bandwidth
rate allocation
transport layer
markov decision processes
computer networks
control policy
packet loss rate