End-to-end LSTM-based dialog control optimized with supervised and reinforcement learning.

Jason D. Williams Geoffrey Zweig

Published in: CoRR (2016)

Keyphrases

end to end
reinforcement learning
optimal control
learning algorithm
multipath
ad hoc networks
admission control
wireless ad hoc networks
congestion control
high bandwidth
rate allocation
transport layer
markov decision processes
computer networks
control policy
packet loss rate