Login / Signup
The Predictron: End-To-End Learning and Planning.
David Silver
Hado van Hasselt
Matteo Hessel
Tom Schaul
Arthur Guez
Tim Harley
Gabriel Dulac-Arnold
David P. Reichert
Neil C. Rabinowitz
André Barreto
Thomas Degris
Published in:
CoRR (2016)
Keyphrases
</>
end to end
reinforcement learning
admission control
real world
motion estimation
ad hoc networks
content delivery