Login / Signup
Combining Q-Learning and Search with Amortized Value Estimates.
Jessica B. Hamrick
Victor Bapst
Alvaro Sanchez-Gonzalez
Tobias Pfaff
Theophane Weber
Lars Buesing
Peter W. Battaglia
Published in:
ICLR (2020)
Keyphrases
</>
search algorithm
reinforcement learning
search efficiency
search space
neural network
learning algorithm
search methods
search strategy
search strategies
cooperative
state space
worst case
function approximation
search tree