Combining Q-Learning and Search with Amortized Value Estimates.

Jessica B. Hamrick Victor Bapst Alvaro Sanchez-Gonzalez Tobias Pfaff Theophane Weber Lars Buesing Peter W. Battaglia

Published in: ICLR (2020)

Keyphrases