Width-based Lookaheads with Learnt Base Policies and Heuristics Over the Atari-2600 Benchmark.

Published in: NeurIPS (2021)

Keyphrases