Width-based Lookaheads with Learnt Base Policies and Heuristics Over the Atari-2600 Benchmark.

Published in: CoRR (2021)

Keyphrases