Login / Signup
Width-based Lookaheads with Learnt Base Policies and Heuristics Over the Atari-2600 Benchmark.
Stefan O'Toole
Nir Lipovetzky
Miquel Ramírez
Adrian R. Pearce
Published in:
NeurIPS (2021)
Keyphrases
</>
optimal policy
heuristic search
lower bound
search algorithm
dynamic programming