Login / Signup
Width-based Lookaheads with Learnt Base Policies and Heuristics Over the Atari-2600 Benchmark.
Stefan O'Toole
Nir Lipovetzky
Miquel Ramírez
Adrian R. Pearce
Published in:
CoRR (2021)
Keyphrases
</>
heuristic search
search algorithm
learning algorithm
cooperative
least squares
dynamic environments
optimal policy