Login / Signup
Monte-Carlo Tree Search as Regularized Policy Optimization.
Jean-Bastien Grill
Florent Altché
Yunhao Tang
Thomas Hubert
Michal Valko
Ioannis Antonoglou
Rémi Munos
Published in:
CoRR (2020)
Keyphrases
</>
monte carlo tree search
monte carlo
bayesian reinforcement learning
tree search algorithm
evaluation function
monte carlo search
optimal policy
least squares
dynamic programming
upper bound
alpha beta search