Login / Signup
Scalable Safe Policy Improvement via Monte Carlo Tree Search.
Alberto Castellini
Federico Bianchi
Edoardo Zorzi
Thiago D. Simão
Alessandro Farinelli
Matthijs T. J. Spaan
Published in:
ICML (2023)
Keyphrases
</>
monte carlo tree search
bayesian reinforcement learning
monte carlo
tree search algorithm
monte carlo search
evaluation function
optimal policy
temporal difference learning
temporal difference
game tree