Login / Signup
Stabilized Nested Rollout Policy Adaptation.
Tristan Cazenave
Jean-Baptiste Sevestre
Matthieu Toulemont
Published in:
CoRR (2021)
Keyphrases
</>
optimal policy
monte carlo search
monte carlo tree search
adaptation process
approximate policy iteration
reinforcement learning
evaluation function
case study
expert systems
action selection
markov decision process
information retrieval
monte carlo
markov decision processes
policy makers