Login / Signup
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning.
Yuxi Xie
Anirudh Goyal
Wenyue Zheng
Min-Yen Kan
Timothy P. Lillicrap
Kenji Kawaguchi
Michael Shieh
Published in:
CoRR (2024)
Keyphrases
</>
monte carlo tree search
preference learning
monte carlo
ordinal regression
gaussian processes
evaluation function
recommender systems
pairwise comparison
active learning
temporal difference learning
ranking functions
data mining
learning algorithm
multi agent
multiple criteria
game tree