Login / Signup

Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning.

Yuxi XieAnirudh GoyalWenyue ZhengMin-Yen KanTimothy P. LillicrapKenji KawaguchiMichael Shieh
Published in: CoRR (2024)
Keyphrases