Login / Signup

Teaching Large Language Models to Reason with Reinforcement Learning.

Alex HavrillaYuqing DuSharath Chandra RaparthyChristoforos NalmpantisJane Dwivedi-YuMaksym ZhuravinskyiEric HambroSainbayar SukhbaatarRoberta Raileanu
Published in: CoRR (2024)
Keyphrases