• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Learning to Learn Faster from Human Feedback with Language Model Predictive Control.

Jacky LiangFei XiaWenhao YuAndy ZengMontserrat Gonzalez ArenasMaria AttarianMaria BauzáMatthew BenniceAlex BewleyAdil DostmohamedChuyuan Kelly FuNimrod GileadiMarissa GiustinaKeerthana GopalakrishnanLeonard HasencleverJan HumplikJasmine HsuNikhil J. JoshiBen JyenisJ. Chase KewSean KirmaniTsang-Wei Edward LeeKuang-Huei LeeAssaf Hurwitz MichaelyJoss MooreKen OslundDushyant RaoAllen Z. RenBaruch TabanpourQuan VuongAyzaan WahidTed XiaoYing XuVincent ZhuangPeng XuErik FreyKen CaluwaertsTingnan ZhangBrian IchterJonathan TompsonLeila TakayamaVincent VanhouckeIzhak ShafranMaja J. MataricDorsa SadighNicolas HeessKanishka RaoNik StewartJie TanCarolina Parada
Published in: CoRR (2024)
Keyphrases