Sign in

AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback.

Yann DuboisXuechen LiRohan TaoriTianyi ZhangIshaan GulrajaniJimmy BaCarlos GuestrinPercy LiangTatsunori B. Hashimoto
Published in: CoRR (2023)
Keyphrases
  • main contribution
  • preprocessing
  • machine learning methods
  • qualitative and quantitative
  • real time
  • feature extraction
  • benchmark datasets
  • theoretical framework
  • simulation environment