Sign in
AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback.
Yann Dubois
Xuechen Li
Rohan Taori
Tianyi Zhang
Ishaan Gulrajani
Jimmy Ba
Carlos Guestrin
Percy Liang
Tatsunori B. Hashimoto
Published in:
CoRR (2023)
Keyphrases
</>
main contribution
preprocessing
machine learning methods
qualitative and quantitative
real time
feature extraction
benchmark datasets
theoretical framework
simulation environment