Login / Signup
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble.
Shun Zhang
Zhenfang Chen
Sunli Chen
Yikang Shen
Zhiqing Sun
Chuang Gan
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
mathematical model
statistical model
computational model
experimental data
management system
machine learning
genetic algorithm
probabilistic model
computational models
learning algorithm
artificial neural networks