C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble.
Shun Zhang
Zhenfang Chen
Sunli Chen
Yikang Shen
Zhiqing Sun
Chuang Gan
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
mathematical model
statistical model
computational model
experimental data
management system
machine learning
genetic algorithm
probabilistic model
computational models
learning algorithm
artificial neural networks