Login / Signup
Towards Comprehensive Preference Data Collection for Reward Modeling.
Yulan Hu
Qingyang Li
Sheng Ouyang
Ge Chen
Kaihui Chen
Lijun Mei
Xucheng Ye
Fuzheng Zhang
Yong Liu
Published in:
CoRR (2024)
Keyphrases
</>
data collection
reinforcement learning
data analysis
modeling method
data sets
multiscale
user preferences
information retrieval
computer vision
knowledge base
website
clustering algorithm
sensor networks
collecting data