Login / Signup
OPTune: Efficient Online Preference Tuning.
Lichang Chen
Jiuhai Chen
Chenxi Liu
John Kirchenbauer
Davit Soselia
Chen Zhu
Tom Goldstein
Tianyi Zhou
Heng Huang
Published in:
CoRR (2024)
Keyphrases
</>
neural network
online learning
computationally efficient
data sets
control system
real time
real world
artificial intelligence
reinforcement learning
data structure
lower bound