Login / Signup

OPTune: Efficient Online Preference Tuning.

Lichang ChenJiuhai ChenChenxi LiuJohn KirchenbauerDavit SoseliaChen ZhuTom GoldsteinTianyi ZhouHeng Huang
Published in: CoRR (2024)
Keyphrases
  • neural network
  • online learning
  • computationally efficient
  • data sets
  • control system
  • real time
  • real world
  • artificial intelligence
  • reinforcement learning
  • data structure
  • lower bound