Login / Signup

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward.

Ruohong ZhangLiangke GuiZhiqing SunYihao FengKeyang XuYuanhan ZhangDi FuChunyuan LiAlexander HauptmannYonatan BiskYiming Yang
Published in: CoRR (2024)
Keyphrases