Login / Signup
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback.
Tianyu Yu
Yuan Yao
Haoye Zhang
Taiwen He
Yifeng Han
Ganqu Cui
Jinyi Hu
Zhiyuan Liu
Hai-Tao Zheng
Maosong Sun
Tat-Seng Chua
Published in:
CoRR (2023)
Keyphrases
</>
fine grained
coarse grained
human behavior
tightly coupled
access control
massively parallel
human subjects
human teacher
web search
human operators
data provenance