Login / Signup
Rethinking Information Structures in RLHF: Reward Generalization from a Graph Theory Perspective.
Tianyi Qiu
Fanzhi Zeng
Jiaming Ji
Dong Yan
Kaile Wang
Jiayi Zhou
Han Yang
Josef Dai
Xuehai Pan
Yaodong Yang
Published in:
CoRR (2024)
Keyphrases
</>
graph theory
neural network
social network analysis
np complete
semantic information
information flow