Rethinking Information Structures in RLHF: Reward Generalization from a Graph Theory Perspective.

Published in: CoRR (2024)

Keyphrases