HAF-RM: A Hybrid Alignment Framework for Reward Model Training.

Published in: CoRR (2024)

Keyphrases