Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment.

Published in: CoRR (2024)

Keyphrases