Login / Signup

Reinforcement Learning from LLM Feedback to Counteract Goal Misgeneralization.

Houda Nait El BarjThéophile Sautory
Published in: CoRR (2024)
Keyphrases