Aligning a medium-size GPT model in English to a small closed domain in Spanish using reinforcement learning.

Published in: CoRR (2023)

Keyphrases