Login / Signup

Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment.

Zhaofeng WuAnanth BalashankarYoon KimJacob EisensteinAhmad Beirami
Published in: CoRR (2024)
Keyphrases
  • cross lingual
  • data mining
  • probabilistic model
  • prior knowledge
  • document clustering