RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs.
John DangArash AhmadianKelly MarchisioJulia KreutzerAhmet ÜstünSara HookerPublished in: CoRR (2024)
Keyphrases
- language independent
- cross lingual
- multi lingual
- optimization problems
- multilingual information retrieval
- global optimization
- multilingual documents
- optimization algorithm
- constrained optimization
- language specific
- language resources
- expressive power
- digital libraries
- databases
- optimization methods
- preference relations
- soft constraints
- multi objective
- linguistic resources
- cross lingual information retrieval