Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment.
Zewen ChiLi DongBo ZhengShaohan HuangXian-Ling MaoHeyan HuangFuru WeiPublished in: CoRR (2021)
Keyphrases
- cross lingual
- language modeling
- word alignment
- language model
- translation model
- statistical machine translation
- n gram
- probabilistic model
- language independent
- retrieval model
- cross language
- query expansion
- parallel corpus
- document retrieval
- information retrieval
- supervised learning
- machine translation
- test collection
- vector space model
- training data
- context sensitive
- text classification
- relevance model
- machine learning
- transfer learning
- pairwise
- natural language