mPLM-Sim: Unveiling Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models.
Peiqin LinChengzhi HuZheyu ZhangAndré F. T. MartinsHinrich SchützePublished in: CoRR (2023)
Keyphrases
- cross lingual
- language modeling
- language model
- cross lingual information retrieval
- language independent
- cross language
- probabilistic model
- retrieval model
- information retrieval
- n gram
- similarity measure
- pseudo feedback
- query expansion
- document retrieval
- translation model
- context sensitive
- parallel corpus
- test collection
- transfer learning
- distance measure
- cross language retrieval
- parallel corpora
- word segmentation
- query terms
- machine translation
- query translation
- machine learning
- pseudo relevance feedback
- linguistic resources
- language modeling framework
- vector space model
- retrieval effectiveness
- document clustering
- generative model
- text classification