Corpora for Cross-Language Information Retrieval in Six Less-Resourced Languages.
Ilya ZavorinAric BillsCassian CoreyMichelle MorrisonAudrey TongRichard TongPublished in: CLSSTS@LREC (2020)
Keyphrases
- cross language information retrieval
- comparable corpora
- linguistic resources
- parallel corpora
- query translation
- statistical machine translation
- machine translation
- chinese english
- parallel corpus
- bilingual dictionaries
- terminology extraction
- cross language
- language resources
- bilingual lexicon
- english chinese
- multilingual information retrieval
- text corpora
- structured queries
- news articles
- translation model
- machine translation system
- word pairs
- source language
- cross lingual
- language modeling
- query terms
- language independent
- cross language retrieval
- parallel texts
- information retrieval systems
- question answering
- character n grams
- text documents
- out of vocabulary
- n gram
- semantic similarity