WikiMulti: a Corpus for Cross-Lingual Summarization.
Pavel TikhonovValentin MalykhPublished in: CoRR (2022)
Keyphrases
- cross lingual
- parallel corpus
- mono lingual
- word sense
- parallel corpora
- statistical machine translation
- machine translation
- language modeling
- text classification
- language independent
- cross lingual information retrieval
- cross language
- translation model
- event extraction
- machine translation system
- multi document summarization
- language model
- document clustering
- news articles
- sentiment classification
- coreference resolution
- machine learning
- cross language information retrieval
- transfer learning
- active learning
- open domain
- image retrieval
- chinese english
- natural language
- information retrieval