CrossSum: Beyond English-Centric Cross-Lingual Abstractive Text Summarization for 1500+ Language Pairs.
Tahmid HasanAbhik BhattacharjeeWasi Uddin AhmadYuan-Fang LiYong-Bin KangRifat ShahriyarPublished in: CoRR (2021)
Keyphrases
- cross lingual
- text summarization
- parallel corpus
- machine translation
- language specific
- natural language processing
- information extraction
- target language
- indian languages
- named entity recognition
- source language
- machine translation system
- cross language
- comparable corpora
- linguistic resources
- natural language
- cross lingual information retrieval
- language independent
- language modeling
- mono lingual
- cross language information retrieval
- bilingual dictionaries
- question answering
- multi document summarization
- query translation
- monolingual retrieval
- parallel corpora
- pairwise
- text mining
- query expansion
- word alignment
- machine learning
- text classification
- named entities
- word pairs
- translation model
- document clustering
- word sense
- news articles
- transfer learning
- data mining
- out of vocabulary
- statistical machine translation
- information retrieval systems
- chinese english
- knowledge base