WikiLingua: A New Benchmark Dataset for Cross-Lingual Abstractive Summarization.
Faisal LadhakEsin DurmusClaire CardieKathleen R. McKeownPublished in: CoRR (2020)
Keyphrases
- benchmark datasets
- cross lingual
- language modeling
- machine translation
- cross lingual information retrieval
- language independent
- cross language
- text classification
- event extraction
- parallel corpus
- translation model
- text summarization
- language model
- multi document summarization
- mono lingual
- document clustering
- retrieval model
- test collection
- indian languages
- transfer learning
- parallel corpora
- query expansion
- natural language processing
- information retrieval