ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization.
Jiaan WangFandong MengZiyao LuDuo ZhengZhixu LiJianfeng QuJie ZhouPublished in: CoRR (2022)
Keyphrases
- benchmark datasets
- cross lingual
- machine translation
- language modeling
- cross lingual information retrieval
- language independent
- cross language
- event extraction
- text classification
- parallel corpus
- natural language
- parallel corpora
- news articles
- mono lingual
- multi document summarization
- document clustering
- query translation
- translation model
- word sense
- transfer learning
- text summarization
- statistical machine translation
- vector space
- digital libraries
- feature selection