NEWSFARM: A Large-Scale Chinese Corpus of Long News Summarization.
Shunan ZangChuang ZhangXiaojun LiuXiaojun ChenPeng ZhangJie LiuPublished in: ICPR (2022)
Keyphrases
- text summarization
- event extraction
- open domain
- news corpus
- topic tracking
- automatic summarization
- mono lingual
- person names
- chinese web
- news articles
- multi document summarization
- dictionary learning for sparse
- lexical chains
- small scale
- social media
- topic detection and tracking
- writing style
- named entity recognition
- web corpora
- news items
- word segmentation
- keyphrase extraction
- sentence level
- chinese word segmentation
- co occurrence
- natural language processing
- real world