KPTimes: A Large-Scale Dataset for Keyphrase Generation on News Documents.
Ygor GallinaFlorian BoudinBéatrice DaillePublished in: INLG (2019)
Keyphrases
- keyphrase extraction
- keyphrases
- news articles
- keywords
- news stories
- text documents
- person names
- news items
- document retrieval
- information retrieval
- information retrieval systems
- document collections
- web documents
- real life
- xml documents
- benchmark datasets
- relevant documents
- feature set
- real world
- metadata
- document clustering
- synthetic datasets
- news corpus
- document classification
- social media
- domain specific
- term frequency
- document representation
- topic detection
- multimedia
- query terms
- vector space