Text Augmentation Techniques for Document Vector Generation from Russian News Articles.
Christoffer AminoffAleksei RomanenkoOnni KosomaaJouko VankkaPublished in: ICIST (2018)
Keyphrases
- news articles
- text documents
- textual content
- text mining
- newspaper articles
- keywords
- text classification
- text categorization
- news corpus
- document clustering
- text data
- wordnet
- keyphrases
- topic models
- information extraction
- document representation
- news stories
- related words
- web documents
- named entities
- news items
- online news
- text corpus
- web news
- blog entries
- bag of words
- news sites
- news sources
- news feeds
- text corpora
- information retrieval
- feature vectors
- natural language