Enhancing News Articles Clustering using Word N-Grams.
Christos BourasVassilis TsogkasPublished in: DATA (2013)
Keyphrases
- n gram
- news articles
- language model
- text documents
- text classification
- bag of words
- language modeling
- newspaper articles
- variable length
- clustering algorithm
- online news
- language independent
- news corpus
- news sites
- k means
- word segmentation
- news stories
- document clustering
- data points
- part of speech
- unsupervised learning
- document representation
- news events
- information retrieval
- neural network
- document retrieval
- text categorization
- natural language processing
- out of vocabulary