Clustering Narrow-Domain Short Texts Using K-Means, Linguistic Patterns and LSI.
Svetlana PopovaVera DanilovaArtem EgorovPublished in: AIST (2014)
Keyphrases
- k means
- short texts
- short text
- clustering algorithm
- clustering method
- linguistic patterns
- cluster analysis
- topic detection
- document clustering
- information extraction
- domain independent
- vector space
- data sets
- numerical data
- expectation maximization
- domain specific
- vector space model
- latent topics
- unsupervised learning
- web pages