Enhancing clustering blog documents by utilizing author/reader comments.
Beibei LiShuting XuJun ZhangPublished in: ACM Southeast Regional Conference (2007)
Keyphrases
- document clustering
- blog posts
- clustering method
- clustering algorithm
- k means
- document collections
- writing style
- relevant documents
- xml documents
- web documents
- data clustering
- text clustering
- document classification
- text documents
- keywords
- metadata
- information retrieval
- self organizing maps
- information retrieval systems
- text mining
- cosine similarity
- hierarchical clustering
- unsupervised learning
- similarity measure
- spectral clustering
- document analysis
- mutual reinforcement
- multi document summarization
- vector space model
- similarity function
- data objects
- document retrieval
- data points
- feature selection