Exploiting tag and word correlations for improved webpage clustering.
Anusua TrivediPiyush RaiScott L. DuVallHal Daumé IIIPublished in: SMUC@CIKM (2010)
Keyphrases
- clustering algorithm
- k means
- clustering method
- search engine
- keywords
- hierarchical clustering
- unsupervised learning
- tag information
- word sense disambiguation
- cluster analysis
- information theoretic
- n gram
- image retrieval
- data mining
- language model
- distance metric
- web pages
- social networks
- data clustering
- spectral clustering
- information retrieval
- data sets