Detecting Content-Bearing Words by Serial Clustering.
Abraham BooksteinShmuel T. KleinTimo RaitaPublished in: SIGIR (1995)
Keyphrases
- k means
- clustering algorithm
- clustering method
- hierarchical clustering
- semantic meaning
- content similarity
- metadata
- multimedia
- keywords
- textual features
- data clustering
- data mining
- related words
- word segmentation
- n gram
- spectral clustering
- web content
- outlier detection
- short text
- web documents
- document content
- data points
- data sets