Human Performance on Clustering Web Pages: A Preliminary Study.
Sofus A. MacskassyArunava BanerjeeBrian D. DavisonHaym HirshPublished in: KDD (1998)
Keyphrases
- web pages
- clustering algorithm
- search engine
- k means
- web objects
- web search
- web content
- data clustering
- data objects
- hierarchical clustering
- human interaction
- web documents
- clustering method
- web page classification
- document clustering
- spectral clustering
- google search engine
- web content mining
- web page prediction
- cluster analysis
- unsupervised learning
- website
- data sets
- ranking algorithm
- hierarchical structure
- human subjects
- categorical data
- high dimensional data
- web logs
- data extraction
- keywords
- geographical locations
- feature selection
- neural network