Bulk-Synchronous On-Line Crawling on Clusters of Computers.
Mauricio MarínCarolina BonacicPublished in: PDP (2008)
Keyphrases
- clustering algorithm
- search engine
- hierarchical clustering
- agglomerative hierarchical clustering
- data points
- computer systems
- data clustering
- cluster analysis
- arbitrary shape
- computer technology
- web mining
- self organizing maps
- feature selection
- genetic algorithm
- hierarchical structure
- neural network
- document clustering
- data objects
- k means
- clustering framework
- focused crawling
- overlapping clusters
- web crawlers
- information retrieval