Novel meta-heuristic algorithms for clustering web documents.
Mehrdad MahdaviMorteza Haghir ChehreghaniHassan AbolhassaniRana ForsatiPublished in: Appl. Math. Comput. (2008)
Keyphrases
- web documents
- content similarity
- semi structured
- information extraction
- web pages
- clustering algorithm
- web search engines
- k means
- keywords
- document classification
- clustering method
- returned by a search engine
- html documents
- web data
- link structure
- prefetching
- vector space model
- web content
- machine learning
- topic specific
- document clustering
- web logs
- association rules
- website
- data points
- training set