Web documents clustering with interest links.
Zifeng CuiBaowen XuWeifeng ZhangJunling XuPublished in: SOSE (2005)
Keyphrases
- web documents
- content similarity
- link structure
- semi structured
- information extraction
- web pages
- web search engines
- clustering algorithm
- web content
- document classification
- k means
- clustering method
- keywords
- vector space model
- prefetching
- document representation
- textual information
- focused crawling
- information retrieval
- database systems
- search engine
- data points
- structured documents
- web logs
- knowledge discovery