Tunneling enhanced by web page content block partition for focused crawling.
Tao PengChangli ZhangWanli ZuoPublished in: Concurr. Comput. Pract. Exp. (2008)
Keyphrases
- focused crawling
- web page content
- anchor text
- web pages
- web documents
- web search
- search engine
- website
- topic specific
- web sources
- web mining
- semantic information
- test collection
- web content
- document representation
- search tasks
- query logs
- semantic relationships
- link structure
- search tools
- machine learning
- web search engines
- image classification
- information retrieval