A topic-specific crawling strategy based on semantics similarity.
YaJun DuQiangQiang PenZhaoQiong GaoPublished in: Data Knowl. Eng. (2013)
Keyphrases
- topic specific
- web crawling
- focused crawling
- web crawler
- search engine
- topic modeling
- web documents
- semantic information
- web pages
- similarity measure
- web queries
- web crawlers
- word pairs
- focused crawler
- similarity measurement
- semantic similarity
- similarity function
- information retrieval systems
- document collections
- probabilistic model
- website