PDD Crawler: A focused web crawler using link and content analysis for relevance prediction.
Prashant DahiwaleMukesh M. RaghuwanshiLatesh G. MalikPublished in: CoRR (2014)
Keyphrases
- content analysis
- web crawler
- web crawlers
- deep web
- topic specific
- collaborative learning
- search tools
- online discussion
- website
- focused crawler
- search engine
- web crawling
- relevant web pages
- video content
- web forums
- online communities
- relevance feedback
- web pages
- machine learning
- web sources
- databases
- feature space
- social networks