Web page download scheduling policies for green web crawling.
Vassiliki HatziBerkant Barla CambazogluIordanis KoutsopoulosPublished in: SoftCOM (2014)
Keyphrases
- web crawling
- scheduling policies
- search engine
- web pages
- link analysis
- web data
- topic specific
- web crawlers
- load balancing
- scheduling algorithm
- deep web
- web mining
- queueing networks
- focused crawling
- round robin
- web documents
- website
- data mining and machine learning
- data mining
- web content
- web search engines
- heavy traffic
- web crawler
- web search
- response time
- quality of service
- ranking algorithm
- web queries
- text categorization
- query logs
- text classification
- data mining techniques
- web users
- information extraction
- information retrieval
- neural network