Crawling the Infinite Web: Five Levels Are Enough.
Ricardo A. Baeza-YatesCarlos CastilloPublished in: WAW (2004)
Keyphrases
- web pages
- web mining
- web crawling
- web crawlers
- website
- web applications
- semantic web
- web documents
- information sources
- web crawler
- focused crawling
- web content
- web graph
- digital libraries
- search engine
- information space
- link analysis
- data extraction
- web scale
- user generated content
- web users
- database
- neural network
- databases
- web information retrieval
- focused crawler
- web communities
- web sources
- levels of abstraction
- web technologies
- web resources
- web data
- data model