Crawling the web for structured documents.
Julián UrbanoJuan LlorénsYorgos AndreadakisMónica MarreroPublished in: CIKM (2010)
Keyphrases
- structured documents
- web documents
- web pages
- structured document retrieval
- web mining
- web crawling
- focused crawling
- xml documents
- web crawlers
- information retrieval
- information retrieval systems
- search engine
- web crawler
- document structure
- web graph
- focused crawler
- web search engines
- query language
- relevant documents
- image classification
- information extraction
- domain knowledge
- keywords
- data mining
- database