RoadRunner: Towards Automatic Data Extraction from Large Web Sites.
Valter CrescenziGiansalvatore MeccaPaolo MerialdoPublished in: VLDB (2001)
Keyphrases
- data extraction
- website
- web pages
- html pages
- web sources
- web data extraction
- semi structured
- national laboratory
- data integration
- wrapper generation
- search engine
- information extraction
- web server
- web documents
- web search
- query interface
- data records
- databases
- web content
- web users
- web databases
- web search engines
- user interaction