RoadRunner: automatic data extraction from data-intensive web sites.
Valter CrescenziGiansalvatore MeccaPaolo MerialdoPublished in: SIGMOD Conference (2002)
Keyphrases
- data intensive
- data extraction
- wrapper generation
- website
- web pages
- html pages
- data management
- data integration
- semi structured
- web data extraction
- web services
- data access
- information extraction
- big data
- database
- grid computing
- structured data
- web databases
- web content
- data processing
- search engine
- machine learning
- data sets