Automatic wrapper induction from hidden-web sources with domain knowledge.
Pierre SenellartAvin MittalDaniel MuschickRémi GilleronMarc TommasiPublished in: WIDM (2008)
Keyphrases
- wrapper generation
- web sources
- wrapper induction
- domain knowledge
- data extraction
- semi structured
- information sources
- information integration
- semi automatic
- web information extraction
- web data
- automatic extraction
- multiple sources
- website
- information extraction
- web pages
- deep web
- web users
- data sources
- search engine
- information retrieval
- domain ontology
- data sets
- structured data
- web documents
- key technologies
- data model
- multi view
- information retrieval systems