Interactive web-wrapper construction for extracting relational information from web documents.
Tsuyoshi SugibuchiYuzuru TanakaPublished in: WWW (Special interest tracks and posters) (2005)
Keyphrases
- web documents
- relational information
- wrapper induction
- information extraction
- web pages
- data extraction
- semi structured
- relational learning
- document classification
- web content
- html documents
- web data
- web information extraction
- keywords
- relational data
- website
- background knowledge
- feature selection
- statistical relational learning
- extraction rules
- multi relational
- natural language processing
- active learning
- machine learning
- multidimensional databases