Keyphrases
- data extraction
- structured documents
- semi structured
- structured document retrieval
- web documents
- web data extraction
- html documents
- information retrieval systems
- data integration
- xml documents
- web pages
- query language
- information retrieval
- information extraction
- databases
- query interface
- structured data
- relevant documents
- text mining
- active learning
- data model
- clustering algorithm
- website
- data mining
- database