A two-phase sampling technique for information extraction from hidden web databases.
Yih-Ling HedleyMuhammad YounasAnne E. JamesMark SandersonPublished in: WIDM (2004)
Keyphrases
- web databases
- information extraction
- structured data
- semi structured
- data extraction
- database systems
- deep web
- databases
- natural language processing
- user queries
- text mining
- query interface
- database technology
- query relaxation
- database
- database server
- information retrieval
- data management
- search engine
- web mining
- web pages
- machine learning
- web documents
- key technologies
- data sources
- data warehousing
- data types
- web data
- web users
- data integration
- user interaction
- object oriented