Query-related data extraction of hidden web documents.
Yih-Ling HedleyMuhammad YounasAnne E. JamesMark SandersonPublished in: SIGIR (2004)
Keyphrases
- web documents
- data extraction
- semi structured
- related web pages
- web pages
- information extraction
- web data extraction
- keywords
- web search engines
- semistructured data
- query interface
- web databases
- html pages
- information integration
- user queries
- database
- web sources
- wrapper generation
- data sources
- query processing
- tree structured patterns
- data integration
- link analysis
- semi structured data
- web content
- query expansion
- web users
- web data
- html documents
- website
- databases
- natural language processing
- relevance feedback
- data mining
- machine learning
- search engine
- data structure
- search queries