Ranking-Constrained Keyword Sequence Extraction from Web Documents.
Ding-Yi ChenXue LiJing LiuXia ChenPublished in: ADC (2009)
Keyphrases
- web documents
- keywords
- information extraction
- keyword search
- content similarity
- web search engines
- semi structured
- web pages
- keyword queries
- semantic association
- textual information
- search engine
- ranking algorithm
- document representation
- link structure
- topic specific
- web search
- search queries
- vector space model
- link analysis
- html documents
- wrapper induction
- focused crawling
- social annotations
- structured documents
- structured data
- web content
- natural language processing
- unstructured documents
- active learning
- web information extraction
- anchor text
- automatic extraction
- information retrieval systems
- text documents