Bootstrapping Information Extraction from Semi-structured Web Pages.
Andrew CarlsonCharles SchaferPublished in: ECML/PKDD (1) (2008)
Keyphrases
- semi structured
- information extraction
- web documents
- data extraction
- web information extraction
- web pages
- web data
- html pages
- web data extraction
- named entity recognition
- structured data
- web data sources
- natural language processing
- relation extraction
- free text
- text mining
- semi structured data
- information integration
- web sources
- wrapper generation
- information retrieval
- website
- web mining
- web search engines
- semi structured documents
- machine learning
- search engine
- unstructured text
- data sets
- natural language
- web content
- textual data
- web databases
- web users
- data collections
- html documents
- knowledge rich
- structured knowledge
- named entities
- keywords
- unstructured data
- deep web
- web search
- information extraction systems
- text documents
- content and structure