Information Extraction from Semi-structured Web Documents.
Bo-Hyun YunChang-Ho SeoPublished in: KSEM (2006)
Keyphrases
- semi structured
- web documents
- information extraction
- natural language processing
- free text
- data extraction
- web data
- structured data
- text mining
- information integration
- relation extraction
- named entities
- web search engines
- information retrieval
- semistructured data
- semi structured data
- html documents
- structured knowledge
- natural language
- unstructured text
- machine learning
- wrapper generation
- web content
- web pages
- web sources
- textual data
- web data sources
- text documents
- unstructured data
- unstructured documents
- textual information
- web information extraction
- xml databases
- tree structured patterns
- semistructured documents
- database systems
- relational databases
- web search
- structured documents
- link structure
- web databases
- domain specific
- web mining