Keyphrases
- semi structured
- information extraction
- free text
- text mining
- web documents
- structured data
- textual data
- unstructured text
- text documents
- information extraction systems
- text processing
- natural language text
- data extraction
- information retrieval
- content and structure
- ontology based information extraction
- open domain
- natural language processing
- wrapper generation
- information integration
- semi structured documents
- named entity recognition
- semi structured data
- text summarization
- html pages
- relation extraction
- linguistic patterns
- web data
- unstructured data
- data model
- web data sources
- structured knowledge
- data collections
- text data
- web data extraction
- web sources
- question answering
- text classification
- web mining
- textual information
- machine learning
- extraction rules
- data analysis
- search engine