Information Extraction for Semi-structured Email Corpora.
Hendrik AdamPhilipp SchaerPublished in: LWDA (2019)
Keyphrases
- semi structured
- information extraction
- natural language processing
- data collections
- text mining
- structured data
- linguistic patterns
- free text
- data extraction
- spam filtering
- web documents
- information integration
- named entity recognition
- email messages
- enron email
- semi structured data
- web data
- relation extraction
- question answering
- named entities
- semi structured documents
- text data
- information retrieval
- machine learning
- extraction rules
- web sources
- unstructured text
- textual data
- web data extraction
- machine translation
- text documents
- structured knowledge
- wrapper generation
- wordnet
- knowledge rich
- web data sources
- data model
- natural language
- mailing lists
- unstructured data
- web mining
- knowledge discovery
- information extraction systems
- data sets
- database