A Framework for Robust Information Extraction from Free Text Documents.
Frank MengCraig A. MoriokaJames W. SayrePublished in: AMIA (2016)
Keyphrases
- text documents
- information extraction
- text mining
- named entities
- keywords
- topic models
- machine learning
- natural language processing
- text classification
- wordnet
- structured data
- document classification
- document clustering
- named entity recognition
- web documents
- text data
- relation extraction
- data mining
- data sets
- information extraction systems
- bag of words
- text categorization
- semi supervised learning
- pairwise
- training data