On the Effects of Low-Quality Training Data on Information Extraction from Clinical Reports.
Diego MarcheggianiFabrizio SebastianiPublished in: CoRR (2015)
Keyphrases
- low quality
- information extraction
- training data
- high quality
- natural language processing
- precision and recall
- data sets
- named entity recognition
- decision trees
- test data
- machine learning
- training set
- clinical practice
- classification accuracy
- supervised learning
- clinical data
- learning algorithm
- training process
- structured data
- named entities
- free text
- patient data
- poor quality
- low quality images
- principal component analysis
- information retrieval
- training examples
- text mining
- active learning
- multiresolution
- computer vision