The Impacts of Low-Quality Training Data on Information Extraction from Clinical Reports.
Diego MarcheggianiFabrizio SebastianiPublished in: ERCIM News (2018)
Keyphrases
- low quality
- information extraction
- training data
- high quality
- precision and recall
- natural language processing
- text mining
- data sets
- question answering
- free text
- training set
- learning algorithm
- training examples
- classification accuracy
- machine learning
- named entity recognition
- decision trees
- clinical practice
- labeled data
- test data
- named entities
- training process
- text summarization
- clinical trials
- poor quality
- image features
- structured data
- ground truth
- feature space
- image sequences
- image processing
- clinical data
- fingerprint images
- information retrieval