Coping with Noisy Training Data Labels in Paraphrase Detection.
Teemu VahtolaMathias CreutzEetu SjöblomSami ItkonenPublished in: W-NUT (2021)
Keyphrases
- training data
- training set
- automatic detection
- training examples
- detection accuracy
- data sets
- noisy data
- class labels
- detection method
- detection algorithm
- object detection
- label noise
- false alarms
- noisy environments
- decision trees
- supervised learning
- low signal to noise ratio
- incomplete data
- weakly labeled
- test data
- prior knowledge
- speech recognition
- test set
- training samples
- anomaly detection
- training dataset
- classification accuracy
- learning algorithm
- classification models
- generalization error
- training process
- unlabeled data
- training instances
- word level
- information retrieval