Scalability of Continuous Active Learning for Reliable High-Recall Text Classification.
Gordon V. CormackMaura R. GrossmanPublished in: CIKM (2016)
Keyphrases
- high recall
- text classification
- active learning
- high precision
- labeled data
- machine learning
- precision and recall
- text categorization
- unlabeled data
- transfer learning
- feature selection
- bag of words
- text mining
- fault tolerance
- text documents
- data cleaning
- n gram
- text data
- semantic features
- naive bayes
- text classification tasks
- text classifiers
- active learning strategies
- knn
- semi supervised
- learning algorithm
- selective sampling
- random sampling
- databases
- achieve high precision
- sentiment analysis
- multi label
- learning strategies
- training examples
- support vector machine
- learning process
- training set
- e learning