Fast Uncertainty Sampling for Labeling Large E-mail Corpora.
Richard B. SegalTed MarkowitzWilliam ArnoldPublished in: CEAS (2006)
Keyphrases
- uncertainty sampling
- active learning
- cost sensitive
- random sampling
- experimental design
- semi supervised
- learning algorithm
- learning process
- machine learning
- labeled data
- training examples
- supervised learning
- text mining
- semi supervised learning
- natural language processing
- class imbalance
- unlabeled data
- generalization error
- minimize total