Towards minimizing the annotation cost of certified text classification.
Mossaab BagdouriWilliam WebberDavid D. LewisDouglas W. OardPublished in: CIKM (2013)
Keyphrases
- text classification
- feature engineering
- text categorization
- bag of words
- text mining
- naive bayes
- n gram
- feature selection
- high cost
- semantic features
- manual annotation
- data cleaning
- machine learning
- data mining
- knn
- cost sensitive
- multi label
- neural network
- expected cost
- semantic annotation
- sentiment analysis
- labeled data
- databases
- unlabeled data
- text classifiers
- metadata
- active learning
- data warehouse