A new text classification technique using small training sets.
Fabio ClariziaFrancesco ColaceMassimo De SantoLuca GrecoPaolo NapoletanoPublished in: ISDA (2011)
Keyphrases
- text classification
- bag of words
- feature selection
- text documents
- text categorization
- machine learning
- labeled data
- text classifiers
- naive bayes
- n gram
- text data
- multi label
- text mining
- semantic features
- document classification
- sentiment analysis
- data cleaning
- unlabeled data
- information extraction
- feature reduction
- k nearest neighbor
- database
- knowledge discovery
- semi supervised learning
- support vector machine