Research Paper: Enhancing Text Categorization with Semantic-enriched Representation and Training Data Augmentation.
Xinghua LuBin ZhengAtulya VelivelliChengXiang ZhaiPublished in: J. Am. Medical Informatics Assoc. (2006)
Keyphrases
- text categorization
- text representation
- training data
- feature selection
- naive bayes
- text classification
- semantic browsing
- unlabeled data
- knn
- multi label
- training documents
- semi supervised learning
- information gain
- k nearest neighbor
- feature weighting
- reuters corpus
- learning algorithm
- text documents
- data sets
- supervised learning
- term frequency
- automated text categorization
- automatic text categorization
- classification accuracy
- image representation
- training set
- document categorization
- decision trees
- multi instance multi label learning
- document frequency
- term weighting
- text classifiers
- target domain
- semantic information
- feature selection for text categorization