The use of bigrams to enhance text categorization.
Chade-Meng TanYuan-Fang WangChan-Do LeePublished in: Inf. Process. Manag. (2002)
Keyphrases
- text categorization
- text classification
- feature selection
- knn
- term frequency
- multi label
- naive bayes
- text documents
- semi supervised learning
- k nearest neighbor
- automated text categorization
- feature weighting
- information gain
- reuters corpus
- text classifiers
- automatic text categorization
- term weighting
- training documents
- semantic browsing
- feature selections
- multi instance multi label learning
- distributional clustering
- term selection
- classification accuracy
- learning algorithm
- named entities
- nearest neighbor
- text mining
- language model