A comparative study on text representation schemes in text categorization.
Fengxi SongShuhai LiuJing-Yu YangPublished in: Pattern Anal. Appl. (2005)
Keyphrases
- text categorization
- text documents
- text collections
- document categorization
- text classifiers
- automatic categorization
- textual data
- text clustering
- text classification
- knn
- text representation
- multi label
- k nearest neighbor
- feature selection
- information gain
- document classification
- text mining
- reuters corpus
- text data
- naive bayes
- text retrieval
- automated text categorization
- tf idf
- word frequency
- feature selection for text categorization
- multi instance multi label learning
- feature selections
- term frequency
- document clustering
- unlabeled data
- natural language processing
- feature generation
- information retrieval
- data sets