Representation of Texts into String Vectors for Text Categorization.
Taeho JoPublished in: J. Comput. Sci. Eng. (2010)
Keyphrases
- term frequency
- text categorization
- text documents
- text classification
- tf idf
- feature selection
- knn
- multi label
- reuters corpus
- automatic text categorization
- naive bayes
- k nearest neighbor
- information gain
- automated text categorization
- semi supervised learning
- document classification
- feature weighting
- feature generation
- image representation
- document categorization
- feature vectors
- text classifiers
- machine learning
- vector space
- unlabeled data
- natural language processing
- data analysis
- feature extraction
- multi instance multi label learning
- data mining
- feature selections
- feature selection for text categorization