An analysis on Frequency of terms for Text Categorization.
Edgar Moyotl-HernándezHéctor Jiménez-SalazarPublished in: Proces. del Leng. Natural (2004)
Keyphrases
- text categorization
- term frequency
- document frequency
- knn
- multi label
- text classification
- text documents
- semi supervised learning
- feature selection
- feature selection for text categorization
- reuters corpus
- term weighting
- information gain
- co occurrence
- k nearest neighbor
- data analysis
- text collections
- feature weighting
- machine learning
- term selection
- neural network
- naive bayes
- tf idf
- training data
- automatic text categorization
- automated text categorization
- data sets