A non-redundant feature selection method for text categorization based on term co-occurrence frequency and mutual information.
Farek LazharAmira BenaidjaPublished in: Multim. Tools Appl. (2024)
Keyphrases
- text categorization
- mutual information
- feature selection
- document frequency
- information gain
- feature set
- similarity measure
- text classification
- knn
- term weighting
- pairwise
- classification accuracy
- support vector machine
- term frequency
- feature selections
- tf idf
- information theoretic
- text documents
- web search engines
- machine learning
- k nearest neighbor
- feature extraction
- image registration
- probabilistic model