Optimal Feature Subset Selection Based on Combining Document Frequency and Term Frequency for Text Classification.
Thirumoorthy KarpagalingamMuneeswaran KaruppaiahPublished in: Comput. Informatics (2020)
Keyphrases
- term frequency
- text classification
- document frequency
- feature selection
- text categorization
- feature subset
- bag of words
- text documents
- term weighting
- naive bayes
- text mining
- machine learning
- n gram
- labeled data
- k nearest neighbor
- text data
- knn
- retrieval model
- tf idf
- average precision
- information gain
- language modeling
- data analysis
- semantic information
- support vector
- mutual information
- keywords