Fusing Gini Index and Term Frequency for Text Feature Selection.
Lin WuYongbin WangShengyan ZhangYannan ZhangPublished in: BigMM (2017)
Keyphrases
- term frequency
- text categorization
- feature selection
- text documents
- document frequency
- text classification
- information gain
- tf idf
- text mining
- bag of words
- machine learning
- average precision
- information retrieval
- document representation
- retrieval model
- mutual information
- keywords
- naive bayes
- n gram
- knn
- support vector
- topic models
- document clustering
- query terms
- search engine
- decision trees
- feature extraction
- feature space
- support vector machine
- unlabeled data
- k nearest neighbor
- nearest neighbor
- labeled data
- wordnet
- feature ranking
- dimensionality reduction