Feature Ranking Based on Information Gain for Large Classification Problems with MapReduce.
Eftim ZdravevskiPetre LameskiAndrea KulakovBoro JakimovskiSonja FiliposkaDimitar TrajanovPublished in: TrustCom/BigDataSE/ISPA (2) (2015)
Keyphrases
- information gain
- feature ranking
- feature selection
- text categorization
- feature subset
- mutual information
- decision trees
- classification accuracy
- feature subset selection
- naive bayes
- text classification
- high dimensionality
- feature set
- feature space
- support vector
- classification models
- feature selection algorithms
- selected features
- attribute selection
- multi class
- support vector machine
- knn
- machine learning
- logistic regression
- multi label
- dimensionality reduction
- random forest
- prior knowledge
- image classification
- fold cross validation
- genetic algorithm
- data mining