Improved information gain feature selection method for Chinese text classification based on word embedding.
Lei ZhuGuijun WangXianchun ZouPublished in: ICSCA (2017)
Keyphrases
- feature selection
- information gain
- text classification
- text categorization
- document frequency
- mutual information
- chi squared
- classification accuracy
- unsupervised learning
- feature set
- chi square
- term frequency
- support vector machine
- optimization method
- bag of words
- n gram
- decision trees
- neural network
- similarity measure
- correlation coefficient
- machine learning
- naive bayes
- feature subset
- computer vision
- active learning
- supervised learning
- feature reduction
- genetic algorithm