Feature Selection for Text Classification Based on Gini Coefficient of Inequality.
Sanasam Ranbir SinghHema A. MurthyTimothy A. GonsalvesPublished in: FSDM (2010)
Keyphrases
- text classification
- feature selection
- text categorization
- naive bayes
- bag of words
- feature weighting
- text classifiers
- machine learning
- n gram
- text mining
- sentiment analysis
- feature set
- mutual information
- data cleaning
- text data
- labeled data
- text documents
- knn
- feature engineering
- web page classification
- term frequency
- high dimensionality
- support vector machine
- text classification tasks
- information gain
- feature extraction
- support vector
- dimensionality reduction
- k nearest neighbor
- multi label
- classification accuracy
- data analysis
- feature reduction
- feature selection algorithms
- feature space
- decision trees
- neural network
- supervised feature selection
- irrelevant features
- bayes classifier
- model selection
- selected features
- semantic features
- unsupervised learning
- microarray data
- similarity measure
- information theoretic