Feature selection based on a normalized difference measure for text classification.
Abdur RehmanKashif JavedHaroon A. BabriPublished in: Inf. Process. Manag. (2017)
Keyphrases
- text classification
- feature selection
- text categorization
- similarity measure
- web page classification
- naive bayes
- bag of words
- n gram
- machine learning
- feature weighting
- text data
- labeled data
- text classifiers
- text mining
- feature engineering
- feature set
- support vector
- unlabeled data
- text documents
- mutual information
- support vector machine
- data cleaning
- classification accuracy
- feature space
- pointwise mutual information
- feature subset
- k nearest neighbor
- dimensionality reduction
- knn
- neural network
- high dimensionality
- multi label
- databases
- data sets