Improved Document Feature Selection with Categorical Parameter for Text Classification.
Fen WangXiaoxuan LiXiaotao HuangLing KangPublished in: MSPN (2016)
Keyphrases
- text classification
- feature selection
- document classification
- text documents
- text classifiers
- text categorization
- term frequency
- bag of words
- topic discovery
- automatic text classification
- training documents
- text mining
- document representation
- machine learning
- text data
- knn
- information retrieval
- feature weighting
- naive bayes
- document clustering
- supervised feature selection
- information retrieval systems
- classify documents
- support vector
- sentiment analysis
- n gram
- classification accuracy
- document images
- tf idf
- web documents
- document collections
- labeled data
- multi label
- semantic features
- text classification tasks
- data cleaning
- dimensionality reduction
- feature space
- mutual information
- information extraction
- feature engineering
- unlabeled data
- feature reduction
- clustering algorithm
- feature extraction
- categorical data
- feature selection algorithms