Dissimilarity based feature selection for text classification: a cluster based approach.
S. ManjunathB. S. HarishDevanur S. GuruPublished in: ICWET (2011)
Keyphrases
- text classification
- feature selection
- text categorization
- distributional clustering
- naive bayes
- machine learning
- web page classification
- feature weighting
- text documents
- bag of words
- clustering algorithm
- labeled data
- knn
- text mining
- feature engineering
- text classifiers
- text data
- feature reduction
- mutual information
- cluster analysis
- k nearest neighbor
- high dimensionality
- data clustering
- n gram
- semantic features
- feature subset
- feature set
- text classification tasks
- data points
- classification accuracy
- selected features
- support vector
- neural network
- microarray data
- multi label
- unlabeled data
- unsupervised learning
- multi class
- information extraction
- support vector machine
- feature space