Kernel PCA based clustering for inducing features in text categorization.
Zsolt MinierLehel CsatóPublished in: ESANN (2007)
Keyphrases
- text categorization
- feature generation
- feature weighting
- feature reduction
- linear svm
- information gain
- text classification
- feature selection
- knn
- pattern recognition and machine learning
- feature space
- multi label
- distributional clustering
- automated text categorization
- training documents
- feature selection for text categorization
- text documents
- clustering method
- feature extraction
- k means
- classification accuracy
- text classifiers
- neural network
- feature selections
- decision trees
- reuters corpus
- feature vectors
- document clustering
- feature set
- image classification
- naive bayes
- k nearest neighbor
- tf idf
- co occurrence
- training set
- support vector
- automatic text categorization
- multi instance multi label learning
- clustering algorithm
- unlabeled data
- semi supervised learning