Interactive feature selection for document clustering.
Yeming HuEvangelos E. MiliosJames BlusteinPublished in: SAC (2011)
Keyphrases
- document clustering
- feature selection
- text mining
- clustering algorithm
- document representation
- text documents
- text categorization
- document collections
- negative matrix factorization
- clustering method
- document clusters
- text classification
- machine learning
- tf idf
- k means
- vector space model
- topic extraction
- knn
- model selection
- tolerance rough set
- feature extraction
- data mining
- unsupervised learning
- databases
- dimensionality reduction
- support vector machine
- feature space
- data analysis
- training data
- similarity measure
- document corpus
- ant based clustering