A Soft Subspace Clustering Method for Text Data Using a Probability Based Feature Weighting Scheme.
Abdul WahidXiaoying GaoPeter AndreaePublished in: WISE (2) (2015)
Keyphrases
- clustering method
- weighting scheme
- text data
- subspace clustering
- high dimensional data
- high dimensional
- tf idf
- text documents
- text classification
- text mining
- document clustering
- clustering algorithm
- structured data
- cluster analysis
- low dimensional
- spectral clustering
- document representation
- similarity measure
- document collections
- visual words
- k means
- bag of words
- principal component analysis
- data sets
- image features
- feature vectors
- training data
- information extraction
- named entities
- data points
- digital libraries
- text categorization
- knn
- feature extraction