A Very Fast Method for Clustering Big Text Datasets.
Frank LinWilliam W. CohenPublished in: ECAI (2010)
Keyphrases
- clustering method
- cost function
- computational cost
- detection method
- k means
- significant improvement
- experimental evaluation
- high accuracy
- computational complexity
- support vector machine
- synthetic and real datasets
- clustering algorithm
- synthetic data
- similarity function
- feature set
- objective function
- clustering approaches
- neural network
- high dimensional datasets
- spectral clustering
- outlier detection
- segmentation algorithm
- text mining
- principal component analysis
- similarity measure