Design and analysis of novel similarity measure for clustering and classification of high dimensional text documents.
G. SureshReddyT. V. RajinikanthA. Ananda RaoPublished in: CompSysTech (2014)
Keyphrases
- text documents
- similarity measure
- text classification
- text clustering
- document classification
- high dimensional
- document clustering
- text analysis
- text categorization
- text mining
- clustering method
- machine learning
- clustering algorithm
- feature space
- data points
- text data
- k means
- unsupervised learning
- feature vectors
- information extraction
- data analysis
- similarity search
- training set
- data sets
- high dimensional data
- topic models
- image classification
- hierarchical clustering
- support vector
- supervised learning
- semi supervised