A Similarity Measure for Text Classification and Clustering.
Yung-Shen LinJung-Yi JiangShie-Jue LeePublished in: IEEE Trans. Knowl. Data Eng. (2014)
Keyphrases
- text classification
- similarity measure
- clustering method
- clustering algorithm
- similarity computation
- similarity function
- similarity calculation
- measuring similarity
- hierarchical clustering
- cluster analysis
- hierarchical clustering algorithm
- unsupervised learning
- feature selection
- spectral clustering
- topic discovery
- bag of words
- n gram
- text categorization
- text mining
- machine learning
- data clustering
- similarity assessment
- k means
- feature extraction
- information theoretic
- mutual information
- text data
- distributional clustering
- pairwise
- cosine similarity
- dissimilarity measure
- similarity search
- naive bayes
- multi label
- text classifiers
- data points
- training data
- self organizing maps