CSMR: A Scalable Algorithm for Text Clustering with Cosine Similarity and MapReduce.
Victor Giannakouris-SalalidisAntonia PlerouSpyros SioutasPublished in: AIAI Workshops (2014)
Keyphrases
- cosine similarity
- text clustering
- document clustering
- k means
- vector space model
- clustering algorithm
- text mining
- tf idf
- document representation
- document collections
- text documents
- clustering method
- similarity measure
- cluster analysis
- vector space
- text categorization
- information retrieval
- machine learning
- text data
- image classification
- language model