Distance Measures and Stemming Impact on Arabic Document Clustering.
Qusay Walid BsoulEiman Al-ShamariMasnizah MohdJaffar AtwanPublished in: AIRS (2014)
Keyphrases
- document clustering
- distance measure
- cosine similarity
- text mining
- document collections
- euclidean distance
- similarity measure
- text documents
- document clusters
- clustering algorithm
- tf idf
- dynamic time warping
- clustering method
- vector space
- document representation
- distance function
- vector space model
- distance metric
- k means
- information retrieval
- knowledge discovery
- bayesian networks
- computer vision