Comparison of Distance Measures for Graph-Based Clustering of Documents.
Adam SchenkerMark LastHorst BunkeAbraham KandelPublished in: GbRPR (2003)
Keyphrases
- distance measure
- cosine similarity
- document clustering
- vector space
- distance metric
- dissimilarity measure
- proximity measures
- euclidean distance
- similarity measure
- nearest neighbor classification
- distance function
- distance calculation
- document collections
- clustering algorithm
- k means
- clustering method
- dynamic time warping
- information retrieval
- similarity function
- document representation
- text documents
- graph model
- relevant documents
- kullback leibler
- bhattacharyya distance
- web documents
- document retrieval
- query terms
- information retrieval systems
- co occurrence
- user queries
- tf idf
- data points
- keywords
- similarity estimation