Distance Measures for Clustering of Documents in a Topic Space.
Tomasz WalkowiakMateusz GniewkowskiPublished in: DepCoS-RELCOMEX (2019)
Keyphrases
- distance measure
- cosine similarity
- vector space
- concept space
- distance metric
- document clustering
- dissimilarity measure
- similarity measure
- euclidean distance
- nearest neighbor classification
- distance function
- k means
- clustering method
- information retrieval
- proximity measures
- dynamic time warping
- document representation
- clustering algorithm
- vector space model
- multi document summarization
- reproducing kernel hilbert space
- similarity function
- topic modeling
- kullback leibler
- text documents
- information retrieval systems
- bhattacharyya distance
- topic models
- high dimensional data
- text mining
- histogram intersection