Clustering of Documents Via Similarity Measures.
Hana RezankováDusan HúsekJan SmidVáclav SnáselPublished in: Communications in Computing (2003)
Keyphrases
- similarity measure
- document clustering
- cosine similarity
- clustering method
- clustering algorithm
- similarity function
- k means
- information retrieval
- web documents
- similarity assessment
- xml documents
- data clustering
- hierarchical clustering
- similarity computation
- information retrieval systems
- text clustering
- dissimilarity measure
- cosine measure
- document representation
- vector space model
- euclidean distance
- data objects
- document retrieval
- document classification
- text documents
- keywords
- spectral clustering
- user queries
- similarity search
- high dimensional data
- unsupervised learning
- mutual information