Self-Similarity Metric for Index Pruning in Conceptual Vector Space Models.
Dario BoninoFulvio CornoPublished in: DEXA Workshops (2008)
Keyphrases
- vector space model
- vector space
- agglomerative hierarchical clustering
- information retrieval
- language model
- semantic similarity
- average precision
- tf idf
- document clustering
- latent semantic indexing
- web documents
- semantic information
- retrieval model
- database
- relational databases
- text mining
- dimensionality reduction
- clustering algorithm
- model based clustering