Scalable Textual Similarity Search on Large Document Collections Through Random Indexing and K-means Clustering.
Ali CevahirPublished in: PAKDD Workshops (2014)
Keyphrases
- similarity search
- efficient search
- indexing techniques
- multimedia databases
- similarity queries
- indexing structure
- similarity retrieval
- efficient indexing
- nearest neighbor queries
- metric space
- similarity search in high dimensional
- indexing schemes
- high dimensional
- distance function
- metric access methods
- indexing methods
- similarity measure
- similarity search in metric spaces
- indexing scheme
- indexing method
- cross view
- content based multimedia retrieval
- knn
- triangle inequality
- high dimensional data
- distance computation
- r tree
- nearest neighbor search
- content based retrieval
- locality sensitive hashing
- similarity searching
- query processing
- space partitioning
- database
- efficient similarity search
- sequential scan
- keywords
- multimedia
- approximate similarity search
- metadata
- dynamic time warping
- hash functions
- mass spectra
- access methods
- databases