Semi-Supervised SimHash for Efficient Document Similarity Search.
Qixia JiangMaosong SunPublished in: ACL (2011)
Keyphrases
- similarity search
- semi supervised
- efficient search
- indexing structure
- efficient similarity search
- metric space
- distance function
- high dimensional
- knn
- query processing
- uncertain trajectories
- multimedia databases
- similarity measure
- vector space
- efficient indexing
- distance computation
- indexing techniques
- similarity retrieval
- similarity searching
- high dimensional data
- r tree
- locality sensitive hashing
- information retrieval systems
- cross view
- space partitioning
- approximate similarity search
- document clustering
- databases
- neural network
- low dimensional
- data structure
- search engine
- data mining