Scalable Protein Sequence Similarity Search using Locality-Sensitive Hashing and MapReduce.
Freddie SunarsoSrikumar VenugopalFederico M. LauroPublished in: CoRR (2013)
Keyphrases
- similarity search
- locality sensitive hashing
- protein sequences
- metric space
- protein structure
- nearest neighbor search
- indexing techniques
- distance function
- similarity measure
- high dimensional
- approximate similarity search
- multimedia databases
- hash functions
- query processing
- approximate nearest neighbor search
- vector space
- biological sequences
- approximate nearest neighbor
- similarity queries
- kd tree
- high dimensional data
- indexing structure
- knn
- r tree
- binary codes
- multimedia information retrieval
- machine learning