Approximate Similarity Search in Genomic Sequence Databases Using Landmark-Guided Embedding.
Ahmet SacanIsmail Hakki TorosluPublished in: SISAP (2008)
Keyphrases
- sequence databases
- approximate similarity search
- similarity search
- sequence data
- vector space
- metric space
- binary codes
- high dimensional
- distance function
- locality sensitive hashing
- similarity measure
- protein sequences
- high throughput
- multimedia databases
- high dimensional data
- knn
- query processing
- r tree
- database
- sequential patterns
- indexing techniques
- biological data
- indexing structure
- dynamic programming
- pattern recognition
- databases