Designing seeds for similarity search in genomic DNA.
Jeremy BuhlerUri KeichYanni SunPublished in: RECOMB (2003)
Keyphrases
- similarity search
- sequence databases
- metric space
- genomic sequences
- distance function
- similarity measure
- high dimensional
- knn
- multimedia databases
- similarity searching
- query processing
- high throughput
- high throughput sequencing
- indexing techniques
- cross view
- high dimensional data
- similarity queries
- dna sequences
- r tree
- triangle inequality
- genome sequences
- dynamic time warping
- sequence data
- efficient similarity search
- locality sensitive hashing
- similarity retrieval
- dna copy number
- efficient indexing
- approximate similarity search
- mass spectra
- biological sequences
- nearest neighbor search
- indexing structure
- databases
- searching in metric spaces
- data mining
- neural network