Minimally overlapping words for sequence similarity search.
Martin C. FrithLaurent NoéGregory KucherovPublished in: Bioinform. (2021)
Keyphrases
- similarity search
- metric space
- sequence databases
- distance function
- multimedia databases
- high dimensional
- similarity measure
- r tree
- similarity searching
- efficient similarity search
- query processing
- cross view
- high dimensional data
- knn
- multimodal data
- indexing techniques
- triangle inequality
- similarity retrieval
- vector space
- locality sensitive hashing
- data sets
- keywords
- nearest neighbor search
- similarity queries
- access methods
- database
- databases
- biological sequences
- feature selection
- binary codes
- low dimensional
- sequential data
- indexing structure
- hash functions
- multimedia
- data analysis
- principal component analysis