Convex hulls in hamming space enable efficient search for similarity and clustering of genomic sequences.
David S. CampoYury KhudyakovPublished in: BMC Bioinform. (2020)
Keyphrases
- efficient search
- convex hull
- genomic sequences
- similarity search
- hamming space
- binary codes
- similarity measure
- data points
- hamming distance
- search problems
- distance function
- hyperplane
- high dimensional data
- dna sequences
- distance computation
- closest points
- metric space
- previously unknown
- knn
- learning algorithm
- orders of magnitude
- data streams
- hash functions
- point sets
- pattern mining
- euclidean distance