UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches.
Baris E. SuzekYuqi WangHongzhan HuangPeter B. McGarveyCathy H. WuPublished in: Bioinform. (2015)
Keyphrases
- similarity search
- similarity searching
- distance function
- data partitioning
- input data
- high dimensional
- knn
- metric space
- clustering algorithm
- multimedia databases
- similarity measure
- high dimensional data
- highly scalable
- indexing techniques
- dimensional vector
- sequence databases
- efficient search
- efficient similarity search
- cluster analysis
- query processing
- r tree
- vector space
- data distribution
- data sets
- hierarchical clustering
- multi dimensional
- image data
- data points
- data structure
- decision trees
- machine learning