SpaRC: scalable sequence clustering using Apache Spark.
Lizhen ShiXiandong MengElizabeth TsengMichael MascagniZhong WangPublished in: Bioinform. (2019)
Keyphrases
- clustering algorithm
- clustering method
- open source
- open source software
- parameter free
- k means
- unsupervised learning
- self organizing maps
- data mining
- data clustering
- database
- distance metric
- fuzzy clustering
- anomaly detection
- case study
- cluster analysis
- web server
- website
- spectral clustering
- search engine
- categorical data
- graph theoretic
- completely unsupervised