cl-dash: rapid configuration and deployment of Hadoop clusters for bioinformatics research in the cloud.
Paul HodorAmandeep ChawlaAndrew ClarkLauren NealPublished in: Bioinform. (2016)
Keyphrases
- cloud computing
- data analytics
- map reduce
- big data
- clustering algorithm
- open source
- data center
- cloud computing platform
- cloud computing environment
- data management
- hierarchical clustering
- parallel computation
- distributed systems
- data points
- fuzzy clustering
- theorem proving
- data mining techniques
- input data
- data sets
- subspace clustering
- data clustering
- case study
- data mining
- hierarchical structure
- unsupervised clustering
- mapreduce framework
- distributed computing
- community detection
- cluster analysis
- service providers
- pairwise
- similarity measure
- database systems
- image segmentation
- information retrieval
- databases