Distributed, Scalable Clustering for Detecting Halos in Terascale Astronomy Datasets.
Srivatsava DaruruSankari DhandapaniGunjan GuptaIlian IlievWeijia XuPaul A. NavrátilNena M. MarinJoydeep GhoshPublished in: ICDM Workshops (2010)
Keyphrases
- clustering algorithm
- k means
- distributed systems
- parameter free
- data mining tasks
- synthetic datasets
- synthetic and real datasets
- clustering method
- scalable distributed
- high dimensional datasets
- distributed environment
- multi agent
- clustering approaches
- data clustering
- hierarchical clustering
- cluster analysis
- self organizing maps
- peer to peer
- data sets
- distributed storage
- high scalability
- cooperative
- fault tolerant
- communication cost
- scientific data
- distance metric
- data intensive
- computer networks
- data points
- lightweight
- web scale
- database
- neural network
- anomaly detection
- gene expression profiles
- high dimensional data
- distributed data
- information theoretic
- document clustering