Reengineering High-throughput Molecular Datasets for Scalable Clustering Using MapReduce.
Trilce EstradaBoyu ZhangMichela TauferPietro CicottiRoger S. ArmenPublished in: HPCC-ICESS (2012)
Keyphrases
- high throughput
- microarray
- genome wide
- mass spectrometry
- biological data
- living cells
- systems biology
- low latency
- virtual screening
- microarray gene
- k means
- genomic data
- gene expression profiles
- protein protein
- clustering method
- clustering algorithm
- interaction networks
- mass spectrometry data
- protein protein interactions
- microarray datasets
- microarray data analysis
- data acquisition
- cluster analysis
- gene expression
- high dimensionality
- experimental conditions
- dna sequencing
- gene expression analysis
- drug discovery
- real time
- proteomic data
- spectral data