A new statistic for identifying batch effects in high-throughput genomic data that uses guided principal component analysis.
Sarah E. ReeseKellie J. ArcherTerry M. TherneauElizabeth J. AtkinsonCeline M. VachonMariza de AndradeJean-Pierre A. KocherJeanette E. Eckel-PassowPublished in: Bioinform. (2013)
Keyphrases
- genomic data
- high throughput
- principal component analysis
- biological data
- genome wide
- microarray
- systems biology
- principal components
- protein protein interactions
- dimensionality reduction
- data acquisition
- face recognition
- gene expression data
- independent component analysis
- molecular biology
- gene expression
- low dimensional
- high speed
- feature extraction
- data mining
- gene expression patterns