SeqyClean: A Pipeline for High-throughput Sequence Data Preprocessing.
Ilya Y. ZhbannikovSamuel S. HunterJames A. FosterMatthew L. SettlesPublished in: BCB (2017)
Keyphrases
- high throughput
- data preprocessing
- preprocessing
- microarray
- genome wide
- preprocessing step
- biological data
- data mining
- feature selection
- web usage mining
- data acquisition
- genomic data
- data cleaning
- mass spectrometry
- gene expression data
- proteomic data
- high dimensionality
- data mining techniques
- cancer diagnosis
- sequence similarity