A new scalable approach for missing value imputation in high-throughput microarray data on apache spark.
Madhuri GuptaBharat GuptaPublished in: Int. J. Data Min. Bioinform. (2020)
Keyphrases
- high throughput
- missing values
- microarray
- microarray data
- gene expression
- data imputation
- missing data
- biological data
- genome wide
- gene selection
- gene expression data
- systems biology
- gene ontology
- incomplete data
- microarray data analysis
- analysis of gene expression
- microarray technology
- experimental conditions
- high dimensionality
- data sets
- gene expression profiles
- meta analysis
- high dimensional data
- regulatory networks
- microarray datasets
- genomic data
- molecular biology
- imputation methods
- cancer classification
- cancer diagnosis
- dna microarray
- gene expression patterns
- gene sets
- mass spectrometry
- gene expression analysis
- gene expression levels
- biological processes
- biological networks
- high dimensional
- saccharomyces cerevisiae
- biologically relevant
- bayesian networks
- feature selection