Impact of missing data imputation methods on gene expression clustering and classification.
Marcílio Carlos Pereira de SoutoPablo A. JaskowiakIvan G. CostaPublished in: BMC Bioinform. (2015)
Keyphrases
- missing data
- gene expression
- microarray gene
- imputation methods
- gene expression profiles
- missing values
- multiple imputation
- microarray
- microarray data analysis
- gene expression analysis
- microarray data
- data imputation
- gene expression data
- colon cancer
- statistical databases
- incomplete data
- clustering analysis
- clustering algorithm
- biological processes
- gene expression patterns
- high dimensionality
- pattern recognition
- dna microarray
- binding sites
- gene selection
- cancer classification
- machine learning
- high throughput
- unsupervised learning
- data clustering
- k nearest neighbour
- cluster analysis
- classification accuracy
- matrix factorization
- text classification
- clustering method
- database
- feature extraction
- semi supervised
- maximum likelihood
- biclustering algorithms
- self organizing maps
- document clustering
- biological networks