Random forests on Hadoop for genome-wide association studies of multivariate neuroimaging phenotypes.
Yue WangWilson Wen Bin GohLimsoon WongGiovanni MontanaPublished in: BMC Bioinform. (2013)
Keyphrases
- random forests
- genome wide association studies
- genome wide
- complex diseases
- random forest
- decision trees
- biological data
- ensemble methods
- logistic regression
- human genome
- machine learning algorithms
- high throughput
- decision tree ensembles
- database
- prediction accuracy
- single nucleotide polymorphisms
- risk factors
- microarray data
- query processing
- relational databases
- support vector
- machine learning
- data sets