Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.
Thanh-Tung NguyenJoshua Zhexue HuangQingyao WuThuy Thi NguyenMark Junjie LiPublished in: BMC Genom. (2015)
Keyphrases
- genome wide
- random forests
- data sets
- data processing
- training data
- high throughput
- data collection
- decision trees
- data sources
- data points
- data mining
- linkage disequilibrium
- semi supervised
- data analysis
- text mining
- text classification
- supervised learning
- training samples
- prediction accuracy
- benchmark datasets
- data acquisition
- classification accuracy
- feature vectors
- complex diseases
- support vector