Inherent Nonlinear Distribution of High-Dimensional Genotypic Data Identified as a Possible Source of Confounding Factors in Population Structure Analysis.
Meng WangPublished in: IEEE ACM Trans. Comput. Biol. Bioinform. (2022)
Keyphrases
- data analysis
- statistical analysis
- high dimensional
- data sets
- data collection
- empirical data
- data points
- synthetic data
- data distribution
- data structure
- data sources
- probability distribution
- knowledge discovery
- input space
- data quality
- raw data
- correlation analysis
- database
- data acquisition
- descriptive statistics
- noisy data
- missing data
- data processing
- input data
- multi dimensional
- image data
- spatial data
- bayesian networks
- sparse data
- training data
- genetic algorithm