Scalable imputation of genetic data with a discrete fragmentation-coagulation process.
Lloyd T. ElliottYee Whye TehPublished in: NIPS (2012)
Keyphrases
- data sets
- missing data
- missing values
- end users
- database
- original data
- data analysis
- data collection
- redundant data
- training data
- synthetic data
- data mining techniques
- complex data
- high quality
- data mining
- nearest neighbor
- image data
- data processing
- knowledge discovery
- statistical analysis
- data points
- high dimensional
- data structure
- incomplete data
- query decomposition
- databases
- massive scale
- human errors