ContEst: estimating cross-contamination of human samples in next-generation sequencing data.
Kristian CibulskisAaron McKennaTim FennellEric BanksMark A. DePristoGad GetzPublished in: Bioinform. (2011)
Keyphrases
- data collection
- data sets
- data samples
- high quality
- database
- training examples
- training data
- data sources
- small number
- raw data
- data analysis
- input data
- image data
- computer systems
- data processing
- data acquisition
- training dataset
- databases
- statistical analysis
- synthetic data
- missing data
- spatial data
- human activities
- data quality