Inferring missing genotypes in large SNP panels using fast nearest-neighbor searches over sliding windows.
Adam RobertsLeonard McMillanWei WangJoel ParkerIvan RusynDavid ThreadgillPublished in: ISMB/ECCB (Supplement of Bioinformatics) (2007)
Keyphrases
- sliding window
- nearest neighbor
- single nucleotide polymorphisms
- genome wide
- data streams
- complex diseases
- knn
- fixed size
- high throughput
- missing data
- training set
- variable size
- space efficient
- continuous queries
- high dimensional
- limited memory
- window size
- human genome
- genetic variation
- streaming data
- index structure
- high dimensional data
- association studies
- data points
- walsh hadamard transform
- genome wide association studies
- genomic data
- statistical methods
- computational approaches
- data sets
- sequence data
- window sizes
- sensor data
- sensor networks
- feature selection