ADS-HCSpark: A scalable HaplotypeCaller leveraging adaptive data segmentation to accelerate variant calling on Spark.
Anghong XiaoZongze WuShoubin DongPublished in: BMC Bioinform. (2019)
Keyphrases
- data sets
- high dimensional data
- experimental data
- original data
- computer systems
- data analysis
- multiscale
- database
- data structure
- high quality
- feature space
- data sources
- data quality
- image segmentation
- synthetic data
- missing data
- prior information
- intensity images
- raw data
- statistical analysis
- data processing
- training data
- clustering algorithm