BiSpark: a Spark-based highly scalable aligner for bisulfite sequencing data.
Seokjun SoeYoonjae ParkHeejoon ChaePublished in: BMC Bioinform. (2018)
Keyphrases
- highly scalable
- data sets
- raw data
- data analysis
- high quality
- data sources
- data points
- data processing
- complex data
- data collection
- data mining techniques
- training data
- input data
- missing values
- synthetic data
- computer systems
- knowledge discovery
- website
- end users
- social media
- xml documents
- search engine
- original data
- noisy data
- data structure
- database