Hadoop-BAM: directly manipulating next generation sequencing data in the cloud.
Matti NiemenmaaAleksi KallioAndré SchumacherPetri KlemeläEija KorpelainenKeijo HeljankoPublished in: Bioinform. (2012)
Keyphrases
- data analysis
- data sets
- training data
- cloud computing
- big data
- database
- original data
- data collection
- high quality
- data processing
- synthetic data
- data structure
- small number
- open source
- input data
- prior knowledge
- missing data
- experimental data
- data acquisition
- data mining techniques
- probability distribution
- raw data
- data quality