Optimizing a MapReduce module of preprocessing high-throughput DNA sequencing data.
Wei-Chun ChungYu-Jung ChangChien-Chih ChenDer-Tsai LeeJan-Ming HoPublished in: IEEE BigData (2013)
Keyphrases
- high throughput
- dna sequencing
- genomic data
- data sets
- preprocessing
- data acquisition
- data processing
- data analysis
- data collection
- data sources
- database
- biological data
- statistical analysis
- data mining techniques
- systems biology
- low latency
- knowledge discovery
- proteomic data
- dna sequences
- low cost
- sensor networks
- data streams
- databases
- real time