Parallel algorithm for indexing large DNA sequences using MapReduce on Hadoop.
Freeson KaniwaOtlhapile DinakenyaneVenu Madhav KuthadiPublished in: BIBM (2017)
Keyphrases
- parallel algorithm
- dna sequences
- map reduce
- parallel computation
- cloud computing
- parallel programming
- mapreduce framework
- distributed computing
- tandem repeats
- human genome
- data analytics
- binding sites
- big data
- motif discovery
- distributed systems
- dna computing
- shared memory
- binary search trees
- parallel implementations
- sequence patterns
- biological sequences
- dna sequencing
- genomic sequences
- parallel version
- coding regions
- database
- cluster of workstations
- gene structure prediction
- transcription factors
- rna sequences