Advancing next-generation sequencing data analytics with scalable distributed infrastructure.
Joohyun KimSharath MaddineniShantenu JhaPublished in: Concurr. Comput. Pract. Exp. (2014)
Keyphrases
- scalable distributed
- data analytics
- commodity hardware
- big data
- open source
- business intelligence
- data mining techniques
- data analysis
- cloud computing
- file system
- internet search
- unstructured data
- keyword search
- data mining
- association rules
- information technology
- parallel computing
- parallel processing
- web search engines
- data sets