GDedup: Distributed File System Level Deduplication for Genomic Big Data.
Paul BartusEmmanuel ArzuagaPublished in: BigData Congress (2018)
Keyphrases
- big data
- data analysis
- cloud computing
- unstructured data
- big data analytics
- analytic tools
- massive data
- vast amounts of data
- data processing
- data warehousing
- data visualization
- data intensive
- data stores
- social media
- data management
- knowledge discovery
- business intelligence
- health informatics
- data science
- predictive modeling
- high volume
- data analytics
- huge data
- commodity hardware