FASTA/Q data compressors for MapReduce-Hadoop genomics: space and time savings made easy.
Umberto Ferraro PetrilloFrancesco PaliniGiuseppe CattaneoRaffaele GiancarloPublished in: BMC Bioinform. (2021)
Keyphrases
- data sets
- data collection
- training data
- data structure
- data objects
- database
- statistical analysis
- high quality
- data analysis
- data intensive
- big data
- data quality
- raw data
- data distribution
- synthetic data
- computer systems
- dimensional data
- high dimensional data
- cloud computing
- data mining
- knowledge discovery
- data points
- end users
- data sources
- case study