Fast Processing and Querying of 170TB of Genomics Data via a Repeated And Merged BloOm Filter (RAMBO).
Gaurav GuptaMinghao YanBenjamin ColemanBryce KilleRyan A. Leo ElworthTharun MediniTodd J. TreangenAnshumali ShrivastavaPublished in: CoRR (2019)
Keyphrases
- data processing
- data structure
- data sets
- raw data
- training data
- data collection
- statistical analysis
- data analysis
- input data
- bloom filter
- recent advances
- database
- high quality
- computer systems
- data quality
- real time
- image data
- knowledge discovery
- database systems
- high dimensional data
- synthetic data
- machine learning
- data acquisition
- high throughput
- data mining