aHDFS: An Erasure-Coded Data Archival System for Hadoop Clusters.
Yuanqi ChenYi ZhouShubbhi TanejaXiao QinJianzhong HuangPublished in: IEEE Trans. Parallel Distributed Syst. (2017)
Keyphrases
- data sets
- raw data
- database
- high quality
- data analysis
- synthetic data
- data points
- data processing
- big data
- data distribution
- input data
- data objects
- data samples
- image data
- knowledge discovery
- data collection
- statistical analysis
- end users
- data structure
- input space
- data management
- high dimensional data
- probability distribution
- missing data
- original data
- data quality
- databases