Efficient scheme for compressing and transferring data in hadoop clusters.
Seungyeon LeeJusuk LeeYongmin KimKicheol ParkJiman HongJunyoung HeoPublished in: SAC (2020)
Keyphrases
- data sets
- data analysis
- synthetic data
- training data
- data sources
- data points
- image data
- statistical analysis
- big data
- data objects
- database
- data processing
- data samples
- input space
- open source
- prior knowledge
- data structure
- data collection
- computer systems
- data mining techniques
- data distribution
- raw data
- data quality
- feature space
- cluster centers
- high quality
- compressed data