DigitHist: a Histogram-Based Data Summary with Tight Error Bounds.
Michael ShekelyanAnton DignösJohann GamperPublished in: Proc. VLDB Endow. (2017)
Keyphrases
- error bounds
- database
- data collection
- data sources
- data sets
- raw data
- high quality
- data analysis
- input data
- neural network
- training data
- original data
- data mining
- synthetic data
- worst case
- lower bound
- data structure
- knowledge discovery
- image data
- upper bound
- data points
- data mining techniques
- probability distribution
- data processing
- social networks
- high dimensional data
- database systems
- data distribution
- high dimensional