Algorithmic Gaussianization through Sketching: Converting Data into Sub-gaussian Random Designs.
Michal DerezinskiPublished in: CoRR (2022)
Keyphrases
- data sets
- database
- data sources
- data collection
- data distribution
- experimental data
- synthetic data
- raw data
- statistical analysis
- historical data
- image data
- data analysis
- high quality
- training data
- data mining
- knowledge discovery
- data objects
- data processing
- end users
- data quality
- poisson distribution
- missing data
- high dimensional data
- maximum likelihood
- data warehouse
- small number
- feature space