VIP Hashing - Adapting to Skew in Popularity of Data on the Fly (extended version).
Aarati KakaraparthyJignesh M. PatelBrian P. KrothKwanghyun ParkPublished in: CoRR (2022)
Keyphrases
- data sources
- data sets
- data analysis
- data quality
- image data
- data structure
- high quality
- database
- raw data
- data collection
- data mining techniques
- data processing
- experimental data
- statistical analysis
- training set
- original data
- high dimensional
- missing values
- data distribution
- attribute values
- missing data
- synthetic data
- prior knowledge
- relational databases
- nearest neighbor
- machine learning
- data points
- website
- social media