Optimizing data popularity conscious bloom filters.
Ming ZhongPin LuKai ShenJoel I. SeiferasPublished in: PODC (2008)
Keyphrases
- data sets
- data collection
- synthetic data
- original data
- raw data
- data distribution
- training data
- data processing
- data sources
- xml documents
- high quality
- computer systems
- learning algorithm
- prior knowledge
- database
- probability distribution
- data analysis
- statistical analysis
- clustering algorithm
- social networks
- databases
- complex data
- record linkage