Techniques for data-parallel searching for duplicate elements.
Brenton LessleyKenneth MorelandMatthew LarsenHank ChildsPublished in: LDAV (2017)
Keyphrases
- data sets
- statistical analysis
- data analysis
- data collection
- data processing
- computer systems
- raw data
- high quality
- small number
- input data
- database
- decision trees
- missing data
- experimental data
- complex data
- data cleaning
- data transfer
- historical data
- application domains
- high dimensional data
- probability distribution
- prior knowledge
- training set