Login / Signup
Data Deduplication Cluster Based on Similarity-Locality Approach.
Xingyu Zhang
Jian Zhang
Published in:
GreenCom/iThings/CPScom (2013)
Keyphrases
</>
data collection
data sets
data quality
data analysis
similarity measure
original data
data processing
distance measure
historical data
statistical analysis
small number
image data
end users
social media
high quality
database
input data
missing values
data structure
noisy data
training data
spatial locality