Login / Signup
MapDupReducer: detecting near duplicates over massive datasets.
Chaokun Wang
Jianmin Wang
Xuemin Lin
Wei Wang
Haixun Wang
Hongsong Li
Wanpeng Tian
Jun Xu
Rui Li
Published in:
SIGMOD Conference (2010)
Keyphrases
</>
massive datasets
massive data
text data
big data
computationally challenging
databases
methods require
data sets
real world
data mining
multimedia
relational databases
spatial data
stored data