Login / Signup
Detecting near-duplicates for web crawling.
Gurmeet Singh Manku
Arvind Jain
Anish Das Sarma
Published in:
WWW (2007)
Keyphrases
</>
web crawling
web mining
search engine
data mining
topic specific
data mining and machine learning
web data
deep web
link analysis
text categorization
semi structured