Login / Signup
Optimizing apache nutch for domain specific crawling at large scale.
Luis A. Lopez
Ruth E. Duerr
Siri Jodha Singh Khalsa
Published in:
IEEE BigData (2015)
Keyphrases
</>
domain specific
web crawling
apache nutch
general purpose
search engine
real world
web mining
topic specific
focused crawling
link analysis
database
data mining
web pages
high dimensional
knowledge discovery
ranking algorithm