Login / Signup
An Agent-Based Focused Crawling Framework for Topic- and Genre-Related Web Document Discovery.
Nikolaos Pappas
Georgios Katsimpras
Efstathios Stamatatos
Published in:
ICTAI (2012)
Keyphrases
</>
focused crawling
web documents
topic specific
focused crawler
web pages
web search engines
information extraction
keywords
knowledge discovery
semi structured
text content
domain knowledge
probabilistic model
data mining techniques
web content
latent dirichlet allocation
web logs
search engine
machine learning