Login / Signup
A modular open-source focused crawler for mining monolingual and bilingual corpora from the web.
Vassilis Papavassiliou
Prokopis Prokopidis
Gregor Thurmair
Published in:
BUCC@ACL (2013)
Keyphrases
</>
focused crawler
open source
focused crawling
web mining
web logs
website
web documents
web pages
question answering
information retrieval
data mining
machine translation
pattern mining
web data
link structure
document retrieval
text mining
web users
end users
query expansion
cross lingual
keywords
relevant pages