Login / Signup
Focused web crawling in the acquisition of comparable corpora.
Tuomas Talvensaari
Ari Pirkola
Kalervo Järvelin
Martti Juhola
Jorma Laurikkala
Published in:
Inf. Retr. (2008)
Keyphrases
</>
web crawling
comparable corpora
cross language information retrieval
web mining
search engine
topic specific
web data
deep web
link analysis
language modeling
data mining
machine translation
word pairs
text corpora
focused crawling
text categorization
topic modeling
information extraction
information retrieval