Login / Signup
RIDIRE-CPI: an Open Source Crawling and Processing Infrastructure for Supervised Web-Corpora Building.
Alessandro Panunzi
Marco Fabbri
Massimo Moneglia
Lorenzo Gregori
Samuele Paladini
Published in:
LREC (2012)
Keyphrases
</>
web corpora
open source
search engine
semi supervised
supervised learning
web pages
query expansion
query translation
learning algorithm
comparable corpora
machine learning