Login / Signup
CommonCOW: Massively Huge Web Corpora from CommonCrawl Data and a Method to Distribute them Freely under Restrictive EU Copyright Laws.
Roland Schäfer
Published in:
LREC (2016)
Keyphrases
</>
data analysis
similarity measure
statistical model
information retrieval
digital libraries
data sources
clustering method
prior information
data mining
learning algorithm
knowledge discovery
semi supervised
concept hierarchy