Login / Signup
Extracting General Lists from Web Documents: A Hybrid Approach.
Fabio Fumarola
Tim Weninger
Rick Barber
Donato Malerba
Jiawei Han
Published in:
IEA/AIE (1) (2011)
Keyphrases
</>
web documents
web pages
web search engines
information extraction
html documents
keywords
semi structured
web data
focused crawling
textual information
vector space model
data mining
geographic information
document classification
web content
website
learning algorithm
database
semistructured documents