Login / Signup
URL tree: efficient unsupervised content extraction from streams of web documents.
Borut Sluban
Miha Grcar
Published in:
CIKM (2013)
Keyphrases
</>
web documents
html documents
content extraction
web pages
semi structured
keywords
tree structured patterns
information extraction
web search engines
web data
textual information
web content
document representation
link structure
semi supervised
semistructured data
data model