Login / Signup
Identifying Content Blocks from Web Documents.
Sandip Debnath
Prasenjit Mitra
C. Lee Giles
Published in:
ISMIS (2005)
Keyphrases
</>
web documents
web content
information extraction
semi structured
web pages
content similarity
web data
textual information
keywords
document classification
web search engines
prefetching
web logs
focused crawling
vector space model
structured documents
html documents
topic specific
information retrieval systems