Login / Signup
Automatic Genre Detection of Web Documents.
Chul Su Lim
Kong Joo Lee
Gil-Chang Kim
Published in:
IJCNLP (2004)
Keyphrases
</>
web documents
semi structured
web search engines
information extraction
web pages
keywords
document classification
web content
html documents
vector space model
focused crawling
structured documents
relational databases
textual information
web data
unstructured documents
document representation
metadata