Login / Signup
Automatic Identification of Specific Web Documents by Using Centroid Technique.
Udomsit Sukakanya
Kriengkrai Porkaew
Published in:
WEBIST (2005)
Keyphrases
</>
web documents
automatic identification
semi structured
information extraction
web pages
web search engines
link structure
web content
vector space model
keywords
web data
focused crawling
html documents
databases
domain specific
document representation
dynamically generated
tree structured patterns