Discovering Semantic Sibling Groups from Web Documents with XTREEM-SG.
Marko BrunzelMyra SpiliopoulouPublished in: EKAW (2006)
Keyphrases
- web documents
- semantic association
- unstructured documents
- semi structured
- information extraction
- web pages
- web search engines
- keywords
- textual information
- vector space model
- related web pages
- html documents
- document classification
- semantic web
- semantic information
- web content
- link structure
- machine learning
- web data
- semantic similarity
- semantic features
- domain specific
- document representation
- topic specific
- web directories
- focused crawling
- high level