Toward a taxonomy of concepts using web documents structure.
Rim ZarradNarjes DoggazEzzeddine ZagroubaPublished in: iiWAS (2012)
Keyphrases
- web documents
- information extraction
- semi structured
- web pages
- keywords
- web search engines
- document classification
- hierarchical structure
- web data
- textual information
- link structure
- vector space model
- document representation
- focused crawling
- domain knowledge
- xml documents
- logical structure
- content similarity
- unstructured documents