Automatic Annotation of Content-Rich HTML Documents: Structural and Semantic Analysis.
Saikat MukherjeeGuizhen YangI. V. RamakrishnanPublished in: ISWC (2003)
Keyphrases
- semantic analysis
- semantic information
- html documents
- automatic annotation
- low level features
- visual information
- high level
- wordnet
- web content
- content based retrieval
- semantic annotation
- metadata
- image annotation
- web documents
- low level
- domain knowledge
- semantic similarity
- contextual information
- keywords
- structured documents
- semantic concepts
- domain ontology
- visual concepts
- natural language processing
- xml documents
- natural language
- background knowledge
- image content
- database
- visual content
- image data
- artificial intelligence