Automatic Discovery of Semantic Structures in HTML Documents.
Saikat MukherjeeGuizhen YangWenfang TanI. V. RamakrishnanPublished in: ICDAR (2003)
Keyphrases
- automatic discovery
- semantic structures
- html documents
- web documents
- automatic extraction
- web pages
- semantic information
- web services
- semantic network
- semi structured
- semantic web services
- structured documents
- web content
- semistructured data
- knowledge domains
- latent semantic indexing
- xml documents
- information extraction
- wordnet
- information retrieval
- knowledge base
- keywords
- structured data
- relational databases
- domain specific
- probabilistic model