Embedding Knowledge in Web Documents: CGs versus XML-based Metadata Languages.
Philippe MartinPeter W. EklundPublished in: ICCS (1999)
Keyphrases
- web documents
- metadata
- semi structured
- web pages
- information extraction
- domain knowledge
- databases
- web search engines
- web content
- unstructured text
- information resources
- knowledge discovery
- textual information
- wrapper induction
- structured data
- web data
- digital libraries
- vector space model
- domain specific
- document representation
- geographic information
- website
- web directories
- focused crawling
- dynamically generated
- unstructured documents