Record Location and Reconfiguration in Unstructured Multiple-Record Web Documents.
David W. EmbleyLi XuPublished in: WebDB (Selected Papers) (2000)
Keyphrases
- web documents
- semi structured
- information extraction
- unstructured documents
- web pages
- web search engines
- web data
- unstructured text
- database
- structured data
- keywords
- document classification
- textual information
- link structure
- data records
- topic specific
- html documents
- information retrieval systems
- xml documents
- data representation
- vector space model