Using Factual Density to Measure Informativeness of Web Documents.
Christopher HornAlisa ZhilaAlexander F. GelbukhRoman KernElisabeth LexPublished in: NODALIDA (2013)
Keyphrases
- web documents
- information extraction
- semi structured
- web pages
- web search engines
- document classification
- vector space model
- html documents
- textual information
- similarity measure
- keywords
- database
- document representation
- web content
- unstructured documents
- geographic information
- structured documents
- databases
- web logs
- web search