XPACK: A High-Performance WEB Document Encoding.
Daniel RoccoJames CaverleeLing LiuPublished in: WEBIST (2005)
Keyphrases
- web documents
- information extraction
- semi structured
- web pages
- keywords
- prefetching
- web search engines
- web logs
- textual information
- dynamically generated
- web content
- vector space model
- web data
- information retrieval systems
- machine learning
- text classification
- metadata
- fractal image compression
- topic specific
- databases