A REST protocol and composite format for interactive web documents.
John M. BoyerCharles WiechaRahul P. AkolkarPublished in: ACM Symposium on Document Engineering (2009)
Keyphrases
- web documents
- information extraction
- web pages
- semi structured
- web search engines
- document classification
- databases
- textual information
- keywords
- wrapper induction
- vector space model
- metadata
- focused crawling
- relational databases
- document representation
- web logs
- structured documents
- html documents
- web search
- text documents
- database systems
- machine learning
- link structure
- database