Identifying contents page of documents.
Qin LuoTakahiro WatanabeTakeshi NakayamaPublished in: ICPR (1996)
Keyphrases
- www pages
- page layout
- html pages
- textual content
- keywords
- metadata
- website
- web pages
- information retrieval
- wikipedia pages
- document collections
- database
- document type
- information retrieval systems
- web information
- text documents
- web documents
- text content
- relevant documents
- document structure
- xml documents
- vector space model
- document clustering
- document retrieval
- logical structure
- vector space
- structured documents
- document classification
- retrieval systems
- html documents
- web browsing
- retrieval model
- multimedia
- user queries
- page contents
- textual contents