Estimating content concreteness for finding comprehensible documents.
Shinya TanakaAdam JatowtMakoto P. KatoKatsumi TanakaPublished in: WSDM (2013)
Keyphrases
- metadata
- web documents
- textual content
- document content
- multimedia documents
- content and structure
- information retrieval
- semantic tags
- semantic content
- text content
- document retrieval
- relevant content
- multimedia
- structured documents
- document classification
- electronic documents
- document structure
- multimedia content
- semi structured documents
- information retrieval systems
- content similarity
- semantic information
- xml documents
- pdf files
- keywords
- related documents
- black box
- user generated content
- digital objects
- document collections
- logical structure
- semantic relevance
- html pages
- topic specific
- text information
- textual information
- web content
- query terms
- text documents
- relevant documents
- digital libraries
- web pages