Faceted documents: describing document characteristics using semantic lenses.
Silvio PeroniDavid M. ShottonFabio VitaliPublished in: ACM Symposium on Document Engineering (2012)
Keyphrases
- document content
- unstructured documents
- document collections
- document space
- semantic information
- document classification
- text documents
- relevant documents
- information retrieval systems
- document clustering
- web documents
- digital documents
- document centric
- information retrieval
- document representation
- electronic documents
- retrieval systems
- document analysis
- vector space model
- semi structured documents
- document retrieval
- structured documents
- semantically related
- search interface
- document processing
- document type
- document similarity
- related documents
- document ranking
- semantic similarity
- keywords
- multimedia documents
- text representation
- relevance ranking
- textual content
- semantic features
- document set
- metadata
- term frequency
- retrieved documents
- information extraction
- digital libraries
- similar documents
- semantic structure
- xml documents
- semantic content
- document summarization
- latent semantic analysis
- scientific documents
- document repository
- document structure
- automatic text classification
- text mining
- semantic relationships
- wordnet
- concept space
- index terms
- word frequency
- query terms
- search engine
- natural language
- text classification
- query expansion
- document images
- document relevance
- tf idf
- text classifiers
- semantic search
- manually constructed
- test collection
- text categorization
- semantic annotation
- text lines
- document level