Towards a canonical specification of document structures.
Michael G. HincheyTony CahillPublished in: SIGDOC (1992)
Keyphrases
- information retrieval
- document images
- retrieval systems
- document collections
- document classification
- information retrieval systems
- web documents
- document clustering
- vector space model
- document processing
- high level
- database
- document retrieval
- complex structures
- formal specification
- document representation
- tf idf
- query processing
- relevant documents
- semantic information
- probabilistic model