Assessing the readability of clinical documents in a document engineering environment.
Mark TruranGersende GeorgMarc CavazzaDong ZhouPublished in: ACM Symposium on Document Engineering (2010)
Keyphrases
- document collections
- document classification
- web documents
- relevant documents
- document processing
- information retrieval systems
- document clustering
- information retrieval
- text documents
- retrieval systems
- document representation
- document content
- semi structured documents
- digital documents
- document retrieval
- keywords
- document similarity
- electronic documents
- document type
- document analysis
- document summarization
- vector space model
- structured documents
- digital libraries
- retrieved documents
- document repository
- index terms
- patient records
- multimedia documents
- document set
- free text
- document structure
- scientific documents
- user queries
- similar documents
- semantic information
- related documents
- unstructured documents
- document centric
- training documents
- medical records
- text collections
- document archives
- document level
- printed documents
- pdf documents
- scanned documents
- text mining
- xml documents
- text categorization
- xml format
- document images
- ranked list
- textual content
- inverted index
- text classifiers
- topic hierarchy
- query based sampling
- keyword extraction
- test collection
- query terms
- logical structure
- term frequency
- retrieval strategies
- latent semantic analysis