Version-Aware Word Documents.
Stephen M. CoakleyJacob MischkaCheng ThaoPublished in: DChanges@DocEng (2014)
Keyphrases
- word spotting
- word frequencies
- information retrieval
- natural language text
- sentence level
- index terms
- keywords
- printed documents
- document collections
- web documents
- term frequency
- spoken documents
- linguistic information
- word pairs
- related words
- co occurrence
- word frequency
- information retrieval systems
- word similarity
- word co occurrence
- page layout
- multiword
- latent topics
- text corpus
- training corpus
- metadata
- xml documents
- document clustering
- document retrieval
- term weighting
- n gram
- concept space
- document classification
- retrieval systems
- related documents
- sentiment analysis
- document space
- document analysis
- text documents
- handwritten documents
- information extraction
- semantic information
- semantic similarity
- word sense disambiguation
- character recognition
- document level
- vector space model
- word sense