Complete Statistical Indexing of Text by Overlapping Word Fragments.
Clinton P. MahRaymond J. D'AmorePublished in: SIGIR Forum (1982)
Keyphrases
- string matching
- compressed text
- text retrieval
- information retrieval
- document analysis
- document indexing
- text indexing
- keywords
- english words
- text corpus
- related words
- word counts
- text input
- printed documents
- database
- word spotting
- pattern matching
- sentence level
- page layout
- word pairs
- chinese text
- english text
- syntactic categories
- linguistic information
- natural language text
- lexical features
- multiword
- word level
- text documents
- controlled vocabulary
- document image retrieval
- n gram
- text segments
- chinese text retrieval
- sentence similarity
- punctuation marks
- stop words
- handwritten words
- syntactic analysis
- syntactic information
- index terms
- word sense
- word recognition
- named entity recognizer
- text mining
- machine translation system
- co occurrence
- content based retrieval
- document images
- information retrieval systems
- retrieval engine
- unknown words
- semantic information
- lexical information