Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus.
Mikio YamamotoKenneth Ward ChurchPublished in: VLC@COLING/ACL (1998)
Keyphrases
- term frequency
- document frequency
- text categorization
- tf idf
- retrieval model
- text classification
- average precision
- information gain
- bag of words
- feature selection
- term weighting
- document representation
- text documents
- n gram
- vector space model
- retrieved documents
- text data
- image classification
- retrieval effectiveness
- pseudo relevance feedback
- information retrieval systems
- test collection
- query expansion
- image content
- retrieval systems
- decision trees