Improving Term Frequency Normalization for Multi-topical Documents, and Application to Language Modeling Approaches.
Seung-Hoon NaIn-Su KangJong-Hyeok LeePublished in: CoRR (2015)
Keyphrases
- term frequency
- language modeling approaches
- language modeling
- language model
- tf idf
- text categorization
- text classification
- text documents
- average precision
- retrieval model
- bag of words
- sentence retrieval
- information retrieval
- document representation
- smoothing methods
- keywords
- information retrieval systems
- machine learning
- document collections
- document clustering
- probabilistic model
- xml documents
- image retrieval
- feature space