Random texts exhibit Zipf's-law-like word frequency distribution.
Wentian LiPublished in: IEEE Trans. Inf. Theory (1992)
Keyphrases
- frequency distribution
- natural language text
- english words
- keywords
- text input
- punctuation marks
- linguistic information
- training corpus
- co occurrence
- text corpus
- n gram
- natural language
- word sense
- word sense disambiguation
- syntactic analysis
- text collections
- text retrieval
- text documents
- text categorization
- multi dimensional
- active learning