Words and Echoes: Assessing and Mitigating the Non-Randomness Problem in Word Frequency Distribution Modeling.
Marco BaroniStefan EvertPublished in: ACL (2007)
Keyphrases
- frequency distribution
- n gram
- related words
- english words
- word meaning
- word recognition
- word segmentation
- word sense disambiguation
- unknown words
- word pairs
- text corpus
- statistical language modeling
- keywords
- word frequencies
- stop words
- syntactic categories
- linguistic information
- multiword
- word spotting
- noun phrases
- word sense
- text documents
- language model