Beyond N-Grams: Can Linguistic Sophistication Improve Language Modeling?
Eric BrillRadu FlorianJohn C. HendersonLidia ManguPublished in: COLING-ACL (1998)
Keyphrases
- n gram
- language modeling
- language model
- text classification
- language modelling
- language independent
- cross lingual
- bag of words
- part of speech
- statistical language modeling
- retrieval model
- word segmentation
- document retrieval
- probabilistic model
- query expansion
- information retrieval
- variable length
- web documents
- vector space model
- text mining
- expert finding
- knn
- natural language
- relevance model
- translation model
- feature selection
- character n grams
- finite state transducers