Hybrid Selection of Language Model Training Data Using Linguistic Information and Perplexity.
Antonio ToralPublished in: HyTra@ACL (2013)
Keyphrases
- language model
- linguistic information
- training data
- multiword
- language modeling
- n gram
- document retrieval
- probabilistic model
- structural information
- speech recognition
- part of speech
- information retrieval
- test collection
- retrieval model
- linguistic features
- training set
- semantic information
- query expansion
- word error rate
- context sensitive
- domain knowledge
- prior knowledge
- supervised learning
- translation model
- query terms
- vector space model
- cross lingual
- pseudo relevance feedback