Getting more from automatic transcripts for semi-supervised language modeling.
Scott NovotneyRichard M. SchwartzSanjeev KhudanpurPublished in: Comput. Speech Lang. (2016)
Keyphrases
- language modeling
- semi supervised
- language model
- information retrieval
- query expansion
- retrieval model
- probabilistic model
- cross lingual
- n gram
- semi supervised learning
- document retrieval
- supervised learning
- relevance model
- text classification
- unlabeled data
- label propagation
- sentence retrieval
- statistical language models
- retrieval effectiveness
- learning algorithm
- labeled data
- search engine
- vector space
- test collection
- speech recognition
- web search
- data points
- pairwise
- pseudo feedback
- dirichlet prior