Sentence-Select: Large-Scale Language Model Data Selection for Rare-Word Speech Recognition.
W. Ronny HuangCal PeyserTara N. SainathRuoming PangTrevor StrohmanShankar KumarPublished in: CoRR (2022)
Keyphrases
- speech recognition
- language model
- n gram
- language modeling
- speech recognition systems
- hidden markov models
- speech recognizer
- information retrieval
- test collection
- word error rate
- speech synthesis
- speech signal
- pattern recognition
- automatic speech recognition
- translation model
- probabilistic model
- mixture model
- query expansion