Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition.
Shaojin DingRajeev RikhyeQiao LiangYanzhang HeQuan WangArun NarayananTom O'MalleyIan McGrawPublished in: CoRR (2022)
Keyphrases
- voice activity detection
- speech recognition
- noisy environments
- hidden markov models
- language model
- automatic speech recognition
- speech synthesis
- speech processing
- speech recognizer
- pattern recognition
- speech understanding
- speech signal
- image processing
- speech recognition technology
- speaker recognition
- speech enhancement
- speech recognition systems
- speech retrieval
- multiscale
- information retrieval