Silent versus Modal Multi-Speaker Speech Recognition from Ultrasound and Video.
Manuel Sam RibeiroAciel EshkyKorin RichmondSteve RenalsPublished in: Interspeech (2021)
Keyphrases
- speech recognition
- automatic speech recognition
- hidden markov models
- speech processing
- video sequences
- digital video library
- speech signal
- video content
- speaker identification
- language model
- pattern recognition
- noisy environments
- multimedia
- video data
- speech recognition technology
- speech recognizer
- speaker dependent
- speaker independent
- speech synthesis
- video frames
- speech recognition systems
- speaker diarization
- speech recognizers
- speech retrieval
- speaker recognition
- speaker adaptation
- broadcast news
- video search
- spontaneous speech
- speaker verification
- cepstral coefficients
- noise reduction
- character recognition
- visual data