Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis.
Karren YangDejan MarkovicSteven KrennVasu AgrawalAlexander RichardPublished in: CoRR (2022)
Keyphrases
- audio visual
- speech enhancement
- multi modal
- visual information
- emotion recognition
- multi stream
- visual data
- multimedia
- speech signal
- noise reduction
- speaker verification
- audio visual speech recognition
- noisy environments
- signal to noise ratio
- audio features
- background noise
- speech recognition
- multi channel
- single channel
- low level